Highlights
Starred repositories
🦜🔗 Build context-aware reasoning applications
Free Data Engineering course!
A guidance language for controlling large language models.
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.
Learn ML engineering for free in 4 months!
This repository contains demos I made with the Transformers library by HuggingFace.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
An annotated implementation of the Transformer paper.
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
Solve puzzles. Improve your pytorch.
Pytorch implementations of various Deep NLP models in cs-224n(Stanford Univ)
NMA Computational Neuroscience course
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SL…
Efficient few-shot learning with Sentence Transformers
MTEB: Massive Text Embedding Benchmark
New ways of breaking app-integrated LLMs
Python class that generates pixel art from images
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
Python code for part 2 of the book Causal Inference: What If, by Miguel Hernán and James Robins
Project page for "The Fuzzing Book"
Public runnable examples of using John Snow Labs' NLP for Apache Spark.
What would you do with 1000 H100s...
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
The book every data scientist needs on their desk.
The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020
Graph Machine Learning course, Xavier Bresson, 2023
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences