Highlights
- Pro
Stars
A bibliography and survey of the papers surrounding o1
Topological Data Analysis (TDA) for Natural Language Processing (NLP) Applications
Lisp code for the textbook "Paradigms of Artificial Intelligence Programming"
A list of tech-related Bluesky starter packs
Machine Learning Engineering Open Book
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Lexical relations data extracted from AO-CHILDES
Pipeline to generate the Standardized Project Gutenberg Corpus
Lecture materials for Cornell CS5785 Applied Machine Learning (Fall 2024)
llama3 implementation one matrix multiplication at a time
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Materials for a language modeling class, broadly construed
Pure-python library for adding annotations to PDFs
this repository accompanies the book "Grokking Deep Learning"
CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.
An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/
Enforce the output format (JSON Schema, Regex etc) of a language model
Natural Language Inference is fundamental to many Natural Language Processing applications such as semantic search and question answering. The task of NLI has gained significant attention in the re…
[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.
A collection of word lists in machine readable, web-native (.yml and .json) format