Starred repositories
Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
Scalable data pre processing and curation toolkit for LLMs
Fully open reproduction of DeepSeek-R1
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
Machine Learning Engineering Open Book
A lightweight script for processing HTML page to markdown format with support for code blocks
Heuristic filtering framework for RefineCode
The Open Cookbook for Top-Tier Code Large Language Model
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Hackable and optimized Transformers building blocks, supporting a composable construction.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Ongoing research training transformer models at scale
Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering" (NeurIPS 2021)
Distributional Generalization in NLP. A roadmap.
Awesome papers on Language-Model-as-a-Service (LMaaS)
Code for the book "High Performance Python 2e" by Micha Gorelick and Ian Ozsvald with OReilly
Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"
This is an implementation of Hearst patterns, for finding hyponyms, written in Python.
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"
An Open-Source Package for Textual Adversarial Attack.