Stars
A framework for few-shot evaluation of language models.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Let your Claude able to think
DeepSeek LLM: Let there be answers
Code and documentation to train Stanford's Alpaca models, and generate the data.
RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…
A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
A playbook for effectively prompting post-trained LLMs
Writing AI Conference Papers: A Handbook for Beginners
A Latex style and template for paper preprints (based on NIPS style)
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Understanding Deep Learning - Simon J.D. Prince
Awesome LLM Benchmarks to evaluate the LLMs across text, code, image, audio, video and more.
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型