Stars
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama modeβ¦
Hydragen: High-Throughput LLM Inference with Shared Prefixes
File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
We write your reusable computer vision tools. π
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Zero Bubble Pipeline Parallelism
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Retrieval and Retrieval-augmented LLMs
Tools for merging pretrained large language models.
Awesome-LLM: a curated list of Large Language Model
π¦π Build context-aware reasoning applications
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A series of large language models trained from scratch by developers @01-ai
Robust recipes to align language models with human and AI preferences
π― Task-oriented embedding tuning for BERT, CLIP, etc.
A programming framework for agentic AI π€ PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Fast inference engine for Transformer models
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Type less, code more: Cody is an AI code assistant that uses advanced search and codebase context to help you write and fix code.
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
A modular RL library to fine-tune language models to human preferences
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)