Starred repositories
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0
The related works and background techniques about Openai o1
🦜🔗 Build context-aware reasoning applications
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
llama3 implementation one matrix multiplication at a time
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Large Language Model (LLM) Systems Paper List
📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism, etc. 🎉🎉
Awesome-LLM: a curated list of Large Language Model
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
The official GitHub page for the survey paper "A Survey of Large Language Models".
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
Robust recipes to align language models with human and AI preferences
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
A series of large language models developed by Baichuan Intelligent Technology
A 13B large language model developed by Baichuan Intelligent Technology
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…