Stars
A framework for few-shot evaluation of language models.
A high-throughput and memory-efficient inference and serving engine for LLMs
The official GitHub page for the survey paper "A Survey of Large Language Models".
Fully open reproduction of DeepSeek-R1
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Ongoing research training transformer models at scale
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
AiMed面向中文医学的人工智能大语言模型期望实现有效处理医学知识问答、医学论文阅读、医学文献检索等任务和在医学科研中的应用。
A large model-based chatbot builder that can quickly integrate AI models (including ChatGPT, Claude, Gemini) into various software applications (such as Telegram, Gmail, Slack, and websites).
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"