Lists (1)
Sort Name ascending (A-Z)
Stars
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Qihoo360 / 360-LLaMA-Factory
Forked from hiyouga/LLaMA-Factoryadds Sequence Parallelism into LLaMA-Factory
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Awesome RL-based LLM Reasoning
Fully open reproduction of DeepSeek-R1
verl: Volcano Engine Reinforcement Learning for LLMs
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
LiveBench: A Challenging, Contamination-Free LLM Benchmark
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)
Arena-Hard-Auto: An automatic LLM benchmark.
WildEval / ZeroEval
Forked from allenai/WildBenchA simple unified framework for evaluating LLMs
The official evaluation suite and dynamic data release for MixEval.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Robust recipes to align language models with human and AI preferences
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
Awesome LLM compression research papers and tools.
Simple frontend for LLMs built in react-native.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.