Stars
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Train transformer language models with reinforcement learning.
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Example models using DeepSpeed
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
800,000 step-level correctness labels on LLM solutions to MATH problems
A library for advanced large language model reasoning
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
PaL: Program-Aided Language Models (ICML 2023)
Retrieval-Augmented Theorem Provers for Lean
Syntax-Aware Network for Handwritten Mathematical Expression Recognition
Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math reasoning.
https://albertqjiang.github.io/Portal-to-ISAbelle/
Llemma formal2formal (tactic prediction) theorem proving experiments