-
IIE, UCAS
- Beijing
- sbwww.github.io
-
Math-Verify Public
Forked from huggingface/Math-VerifyPython Apache License 2.0 UpdatedFeb 6, 2025 -
-
Qwen2.5-Math Public
Forked from QwenLM/Qwen2.5-MathA series of math-specific large language models of our Qwen2 series.
Python UpdatedJan 11, 2025 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of autoregressive language models.
Python MIT License UpdatedJan 6, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Python Apache License 2.0 UpdatedDec 24, 2024 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Python Apache License 2.0 UpdatedDec 24, 2024 -
-
TransAct-pruning Public
Please refer to https://github.com/XiaoMi/transact-pruning for the code
UpdatedNov 28, 2024 -
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 20, 2024 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedOct 18, 2024 -
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attentionRing attention implementation with flash attention
Python MIT License UpdatedOct 8, 2024 -
GPU-Puzzles Public
Forked from srush/GPU-PuzzlesSolve puzzles. Learn CUDA.
Jupyter Notebook MIT License UpdatedSep 1, 2024 -
quiet-star Public
Forked from ezelikman/quiet-starCode for Quiet-STaR
Python Apache License 2.0 UpdatedAug 21, 2024 -
rebiber Public
Forked from yuchenlin/rebiberA simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Python MIT License UpdatedAug 18, 2024 -
turndown Public
Forked from mixmark-io/turndown🛏 An HTML to Markdown converter written in JavaScript
HTML MIT License UpdatedJul 30, 2024 -
-
lighteval Public
Forked from huggingface/lightevalLightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
Python MIT License UpdatedMay 8, 2024 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedMay 6, 2024 -
mlc-llm Public
Forked from mlc-ai/mlc-llmEnable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Python Apache License 2.0 UpdatedMay 6, 2024 -
img2dataset Public
Forked from rom1504/img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Python MIT License UpdatedApr 29, 2024 -
math-evaluation-harness Public
Forked from ZubinGou/math-evaluation-harnessA simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
Python MIT License UpdatedApr 26, 2024 -
-
StableDiffusionOnDevice Public
Forked from XiaoMi/StableDiffusionOnDevice本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。
C++ MIT License UpdatedMar 29, 2024 -
Medusa Public
Forked from FasterDecoding/MedusaMedusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Jupyter Notebook Apache License 2.0 UpdatedMar 7, 2024 -
LLM-Shearing Public
Forked from princeton-nlp/LLM-ShearingPreprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
-
trlx Public
Forked from CarperAI/trlxA repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Python MIT License UpdatedJan 8, 2024 -
Learn-CUDA-Programming Public
Forked from PacktPublishing/Learn-CUDA-ProgrammingLearn CUDA Programming, published by Packt
Cuda MIT License UpdatedDec 30, 2023 -
mlx Public
Forked from ml-explore/mlxMLX: An array framework for Apple silicon
C++ MIT License UpdatedDec 6, 2023 -
ChineseNLPCorpus Public
Forked from InsaneLife/ChineseNLPCorpus中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
Python UpdatedNov 21, 2023