zypan0

zypan0

0 followers · 3 following

Achievements

Stars

26 results for source starred repositories written in Python

Clear filter

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,716 5,974 Updated Aug 24, 2024

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 31,537 2,103 Updated Feb 22, 2025

facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,259 599 Updated Feb 21, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 8,683 961 Updated Feb 22, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 7,897 2,126 Updated Feb 21, 2025

OpenSPG / KAG

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 5,375 347 Updated Feb 21, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,886 215 Updated Feb 19, 2025

ucbepic / docetl

A system for agentic LLM-powered data processing and ETL

Python 1,680 152 Updated Feb 19, 2025

qhjqhj00 / MemoRAG

Empowering RAG with a memory-based data interface for all-purpose applications!

Python 1,638 113 Updated Nov 28, 2024

AnswerDotAI / rerankers

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 1,300 75 Updated Feb 21, 2025

Om-Alve / smolGPT

Python 1,285 97 Updated Feb 15, 2025

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,186 84 Updated Feb 9, 2025

parthsarthi03 / raptor

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 1,106 156 Updated Sep 3, 2024

ZihanWang314 / RAGEN

RAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.

Python 890 64 Updated Feb 23, 2025

Doriandarko / omni-engineer

Python 845 140 Updated Sep 15, 2024

yoeo / guesslang

Detect the programming language of a source code

Python 822 118 Updated Mar 4, 2024

THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 575 44 Updated Jan 20, 2025

dbrattli / Expression

Functional programming for Python

Python 574 31 Updated Feb 22, 2025

tatsu-lab / gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Python 506 133 Updated Mar 26, 2024

jondurbin / bagel

A bagel, with everything.

Python 316 31 Updated Apr 11, 2024

THUDM / LongAlign

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Python 242 20 Updated Dec 16, 2024

boson-ai / RPBench-Auto

An automated pipeline for evaluating LLMs for role-playing.

Python 158 8 Updated Sep 14, 2024

Stanford-ILIAD / PantheonRL

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Python 138 21 Updated Nov 6, 2023

hao-ai-lab / Dynasor

Simple extension on vLLM to help you speed up reasoning model without training.

Python 90 10 Updated Feb 21, 2025

zankner / CLoud

Critique-out-Loud Reward Models

Python 52 4 Updated Oct 18, 2024

szprob / toxic_detection

文本/图像的恶意检测.

Python 3 1 Updated Feb 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zypan0

Achievements

Achievements

Block or report zypan0

Stars

labmlai / annotated_deep_learning_paper_implementations

unslothai / unsloth

facebookresearch / nougat

axolotl-ai-cloud / axolotl

EleutherAI / lm-evaluation-harness

OpenSPG / KAG

hkust-nlp / simpleRL-reason

ucbepic / docetl

qhjqhj00 / MemoRAG

AnswerDotAI / rerankers

Om-Alve / smolGPT

RLHFlow / RLHF-Reward-Modeling

parthsarthi03 / raptor

ZihanWang314 / RAGEN

Doriandarko / omni-engineer

yoeo / guesslang

THUDM / ReST-MCTS

dbrattli / Expression

tatsu-lab / gpt_paper_assistant

jondurbin / bagel

THUDM / LongAlign

boson-ai / RPBench-Auto

Stanford-ILIAD / PantheonRL

hao-ai-lab / Dynasor

zankner / CLoud

szprob / toxic_detection