Skip to content
View zypan0's full-sized avatar

Block or report zypan0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
26 results for source starred repositories written in Python
Clear filter

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,716 5,974 Updated Aug 24, 2024

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 31,537 2,103 Updated Feb 22, 2025

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,259 599 Updated Feb 21, 2025

Go ahead and axolotl questions

Python 8,683 961 Updated Feb 22, 2025

A framework for few-shot evaluation of language models.

Python 7,897 2,126 Updated Feb 21, 2025

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 5,375 347 Updated Feb 21, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,886 215 Updated Feb 19, 2025

A system for agentic LLM-powered data processing and ETL

Python 1,680 152 Updated Feb 19, 2025

Empowering RAG with a memory-based data interface for all-purpose applications!

Python 1,638 113 Updated Nov 28, 2024

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 1,300 75 Updated Feb 21, 2025
Python 1,285 97 Updated Feb 15, 2025

Recipes to train reward model for RLHF.

Python 1,186 84 Updated Feb 9, 2025

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 1,106 156 Updated Sep 3, 2024

RAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.

Python 890 64 Updated Feb 23, 2025
Python 845 140 Updated Sep 15, 2024

Detect the programming language of a source code

Python 822 118 Updated Mar 4, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 575 44 Updated Jan 20, 2025

Functional programming for Python

Python 574 31 Updated Feb 22, 2025

GPT4 based personalized ArXiv paper assistant bot

Python 506 133 Updated Mar 26, 2024

A bagel, with everything.

Python 316 31 Updated Apr 11, 2024

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Python 242 20 Updated Dec 16, 2024

An automated pipeline for evaluating LLMs for role-playing.

Python 158 8 Updated Sep 14, 2024

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Python 138 21 Updated Nov 6, 2023

Simple extension on vLLM to help you speed up reasoning model without training.

Python 90 10 Updated Feb 21, 2025

Critique-out-Loud Reward Models

Python 52 4 Updated Oct 18, 2024

文本/图像的恶意检测.

Python 3 1 Updated Feb 29, 2024