zypan0

zypan0

0 followers · 3 following

Achievements

Stars

MineDojo / MineDojo

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Java 1,885 169 Updated Mar 18, 2024

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,181 84 Updated Feb 9, 2025

hao-ai-lab / Dynasor

Simple extension on vLLM to help you speed up reasoning model without training.

Jupyter Notebook 78 9 Updated Feb 19, 2025

git-disl / awesome-LLM-game-agent-papers

A Survey on Large Language Model-Based Game Agents

469 19 Updated Feb 20, 2025

Stanford-ILIAD / PantheonRL

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Python 138 21 Updated Nov 6, 2023

Om-Alve / smolGPT

Python 1,279 98 Updated Feb 15, 2025

ZihanWang314 / RAGEN

RAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.

Python 878 62 Updated Feb 20, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,840 212 Updated Feb 19, 2025

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉

3,463 236 Updated Feb 19, 2025

wasiahmad / Awesome-LLM-Synthetic-Data

A reading list on LLM based Synthetic Data Generation 🔥

1,150 67 Updated Feb 20, 2025

qhjqhj00 / MemoRAG

Empowering RAG with a memory-based data interface for all-purpose applications!

Python 1,633 112 Updated Nov 28, 2024

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 8,666 960 Updated Feb 21, 2025

tatsu-lab / gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Python 506 133 Updated Mar 26, 2024

facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,254 598 Updated Apr 16, 2024

OpenSPG / KAG

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 5,315 340 Updated Feb 21, 2025