Skip to content
View zypan0's full-sized avatar

Block or report zypan0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Java 1,885 169 Updated Mar 18, 2024

Recipes to train reward model for RLHF.

Python 1,181 84 Updated Feb 9, 2025

Simple extension on vLLM to help you speed up reasoning model without training.

Jupyter Notebook 78 9 Updated Feb 19, 2025

A Survey on Large Language Model-Based Game Agents

469 19 Updated Feb 20, 2025

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Python 138 21 Updated Nov 6, 2023
Python 1,279 98 Updated Feb 15, 2025

RAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.

Python 878 62 Updated Feb 20, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 2,840 212 Updated Feb 19, 2025

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉

3,463 236 Updated Feb 19, 2025

A reading list on LLM based Synthetic Data Generation 🔥

1,150 67 Updated Feb 20, 2025

Empowering RAG with a memory-based data interface for all-purpose applications!

Python 1,633 112 Updated Nov 28, 2024

Go ahead and axolotl questions

Python 8,666 960 Updated Feb 21, 2025

GPT4 based personalized ArXiv paper assistant bot

Python 506 133 Updated Mar 26, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,254 598 Updated Apr 16, 2024

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 5,315 340 Updated Feb 21, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 31,061 2,065 Updated Feb 20, 2025

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Python 241 18 Updated Dec 16, 2024

A system for agentic LLM-powered data processing and ETL

Python 1,677 151 Updated Feb 19, 2025
GDScript 6 Updated Dec 28, 2024

Functional programming for Python

Python 572 31 Updated Feb 20, 2025

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 1,104 155 Updated Sep 3, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 574 44 Updated Jan 20, 2025

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 1,296 75 Updated Feb 13, 2025

Critique-out-Loud Reward Models

Python 52 4 Updated Oct 18, 2024

📘 OpenAPI/Swagger-generated API Reference Documentation

TypeScript 24,032 2,320 Updated Feb 12, 2025

东南大学《知识图谱》研究生课程

4,065 1,121 Updated Apr 29, 2024

Access models from OpenAI, Groq, local Ollama, and others by setting llm-router as Cursor's Base URL

Go 176 11 Updated Apr 30, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,473 362 Updated Feb 21, 2025

An automated pipeline for evaluating LLMs for role-playing.

Python 158 8 Updated Sep 14, 2024

A bagel, with everything.

Python 316 31 Updated Apr 11, 2024
Next