ZhaoyangLiu-Leo

Zhaoyang Liu ZhaoyangLiu-Leo

Work in Alibaba Group and focus on LLM, Quant, Recommender System and Causal Inference.

31 followers · 33 following

Alibaba
Shanghai

Achievements

Organizations

Starred repositories

modelcontextprotocol / python-sdk

The official Python SDK for Model Context Protocol servers and clients

Python 3,458 323 Updated Mar 12, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 20,151 1,637 Updated Mar 13, 2025

SsmallSong / R1-Searcher

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Python 270 18 Updated Mar 10, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,585 74 Updated Mar 5, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,667 446 Updated Mar 13, 2025

openpsi-project / ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 236 14 Updated Jan 13, 2025

THUDM / T1

Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling

89 Updated Jan 23, 2025

zaidmukaddam / scira

Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like Grok 2.0.

TypeScript 7,269 851 Updated Mar 8, 2025

2XUID / Deepseek-goes-from-beginner-to-master.PDF

全网乱传的Deepseek从入门到精通的PDF版本，清华大学新闻与传播学院新媒体研究中心元宇宙文化实验室

348 134 Updated Feb 14, 2025

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,795 232 Updated Feb 19, 2025

GaryYufei / AlignLLMHumanSurvey

Aligning Large Language Models with Human: A Survey

724 32 Updated Sep 11, 2023

thunlp / UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,474 123 Updated Mar 13, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 26,005 2,982 Updated Oct 2, 2024

eddycmu / demystify-long-cot

Python 243 16 Updated Feb 6, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 2,789 358 Updated Mar 12, 2025

jina-ai / node-DeepResearch

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 3,392 314 Updated Mar 13, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,141 234 Updated Feb 19, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,693 2,038 Updated Mar 13, 2025

GAIR-NLP / auto-j

Generative Judge for Evaluating Alignment

Python 230 15 Updated Jan 18, 2024

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 521 61 Updated Feb 27, 2025

Ironclad / rivet

The open-source visual AI programming environment and TypeScript library

TypeScript 3,581 296 Updated Mar 10, 2025

MiniMax-AI / MiniMax-01

Python 2,331 165 Updated Mar 6, 2025

thu-coai / BPO

Python 308 16 Updated Jun 24, 2024

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,140 1,416 Updated Mar 10, 2025

pingcap / autoflow

pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidb.ai

TypeScript 2,440 137 Updated Mar 12, 2025

MoonshotAI / Kimi-k1.5

3,205 192 Updated Mar 7, 2025

yongchao98 / PROMST

Automatic prompt optimization framework for multi-step agent tasks.

PDDL 28 2 Updated Nov 12, 2024

OpenSPG / KAG

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 5,949 398 Updated Mar 13, 2025

bklieger-groq / g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,195 377 Updated Jan 27, 2025

sunnynexus / Search-o1

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 702 77 Updated Mar 4, 2025

Zhaoyang Liu ZhaoyangLiu-Leo

Organizations

Starred repositories

attention-is-all-you-need