Timothyxxx

Follow

🧑‍💻

struggle with paradox

Tianbao Xie Timothyxxx

🧑‍💻

struggle with paradox

Follow

PhD student of the University of Hong Kong @xlang-ai @HKUNLP. Previously in @HIT-SCIR. Not a typical NLP researcher.

532 followers · 424 following

The University of Hong Kong
Hong Kong, SAR
18:43 - 8h ahead
tianbaoxie.com
@TianbaoX

Sponsors

Achievements

Achievements

Highlights

Developer Program Member

Organizations

Lists (4)

Sort

code_base

🔮 Future ideas

14 repositories

✨ Inspiration

mixi food

Stars

FanbinLu / STEVE-R1

R1-like Computer-use Agent

Python 63 4 Updated Mar 21, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

945 38 Updated Mar 27, 2025

QwenLM / QwQ

QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.

Python 406 9 Updated Mar 27, 2025

electronicarts / CnC_Generals_Zero_Hour

Command and Conquer: Generals - Zero Hour

C++ 3,916 1,264 Updated Feb 27, 2025

electronicarts / CnC_Red_Alert

Command and Conquer: Red Alert

C++ 6,106 1,169 Updated Feb 27, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,790 116 Updated Mar 27, 2025

MoonshotAI / Moonlight

Muon is Scalable for LLM Training

994 39 Updated Mar 28, 2025

DigiRL-agent / digiq

Python 84 3 Updated Feb 25, 2025

HKUDS / Auto-Deep-Research

"Your Fully-Automated Personal AI Assistant, and Open-Source & Cost-Efficient Alternative to OpenAI's Deep Research"

Python 844 112 Updated Feb 23, 2025

HKUNLP / critic-rl

Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

Python 85 5 Updated Feb 17, 2025

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,495 2,760 Updated Mar 31, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,146 58 Updated Feb 8, 2025

All-Hands-AI / open-operator

Open-source resources on agents for computer use.

297 22 Updated Jan 26, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,091 712 Updated Mar 6, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,437 272 Updated Mar 1, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,433 1,445 Updated Mar 10, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,343 247 Updated Mar 31, 2025

OS-Copilot / OS-Genesis

Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 118 8 Updated Mar 29, 2025

bytedance / UI-TARS

3,643 239 Updated Feb 17, 2025

MoonshotAI / Kimi-k1.5

3,256 202 Updated Mar 7, 2025

chujiezheng / chat_templates

Chat Templates for 🤗 HuggingFace Large Language Models

Jinja 639 58 Updated Dec 13, 2024

GNOME / gimp

Read-only mirror of https://gitlab.gnome.org/GNOME/gimp

C 5,290 721 Updated Mar 31, 2025

psunlpgroup / VisOnlyQA

This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information"

Python 22 1 Updated Mar 29, 2025

TaylorAndTony / swm-auto-tool

沙威玛辅助工具键盘快捷键控制的模拟鼠标操作脚本

Python 32 5 Updated Nov 8, 2024

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,448 89 Updated Mar 18, 2025

AriaUI / Aria-UI

Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents

Python 344 33 Updated Feb 8, 2025

HKUDS / GraphAgent

"GraphAgent: Agentic Graph Language Assistant"

Jupyter Notebook 292 40 Updated Feb 8, 2025

ant-8 / GUI-Grounding-via-Iterative-Narrowing

Code for paper "Improved GUI Grounding via Iterative Narrowing"

Jupyter Notebook 9 Updated Mar 5, 2025

GAIR-NLP / PC-Agent

PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World

Python 218 19 Updated Dec 25, 2024

SWE-Gym / SWE-Gym

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym

Jupyter Notebook 410 26 Updated Mar 10, 2025