Skip to content
View Timothyxxx's full-sized avatar
🧑‍💻
struggle with paradox
🧑‍💻
struggle with paradox

Sponsors

@ZeonLap
@ludunjie1219

Organizations

@MetaMind @HKUNLP @xlang-ai @OpenLemur

Block or report Timothyxxx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

R1-like Computer-use Agent

Python 63 4 Updated Mar 21, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

945 38 Updated Mar 27, 2025

QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.

Python 406 9 Updated Mar 27, 2025

Command and Conquer: Generals - Zero Hour

C++ 3,916 1,264 Updated Feb 27, 2025

Command and Conquer: Red Alert

C++ 6,106 1,169 Updated Feb 27, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,790 116 Updated Mar 27, 2025

Muon is Scalable for LLM Training

994 39 Updated Mar 28, 2025
Python 84 3 Updated Feb 25, 2025

"Your Fully-Automated Personal AI Assistant, and Open-Source & Cost-Efficient Alternative to OpenAI's Deep Research"

Python 844 112 Updated Feb 23, 2025

Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

Python 85 5 Updated Feb 17, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,495 2,760 Updated Mar 31, 2025

A fork to add multimodal model training to open-r1

Python 1,146 58 Updated Feb 8, 2025

Open-source resources on agents for computer use.

297 22 Updated Jan 26, 2025

s1: Simple test-time scaling

Python 6,091 712 Updated Mar 6, 2025

Witness the aha moment of VLM with less than $3.

Python 3,437 272 Updated Mar 1, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,433 1,445 Updated Mar 10, 2025

Simple RL training for reasoning

Python 3,343 247 Updated Mar 31, 2025

Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 118 8 Updated Mar 29, 2025

Chat Templates for 🤗 HuggingFace Large Language Models

Jinja 639 58 Updated Dec 13, 2024

Read-only mirror of https://gitlab.gnome.org/GNOME/gimp

C 5,290 721 Updated Mar 31, 2025

This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information"

Python 22 1 Updated Mar 29, 2025

沙威玛辅助工具 键盘快捷键控制的模拟鼠标操作脚本

Python 32 5 Updated Nov 8, 2024

Scalable RL solution for advanced reasoning of language models

Python 1,448 89 Updated Mar 18, 2025

Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents

Python 344 33 Updated Feb 8, 2025

"GraphAgent: Agentic Graph Language Assistant"

Jupyter Notebook 292 40 Updated Feb 8, 2025

Code for paper "Improved GUI Grounding via Iterative Narrowing"

Jupyter Notebook 9 Updated Mar 5, 2025

PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World

Python 218 19 Updated Dec 25, 2024

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym

Jupyter Notebook 410 26 Updated Mar 10, 2025
Next
Showing results