-
The University of Hong Kong
- Hong Kong, SAR
-
18:43
- 8h ahead - tianbaoxie.com
- @TianbaoX
Highlights
Lists (4)
Sort Name ascending (A-Z)
Stars
An Open-source RL System from ByteDance Seed and Tsinghua AIR
QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.
Command and Conquer: Generals - Zero Hour
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
"Your Fully-Automated Personal AI Assistant, and Open-Source & Cost-Efficient Alternative to OpenAI's Deep Research"
Code for Paper: Teaching Language Models to Critique via Reinforcement Learning
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A fork to add multimodal model training to open-r1
Open-source resources on agents for computer use.
Witness the aha moment of VLM with less than $3.
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
Chat Templates for 🤗 HuggingFace Large Language Models
Read-only mirror of https://gitlab.gnome.org/GNOME/gimp
This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information"
Scalable RL solution for advanced reasoning of language models
Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
"GraphAgent: Agentic Graph Language Assistant"
Code for paper "Improved GUI Grounding via Iterative Narrowing"
PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym