Skip to content
View yuyaxiong's full-sized avatar

Block or report yuyaxiong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 885 104 Updated Apr 14, 2025

The Next Step Forward in Multimodal LLM Alignment

Python 145 4 Updated Mar 5, 2025

A collection of MCP servers.

38,772 2,751 Updated Apr 16, 2025

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 11,640 917 Updated Apr 16, 2025

Train your AI self, amplify you, bridge the world

Python 10,785 746 Updated Apr 17, 2025

Everything about the SmolLM2 and SmolVLM family of models

Python 2,187 127 Updated Mar 31, 2025

Build resilient language agents as graphs.

Python 11,591 1,926 Updated Apr 17, 2025

The only reliable agent framework built on top of the latest OpenAI Assistants API.

Python 3,634 923 Updated Apr 14, 2025

🐝 GPTSwarm: LLM agents as (Optimizable) Graphs

Python 826 62 Updated Jan 3, 2025

Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users

Python 322 38 Updated Apr 15, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 296,510 49,297 Updated Dec 2, 2024

A live stream development of RL tunning for LLM agents

Python 2,416 323 Updated Apr 16, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 9,816 688 Updated Apr 10, 2025

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,824 145 Updated Dec 30, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,455 650 Updated Feb 10, 2025

HumanOmni

Python 151 7 Updated Mar 10, 2025
Python 829 48 Updated Mar 24, 2025

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 4,153 353 Updated Apr 17, 2025

Learn how to use CUA (our Computer Using Agent) via the API on multiple computer environments.

Python 756 166 Updated Apr 8, 2025

A lightweight, powerful framework for multi-agent workflows

Python 8,780 1,105 Updated Apr 17, 2025

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Python 451 29 Updated Mar 23, 2025

A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API

TypeScript 8,654 1,475 Updated Apr 6, 2025

Make websites accessible for AI agents

Python 56,324 6,030 Updated Apr 17, 2025

Muon is Scalable for LLM Training

1,023 41 Updated Mar 28, 2025

🚀 KIMI AI 长文本大模型逆向API【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、探索版、K1思考模型、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹,仅供测试,如需商用请前往官方开放平台。

TypeScript 4,435 747 Updated Dec 30, 2024

MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Python 548 22 Updated Apr 16, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 15,617 1,845 Updated Apr 15, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 43,455 7,456 Updated Apr 16, 2025
Next