Stars
A highly optimized LLM inference acceleration engine for Llama and its variants.
The Next Step Forward in Multimodal LLM Alignment
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
Train your AI self, amplify you, bridge the world
Everything about the SmolLM2 and SmolVLM family of models
Build resilient language agents as graphs.
The only reliable agent framework built on top of the latest OpenAI Assistants API.
🐝 GPTSwarm: LLM agents as (Optimizable) Graphs
Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
A live stream development of RL tunning for LLM agents
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
Learn how to use CUA (our Computer Using Agent) via the API on multiple computer environments.
A lightweight, powerful framework for multi-agent workflows
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
Make websites accessible for AI agents
🚀 KIMI AI 长文本大模型逆向API【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、探索版、K1思考模型、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹,仅供测试,如需商用请前往官方开放平台。
MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
No fortress, purely open ground. OpenManus is Coming.