- All languages
- Assembly
- C
- C#
- C++
- CSS
- Clojure
- Dart
- Dockerfile
- EJS
- Elixir
- Erlang
- Gherkin
- Go
- HTML
- Java
- JavaScript
- Jinja
- Jsonnet
- Julia
- Jupyter Notebook
- Kotlin
- LLVM
- Lua
- MDX
- Makefile
- Markdown
- Mustache
- Nunjucks
- Objective-C
- PHP
- Perl
- PlantUML
- Python
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
Starred repositories
Automate browser-based workflows with LLMs and Computer Vision
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
A lightweight, powerful framework for multi-agent workflows
Serverless AI Workflows for Data & ML Teams
Make websites accessible for AI agents
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
Python version of the Playwright testing and automation library.
Collection of open-source libraries and tools for Robotic Process Automation (RPA), designed to be used with both Robot Framework and Python
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves
Desktop app for prototyping and debugging LangGraph applications locally.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
No fortress, purely open ground. OpenManus is Coming.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Robust Speech Recognition via Large-Scale Weak Supervision
AI模型接口管理与分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude等格式,可供个人或者企业内部管理与分发渠道使用,本项目基于One API二次开发。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT4.1/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
本项目为 chatgpt-on-wechat下游分支, 额外对接了LLMOps平台 Dify,同时支持gewechat,相比itchat更加稳定。
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation