Stars
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
A UI-Focused Agent for Windows OS Interaction.
The open source platform for AI-native application development.
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
The next generation deep reinforcement learning tookit
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
SDG is a specialized framework designed to generate high-quality structured tabular data.
Your Automatic Prompt Engineering Assistant for GenAI Applications
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Applications self-hosting and DevOps platform for running open source, web-based linux Panel of lite PaaS
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
airda(Air Data Agent)是面向数据分析的多智能体,能够理解数据开发和数据分析需求、理解数据、生成面向数据查询、数据可视化、机器学习等任务的SQL和Python代码
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
Build multimodal language agents for fast prototype and production
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
Real-time and accurate open-vocabulary end-to-end object detection
An intelligent assistant serving the entire software development lifecycle, powered by a Multi-Agent Framework, working with DevOps Toolkits, Code&Doc Repo RAG, etc.