Stars
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
An elegant PyTorch deep reinforcement learning library.
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
A UI-Focused Agent for Windows OS Interaction.
All-in-One Development Tool based on PaddlePaddle(飞桨低代码开发工具)
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
This Inventory management system is the currently Ford Asia Pacific after-sales logistics warehousing supply chain process . After I leave Ford , I start this project . You can share your vacant wa…
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs o…
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
The next generation deep reinforcement learning tookit
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
Align Anything: Training All-modality Model with Feedback
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
awesome game security [Welcome to PR]
SDG is a specialized framework designed to generate high-quality structured tabular data.
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Build multimodal language agents for fast prototype and production
Your Automatic Prompt Engineering Assistant for GenAI Applications
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.