Stars
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
The open source platform for AI-native application development.
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…
This Inventory management system is the currently Ford Asia Pacific after-sales logistics warehousing supply chain process . After I leave Ford , I start this project . You can share your vacant wa…
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
The next generation deep reinforcement learning tookit
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
SDG is a specialized framework designed to generate high-quality structured tabular data.
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Your Automatic Prompt Engineering Assistant for GenAI Applications
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.