Stars
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
[NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
💬 MaxKB is an open-source AI assistant for enterprise. It seamlessly integrates RAG pipelines, supports robust workflows, and provides MCP tool-use capabilities.
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Large World Model -- Modeling Text and Video with Millions Context
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard