Stars
DeepEP: an efficient expert-parallel communication library
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Janus-Series: Unified Multimodal Understanding and Generation Models
Fully open reproduction of DeepSeek-R1
LLM powered retrieval engine designed to process a ton of sources to collect a comprehensive list of entities.
Large Concept Models: Language modeling in a sentence representation space
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Use LLMs to dig out what you care about from massive amounts of information and a variety of sources daily.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Python based web automation tool. Powerful and elegant.
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Model for MDX23 music separation contest
GUI for a Vocal Remover that uses Deep Neural Networks.
Code for the paper Hybrid Spectrogram and Waveform Source Separation