Stars
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easily train a good VC model with voice data <= 10 mins!
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Grounded Tracking for Streaming Videos
R&D playground to play with agents and OpenBB
Investment Research for Everyone, Everywhere.
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
A general fine-tuning kit geared toward diffusion models.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Official inference repo for FLUX.1 models
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
GRUtopia: Dream General Robots in a City at Scale
Code for Fast Training of Diffusion Models with Masked Transformers
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
Integrating ChatGPT into your browser deeply, everything you need is here
Generate summary of any video 📺 anywhere and anytime
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.