Skip to content
View narrowsnap's full-sized avatar

Block or report narrowsnap

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 34,454 4,171 Updated Aug 16, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 23,630 3,514 Updated Sep 5, 2024
Python 6,108 456 Updated Oct 4, 2024

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 763 24 Updated Sep 23, 2024

Grounded Tracking for Streaming Videos

Jupyter Notebook 34 3 Updated Aug 15, 2024

R&D playground to play with agents and OpenBB

Python 453 67 Updated Jul 22, 2024

Investment Research for Everyone, Everywhere.

Python 32,099 2,931 Updated Oct 8, 2024

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 63,204 32,236 Updated Oct 7, 2024

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…

TypeScript 42,240 9,538 Updated Oct 9, 2024

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 352 9 Updated Sep 2, 2024

A general fine-tuning kit geared toward diffusion models.

Python 1,626 145 Updated Oct 9, 2024
Jupyter Notebook 306 35 Updated Sep 14, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,922 744 Updated Oct 9, 2024

Official inference repo for FLUX.1 models

Python 14,650 1,052 Updated Oct 8, 2024

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 1,485 184 Updated Aug 13, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,436 980 Updated Oct 8, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,220 92 Updated Aug 22, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,628 78 Updated Aug 5, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,091 540 Updated May 31, 2024

GRUtopia: Dream General Robots in a City at Scale

Python 476 23 Updated Sep 5, 2024
Python 1,440 253 Updated Apr 19, 2024
Python 678 165 Updated Jul 23, 2024

Code for Fast Training of Diffusion Models with Masked Transformers

Python 357 14 Updated May 15, 2024

[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding

Python 315 28 Updated May 8, 2024

Integrating ChatGPT into your browser deeply, everything you need is here

JavaScript 9,921 748 Updated Aug 14, 2024

Generate summary of any video 📺 anywhere and anytime

Python 259 68 Updated Dec 8, 2022

Distributed LLM and StableDiffusion inference for mobile, desktop and server.

Rust 2,520 132 Updated Aug 30, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,904 1,381 Updated Sep 5, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 25,463 5,270 Updated Oct 9, 2024
Python 116 3 Updated Jun 23, 2024
Next