Skip to content
View chenlin9's full-sized avatar

Block or report chenlin9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PhotoMaker [CVPR 2024]

Jupyter Notebook 9,727 774 Updated Oct 31, 2024

for Data Science class on Coursera

472 138 Updated Nov 26, 2019

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,984 4,582 Updated Aug 16, 2024

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

Python 917 84 Updated Nov 11, 2023
Jupyter Notebook 114 9 Updated Dec 19, 2023

Generative Models by Stability AI

Python 25,108 2,783 Updated Sep 4, 2024
Jupyter Notebook 3,208 302 Updated May 14, 2024

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,887 265 Updated Jun 4, 2024

Medium Articles Notebooks and Media Files

Jupyter Notebook 14 4 Updated Apr 11, 2024

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,476 217 Updated Apr 15, 2024

Python package to corrupt arbitrary images.

Python 420 69 Updated Oct 20, 2024

T2I-Adapter

Python 3,562 215 Updated Jun 21, 2024

SoftVC VITS Singing Voice Conversion

Python 26,347 4,894 Updated Nov 11, 2023

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Jupyter Notebook 1,010 60 Updated Sep 21, 2023
Jupyter Notebook 568 90 Updated Oct 18, 2024

A feature-rich command-line audio/video downloader

Python 97,232 7,618 Updated Jan 19, 2025

a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image

Python 2,254 333 Updated Aug 16, 2024

通过水印减除方法去掉视频中的水印,快速但不完美

Python 333 79 Updated Sep 4, 2018

[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)

Python 157 6 Updated Oct 19, 2023

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,575 5,738 Updated Sep 18, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,658 355 Updated Jul 10, 2024

Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies

Python 1,299 108 Updated Jul 14, 2024

Finetune ModelScope's Text To Video model using Diffusers 🧨

Python 675 107 Updated Dec 14, 2023

Large-scale text-video dataset. 10 million captioned short videos.

Python 617 39 Updated Aug 14, 2024

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Python 4,290 388 Updated Oct 25, 2023

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Python 355 43 Updated May 19, 2022

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Python 4,106 358 Updated May 6, 2023

High-Resolution Image Synthesis with Latent Diffusion Models

Python 39,808 5,116 Updated Oct 10, 2024

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Python 834 158 Updated Oct 10, 2023

Chinese Text-to-Speech web service

Python 309 86 Updated Apr 11, 2021
Next