Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,836 686 Updated Mar 3, 2025

IAHispano / Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

Python 2,207 358 Updated Mar 23, 2025

rlawjdghek / StableVITON

[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On

Python 1,123 181 Updated Jan 20, 2025

levihsu / OOTDiffusion

[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"

Python 6,165 884 Updated May 13, 2024

bytedance / res-adapter

[AAAI 2025] Official codes of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".

Python 740 25 Updated Mar 9, 2025

mickasmt / next-mobbin-clone

Replicate the design of mobbin.com to test the news components of shadcn-ui. #builtinpublic

TypeScript 101 11 Updated Feb 13, 2024

genforce / freecontrol

Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"

Python 463 15 Updated Oct 21, 2024

cvlab-kaist / DreamMatcher

Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (CVPR 2024)

Python 166 6 Updated Feb 27, 2024

Haoming02 / comfyui-diffusion-cg

Custom Nodes for ComfyUI that perform color grading based on the latent tensor value range

Python 94 11 Updated Oct 12, 2024

Bing-su / adetailer

Auto detecting, masking and inpainting with detection model.

Python 4,397 337 Updated Mar 17, 2025

ZHO-ZHO-ZHO / ComfyUI-BRIA_AI-RMBG

Unofficial implementation of BRIA RMBG Model for ComfyUI

Python 774 61 Updated May 22, 2024

metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS

Python 4,074 683 Updated Jul 30, 2024

lucia-auth / lucia

Authentication, simple and clean

10,012 518 Updated Feb 23, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 42,990 4,792 Updated Mar 5, 2025

streamich / react-use

React Hooks — 👍

TypeScript 42,721 3,213 Updated Mar 19, 2025

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,376 1,010 Updated Nov 18, 2024

ali-vilab / AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 4,119 367 Updated Apr 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Steralys Steralys

Block or report Steralys

Lists (1)

🔮 Future ideas

Stars

fudan-generative-vision / hallo2

SWivid / F5-TTS

withastro / astro

shadcn-ui / ui

birobirobiro / awesome-shadcn-ui

nanostores / nanostores

fudan-generative-vision / hallo

warmshao / FasterLivePortrait

KwaiVGI / LivePortrait

FusionBrainLab / HairFastGAN

TencentARC / CustomNet

fudan-generative-vision / champ

TIGER-AI-Lab / AnyV2V

open-mmlab / Amphion