A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,310 2,728 Updated Mar 12, 2025

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 70,649 7,634 Updated Mar 12, 2025

NVIDIA / flownet2-pytorch

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Python 3,197 743 Updated May 28, 2023

princeton-vl / RAFT

Python 3,483 644 Updated Dec 5, 2023

NVIDIA / NeMo-Curator

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 816 112 Updated Mar 11, 2025

clovaai / CRAFT-pytorch

Official implementation of Character Region Awareness for Text Detection (CRAFT)

Python 3,206 915 Updated Jul 16, 2024

MCG-NJU / EMA-VFI

[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio

Python 446 44 Updated May 29, 2023

Breakthrough / PySceneDetect

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,651 422 Updated Mar 10, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,194 759 Updated Mar 12, 2025

facebookresearch / sapiens

High-resolution models for human tasks.

Python 4,879 290 Updated Nov 18, 2024

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,940 1,047 Updated Mar 3, 2025

xizaoqu / MOFT

[Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller

Jupyter Notebook 34 1 Updated Feb 15, 2025

geekyutao / Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 6,980 595 Updated Feb 29, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 20,746 1,461 Updated Feb 6, 2025

jdh-algo / JoyVASA

Diffusion-based Portrait and Animal Animation

Python 701 64 Updated Mar 5, 2025

31sy / AIParsing

The pytorch code of AIParsing: Anchor-Free Instance-Level Human Parsing

Python 18 2 Updated May 27, 2023

chendatouha / dt_tryon

Python 84 6 Updated May 27, 2024

harlanhong / awesome-talking-head-generation

1,625 118 Updated Feb 8, 2025

facebookresearch / Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 2,700 414 Updated Jul 29, 2024

RuoyuFeng / CCEdit

CCEdit: Creative and Controllable Video Editing via Diffusion Models

Python 106 6 Updated Jun 11, 2024

jy0205 / Pyramid-Flow

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,828 280 Updated Dec 21, 2024

Photoroom / fast-foreground-estimation

Official repository for the paper Approximate Fast Foreground Colour Estimation. ICIP 2021.

Jupyter Notebook 69 10 Updated Jul 5, 2023

smartcameras / EdgeFool

PyTorch implementation of EdgeFool: An Adversarial Image Enhancement Filter, ICASSP2020

Python 27 8 Updated Feb 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lOJQKA

Block or report lOJQKA

Stars

Tencent / HunyuanVideo-I2V

Wan-Video / Wan2.1

ToTheBeginning / PuLID

Open-Magic-Video / Magic-1-For-1

stepfun-ai / Step-Video-T2V

MFaceTech / InstantID

instantX-research / InstantID

NVIDIA / NeMo