Stars
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
Wan: Open and Advanced Large-Scale Video Generative Models
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
Scalable data pre processing and curation toolkit for LLMs
Official implementation of Character Region Awareness for Text Detection (CRAFT)
[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio
🎥 Python and OpenCV-based scene cut/transition detection program & library.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
High-resolution models for human tasks.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
[Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller
Inpaint anything using Segment Anything and inpainting models.
Official inference repo for FLUX.1 models
The pytorch code of AIParsing: Anchor-Free Instance-Level Human Parsing
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
CCEdit: Creative and Controllable Video Editing via Diffusion Models
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Official repository for the paper Approximate Fast Foreground Colour Estimation. ICIP 2021.
PyTorch implementation of EdgeFool: An Adversarial Image Enhancement Filter, ICASSP2020