Stars
MARS5 speech model (TTS) from CAMB.AI
Python Audio Separator in Real Time using MDX-NET model
Real-time image and video processing library similar to GPUImage, with built-in beauty filters, achieving commercial-grade beauty effects. Written in C++11 and based on OpenGL/ES.
[NeurIPS 24] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Official implementation for ACM MM 2023 paper '360-Degree Panorama Generation from Few Unregistered NFoV Images'
A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…
(Siggraph Asia 2023) Official code of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
[ECCV'22] Official PyTorch Implementation of "Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers"
[NeurIPS 2023] PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance
Code for Few-Shot Head Swapping in the Wild (CVPR 2022 Oral)
podgorskiy / dnnlib
Forked from NVlabs/stylegandnnlib extracted from official StyleGAN implementation
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Unofficial PyTorch implementation of the Try-On algorithm (tryondiffusion, sd+cn, PIDM).
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024
Official implementation of our CVPR2023 paper "A Unified Pyramid Recurrent Network for Video Frame Interpolation"
Official code for "AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation" (CVPR2023)
[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio
A Collection of Papers and Codes in CVPR2023/2022 about low level vision
The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥
Agent techniques to augment your LLM and push it beyong its limits
Evaluation tool for LLM QA chains