Lists (23)
Sort Name ascending (A-Z)
3d
3D(Nerf)
ai conferences
CLIP
corresponding learning
cs61.c
riscv learningdiffusion model
DINO
efficient models
Foudation Models
human image translation
image inpainting
✨ Inspiration
non-autoregression
pose transfer
human pose/Motion transfer collectionsUseful Tools
vq-related
vqvae,vqgan系列论文VTON
virtual try on图像生成/合成/迁移
常用网络
常用网络组件或者网络结构有用的仓库
有用的工具
视觉工具
human parsing, keypoints estimation, etc.Starred repositories
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
A generative world for general-purpose robotics & embodied AI learning.
Code for the the ICLR 2024 paper "∞-Diff: Infinite Resolution Diffusion with Subsampled Mollified States"
Equilibrated Diffusion: Frequency-aware Textual Embedding for EquilibratedImage Customization. https://maple-aigc.github.io/EqDiff/
[NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Enjoy the magic of Diffusion models!
🎁 5,400,000+ Unsplash images made available for research and machine learning
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Latte: Latent Diffusion Transformer for Video Generation.
Stable Video Diffusion Training Code and Extensions.
[CSUR] A Survey on Video Diffusion Models
Lumina-T2X is a unified framework for Text to Any Modality Generation
An Open-source Toolkit for LLM Development
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Open-Sora: Democratizing Efficient Video Production for All
Official PyTorch implementation of LongVideoGAN
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Generative Models by Stability AI
Open source implementation of "Vision Transformers Need Registers"