-
Nanjing University
Highlights
- Pro
Stars
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
WebGazer.js: Scalable Webcam EyeTracking Using User Interactions
[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
💥 Blazing fast terminal file manager written in Rust, based on async I/O.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Access latex source of any arxiv.org paper directly on overleaf
5D Diplomacy With Multiverse Time Travel
This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving," held at ECCV 2024.
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR 2025 Oral)
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*
Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving
[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
Official PyTorch implementation of SegFormer
Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs
[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…