Highlights
- Pro
Stars
Collections of CS PhD Application Fee Waivers of schools in North America
A python library for self-supervised learning on images.
Open source AI/ML capabilities for the FiftyOne ecosystem
Refine high-quality datasets and visual AI models
text to image to generation: CogView3-Plus and CogView3(ECCV 2024)
Official Github Repo for Neurips 2024 Paper Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment
✨✨Latest Advances on Multimodal Large Language Models
Paper collections of the continuous effort start from World Models.
Latte: Latent Diffusion Transformer for Video Generation.
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
LVBench: An Extreme Long Video Understanding Benchmark
🗄️ Solutions to Database System Concepts Seventh Edition
A high resolution face dataset for face editing purpose
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
A latent text-to-image diffusion model
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
SDXL-based ControlNet implementation
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image