Stars
[ICLR 2025] Outlier Synthesis via Hamiltonian Monte Carlo for Out-of-Distribution Detection
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
A pytorch implementation of SHMC algorithm in the paper Spherical Hamiltonian Monte Carlo for Constrained Target Distributions
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
aandyw / diffusers
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Out-of-distribution detection, robustness, and generalization resources. The repository contains a curated list of papers, tutorials, books, videos, articles and open-source libraries etc
Official PyTorch implementation for paper: Energy-Based Sliced Wasserstein Distance
An implementation of MALA(Metropolis adjusted Langevin algorithm), GD, SGD, SGLD(Stochastic Gradient Langevin Dynamics), ULA(Unadjusted Langevin algorithm) on the RFM(Random Feature Model)
Metropolis-adjusted Langevin algorithm and Hybrid (Hamiltionian) Monte Carlo
Model your data with the Von Mises-Fisher distribution in Python
Sampling with gradient-based Markov Chain Monte Carlo approaches
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
Latte: Latent Diffusion Transformer for Video Generation.
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Benchmarking Generalized Out-of-Distribution Detection
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate