Skip to content
View WilliamsToTo's full-sized avatar

Block or report WilliamsToTo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of papers & resources linked to data poisoning, backdoor attacks and defenses against them (no longer maintained)

213 19 Updated Jul 19, 2024

High-Performance Symbolic Regression in Python and Julia

Python 2,546 224 Updated Jan 10, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 4,800 267 Updated Jan 9, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 22,245 1,792 Updated Jan 9, 2025

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,058 269 Updated Dec 19, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 15,307 1,428 Updated Dec 11, 2024
Jupyter Notebook 1,682 163 Updated Sep 27, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 704 36 Updated Aug 5, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,126 48 Updated Dec 26, 2024

Let us control diffusion models!

Python 31,157 2,791 Updated Feb 25, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 1,298 67 Updated Nov 13, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,898 112 Updated Jul 29, 2024

High-resolution models for human tasks.

Python 4,732 269 Updated Nov 18, 2024

Official implementation of AnimateDiff.

Python 10,833 881 Updated Jul 31, 2024

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 281 7 Updated Jul 9, 2024

MoVQGAN - model for the image encoding and reconstruction

Jupyter Notebook 211 14 Updated Oct 31, 2023

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,444 58 Updated Aug 15, 2024

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Python 3,531 384 Updated Jan 3, 2025

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Python 992 59 Updated Jun 6, 2024

Research code for ACL2024 paper: "Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline"

Python 20 5 Updated Dec 27, 2024

[NeurIPS 2023] Scalable 3D Captioning with Pretrained Models

Python 243 14 Updated Apr 25, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,645 354 Updated Jul 10, 2024

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 486 27 Updated Sep 16, 2024

Code repository for T2V-Turbo and T2V-Turbo-v2

Python 279 17 Updated Oct 21, 2024

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 669 18 Updated Sep 18, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 675 34 Updated Jan 6, 2025

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 906 59 Updated Nov 13, 2024
Next