Skip to content
View jwgu's full-sized avatar

Block or report jwgu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics

Python 73 5 Updated Oct 11, 2024

A suite of image and video neural tokenizers

Jupyter Notebook 1,535 65 Updated Jan 19, 2025

MoVQGAN - model for the image encoding and reconstruction

Jupyter Notebook 215 15 Updated Oct 31, 2023
Python 88 5 Updated Aug 16, 2024

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Python 440 49 Updated Dec 6, 2024

Efficiently apply modification functions to RLDS/TFDS datasets.

Python 10 10 Updated Jun 19, 2024

Paper list in the survey paper: Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

397 28 Updated Jan 23, 2025
Jupyter Notebook 324 23 Updated Sep 26, 2024

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 621 69 Updated Aug 30, 2024

DROID Policy Learning and Evaluation

Python 159 13 Updated Dec 21, 2024

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Python 275 17 Updated May 1, 2024

Blend Between Multiple Images in JupyterLab.

Jupyter Notebook 111 11 Updated Dec 5, 2024

A sample html to compare two videos with slider animation using

HTML 4 Updated Jan 27, 2023

Official inference repo for FLUX.1 models

Python 19,898 1,392 Updated Jan 31, 2025

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Python 1,500 75 Updated Sep 25, 2024

A collection of my personal dotfiles

Lua 575 94 Updated Feb 1, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 677 35 Updated Jan 25, 2025

Official repository for "AM-RADIO: Reduce All Domains Into One"

Jupyter Notebook 904 37 Updated Jan 21, 2025

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Jupyter Notebook 828 90 Updated Jan 3, 2024

A PyTorch native library for large model training

Python 3,238 260 Updated Feb 3, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 1,803 237 Updated Dec 11, 2024

Deep Fourier Upsampling

Python 67 3 Updated Mar 26, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,873 349 Updated Aug 7, 2024

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Python 1,100 88 Updated Dec 12, 2023

A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch

Jupyter Notebook 27 5 Updated Feb 14, 2024

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Python 405 60 Updated Jul 26, 2024

[ICRA 2023] A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter

Python 114 16 Updated May 19, 2024

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Python 210 10 Updated Jan 27, 2025

MeshNet: Mesh Neural Network for 3D Shape Representation (AAAI 2019)

Python 348 61 Updated Jul 25, 2024

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,405 343 Updated Nov 3, 2024
Next