jwgu

Jinwei Gu jwgu

Senior Research Scientist

39 followers · 10 following

NVIDIA
http://www.gujinwei.org

Achievements

Stars

Hritikbansal / videophy

Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics

Python 73 5 Updated Oct 11, 2024

NVIDIA / Cosmos-Tokenizer

A suite of image and video neural tokenizers

Jupyter Notebook 1,535 65 Updated Jan 19, 2025

ai-forever / MoVQGAN

MoVQGAN - model for the image encoding and reconstruction

Jupyter Notebook 215 15 Updated Oct 31, 2023

bytedance / IRASim

Python 88 5 Updated Aug 16, 2024

rail-berkeley / serl

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Python 440 49 Updated Dec 6, 2024

moojink / rlds_dataset_mod

Forked from kpertsch/rlds_dataset_mod

Efficiently apply modification functions to RLDS/TFDS datasets.

Python 10 10 Updated Jun 19, 2024

JeffreyYH / Awesome-Generalist-Robots-via-Foundation-Models

Paper list in the survey paper: Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

397 28 Updated Jan 23, 2025

google-research / rlds

Jupyter Notebook 324 23 Updated Sep 26, 2024

huangwl18 / ReKep

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 621 69 Updated Aug 30, 2024

droid-dataset / droid_policy_learning

DROID Policy Learning and Evaluation

Python 159 13 Updated Dec 21, 2024

songweige / TATS

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Python 275 17 Updated May 1, 2024

Octoframes / jupyter_compare_view

Blend Between Multiple Images in JupyterLab.

Jupyter Notebook 111 11 Updated Dec 5, 2024

chenyuntc / video-comparison-slider

A sample html to compare two videos with slider animation using

HTML 4 Updated Jan 27, 2023

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 19,898 1,392 Updated Jan 31, 2025

dvlab-research / ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Python 1,500 75 Updated Sep 25, 2024

hamvocke / dotfiles

A collection of my personal dotfiles

Lua 575 94 Updated Feb 1, 2025

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 677 35 Updated Jan 25, 2025

NVlabs / RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Jupyter Notebook 904 37 Updated Jan 21, 2025

kakaobrain / rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Jupyter Notebook 828 90 Updated Jan 3, 2024

pytorch / torchtitan

A PyTorch native library for large model training

Python 3,238 260 Updated Feb 3, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 1,803 237 Updated Dec 11, 2024

manman1995 / Deep-Fourier-Upsampling

Deep Fourier Upsampling

Python 67 3 Updated Mar 26, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,873 349 Updated Aug 7, 2024

lucidrains / CoCa-pytorch

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Python 1,100 88 Updated Dec 12, 2023

filipbasara0 / simple-clip

A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch

Jupyter Notebook 27 5 Updated Feb 14, 2024

kyegomez / RT-2

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Python 405 60 Updated Jul 26, 2024

xukechun / Vision-Language-Grasping

[ICRA 2023] A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter

Python 114 16 Updated May 19, 2024

kyegomez / NaViT

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Python 210 10 Updated Jan 27, 2025

iMoonLab / MeshNet

MeshNet: Mesh Neural Network for 3D Shape Representation (AAAI 2019)

Python 348 61 Updated Jul 25, 2024

NExT-GPT / NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,405 343 Updated Nov 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jinwei Gu jwgu

Achievements

Achievements

Block or report jwgu

Stars

Hritikbansal / videophy

NVIDIA / Cosmos-Tokenizer

ai-forever / MoVQGAN

bytedance / IRASim

rail-berkeley / serl

moojink / rlds_dataset_mod

JeffreyYH / Awesome-Generalist-Robots-via-Foundation-Models

google-research / rlds

huangwl18 / ReKep

droid-dataset / droid_policy_learning

songweige / TATS

Octoframes / jupyter_compare_view

chenyuntc / video-comparison-slider

black-forest-labs / flux

dvlab-research / ControlNeXt

hamvocke / dotfiles

bytedance / 1d-tokenizer

NVlabs / RADIO

kakaobrain / rq-vae-transformer

pytorch / torchtitan

openvla / openvla

manman1995 / Deep-Fourier-Upsampling

rom1504 / img2dataset

lucidrains / CoCa-pytorch

filipbasara0 / simple-clip

kyegomez / RT-2

xukechun / Vision-Language-Grasping

kyegomez / NaViT

iMoonLab / MeshNet

NExT-GPT / NExT-GPT