Skip to content
View zhengsipeng's full-sized avatar

Block or report zhengsipeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Hand-object interaction Pretraining From Videos

Python 70 6 Updated Oct 28, 2024

This is the official release for the paper "EFM3D A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models" (https//arxiv.org/abs/2406.10224).

Python 128 8 Updated Jan 16, 2025

[3DV2025] SPAFormer: Sequential 3D Part Assembly with Transformers

Python 9 2 Updated Mar 13, 2024
Python 425 32 Updated Jan 20, 2025

Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).

Python 33 1 Updated Jul 15, 2024

Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"

Python 1,051 77 Updated Oct 13, 2024

Bimanual Dexterous Teleoperation with Real-Time Retargeting using VisionPro

Python 238 15 Updated Sep 18, 2024
Python 1,784 55 Updated Jun 28, 2024

Paper repo for publication: "Steve-Eye: Equiping LLM-based Embodied Agents with Visual Perception in Open Worlds".

7 Updated Dec 7, 2023

Mobile manipulation research tools for roboticists

Python 981 133 Updated Jun 8, 2024
Python 59 6 Updated Jul 13, 2024
Python 232 13 Updated Aug 18, 2022

Repository to identify Lego bricks automatically only using images

Python 100 23 Updated Apr 30, 2021

Vector (and Scalar) Quantization, in Pytorch

Python 2,860 232 Updated Jan 10, 2025

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Python 1,704 118 Updated Jan 28, 2025

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"

Jupyter Notebook 765 105 Updated Jul 30, 2024

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 970 43 Updated Jan 17, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 35,147 5,347 Updated Jan 28, 2025

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,625 349 Updated Dec 7, 2024

3D LEGO models and mosaics from images using R and #tidyverse

R 424 57 Updated Nov 27, 2023

A modular RL library to fine-tune language models to human preferences

Python 2,261 192 Updated Mar 1, 2024

Making large AI models cheaper, faster and more accessible

Python 39,031 4,357 Updated Jan 24, 2025
Python 2 Updated Jun 4, 2023
Python 1 Updated Dec 7, 2019

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,400 178 Updated Jan 23, 2025

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,336 833 Updated Jan 26, 2025
Python 15 4 Updated Jun 4, 2023

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,634 1,442 Updated Sep 5, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,394 4,211 Updated Jan 28, 2025
Next