Skip to content
View zc2023's full-sized avatar
🏡
Work from home
🏡
Work from home

Highlights

  • Pro

Block or report zc2023

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2024] A Unified Framework for 3D Scene Understanding

Python 127 3 Updated Nov 28, 2024

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Python 384 34 Updated Oct 23, 2024

Code&Data for Grounded 3D-LLM with Referent Tokens

Python 98 2 Updated Jan 5, 2025

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,987 1,993 Updated Apr 16, 2024

Build you own translator from chinese to english with seq2seq model in pytorch😄

Python 3 Updated Apr 4, 2021

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 41,748 5,436 Updated Jan 23, 2025

现代化的基于 NTQQ 的 Bot 协议端实现

TypeScript 3,117 225 Updated Jan 22, 2025

Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"

Python 59 6 Updated Aug 2, 2024

Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.

Python 578 112 Updated Oct 29, 2023

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Python 2,547 370 Updated Mar 5, 2024

卷积神经网络ResNet进行动物10分类

Python 27 2 Updated Oct 2, 2022

MikuDance: Animating Character Art with Mixed Motion Dynamics

118 2 Updated Jan 10, 2025

Autoregressive Policy for Robot Learning

Python 97 6 Updated Dec 6, 2024

Improving 3D Large Language Model via Robust Instruction Tuning

45 3 Updated Oct 2, 2024

A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World

Python 200 6 Updated Nov 29, 2024

Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"

Python 222 3 Updated Dec 30, 2024

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 402 36 Updated Jan 20, 2025
Python 48 1 Updated Oct 3, 2024

[NIPS'24] Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection

Python 106 7 Updated Sep 26, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,166 4,419 Updated Jan 18, 2025

[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"

Python 395 31 Updated Sep 4, 2024

VMamba: Visual State Space Models,code is based on mamba

Python 2,365 155 Updated Oct 28, 2024

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Python 4,799 1,312 Updated Aug 8, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 2,120 36 Updated Oct 22, 2024

OpenMMLab's next-generation platform for general 3D object detection.

Python 5,465 1,569 Updated Jul 10, 2024
Python 191 45 Updated Nov 21, 2022

Code for a series of work in LiDAR perception, including SST (CVPR 22), FSD (NeurIPS 22), FSD++ (TPAMI 23), FSDv2, and CTRL (ICCV 23, oral).

Python 819 102 Updated Dec 22, 2024

Code for the paper "Masked Autoencoders for Self-Supervised Learning on Automotive Point Clouds"

Python 77 7 Updated Mar 9, 2023

[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis

Python 387 28 Updated Oct 11, 2024

This is an unofficial implementation of the Point Transformer paper.

Python 530 101 Updated Apr 19, 2022
Next