Skip to content
View zc2023's full-sized avatar
🏡
Work from home
🏡
Work from home

Highlights

  • Pro

Block or report zc2023

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
32 stars written in Python
Clear filter

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 41,752 5,436 Updated Jan 23, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,166 4,419 Updated Jan 18, 2025

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…

Python 24,210 4,614 Updated Oct 15, 2023

Mamba SSM architecture

Python 13,831 1,191 Updated Jan 18, 2025

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,987 1,993 Updated Apr 16, 2024

OpenMMLab's next-generation platform for general 3D object detection.

Python 5,465 1,569 Updated Jul 10, 2024

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Python 4,799 1,312 Updated Aug 8, 2024

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Python 2,547 370 Updated Mar 5, 2024

VMamba: Visual State Space Models,code is based on mamba

Python 2,365 155 Updated Oct 28, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 2,120 36 Updated Oct 22, 2024

Code for a series of work in LiDAR perception, including SST (CVPR 22), FSD (NeurIPS 22), FSD++ (TPAMI 23), FSDv2, and CTRL (ICCV 23, oral).

Python 819 102 Updated Dec 22, 2024

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Python 618 70 Updated Jun 26, 2024

Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.

Python 578 112 Updated Oct 29, 2023

This is an unofficial implementation of the Point Transformer paper.

Python 530 101 Updated Apr 19, 2022

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 402 36 Updated Jan 20, 2025

[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"

Python 395 31 Updated Sep 4, 2024

[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis

Python 387 28 Updated Oct 11, 2024

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Python 384 34 Updated Oct 23, 2024

Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"

Python 222 3 Updated Dec 30, 2024

A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World

Python 200 6 Updated Nov 29, 2024
Python 191 45 Updated Nov 21, 2022

[NeurIPS 2024] A Unified Framework for 3D Scene Understanding

Python 127 3 Updated Nov 28, 2024

[NIPS'24] Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection

Python 106 7 Updated Sep 26, 2024

Code&Data for Grounded 3D-LLM with Referent Tokens

Python 98 2 Updated Jan 5, 2025

Autoregressive Policy for Robot Learning

Python 97 6 Updated Dec 6, 2024

Code for the paper "Masked Autoencoders for Self-Supervised Learning on Automotive Point Clouds"

Python 77 7 Updated Mar 9, 2023

Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"

Python 59 6 Updated Aug 2, 2024
Python 48 1 Updated Oct 3, 2024

The Experiment Code for Swin3D

Python 33 Updated Mar 6, 2024

卷积神经网络ResNet进行动物10分类

Python 27 2 Updated Oct 2, 2022
Next