GitHub

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Rui Huang¹, Songyou Peng², Ayça Takmaz², Federico Tombari³, Marc Pollefeys^2,4, Shiji Song¹, Gao Huang^1*, Francis Engelmann^2,3

¹Tsinghua University ²ETH Zurich ³Google ⁴Microsoft

Segment3D (right) predicts accurate segmentation masks, outperforming fully-supervised 3D segmentation methods like Mask3D (left), without requiring manually labeled 3D training data.

[Project Webpage] [Paper]

Dependencies

We follow the main dependencies of Mask3D:

python: 3.10.9
cuda: 11.3

You can set up a conda environment as follows:

# Some users experienced issues on Ubuntu with an AMD CPU
# Install libopenblas-dev (issue #115, thanks WindWing)
# sudo apt-get install libopenblas-dev

export TORCH_CUDA_ARCH_LIST="6.0 6.1 6.2 7.0 7.2 7.5 8.0 8.6"

conda env create -f environment.yml

conda activate mask3d_cuda113

pip3 install torch==1.12.1+cu113 torchvision==0.13.1+cu113 --extra-index-url https://download.pytorch.org/whl/cu113
pip3 install torch-scatter -f https://data.pyg.org/whl/torch-1.12.1+cu113.html
pip3 install 'git+https://github.com/facebookresearch/detectron2.git@710e7795d0eeadf9def0e7ef957eea13532e34cf' --no-deps

mkdir third_party
cd third_party

git clone --recursive "https://github.com/NVIDIA/MinkowskiEngine"
cd MinkowskiEngine
git checkout 02fc608bea4c0549b0a7b00ca1bf15dee4a0b228
python setup.py install --force_cuda --blas=openblas

cd ../Segmentator
git checkout 3e5726500896748521a6ceb81271b0f5b2c0e7d2
make

cd ../pointnet2
python setup.py install

cd ../../
pip3 install pytorch-lightning==1.7.2

conda install cuml

To set up the segment-anything model:

cd third_party
git clone [email protected]:facebookresearch/segment-anything.git

Replace the automatic_mask_generator.py with the provided version to enable multi-scale mask generation, and move process.py to third_party/segment-anything/scripts/:

cd segment-anything
pip install -e .

Demo

Download the pre-trained checkpoint and demo data. Place the checkpoint in checkpoints/ and the data in demo_test/.
(Optional) To use your own test data:
- Apply the Graph-Based Image Segmentation algorithm to your test scenes (see original repository).
- Place the mesh and segmentations in demo_test/TEST_SCENE/.
Run the demo:
```
bash scripts/run_demo.sh TEST_SCENE
```
Visualize results with PyViz3D:
```
python -m http.server 6008
```
Open a browser and navigate to localhost:6008.

Training stage 1

Download the 2D data from ScanNet to data/processed/scannet. Your folder structure should look like this:

scannet
├── scene1
│   ├── color
│   ├── depth
│   ├── intrinsic
│   ├── pose
├── scene2
├── ...

Download the SAM checkpoints from here and follow their instructions to generate 2D masks:

cd third_party/segment-anything
python scripts/process.py  --checkpoint "PATH_TO_SAM_CHECKPOINT" --model-type vit_h

Train Mask3D on the ScanNet dataset:

bash scripts/train_stage1.sh

Training stage 2

Download the 3D data from ScanNet and preprocess the datasets:

python -m datasets.preprocessing.scannet_preprocessing preprocess \
--data_dir="PATH_TO_RAW_SCANNET_DATASET" \
--save_dir="data/processed/scannet200" \
--git_repo="PATH_TO_SCANNET_GIT_REPO" \
--scannet200=true

Generate 3D masks for 3D data:

bash scripts/generate_masks_trainset.sh
bash scripts/generate_masks_valset.sh

Select confident masks as pseudo labels to finetune the 3D segmentation model:

bash scripts/train_stage2.sh

Inference

Download 3D data from ScanNet++ and preprocess the datasets.

python -m datasets.preprocessing.scannetpp_preprocessing preprocess \
--data_dir="PATH_TO_RAW_SCANNETPP_DATASET" \
--save_dir="data/processed/scannetpp" \

Evaluate the model:

bash scripts/eval.sh

BibTeX 🙏

@article{Huang2023Segment3D,
  author    = {Huang, Rui and Peng, Songyou and Takmaz, Ayca and Tombari, Federico and Pollefeys, Marc and Song, Shiji and Huang, Gao and Engelmann, Francis},
  title     = {Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels},
  journal   = {European Conference on Computer Vision (ECCV)},
  year      = {2024}
}

Acknowledgement 👏

We borrow the source code of Mask3D and segment-anything-langsplat, we sincerely thank the authors for their efforts.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.hydra		.hydra
benchmark		benchmark
conf		conf
data/scannet_info		data/scannet_info
datasets		datasets
docs		docs
models		models
scripts		scripts
third_party		third_party
trainer		trainer
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
automatic_mask_generator.py		automatic_mask_generator.py
demo.py		demo.py
demo_utils.py		demo_utils.py
main_instance_segmentation.py		main_instance_segmentation.py
main_instance_segmentation_generate_mask_trainset.py		main_instance_segmentation_generate_mask_trainset.py
main_instance_segmentation_generate_mask_valset.py		main_instance_segmentation_generate_mask_valset.py
main_instance_segmentation_stage1.py		main_instance_segmentation_stage1.py
main_instance_segmentation_stage2.py		main_instance_segmentation_stage2.py
process.py		process.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Table of Contents

Dependencies

Demo

Training stage 1

Training stage 2

Inference

BibTeX 🙏

Acknowledgement 👏

About

Releases

Packages

Languages

License

LeapLabTHU/Segment3D

Folders and files

Latest commit

History

Repository files navigation

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Table of Contents

Dependencies

Demo

Training stage 1

Training stage 2

Inference

BibTeX 🙏

Acknowledgement 👏

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages