Skip to content

A Moving Object Semantic Segmentation Model Based on the Bird's eye view - jxLiang's Bachelor Thesis Code

Notifications You must be signed in to change notification settings

jxLiang666/MotionBEVpp

 
 

Repository files navigation

MotionBEV++: A Moving Object Semantic Segmentation Model Based on the Bird's eye view

Developing From MotionBEV

Thanks for xiekkki's wonderful job!

Prerequisites

Take a reference of MotionBEV

my environment:

  • Ubuntu:18.04 CUDA:11.6 CUDNN:8.3 python:3.9
  • torch==1.13.1+cu116 torch-scatter==2.1.1
  • dropblock==0.3.0 tqdm==4.66.1 numpy==1.21.6

(how to install pytorch : take a reference in pytorch.org)

ensure that open3d = 0.15.2, if you want to visualize the pointcloud

conda create -n mosbev python=3.9

DATA

Take a reference of MotionBEV

Only need to prepare SemanticKITTI (KITTI-road if you want to use)

How to use it

Generate the residual image

python utils/generate_residual/utils/auto_gen_polar_sequential_residual_images-mp.py

Infer

there are two pretrained models:

MotionBEVpp-valid-77.58.pt

MotionBEVpp-test-77.88.pt

  • modify data_path, data_path_test, residual_path and model_load_path in the infer.yaml

    data_path : the path to kittidataset

    data_path_test : the path to the test part of kittidataset

    residual_path : the path to the residual image path

    model_load_path : the pretrained model path

  • also if you want to infer in the test part, change the parameter of test to True in infer.yaml

  • then run

python infer.py

Train

python train.py

Evaluate

  • run
python utils/evaluate.py -l $your_dataset_path -s 'valid'

Visualize

  • modify seq, dataset_path, gt_label_path and prediction_label_path in the viz.sh

    seq : sequence

    dataset_path : the path to dataset velodyne

    gt_label_path : the path to dataset groundtrue label

    prediction_label_path : the path to the prediction

  • then run

cd open3d-semantickitti-mos-visualize
bash viz.sh

Press key d to show next frame.

Press key a to show last frame.

Press key z to auto play.

Press key x to stop auto play.

Press key s to save the view.

Press key f and input number to the frame you want.

Press key p to get the screen capture.

About

A Moving Object Semantic Segmentation Model Based on the Bird's eye view - jxLiang's Bachelor Thesis Code

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.7%
  • GLSL 1.1%
  • Shell 0.2%