Semantic Segmentation Tutorial
Below we provide more detailed instructions of utilizing MCD dataset for semantic segmentation.
1. Data organization and pre-processing
We mainly follow the data organization of SemanticKITTI so that most existing backbones tested on SemanticKITTI can be easily adopted on MCD. After downloading the annotated lidar, we suggest users to follow the detailed organization instruction as in this tutorial to pre-process the data.
2. Training/Testing with Different Backbones
In the following section, we provide the implementation details of training/testing different 3D semantic segmentation methods with MCD dataset. For all methods, we provide their corresponding Docker images to facilitate their implementations. For each repo, you can either build the corresponding Docker image by docker build -t IMAGE_NAME .
in each repo with the Dockerfile we provided, or pull the images we built in advance from Dockerhub.
MinkowskiNet (MK-Net) and SPVCNN
For both MK-Net and SPVCNN, we leverage the re-implementation of from 2DPass with some minor modifications on configs and data I/O.
- Repo: https://github.com/AronCao49/2DPASS.
- To pull built image, run
docker pull aroncao49/2dpass:latest
. - To train MK-Net/SPVCNN with MCD dataset, please follow the train/test instruction of
README.md
in the repo.
SalsaNext
For SalsaNext, our implementation is mostly based on the official code.
- Repo: https://github.com/AronCao49/SalsaNext.
- To pull built image, run
docker pull aroncao49/salsanext:latest
. - Follow the updated
README.md
to train/test with MCD dataset. - For training configuration, we made the following changes to adapt SalsaNext to MCD:
- LiDAR fov, updated to -35.2 to 35.2 degree.
- Range image resolution, updated to 256x256.
- Range image standardization, with mean & std computed from MCD dataset.
Cylinder3D
TBD
SDSeg3D
TBD
WaffleIron
TBD
S-Former
TBD