Mask4Former: Mask Transformer for 4D Panoptic Segmentation (Renamed from MASK4D)

Kadir Yilmaz, Jonas Schult, Alexey Nekrasov, Bastian Leibe

RWTH Aachen University

Mask4Former is a transformer-based model for 4D Panoptic Segmentation, achieving a new state-of-the-art performance on the SemanticKITTI test set.

[Project Webpage] [arXiv]

News

2023-01-29: Mask4Former accepted to ICRA 2024
2023-09-28: Mask4Former on arXiv

Dependencies

The main dependencies of the project are the following:

python: 3.8
cuda: 11.7

You can set up a conda environment as follows

conda create --name mask4former python=3.8
conda activate mask4former

pip install torch==1.13.0+cu117 torchvision==0.14.0+cu117 --extra-index-url https://download.pytorch.org/whl/cu117

pip install -r requirements.txt --no-deps

pip install git+https://github.com/NVIDIA/MinkowskiEngine.git -v --no-deps

pip install git+https://github.com/facebookresearch/pytorch3d.git@v0.7.5 --no-deps

Data preprocessing

After installing the dependencies, we preprocess the SemanticKITTI dataset.

python -m datasets.preprocessing.semantic_kitti_preprocessing preprocess \
--data_dir "PATH_TO_RAW_SEMKITTI_DATASET" \
--save_dir "data/semantic_kitti"

python -m datasets.preprocessing.semantic_kitti_preprocessing make_instance_database \
--data_dir "PATH_TO_RAW_SEMKITTI_DATASET" \
--save_dir "data/semantic_kitti"

Training and testing

Train Mask4Former:

python main_panoptic.py

In the simplest case the inference command looks as follows:

python main_panoptic.py \
general.mode="validate" \
general.ckpt_path='PATH_TO_CHECKPOINT.ckpt'

Or you can use DBSCAN to boost the scores even further:

python main_panoptic.py \
general.mode="validate" \
general.ckpt_path='PATH_TO_CHECKPOINT.ckpt' \
general.dbscan_eps=1.0

Trained checkpoint

Mask4Former

The provided model, trained after the submission, achieves 71.1 LSTQ without DBSCAN and 71.5 with DBSCAN post-processing.

BibTeX

@inproceedings{yilmaz24mask4former,
  title     = {{Mask4Former: Mask Transformer for 4D Panoptic Segmentation}},
  author    = {Yilmaz, Kadir and Schult, Jonas and Nekrasov, Alexey and Leibe, Bastian},
  booktitle = {{International Conference on Robotics and Automation (ICRA)}},
  year      = {2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
conf		conf
data		data
datasets		datasets
docs		docs
models		models
scripts		scripts
trainer		trainer
utils		utils
.gitignore		.gitignore
README.md		README.md
main_panoptic.py		main_panoptic.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

conf

conf

data

data

datasets

datasets

docs

docs

models

models

scripts

scripts

trainer

trainer

utils

utils

.gitignore

.gitignore

README.md

README.md

main_panoptic.py

main_panoptic.py

requirements.txt

requirements.txt

Repository files navigation

Mask4Former: Mask Transformer for 4D Panoptic Segmentation (Renamed from MASK4D)

News

Dependencies

Data preprocessing

Training and testing

Trained checkpoint

BibTeX

About

Contributors 2

Languages

YilmazKadir/Mask4Former

Folders and files

Latest commit

History

Repository files navigation

Mask4Former: Mask Transformer for 4D Panoptic Segmentation (Renamed from MASK4D)

News

Dependencies

Data preprocessing

Training and testing

Trained checkpoint

BibTeX

About

Topics

Resources

Stars

Watchers

Forks

Languages