Improved PSMNet for Deep Stereo Disparity Estimation

This is the code repository of Improved-PSMNet. Implemented by PyTorch.

See our project report for details.

Authors:

Ganlin Zhang (@zhangganlin) Haokai Pang (@hkkkpang) Xinyu Shen(@ucabxs0) Yunying Zhu(@yunyingzhu)

Structure

Improved-PSMNet is designed to estimate disparity from stereo image pairs. Here is the structure of Improved-PSMNet:

Detailed structure in the green dashed box is displayed below:

Example

Left and right input images:

Disparity groundtruth and our estimation:

Conda Virtual Environment

conda env create -f env.yaml

Activate the conda environment before run the code.

conda activate IPSM

Dataset

KITTI stereo 2015

(http://www.cvlibs.net/datasets/kitti/eval_scene_flow.php)
Scene Flow (driving part)

(https://lmb.informatik.uni-freiburg.de/resources/datasets/SceneFlowDatasets.en.html)

Only left image, right image and groundtruth disparity are needed.

Download these two datasets and extract them into dataset folder. The folder structure should be as follow:

dataset
├── data_scene_flow_2015
│   ├── testing
│   │   ├── image_2
│   │   └── image_3
│   └── training
│       ├── disp_occ_0
│       ├── image_2
│       └── image_3
├── driving_disparity
│   └── 35mm_focallength
│       └── scene_forwards
│           ├── fast
│           │   ├── left
│           │   └── right
│           └── slow
│               ├── left
│               └── right
└── driving_frame_cleanpass
    └── 35mm_focallength
        └── scene_forwards
            ├── fast
            │   ├── left
            │   └── right
            └── slow
                ├── left
                └── right

Semantic Segmentation Model

cd into semantic_segmentation folder and download pretrained semantic segmentation model from MIT CSAIL Computer Vision

cd semantic_segmentation
bash download_pretrained_model.sh
cd ..

Training Improved-PSMNet on SceneFlow dataset

bash train.sh

Finetuning Improved-PSMNet

Notice that our model is only trained on SceneFlow, if you want it to have better performance on KITTI, you can finetune it after the training phase.

bash finetune.sh

Getting output disparity image

After training/finetuning the network, we can use it to generate estimated disparity image.

bash test_img.sh

Pretrained model

We also provide pretrained model, so that you can use it directly to test the network.

Extract it put it into trained folder. The folder structure should be as follow:

trained
├── dilated
├── dilated_gwc_seg
├── dilated_seg
├── gwc
├── gwc_dilated
├── gwc_seg
├── new_psm
└── seg

Euler Cluster

Some notes about how to train the network on ETHZ's Euler Cluster are listed in how-to-hand-in-job.md

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
dataloader		dataloader
dataset		dataset
figures		figures
models		models
semantic_segmentation		semantic_segmentation
test_result		test_result
trained		trained
utils		utils
.gitignore		.gitignore
Improved-PSMNet-for-Deep-Stereo-Disparity-Estimation.pdf		Improved-PSMNet-for-Deep-Stereo-Disparity-Estimation.pdf
LICENSE		LICENSE
README.md		README.md
Test_img.py		Test_img.py
env.yaml		env.yaml
finetune.py		finetune.py
finetune.sh		finetune.sh
generate_seg.py		generate_seg.py
how-to-hand-in-job.md		how-to-hand-in-job.md
main.py		main.py
test_img.sh		test_img.sh
train.sh		train.sh

License

zhangganlin/Improved-PSMNet-for-Deep-Stereo-Disparity-Estimation

Folders and files

Latest commit

History

Repository files navigation

Improved PSMNet for Deep Stereo Disparity Estimation

Structure

Example

Conda Virtual Environment

Dataset

Semantic Segmentation Model

Training Improved-PSMNet on SceneFlow dataset

Finetuning Improved-PSMNet

Getting output disparity image

Pretrained model

Euler Cluster

About

Topics

Resources

License

Stars

Watchers

Forks

Languages