Adaptive Illumination based Depth Sensing using Deep Learning

A PyTorch implementation of the paper: [Adaptive Illumination Based Depth Sensing Using Deep Superpixel and Soft Sampling Approximation] [Arxiv Preprint] [IEEE TCI]

Introduction

The proposed pipeline contains two submodules, adaptive depth sampling (N etM ) and depth reconstruction (N etE). The binary adaptive sampling mask is generated based on the RGB image. Then, the LiDAR samples the scene based on the binary sampling mask and generate the sampled sparse depth map. Finally, both RGB image and sampled sparse depth map are applied to estimate the dense depth map.

LiDAR systems is able to capture accurate sparse depth map (bottom). Reducing the number of samples is able to increase the capture framerate. RGB image (top) can be applied to fuse with the captured sparse depth data and estimate a dense depth map. We demonstrate that choosing the sampling location is im- portant to the accuracy of the estimated depth map. Under 0.25% sampling rate (with respect to the RGB image), using the same depth estimation method, the depth map estimated from the adaptively sampled sparse depth (third row) is more accurate than the depth map estimated from random samples (second row).

Requirements

Anaconda
PyTorch
opencv

conda install -c conda-forge opencv

tensorboard_logger

pip install tensorboard_logger

Datasets

Train and Val Dataset

The train and val datasets are sampled from KITTI-depth-completion. Train dataset has 42949 images. Val dataset has 3426 images. We use a traditional image inpainting algorithm to densify the semi-dense ground truth depth data. Download the datasets from here, and then extract it into $data directory. Modify the path of $data directory in line#34/35 of file train_SparseD2DenseD.py, line#49/50 of file train_SuperPixelFCN_color.py and line#XX/XX of file train_NetM.py.

Usage

Train NetE

Run

python train_RGBSparseD2DenseD.py

to train the RGB and Sparse Depth Completion network NetE.

Train NetFCN

Run

python train_SuperPixelFCN_color.py

to train the FCN super pixel network using the SLIC loss. NetFCN is used as initialization of NetM.

Train NetM

After NetE and NetFCN are trained, modify the file name of trained NetE in line#29 of file train_NetM.py and run

python train_NetM_SP_RGBSparseD2Dense.py

to train the adaptive image sampling network NetM.

To visualize the training process, run

tensorboard --logdir tensorBoardRuns

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
dataloaders		dataloaders
figs		figs
superpixel_fcn		superpixel_fcn
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
loss.py		loss.py
metrics.py		metrics.py
model.py		model.py
train_NetM_SP_RGBSparseD2Dense.py		train_NetM_SP_RGBSparseD2Dense.py
train_RGBSparseD2DenseD.py		train_RGBSparseD2DenseD.py
train_SuperPixelFCN_color.py		train_SuperPixelFCN_color.py

License

usstdqq/adaptive-depth-sensing

Folders and files

Latest commit

History

Repository files navigation

Adaptive Illumination based Depth Sensing using Deep Learning

Introduction

Requirements

Datasets

Train and Val Dataset

Usage

Train NetE

Train NetFCN

Train NetM

About

Resources

License

Stars

Watchers

Forks

Languages