Bird's Eye View layout prediction: roads and cars

Final project for Deep Learning course (DS-GA 1008, NYU Center for Data Science)

Top-10 overall rank in road layout prediction and car bounding boxes prediction tasks

Kawshik Kannan, Hsin-Rung Chou, Dipika Rajesh

Report | [Video](add link for video)

Abstract

In this project we focus on Bird's Eye View (BEV) prediction based on monocular photos taken by the cameras on top of the car. We experiment with Determinisitic autoencoders, stochastic variational autoencoders, generative adversarial networks for generating Bird's eye view road layout and Bird's eye view of vehicles on the road indirectly. THe best performing models on the training set use GANs whereas the maximum test performance was from the deterministic model. Our models achieve 0.904 val threat score on the road layout prediction task and 0.044 val threat score on the BB prediction task.

Usage

Generate and save labels

Use generate_labels.py to generate

vehicles mask
road mask
warped and glued photos

Road Layout Prediction and Bounding Boxes Prediction

Refer to src/ for code used to train and test road layout prediction models.

GANs src/GANmodels
Deterministic models and Retinanet src/SupModels
training and validation scripts src/trainer
training and validation scripts src/trainer

Self-supervised learning

Implemented PIRL and SIMCLR SSL techniques in src/SSLmodels.py

Libraries used

Papers and useful links:

simclr https://arxiv.org/abs/2002.05709
PIRL https://arxiv.org/abs/1912.01991
retinanet https://arxiv.org/abs/1708.02002
rotation based object detection https://arxiv.org/pdf/1911.08299.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
helper_code		helper_code
notebooks		notebooks
results		results
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
DL_final_report.pdf		DL_final_report.pdf
README.md		README.md
run.sh		run.sh
sbatch_script.s		sbatch_script.s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

helper_code

helper_code

notebooks

notebooks

results

results

src

src

.gitattributes

.gitattributes

.gitignore

.gitignore

DL_final_report.pdf

DL_final_report.pdf

README.md

README.md

run.sh

run.sh

sbatch_script.s

sbatch_script.s

Repository files navigation

Bird's Eye View layout prediction: roads and cars

Final project for Deep Learning course (DS-GA 1008, NYU Center for Data Science)

Top-10 overall rank in road layout prediction and car bounding boxes prediction tasks

Kawshik Kannan, Hsin-Rung Chou, Dipika Rajesh

Report | [Video](add link for video)

Abstract

Usage

Generate and save labels

Road Layout Prediction and Bounding Boxes Prediction

Self-supervised learning

Papers and useful links:

About

Releases

Packages

Contributors 2

Languages

kawshik8/Birds-eye-view-layout-prediction

Folders and files

Latest commit

History

Repository files navigation

Bird's Eye View layout prediction: roads and cars

Final project for Deep Learning course (DS-GA 1008, NYU Center for Data Science)

Top-10 overall rank in road layout prediction and car bounding boxes prediction tasks

Kawshik Kannan, Hsin-Rung Chou, Dipika Rajesh

Report | [Video](add link for video)

Abstract

Usage

Generate and save labels

Road Layout Prediction and Bounding Boxes Prediction

Self-supervised learning

Papers and useful links:

About

Resources

Stars

Watchers

Forks

Languages