Skip to content

kawshik8/Birds-eye-view-layout-prediction

Repository files navigation

Bird's Eye View layout prediction: roads and cars

Final project for Deep Learning course (DS-GA 1008, NYU Center for Data Science)

Top-10 overall rank in road layout prediction and car bounding boxes prediction tasks

Kawshik Kannan, Hsin-Rung Chou, Dipika Rajesh

Report | [Video](add link for video)

Abstract

In this project we focus on Bird's Eye View (BEV) prediction based on monocular photos taken by the cameras on top of the car. We experiment with Determinisitic autoencoders, stochastic variational autoencoders, generative adversarial networks for generating Bird's eye view road layout and Bird's eye view of vehicles on the road indirectly. THe best performing models on the training set use GANs whereas the maximum test performance was from the deterministic model. Our models achieve 0.904 val threat score on the road layout prediction task and 0.044 val threat score on the BB prediction task.


Usage

Generate and save labels

Use generate_labels.py to generate

  • vehicles mask
  • road mask
  • warped and glued photos

Road Layout Prediction and Bounding Boxes Prediction

Refer to src/ for code used to train and test road layout prediction models.

  • GANs src/GANmodels
  • Deterministic models and Retinanet src/SupModels
  • training and validation scripts src/trainer
  • training and validation scripts src/trainer

Self-supervised learning

Implemented PIRL and SIMCLR SSL techniques in src/SSLmodels.py


Libraries used

Papers and useful links:

About

Deep Learning Course Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published