Skip to content

tomasamado/cityscapes-image-segmentation

Repository files navigation

Neural Networks: Implementation and Application - Computer Vision Project

Abstract

Image segmentation has important applications in different fields such as medical imaging (diagnosis, surgery planning,...), recognition tasks (face recognition) and computer vision. Segmentation methods based on deep neural networks have been evolving and achieving higher performance in different tasks over the last years. In this report, we summarize our work within the Computer Vision project of the \textit{Neural Networks: Theory and Implementation} (WiSem2020/21) course at Saarland University, which focuses on understanding and implementing different neural network models for the mentioned methods. During this project, we familiarised ourselves with two image segmentation datasets: the well known PASCAL Visual Object Classes (PASCAL VOC) and the Cityscapes datasets. We successfully implemented a version of the R2U-Net model used in medical image segmentation, named R2U-Net64, that can be used in multi-class pixel-level segmentation tasks with the Cityscapes dataset and we improved this model's performance by implementing and including a module based on height-driven attention networks (HANet) into the architecture.

Notebooks

Task 1: SimpleSegNet, PascalVOC dataset

Task 2: R2-Unet, Cityscapes datase

Task 3: R2-Unet + HANet, Cityscapes dataset

About

Implementation of R2U-Net and a custom model using the main module from HANet + R2U-Net for image segmentation of urban scenes on the Cityscapes dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published