PEDESTRIAN CROSSING DETECTION WITH FRCNN(faster rcnn)

Problem Defination:

The continuous monitoring of street signals is important to regulate the traffic smoothly and to avoid accidents. There are challanges in maintaining such signals throughout since those involve plannning,time and labour. One of the major challanges is to maintain the pedestrian crossings. Pedestrian crossings may eventaully degrade because of increased traffic , weathering, lack of maintainace etc. It is really difficult to maintain pedestrian crossings from human source information because this causes delay in maintenance and there are high chances that several degraded signals are left out.

City of Helsinki,Finland is also facing similar situations, which needs immediate attention. For this problem, we are using traditional machine learning approach and deep learning techniques(frcnn,mask rcnn and retinanet) to detect the pedestrian crossing and monitor it's condition from orthophotos(aerial images).

Discussing Machine Leaning Approach:

Every machine learning task needs the data to learn from it. We fetched the pedestrian crossing data from https://kartta.hel.fi/ using wms api. The code for generating the labelled data is avilable in fetching_pedestrian_crossing_images.ipynb

The images fetched from their database not only contained the crossroad images but also contained the images of houses roof, anomalous gray and white stripes, and noisy images(mostly deterioted crossing paintings, shadows etc). Some of the images from the database are shown below. Several experimentation on those images data including anomaly detection and simple classification tasks are included in https://github.com/sangamdeuja/helsinki_crosswalk.

As we checked the maps, we observed some pedestrian crossing that were not actually not part of the database. This is the main motivation of implementing deep neural architecures for pedestrian crossing detection in the first place. Here we discuss about implementation of frcnn.

Deep Learning Approach

Problems with Convnets

The length of output layer is not fixed because the occurrences of the object in an image is not fixed
The object could be of any size and at any spatial position, that disables the idea of taking different region of interest followed by classification.

RCNN (Regional CNN, Ross Girshick et al. https://arxiv.org/abs/1311.2524)

Uses selective search algorithm to extract just 2000 regions from the image as region proposals
The regions from the selective search are fed to CNN to extract the features
The extracted features are fed to SVM classifier (Hinge loss/ huber loss) and Bounding box regressor (sum of square error SSE)

Fast RCNN ,Ross Girshick et al. https://arxiv.org/abs/1504.08083

Faster than RCNN
Images are fed to convnets to obtain features, identify the region of proposal, apply ROI pooling layer to convert to fixed size for each proposal, apply fully connected layers with SoftMax and bounding box regressor

Faster RCNN ,Shaoqing Ren et al. https://arxiv.org/abs/1506.01497

Faster than both above variations
Eliminates the selective search from Fast RCNN, and adds region proposal network (RPN)
RPN ranks region boxes(anchors) and proposes the most likely object containing region
By default, there are 9 anchors at a position in an image(eg: 3 scales 128,256,512 and 3 aspect ratios of 1:1,1:2,2:1) generates 9 anchors.
There are a huge number of anchors generated. In order to reduce them, cross boundary anchors are ignored and the non-maximum suppression(IOU) is applied.

Dataset generation and labelling

Image patches of helsinki city were generated using the code generate_datasets.ipynb. The training images were manually labelled using labelImg (https://github.com/tzutalin/labelImg)

FRCNN Implementation And Results

Frcnn implementation is based on https://github.com/kbardool/keras-frcnn . Annotation file was parsed in the form of text file. The xml file generated from labelImg is converted to csv file using create_annotate_csvfile.ipynb. The code to generate the annotation file in my context(csv to txt) is available in create_annotate_textfile.ipynb. The parameters settings is available in config.pickle

Extracting latitude and longitude

If we have the geo-coordinates of image of interest and the size of image, it is easy to extract the cordinates of the pedestrian crossing by mapping the predicted bounding box. The code for extracting the coordinates of crossing is available in extract_coordinate.ipynb

Applications

Identification of unlablled pedestrian crossing. This helps keeping database update.
Monitor the conditions if the painting is required.
Estimating the price incurring for repainting within specific area/city

Similar work using Mask rcnn is available in https://github.com/jyotipba/Hesinki_crossing_detection and using retinanet is availale in https://github.com/bhuwanKarki/keras_retinanet

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
keras_frcnn		keras_frcnn
miscel_images		miscel_images
results_imgs		results_imgs
test_images		test_images
README.md		README.md
annotate.txt		annotate.txt
annotations.csv		annotations.csv
config.pickle		config.pickle
create_annotate_textfile.ipynb		create_annotate_textfile.ipynb
create_annotation_csvfile.ipynb		create_annotation_csvfile.ipynb
extract_coordinate.ipynb		extract_coordinate.ipynb
fetching_pedestrian_crossing_images.ipynb		fetching_pedestrian_crossing_images.ipynb
generate_datasets.ipynb		generate_datasets.ipynb
measure_map.py		measure_map.py
requirements.txt		requirements.txt
test_frcnn.py		test_frcnn.py
train_frcnn.py		train_frcnn.py

sangamdeuja/Helsinki_pedestrian_crossing_detection

Folders and files

Latest commit

History

Repository files navigation

PEDESTRIAN CROSSING DETECTION WITH FRCNN(faster rcnn)

Problem Defination:

Discussing Machine Leaning Approach:

Deep Learning Approach

Problems with Convnets

RCNN (Regional CNN, Ross Girshick et al. https://arxiv.org/abs/1311.2524)

Fast RCNN ,Ross Girshick et al. https://arxiv.org/abs/1504.08083

Faster RCNN ,Shaoqing Ren et al. https://arxiv.org/abs/1506.01497

Dataset generation and labelling

FRCNN Implementation And Results

Extracting latitude and longitude

Applications

About

Resources

Stars

Watchers

Forks

Languages