Skip to content

mrcopicat/dask-parallel-image-Segmentation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dask Parallel Image Segmentation & Preprocessing :


A Parallel segmentation algorithm of a flowers dataset using Dask library on python.

The problem treated is a classic images segmentation problem : through the DaskFlowersSegmentation.py algorithm, I applied some basic preprocessing (grayscale filters as an example), and then implemented a parallel pipeline that preprocess and segment each flower on the dataset.

The final outcome is a histogramm that displays the distribution of the number of segments found in the flowers dataset. This result can be used for further analysis (to identify clusters in the data for example).

hist-results

This algorithm was implemented as an academic project for the BigData lecture given by Prof.Jean-Marc Gratien.

the data used in this project is from the Oxford Flowers dataset.

Execution :


This ML pipeline can be executed via : python DaskFlowersSegmentation.py

Requirements :


  • Python v > 3.6
  • dask


Author : Benhari Abdessalam

Date : 25/02/2021

About

A Parallel segmentation algorithm of a flowers dataset using Dask.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages