Applying-Deep-Learning-vs-Machine-Learning-models-to-reproduce-dry-spell-sequences

Large parts of the world rely on rainfed agriculture for their food security. In Africa, 90% of the agricultural yields rely only on precipitation for irrigation purposes and approximately 80% of the population’s livelihood is highly dependent on its food production. Parts of Ghana are prone to droughts and flood events due to increasing variability of precipitation phenomena. Crop growth is sensitive to the wet- and dry spell phenomena during the rainy season. To support rural communities and small farmer in their efforts to adapt to climate change and natural variability, it is crucial to have good predictions of rainfall and related dry/wet spell indices.

This research constitutes an attempt to assess the dry spell patterns in the northern region of Ghana, near Burkina Faso. We aim to develop a model which by exploiting satellite products overcomes the poor temporal and spatial coverage of existing ground precipitation measurements. For this purpose 14 meteorological stations featuring different temporal coverage are used together with satellite-based precipitation products. Conventional machine-learning and deep-learning algorithms were compared in an attempt to establish a link between satellite products and field rainfall data for dry spell assessment.

The deep-learning architecture used should be able to efficiently manipulate spatial data. Hence, Convolutional Neural Networks were used in order to detect spatial patterns in the satellite data. Using these models we will attempt to exploit the long temporal coverage of the satellite products in order to overcome the poor temporal and spatial coverage of existing ground precipitation measurements. Doing that, our final objective is to enhance our knowledge about the dry spell characteristics and, thus, provide more reliable climatic information to the farmers in the area of Northern Ghana. Defining the optimal ANN architecture for each application depends on several factors. In our case, we are looking for an ANN architecture that would actually use as input the satellite-based images of precipitation at a fixed extent around any gauge (e.g. 32x32 pixels), while the labels will be the ground observations, constituting a typical supervised-learning modeling case. This process can be efficiently handled by a CNN. The input satellite precipitation images are passed through the feature extraction part and, then, the classifier of the CNN and the output is mapped in [0, 1]. For a more detailed analysis of the technical characteristics of the proposed CNN (convolutional layers, pooling layers, activation functions, padding etc.) refer to the code.

The years where the satellite-based data and the point measurements overlap will be used to train and test the model. The number of input nodes depends on the amount of spatial and temporal information we choose to use. The output node will feature a cost function related to the discrepancy between the modeled output of a sigmoid function and observed wet/dry day label. Based on that, the Stochastic Gradient Descent (SGD) will optimize the values of all the trainable parameters of the ANN. The CNN architecture is chosen to perform this task because of its efficiency in dealing with gridded data and detecting spatial patterns. There are many features that give an advantage to CNNs when handling spatial data (e.g. in image multi-class classification tasks) compared to conventional ANN architectures. One of them is parameter (or weight) sharing. Parameter sharing introduces local connectivity in the convolutional layers of the feature extraction part of the CNN, resulting to each neuron being connected only to a subset of the input image. Thus, the total number of trainable parameters is drastically reduced, minimizing the computational expense and the complexity of the network. Apart from this, with parameter sharing the convolutional layer gains the ability to detect similar spatial patterns in different locations within the input image. Another concept of CNN that makes it suitable for this particular problem is spatial inductive bias. When referred to inductive bias of an ANN in computer vision, we refer to an ensemble of assumptions that assist the model to generate unknown output. In the case of gridded data handled by a CNN, spatial inductive bias assumes a certain spatial structure of the data. Two similar but not identical concepts of CNNs are also relevant to our classification task: transitional invariance and transitional equivariance. Translational invariance expresses the ability of the CNN to generate robust output even when the initial input is translated. Translational equivariance expresses the ability of a CNN to detect the location of the spatial pattern of interest in the image without it being in a certain pre-known position.

Even though the aforementioned concepts give a clear advantage to CNNs when dealing with spatial data, the potential of a conventional FNN is also being examined. The number and type of hidden layers, as well as the overall architecture of the FNN, is defined after trial-and-error processes and is a subject closely related to the spatiotemporal characteristics of the data used. Overall, the process scheme of all the modeling strategies used in this research are:

Four benchmark ML classifiers (Logistic Regression, Random Forest, Gaussian Naïve Bayes and Support Vector Machine) and one benchmark FNN;
The main CNN classifier;
An alternative of the CNN classifier which also incorporates the stations latitude as an input variable;
Different modeling approaches attempting to confront class imbalance;
Several “sophisticated” ensembles of models built in an attempt to combine the information of different satellite precipitation products, featuring majority vote models, models that stack different satellite data in parallel input channels and multi-resolution models;
Three state-of-art classifiers used for multi-class classification fine-tuned and tested for this binary classification task.

Complete report at https://repository.tudelft.nl/islandora/object/uuid%3Adbe50c7f-643e-4f9c-b6b0-8a20ddf7262c

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Abstract		Abstract
Code		Code
Data		Data
Environment		Environment
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Abstract

Abstract

Code

Code

Data

Data

Environment

Environment

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Applying-Deep-Learning-vs-Machine-Learning-models-to-reproduce-dry-spell-sequences

About

Releases

Packages

Languages

Pargo18/Applying-Deep-Learning-vs-Machine-Learning-models-to-reproduce-dry-spell-sequences

Folders and files

Latest commit

History

Repository files navigation

Applying-Deep-Learning-vs-Machine-Learning-models-to-reproduce-dry-spell-sequences

About

Topics

Resources

Stars

Watchers

Forks

Languages