Merge pull request #131 from innovationOUtside/wip

Update week 8
innovationOUtside · Nov 13, 2020 · b41e06a · b41e06a
2 parents e701749 + d4c6df4
commit b41e06a
Show file tree

Hide file tree

Showing 13 changed files with 2,324 additions and 4,095 deletions.
diff --git a/...i-agent systems/.md/08.1 Introducing remote services and multi-agent systems.md b/...i-agent systems/.md/08.1 Introducing remote services and multi-agent systems.md
diff --git a/...nt systems/.md/08.2 Collecting digit image and class data from the simulator.md b/...nt systems/.md/08.2 Collecting digit image and class data from the simulator.md
@@ -0,0 +1,259 @@
+---
+jupyter:
+  jupytext:
+    formats: ipynb,.md//md
+    text_representation:
+      extension: .md
+      format_name: markdown
+      format_version: '1.2'
+      jupytext_version: 1.6.0
+  kernelspec:
+    display_name: Python 3
+    language: python
+    name: python3
+---
+
+## 2. Collecting digit image and class data from the simulator
+
+If we wanted to collect image data from the background and then train a network using those images, we would need to generate the training label somehow. We could do this manually, looking at each image and then by observation recording the digit value, associating it with the image location co-ordinates. But could we also encode the digit value explicitly somehow?
+
+If you look carefully at the *MNIST_Digits* background in the simulator, you will see that alongside each digit is a solid coloured area. This area is a greyscale value that represents the value of the digit represented by the image. That is, it represents a training label for the digit.
+
+<!-- #region tags=["alert-success"] -->
+*The greyscale encoding is quite a crude encoding method that is perhaps subject to noise. Another approach might be to use a simple QR code to encode the digit value.*
+<!-- #endregion -->
+
+As usual, load in the simulator in the normal way:
+
+```python
+from nbev3devsim.load_nbev3devwidget import roboSim, eds
+
+%load_ext nbev3devsim
+```
+
+Clear the datalog just to ensure we have a clean datalog to work with:
+
+```python
+%sim_data --clear
+```
+
+The solid greyscale areas are arranged so that when the left light sensor is over the image, the right sensor is over the training label area.
+
+```python
+%%sim_magic_preloaded -b MNIST_Digits -O -R -AH -x 400 -y 50
+
+#Sample the light sensor reading
+sensor_value = colorLeft.reflected_light_intensity
+
+# This is essentially a command invocation
+# not just a print statement!
+print("image_data both")
+```
+
+We can retrieve the last pair of images from the `roboSim.image_data()` dataframe using the `get_sensor_image_pair()` function:
+
+```python
+from nn_tools.sensor_data import zoom_img
+from nn_tools.sensor_data import get_sensor_image_pair
+
+# The sample pair we want from the logged image data
+pair_index = -1
+
+left_img, right_img = get_sensor_image_pair(roboSim.image_data(),
+                                            pair_index)
+
+zoom_img(left_img), zoom_img(right_img)
+
+```
+
+<!-- #region tags=["alert-success"] -->
+The image labels are encoded as follows:
+
+`greyscale_value = 25 * digit_value`
+<!-- #endregion -->
+
+One way of decoding the label is as follows:
+
+- divide each of the greyscale pixel values collected from the right hand sensor array by 25;
+- take the median of these values and round to the nearest integer; *in a noise free environment, using the median should give a reasonable estimate of the dominant pixel value in the frame.*
+- ensure we have an integer by casting the result to an integer.
+
+The *pandas* package has some operators that can help us with that if we put all the data into a *pandas* *Series* (essentially, a single column dataframe):
+
+```python
+import pandas as pd
+
+def get_training_label_from_sensor(img):
+    """Return a training class label from a sensor image."""
+    # Get the pixels data as a pandas series
+    # (similar to a single column dataframe)
+    image_pixels = pd.Series(list(img.getdata()))
+
+    # Divide each value in the first column (name: 0) by 25
+    image_pixels = image_pixels / 25
+
+    # Find the median value
+    pixels_median = image_pixels.median()
+
+    # Find the nearest integer and return it
+    return int( pixels_median.round(0))
+
+# Try it out
+get_training_label_from_sensor(right_img)
+```
+
+The following function will grab the right and left images from the data log, decode the label from the right hand image, and return the handwritten digit from the left light sensor along with the training label:
+
+```python
+def get_training_data(raw_df, pair_index):
+    """Get training image and label from raw data frame."""
+
+    # Get the left and right images
+    # at specified pair index
+    left_img, right_img = get_sensor_image_pair(raw_df,
+                                            pair_index)
+
+    # Find the training label value as the median
+    # value of the right habd image.
+    # Really, we should properly try to check that
+    # we do have a proper training image, for example
+    # by encoding a recognisable pattern 
+    # such as a QR code
+    training_label = get_training_label_from_sensor(right_img)
+    return training_label, left_img
+
+
+# Try it out
+label, img = get_training_data(roboSim.image_data(),
+                               pair_index)
+print(f'Label: {label}')
+zoom_img(img)
+```
+
+<!-- #region tags=["alert-danger"] -->
+We're actually taking quite a lot on trust in extracting the data from the dataframe in this way. Ideally, we would have a unique identifiers that reliably associate the left and right images as having been sampled from the same location. As it is, we assume the left and right image datasets appear in that order, one after the other, so we can count back up the dataframe to collect different pairs of data.
+<!-- #endregion -->
+
+Load in our previously trained MLP classifier:
+
+```python
+# Load model
+from joblib import load
+
+MLP = load('mlp_mnist14x14.joblib')
+```
+
+We can now test that image against the classifier:
+
+```python
+from nn_tools.network_views import image_class_predictor
+
+image_class_predictor(MLP, img)
+```
+
+<!-- #region activity=true -->
+### 2.3.1 Activity — Testing the ability to recognise images slight off-center in the image array
+
+Write a simple program to collect sample data at a particular location and then display the digit image and the decoded label value.
+
+Modify the x or y co-ordinates used to locate the robot by by a few pixel values away from the sampling point origins and test the ability of the network to recognise digits that are lightly off-center in the image array.
+
+How well does the network perform?
+
+*Hint: when you have run your program to collect the data in the simulator, run the `get_training_data()` with the `roboSim.image_data()` to generate the test image and retrieve its decoded training label.*
+
+*Hint: use the `image_class_predictor()` function with the test image to see if the classifier can recognise the image.*
+
+*Hint: if you seem to have more data in the dataframe than you thought you had collected, did you remember to clear the datalog before collecting your data?*
+<!-- #endregion -->
+
+```python
+# Your code here
+```
+
+<!-- #region student=true -->
+*Record your observations here.*
+<!-- #endregion -->
+
+<!-- #region activity=true -->
+### 2.3.2 Activity — Collecting image sample data from the *MNIST_Digits* background (optional)
+
+In this activity, you will need to collect a complete set of sample data from the simulator to test the ability of the network to correctly identify the handwritten digit images.
+
+Recall that the sampling positions are arranged along rows 100 pixels apart, starting at x=100 and ending at x=2000;
+along columns 100 pixels apart, starting at y=50 and ending at y=1050.
+
+Write a program to automate the collection of data at each of these locations.
+
+How would you then retrieve the hand written digit image and it's decoded training label?
+
+*Hint: import the `time` package and use the `time.sleep` function to provide a short delay between each sample collection. You may also find it convenient to import the `trange` function to provide a progress bar indicator when iterating through the list of collection locations: `from tqdm.notebook import trange`.*
+<!-- #endregion -->
+
+<!-- #region student=true -->
+*Your program design notes here.*
+<!-- #endregion -->
+
+```python student=true
+# Your program code
+```
+
+<!-- #region student=true -->
+*Describe here how you would retrieve the hand written digit image and it's decoded training label.*
+<!-- #endregion -->
+
+<!-- #region activity=true heading_collapsed=true -->
+#### Example solution
+
+*Click on the arrow in the sidebar or run this cell to reveal an example solution.*
+<!-- #endregion -->
+
+<!-- #region activity=true hidden=true -->
+To collect the data, I use two `range()` commands, one inside the other, to iterate through the *x* and *y* coordinate values. The outer loop generates the *x* values and the inner loop generates the *y* values:
+<!-- #endregion -->
+
+```python activity=true hidden=true
+# Make use of the progress bar indicated range
+from tqdm.notebook import trange
+import time
+
+# Clear the datalog so we know it's empty
+%sim_data --clear
+
+
+# Generate a list of integers with desired range and gap
+min_value = 50
+max_value = 1050
+step = 100
+
+for _x in trange(100, 501, 100):
+    for _y in range(min_value, max_value+1, step):
+
+        %sim_magic -R -x $_x -y $_y
+        # Give the data time to synchronise
+        time.sleep(1)
+```
+
+<!-- #region activity=true hidden=true -->
+We can now grab and view the data we have collected:
+<!-- #endregion -->
+
+```python activity=true hidden=true
+training_df = roboSim.image_data()
+training_df
+```
+
+<!-- #region activity=true hidden=true -->
+The `get_training_data()` function provides a convenient way of retrieving the handwritten digit image and the decoded training label.
+<!-- #endregion -->
+
+```python activity=true hidden=true
+label, img = get_training_data(training_df, pair_index)
+zoom_img(img), label
+```
+
+## 2.4 Summary
+
+In this notebook, you have automated the collection of hand-written digit and encoded label image data from the simulator ad seen how this can be used to generate training data made up of scanned handwritten digit and image label pairs. In principle, we could use the image and test label data collected in this way as a training data set for an MLP or convolutional neural network.
+
+The next notebook in the series is optional and demonstrates the performance of a CNN on the MNIST dataset. The required content continues with a look at how we can start to collect image data using the simulated robot whilst it is on the move.
diff --git a/...onvolutional neural network (optional).md → ...onvolutional neural network (optional).md b/...onvolutional neural network (optional).md → ...onvolutional neural network (optional).md
@@ -7,7 +7,7 @@ jupyter:
       extension: .md
       format_name: markdown
       format_version: '1.2'
-      jupytext_version: 1.5.2
+      jupytext_version: 1.6.0
   kernelspec:
     display_name: Python 3
     language: python
@@ -20,14 +20,14 @@ __This notebook contains optional study material. You are not required to work t
 *This notebook demonstrates the effectiveness of a pre-trained convolutional neural network (CNN) at classifying MLP handwritten digit images.*
 <!-- #endregion -->
 
-# 2 Recognising digits using a convolutional neural network (optional)
+# 3 Recognising digits using a convolutional neural network (optional)
 
 In the previous notebook, you saw how we could collect image data sampled by the robot within the simulator into the notebook environment and then test the collected images against an "offboard" pre-trained multilayer perceptron run via the notebook's Python environment. However, even with an MLP tested on "jiggled" images, the network's classification performance degrades when "off-center" images are presented to it.
 
 In this notebook, you will see how we can use a convolutional neural network running in the notebook's Python environment to classify images retrieved from the robot in the simulator.
 
 
-## 2.1 Using a pre-trained convolutional neural network
+## 3.1 Using a pre-trained convolutional neural network
 
 Although training a convolutional neural network can take quite a lot of time, and a *lot* of computational effort, off-the-shelf pre-trained models are also increasingly available. However, whilst this means you may be able to get started on a recognition task without the requirement to build your own model, you would do well to remember the phrase *caveat emptor*: buyer beware.
 
@@ -42,7 +42,7 @@ However, you should be aware when using third party models that they may incorpo
 The following example uses a pre-trained convolutional neural network model implemented as a TensorFlow Lite model. [*TensorFlow Lite*](https://www.tensorflow.org/lite/) is a framework developed to support the deployment of TensorFlow Model on internet of things (IoT) devices. As such, the models are optimised to be as small as possible and to be evaluated as computationally quickly and efficiently as possible.
 
 
-### 2.1.1 Loading the CNN
+### 3.1.1 Loading the CNN
 
 The first thing we need to do is to load in the model. The actual TensorFlow Lite framework code is a little bit fiddly in places, so we'll use some convenience functions to make using the framework slightly easier.
 
@@ -62,10 +62,10 @@ from nn_tools.network_views import cnn_get_details
 cnn_get_details(cnn, report=True)
 ```
 
-The main take away from this report are the items the describe the structure of the input and output arrays. In particular, we have an input array of a single 28x28 pixel greyscale image array, and an output of 10 classification classes. Each output gives the probability with which the CNN believes the image represents the corresponding digit.
+The main take away from this report are the items that describe the structure of the input and output arrays. In particular, we have an input array of a single 28x28 pixel greyscale image array, and an output of 10 classification classes. Each output gives the probability with which the CNN believes the image represents the corresponding digit.
 
 
-### 2.1.2 Testing the network
+### 3.1.2 Testing the network
 
 We'll test the network with images retrieved from the simulator.
 
@@ -162,7 +162,7 @@ Let's test this offset image to see if our convolutional neural network can stil
 cnn_test_with_image(cnn, img, rank=2)
 ```
 
-### 2.1.3 Activity — Testing the CNN using robot collected image samples
+### 3.1.3 Activity — Testing the CNN using robot collected image samples
 
 The `ipywidget` powered end user application defined in the code cell below will place the robot at a randomly selected digit location and display and then test the image grabbed from *the previous location* using the CNN.
 
@@ -200,7 +200,7 @@ def random_MNIST_location(location_noise = False):
     cnn_test_with_image(cnn, img, rank=3)
 ```
 
-## 2.2 Summary
+## 3.2 Summary
 
 In this notebook, you have seen how we can use a convolutional neural network to identify handwritten digits scanned by the robot in the simulator.