Deploy, and Consume a Deep Learning Model

Ever wonder how AI is infused into applications, or how custom built AI can be deployed and consumed offline on the edge of a network or in the Cloud? The following example showcases just how to accomplish this task.

We will create and deploy two different applications in docker containers. The first container will contain a deep learning model, wrapped by a Python Flask Web app exposing a REST API to make predictions using the deep learning model. The second container will host a React web application that utilizes the API hosted by the first container to create a web based photo manipulation tool backed by deep learning. The complete application will not call any external services, and can be run offline on the edges of a network.

Use-case

Build an application that can run offline or in the cloud, utilizing a deep learning model to provide insights. We want to simplify the task of extracting objects from images utilizing image segmentation.

Prerequisites

docker: The Docker command-line iterface. Follow the installations instructions for your system. or use PlayWithDocker online
IBM Cloud Account
(Optional) Promo Code for Free Kubernetes Clusters
(Optional) IBM Cloud CLI Installation Instructions
(Optional) IBM Cloud Kubernetes Service Installation Instructions

Steps

Deploy the Image Segmentation Model Web Application and API to IBM's Kubernetes Service
Finding or building the right deep learning model
Looking closely at the code
Consuming the Image Segmentation API
Start up the Image Segmentation Web Application

1. Deploy the Image Segmentation Model API to IBM's Kubernetes Service

Clone this GitHub repository https://github.com/justinmccoy/max-meetup provides instructions yaml file for deploying the deep learning model as a REST API and web application on IBM's Kubernetes Service.

git clone https://github.com/justinmccoy/max-meetup
cd max-meetup

** Note you need a working ibmcloud cli, and a Kubernetes Cluster created

Create a new Kubernetes Cluster in US South Region

Spin up the IBM CLI Development Docker Container:

# -v option below is mounting current directory to /workspace on container
docker run -ti -v ./:/workspace ibmcom/ibm-cloud-developer-tools-amd64

Login into IBM Cloud account

# use --sso if logging onto account with Federated ID
ibmcloud login

Setup to use the IBM South Region (Free Cluster Access):

ibmcloud ks region-set us-south

Download and export the kubectl configuration for interacting with your newly created cluster:

ibmcloud ks cluster-config YOUR_CLUSTER_NAME

List details about Kubernetes cluster, note the external IP address

ibmcloud ks workers --cluster YOUR_CLUSTER_NAME

Create a new deployment of the MAX-Image-Segmentation REST API

kubectl create -f max-image-segmenter.yaml

Show Kubernetes Deployments on cluster:

kubectl get deployments

Show Kubernetes Services on cluster

kubectl get services -o wide

Show details about Kubernetes service and deployment

kubectl describe deployment max-image-segmenter
kubectl describe service max-image-segmenter

At this point you've deployed your the deep learning model's REST API and a React Web App to Kubernetes, and exposed it to the internet through the defined NodePorts of 30050 30030. You can now reach the API at the external IP address noted above, at port 30050, or the web application at the external IP address at port 30030.

For example my Web App is deployed to http://184.172.250.9:30030

2. Finding or building the right deep learning model

Before we can enable an application with insights from a machine learning model we first must figure out what we want to accomplish, find a model, or build one for ourselves.

There are several different common usecases, including Image Classification, Image Segmentation, Object Detection, Generative Models, etc.

Building a new machine learning model or deep learning model is a big undertaking, one that requires some specialized expertise and a lot of data, before going down that road there are several places to look for pre trained models. IBM has been working to train and release deep learning and machine learning models across several usage domains available in the MAX Model Exchange.

Image Segmentation Model

Clone the models GitHub repo

git clone https://github.com/IBM/MAX-Image-Segmenter
cd MAX-Image-Segmenter

This Image Segmenation model has been trained on 20 different objects.

The segmentation map returns an integer between 0 and 20 that corresponds to one of the labels below for each pixel in the input image. The first nested array corresponds to the top row of pixels in the image and the first element in that array corresponds to the pixel at the top left hand corner of the image. NOTE: the image will be resized and the segmentation map refers to pixels in the resized image, not the original input image.

Id	Label	Id	Label	Id	Label
0	background	7	car	14	motorbike
1	aeroplane	8	cat	15	person
2	bicycle	9	chair	16	pottedplant
3	bird	10	cow	17	sheep
4	boat	11	diningtable	18	sofa
5	bottle	12	dog	19	train
6	bus	13	horse	20	tv

To learn how to build you own models see the following code patterns:

Looking closely at the code

Clone the following GitHub Repo: https://github.com/IBM/MAX-Image-Segmenter

git clone https://github.com/IBM/MAX-Image-Segmenter

Great, we've found a model, and when using a model from MAX it's already packaged and fronted with an Flask app providing an API interface to the trained machine learning model, let's take a closer look at what's happening to host and front a deep learning image model as an API.

Our implementation is using Python Flask to front the deep learning model as a REST API, defining the endpoints and hosting the application as a web service. Bundled within the Python web service and /predict API is an application that loads the trained deep learning image segmentation model using Tensorflow for Python, and wraps the model with some helper methods to simplify prediction when called from our Flask application.

Flask Web Service exposing two HTTP endpoints

POST /model/predict GET /model/metadata

Calling POST on the /model/predict endpoint creates a new instance of the ModelWrapper class where the deep learning image segmentation model is loaded for inference. Calling predict endpoint with an image returns a mapping of pixel segments where an object has been detected, along with the resized version of the image.

Let's dig into the code a bit:

How is the container built?
What application is started when loading the container?
What's happening when the /predict URI is invoked
Where is the deep learning model coming from?
How is the model being loaded?
How do you call the model?

Consuming the Image Segmentation API

Using the Juypter Notebook container we can quickly setup an environment to test the API

Run the following docker container with a Juypter Notebook and Python libraries installed:

docker run -it --rm -p 8888:8888 jupyter/tensorflow-notebook

One up and running go to the url return from the command above, and import the docs/selfie.jpg and the demo.ipynb notebook.

Run the demo.ipynb notebook to test the deployed deep learning REST API.

Start up the Image Segmentation Web Application

The MAX-Image-Segmentation Code Pattern has a prebuilt React Web Application that utilizes the API to extract out object identified by the image segmentation deep learning model.

The details and code for this web application are available on GitHub but we need to configure it differently to connect to the REST API deployed on the IBM Cloud Kubernetes Cluster

Connect to deployed Deep Learning Model REST API

docker run -it -e REACT_APP_DEPLOY_TYPE='KUBE' -e REACT_APP_KUBE_IP="<IP_ADDRESS_OF_YOUR_KUBERNETES_CLUSTER>" -e REACT_APP_KUBE_MODEL_PORT='30050' -p 3000:3000 codait/max-image-segmenter-web-app

Open up your browser and go to http://localhost:3000

License

Component	License	Link
This repository	Apache 2.0	LICENSE

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs

docs

LICENSE

LICENSE

README.md

README.md

demo.ipynb

demo.ipynb

max-image-segmenter.yaml

max-image-segmenter.yaml

Repository files navigation

Deploy, and Consume a Deep Learning Model

Use-case

Prerequisites

Steps

1. Deploy the Image Segmentation Model API to IBM's Kubernetes Service

Create a new Kubernetes Cluster in US South Region

2. Finding or building the right deep learning model

Looking closely at the code

Consuming the Image Segmentation API

Start up the Image Segmentation Web Application

License

Additional Resources

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
docs		docs
LICENSE		LICENSE
README.md		README.md
demo.ipynb		demo.ipynb
max-image-segmenter.yaml		max-image-segmenter.yaml

License

justinmccoy/max-meetup

Folders and files

Latest commit

History

Repository files navigation

Deploy, and Consume a Deep Learning Model

Use-case

Prerequisites

Steps

1. Deploy the Image Segmentation Model API to IBM's Kubernetes Service

2. Finding or building the right deep learning model

Looking closely at the code

Consuming the Image Segmentation API

Start up the Image Segmentation Web Application

License

Additional Resources

About

Topics

Resources

License

Stars

Watchers

Forks

Languages