Model_Deployment_Sagemaker_Docker_HuggingFace:

1. Reason for model selection - >

I chose DistilBERT base uncased distilled SQuAD as my Huggingface model. DistilBERT is a transformers model, smaller and faster than BERT, which was pretrained on the same corpus in a self-supervised fashion, using the BERT base model as a teacher. This means it was pretrained on the raw texts only, with no humans labelling them in any way (which is why it can use lots of publicly available data) with an automatic process to generate inputs and labels from those texts using the BERT base model. More precisely, it was pretrained with three objectives:

i. Distillation loss: the model was trained to return the same probabilities as the BERT base model.

ii. Masked language modeling (MLM): this is part of the original training loss of the BERT base model. When taking a sentence, the model randomly masks 15% of the words in the input then run the entire masked sentence through the model and has to predict the masked words. This is different from traditional recurrent neural networks (RNNs) that usually see the words one after the other, or from autoregressive models like GPT which internally mask the future tokens. It allows the model to learn a bidirectional representation of the sentence.

iii. Cosine embedding loss: the model was also trained to generate hidden states as close as possible as the BERT base model. This way, the model learns the same inner representation of the English language than its teacher model, while being faster for inference or downstream tasks.

2. Model_Deployment_Docker.ipynb: Post to Container endpoint & prints out the response - >

i. Install Sagemaker SDK, Transformers in Python3

ii. Install necessary Libraries

iii. Create SageMaker endpoint with the chosen model

iv. Create Hugging Face Model Class

v. Deploy model to SageMaker Inference

vi. Test with Sample Input

vii. Run Predictor

3. Dockerfile - >

i. Create Dockerfile

ii. Pull image from Dockerhub

iii. Set nginx- server components to support multiple parallel incoming requests

iv. Install necessary dependencies for SageMaker Inference Toolkit

v. Install pip requirements

vi. Install Hugging Face libraries and its dependencies

vii. Copy entrypoint script to the image

viii. Copy the default custom service file to handle incoming data and inference requests

ix. Build Docker Image

x. Define an entrypoint script for the docker image

xi. Define command to be passed to the entrypoint

xii. Run Inference and Check Docker Image in 'Docker Desktop'

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Dockerfile		Dockerfile
Model_Deployment_Docker.ipynb		Model_Deployment_Docker.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dockerfile

Dockerfile

Model_Deployment_Docker.ipynb

Model_Deployment_Docker.ipynb

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Model_Deployment_Sagemaker_Docker_HuggingFace:

1. Reason for model selection - >

2. Model_Deployment_Docker.ipynb: Post to Container endpoint & prints out the response - >

3. Dockerfile - >

About

Releases

Packages

Languages

SaifurRR/Model-Deployment-using-AWS-Sagemaker-Docker

Folders and files

Latest commit

History

Repository files navigation

Model_Deployment_Sagemaker_Docker_HuggingFace:

1. Reason for model selection - >

2. Model_Deployment_Docker.ipynb: Post to Container endpoint & prints out the response - >

3. Dockerfile - >

About

Topics

Resources

Stars

Watchers

Forks

Languages