Skip to content

SpringerNLP/Chapter5

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Chapter 5 - Embeddings

Requirements

Data

We will leverage the Open American National Corpus, which consists of roughly 15 million spoken and written words from a variety of sources. Specifically, we will be using the subcorpus which consists of 4531 Slate magazine articles from 1996 to 2000 (approximately 4.2 million words).

Running the Docker Image

The docker images for this case study are located on dockerhub. Running the commands below will automatically download and start a jupyter notebook.

Run the Docker image:

docker run -p 8888:8888 --rm springernlp/chapter_5:latest

Building the Docker image

docker build -t chapter_5:latest .

Book Reference

More information can be found at: Deep Learning for NLP and Speech Recognition by Springer