Chapter 5 - Embeddings

Requirements

Data

We will leverage the Open American National Corpus, which consists of roughly 15 million spoken and written words from a variety of sources. Specifically, we will be using the subcorpus which consists of 4531 Slate magazine articles from 1996 to 2000 (approximately 4.2 million words).

Running the Docker Image

The docker images for this case study are located on dockerhub. Running the commands below will automatically download and start a jupyter notebook.

Run the Docker image:

docker run -p 8888:8888 --rm springernlp/chapter_5:latest

Building the Docker image

docker build -t chapter_5:latest .

Book Reference

More information can be found at: Deep Learning for NLP and Speech Recognition by Springer

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
Chapter 5 - Word Embeddings.ipynb		Chapter 5 - Word Embeddings.ipynb
Dockerfile		Dockerfile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

Chapter 5 - Word Embeddings.ipynb

Chapter 5 - Word Embeddings.ipynb

Dockerfile

Dockerfile

README.md

README.md

Repository files navigation

Chapter 5 - Embeddings

Requirements

Data

Running the Docker Image

Building the Docker image

Book Reference

About

Releases

Packages

Languages

SpringerNLP/Chapter5

Folders and files

Latest commit

History

Repository files navigation

Chapter 5 - Embeddings

Requirements

Data

Running the Docker Image

Building the Docker image

Book Reference

About

Topics

Resources

Stars

Watchers

Forks

Languages