subvoc

This project was created by me to scratch my own itch. I love to watch movies and am always keen to expand my vocabulary. But it's difficult to notice an unknown word during a movie without spoiling the experience. That's where subvoc comes in: search for a movie and discover its vocabulary.

Online Demo

Visit https://subvoc.stephanbehnke.com (hosted on Heroku, takes a few moments to start sometimes).

NOTE: The external API can be flaky - you can visit a cached analysis in this case.

To get a quick impression, here are some screenshots:

Homepage	Find Movie	List of words	Word details

How it works

When you select a movie, the OpenSubtitles API is queried for its subtitles. Then, the result is parsed, tokenized and analyzed sentence by sentence, word by word with the help of the Python Natural Language Toolkit. The difficulty of a word is determined by its relative frequency in the English language, assuming that more difficult words are simply used less.

Features

Development

(requires Docker)

run server with scripts/dev-py.sh
build client scripts/dev-js.sh
run tests with scripts/test-py.sh and scripts/test-js.sh

License

MIT (see LICENSE).

Name		Name	Last commit message	Last commit date
Latest commit History 304 Commits
.vscode		.vscode
api		api
corpora		corpora
domain		domain
fixtures		fixtures
scripts		scripts
static		static
templates		templates
web		web
.babelrc		.babelrc
.coveragerc		.coveragerc
.eslintrc		.eslintrc
.flake8		.flake8
.gitignore		.gitignore
.nltk_packages		.nltk_packages
.travis.yml		.travis.yml
Dockerfile		Dockerfile
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
config_dev.py		config_dev.py
config_prod.py		config_prod.py
config_test.py		config_test.py
dev-requirements.txt		dev-requirements.txt
main.py		main.py
package.json		package.json
requirements.txt		requirements.txt
rollup.config.js		rollup.config.js
run.py		run.py
setup.cfg		setup.cfg

License

stephanos/subvoc

Folders and files

Latest commit

History

Repository files navigation

subvoc

Online Demo

How it works

Features

Development

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages