Document classification by skip gram (Negative sampling)

Perform embedding of words in the text so that it has the highest relationship with the embedding of the document label.

Running

Create skip gram dataset for training

python data/generated_data/generate_data.py

Train skip gram model

python train_word2vec.py

Train classifier

python classifier.py

Classification results

Train	Test	Validation
`1.0`	`0.973`	`0.979`

Reference

Pythonic Excursions: Optimize Computational Efficiency of Skip-Gram with Negative Sampling

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
checkpoint		checkpoint
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
classifier.py		classifier.py
model.py		model.py
train_word2vec.py		train_word2vec.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

checkpoint

checkpoint

data

data

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

classifier.py

classifier.py

model.py

model.py

train_word2vec.py

train_word2vec.py

utils.py

utils.py

Repository files navigation

Document classification by skip gram (Negative sampling)

Running

Classification results

Reference

About

Releases

Packages

Languages

License

hautran7201/skip_gram_for_document_classification

Folders and files

Latest commit

History

Repository files navigation

Document classification by skip gram (Negative sampling)

Running

Classification results

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Languages