Skip to content

Latest commit

 

History

History
12 lines (7 loc) · 484 Bytes

README.md

File metadata and controls

12 lines (7 loc) · 484 Bytes

Description

Implementation of SIF sentence embedding in Java.

  • Use pre-computed tfidf weights & word2vec to compute weighted sentence embedding.
  • Use PCA to compute principle components for all pairs of sentences and remove it.

For usage see test cases.

Data

Please download word2vec to data. I used glove.6B.50d.txt from Glove.