Universal Sentence Representations

This project experiments with different ways to learn universal sentence representations. Four different neural models are implemented to encode sentences. These models are trained on the Stanford Natural Language Inference corpus to classify sentence pairs based on their relation. The learned sentence representations are evaluated using the SentEval framework.

Project Organization

The project is organized as follows:

models/ ..................................................... Model Architectures
- embedding_encoder.py
- uni_lstm.py
- bi_lstm.py
- bi_lstm_pool.py
- classifier.py
configs/ ..................................................... Experiment Configurations
- baseline.py
- uni_lstm.py
- bi_lstm.py
- bi_lstm_pool.py
checkpoints/ ............................................ Trained Models
results/ ..................................................... Experiment Logs
trainer.py ................................................. Training Logic
utils.py ..................................................... Utilities
main.py .................................................... Run Experiments
demo.ipynb .............................................. Demo & Analysis

How to run?

Activate conda environment

 conda env create -f "environment.yml
 conda activate prod

Download the pre-trained checkpoints from Google Drive
Install SentEval
Run the demonstration and analysis notebook: demo.ipynb
To run our experiments:

python3 main.py --config=configs.baseline --train=True --test=True

To design your own experiment, use the configuration dictionaries in the configs directory as follows:

{
  "exp_name": "exp001",                         (str, the name of the experiment, used to save the checkpoints and csv results)
  "device": "cuda:0"/"cpu",                     (str, which device to be used for the model)
  "lr": 1e-4,                                   (float, the learning rate)
  "epochs": 10,                                (int, the number of epochs)
  "batch_size" : 128,                           (int, the size of each batch of data)
  "print_freq": 1000,                            (int, how often to print metrics for the trainint set)
  "eval_freq" : 500,                             (int, how often to evaluate the model and print metrics for the validation set)
  "seed": 42                             (int, the seed to be used for reproducibility)
}

References

Conneau, Alexis, Douwe Kiela, Holger Schwenk, Loic Barrault, and Antoine Bordes. “Supervised Learning of Universal Sentence Representations from Natural Language Inference Data.” ArXiv:1705.02364 [Cs], July 8, 2018. http://arxiv.org/abs/1705.02364.
Conneau, Alexis, and Douwe Kiela. “SentEval: An Evaluation Toolkit for Universal Sentence Representations.” ArXiv:1803.05449 [Cs], March 14, 2018. http://arxiv.org/abs/1803.05449.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

models

models

results

results

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

demo.ipynb

demo.ipynb

environment.yml

environment.yml

main.py

main.py

trainer.py

trainer.py

utils.py

utils.py

Repository files navigation

Universal Sentence Representations

Project Organization

How to run?

References

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
configs		configs
models		models
results		results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.ipynb		demo.ipynb
environment.yml		environment.yml
main.py		main.py
trainer.py		trainer.py
utils.py		utils.py

License

AmanDaVinci/Universal-Sentence-Representations

Folders and files

Latest commit

History

Repository files navigation

Universal Sentence Representations

Project Organization

How to run?

References

About

Resources

License

Stars

Watchers

Forks

Languages