GitHub - BDBC-KG-NLP/NGCSE: Official implementation for the paper *Narrowing the Gap between Supervised and Unsupervised Sentence Representation Learning with Large Language Model*

Environment

Run the following command to create the required conda environment:

conda env create -f environment.yml -n your_new_environment_name

How we hold out 10% of the training data and how some data augmentations are performed are shown in tools.py.

Train with final performance in the "Wiki.STS_HT" training setting :
```
bash scripts/train_bert_wiki_sts.sh
```
Train with final performance in the "NLI.STS_HT" training setting :
```
bash scripts/train_bert_nli_sts.sh
```

The scripts to perform experiments before Final Performance section are listed in scripts/data_domain.

bash scripts/evaluation.sh path_to_the_result

How we plot figures in the paper are shown in plot.py.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
SentEval		SentEval
data		data
figure		figure
ngcse		ngcse
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
appendix.pdf		appendix.pdf
environment.yml		environment.yml
evaluation.py		evaluation.py
plot.py		plot.py
tools.py		tools.py
train.py		train.py