Add XLSum evaluation / unify eval script #12

haileyschoelkopf · 2022-05-09T19:18:32Z

Submitting a PR from fork because I may not have edit access to this repo.

In this PR: added adapters_eval.py , a script that can be used to evaluate on XLSum or XNLI based on the 'dataset' flag.
Also working on adding deepspeed compatibility via Huggingface Trainer / command line.

TODO/needs checking:

rouge compute_metrics function could be wrong. I will try to check this
make sure the logic within load_model for setting adapters to train / adding adapters is correct.
Has the FIXME in adapters_xnli_de.py been dealt with?

…tence_retrieval_eval Sentence retrieval eval

…cript, and changed the slurm running settings

…layer (instead of whole model), added early stopping,

…ce-workshop/multilingual-modeling into remotes/origin/sentence_retrieval_eval

… into ext_exp

yongzx · 2022-05-11T04:22:38Z

Thanks Hailey!

(Referring to #11) Will resolve this PR once Vassilina and I have finalized on our evaluation script on XNLI. Apologies for the delay.

…ilingual-modeling into ext_exp

…lingual-modeling into bitfit

…lingual-modeling into jz

Merge JZ

…lingual-modeling into bitfit

…tfit Bitfit

…lingual-modeling

yongzx · 2022-07-07T14:11:13Z

@haileyschoelkopf Can you help review b0a23c5? Thank you!
I've tested it and the training and evaluation (on baseline BLOOM and GPT2 models) are working. The only minor issue is that the evaluation that uses model.generate takes quite long (even for num_beams = 1).

haileyschoelkopf · 2022-07-07T14:32:57Z

Yes I can! I might only get to it tomorrow though

vnikouliNLE and others added 15 commits April 3, 2022 10:26

Merge pull request bigscience-workshop#8 from bigscience-workshop/sen…

4a8cd34

…tence_retrieval_eval Sentence retrieval eval

added script to train tokenizer only on a subset of the dataset

347d22f

added script to train tokenizer only on a subset of the dataset

7069276

updated instructions for samples tokenizer

ad6d511

updated training script: added some extra parameters in the running s…

4e1f137

…cript, and changed the slurm running settings

added overlap-replace parameter, added possibility to save embedding …

7ff1c18

…layer (instead of whole model), added early stopping,

Merge branch 'sentence_retrieval_eval' of https://github.com/bigscien…

888f49f

…ce-workshop/multilingual-modeling into remotes/origin/sentence_retrieval_eval

Merge remote-tracking branch 'remotes/origin/sentence_retrieval_eval'…

0967f07

… into ext_exp

update madx_run_clm

afb108d

adapted xnli script to properly load wte, wpe and adapters

a79bfd0

updated the way we save the model; added fp16 training

c9a8cec

add xlsum script (version #1)

3e8bd62

add unified eval script

2cd27a3

xlsum separate script

fac77dc

script bugfixes

96339f4

yongzx and others added 14 commits May 11, 2022 00:31

change zero_shot to cross_lingual

043ece7

load language adapters during inference setting

6ac743a

updated tokenizer training script

d4b0e30

added xnli zero shot training and eval scripts

f3a165e

added xnli zero shot training and eval scripts

5ed40b9

Merge branch 'ext_exp' of https://github.com/bigscience-workshop/mult…

f69d167

…ilingual-modeling into ext_exp

merged with current version

2497ba3

added script to get stats about different tokenizers

f35b984

added script to get stats about different tokenizers

dbf3f0e

added script to get stats about different tokenizers

639c4da

added script to get stats about different tokenizers

685f402

fixed tokenizer training with unk token

04f9fab

add num_classes arg to model init

04f893f

rename pretrained_model to adapted_model

448035e

yongzx and others added 27 commits July 1, 2022 23:30

Update README.md

f3223e4

Delete calculate_bias_changes.py

153f7da

removed finetune_strategies in favor of lang_adapt_strategis

7c48c20

Merge branch 'bitfit' of https://github.com/bigscience-workshop/multi…

820cbc5

…lingual-modeling into bitfit

change

cbe45bd

Update README.md

1c49adf

Update README.md

e955acf

fixed logic

2484b22

jz

81ace49

Merge branch 'master' of https://github.com/bigscience-workshop/multi…

0be97d4

…lingual-modeling into jz

Merge pull request bigscience-workshop#41 from bigscience-workshop/jz

3edfc06

Merge JZ

update README

d3feb31

Merge branch 'master' of https://github.com/bigscience-workshop/multi…

2c89ef5

…lingual-modeling into bitfit

update

1731027

Merge pull request bigscience-workshop#22 from bigscience-workshop/bi…

4c9d075

…tfit Bitfit

uncomment pip install

3fd568a

Merge branch 'master' of https://github.com/bigscience-workshop/multi…

d1faa64

…lingual-modeling

update tokenizer training

2f806eb

update bigs_model

2e94a9e

update tokenizer_dir

d5209a7

load_best using eval metrics

5fac29a

load best model

7e0feca

missing output_dir

d4a887e

scripts for wikiann

585cb5d

tok_strategy adds overlap-replace

9cca893

remove outdated code and support continual pretraining

031a14a

update xlsum

b0a23c5

fix tokenization

c7e1e6f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add XLSum evaluation / unify eval script #12

Add XLSum evaluation / unify eval script #12

haileyschoelkopf commented May 9, 2022

yongzx commented May 11, 2022

yongzx commented Jul 7, 2022

haileyschoelkopf commented Jul 7, 2022

Add XLSum evaluation / unify eval script #12

Are you sure you want to change the base?

Add XLSum evaluation / unify eval script #12

Conversation

haileyschoelkopf commented May 9, 2022

yongzx commented May 11, 2022

yongzx commented Jul 7, 2022

haileyschoelkopf commented Jul 7, 2022