Training script/configuration of T5 generator for GenQ #141

jihyukkim-nlp · 2023-04-20T01:26:32Z

Hello, thank you for sharing this repo and useful hugging face model cards!

I am interested in T5 generators for query generation, and trying to extend this to other datasets/tasks.
For doing so, I would like to reproduce T5 generators, specifically BeIR/query-gen-msmarco-t5-large-v1.

I am wondering if the training script and training configurations for the generators can be shared,
including

T5 initial checkpoint (Did you use google/t5-v1_1-large or google/flan-t5-large?),
maximum source/target length,
batch size,
optimizer,
learning rate,
learning rate scheduling,
warmup steps,
and total training steps.

Best regards,
Jihyuk Kim

thakur-nandan · 2023-06-20T12:37:48Z

HI @jihyukkim-nlp,

Sadly, I do not remember these exact training details for training the question generator models. @nreimers could you help me here?

Regarding the first two points:

We use the T5-Large model as the initial checkpoint (Not the Flan T5 large model).
I believe the maximum length for the target question is 64 tokens and the source is 350 tokens.

jihyukkim-nlp · 2023-06-24T16:43:18Z

Hi @thakur-nandan,

Thank you for the information!
I have tried a few different configurations, e.g., different learning rates (1e-5, 3e-5, 5e-5, 1e-4) either with or without warmup steps. But I have failed to reproduce.

It worked relatively well for MS MARCO, but not for BEIR.
At this point, I am also wondering if other datasets, such as NQ, were used for training the generator by any chance?

It will be really helpful if @nreimers can help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training script/configuration of T5 generator for GenQ #141

Training script/configuration of T5 generator for GenQ #141

jihyukkim-nlp commented Apr 20, 2023

thakur-nandan commented Jun 20, 2023

jihyukkim-nlp commented Jun 24, 2023

Training script/configuration of T5 generator for GenQ #141

Training script/configuration of T5 generator for GenQ #141

Comments

jihyukkim-nlp commented Apr 20, 2023

thakur-nandan commented Jun 20, 2023

jihyukkim-nlp commented Jun 24, 2023