training size #6

IreneSucameli · 2022-02-17T13:27:53Z

Hi, I have a question about the training set of AlBERTo.
I've read that the pre-trained lower cased model is based on 200M of tweets (191GB of raw data) but what about the training set only? Could you please specify how large the training set is?
Thanks a lot.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training size #6

training size #6

IreneSucameli commented Feb 17, 2022

training size #6

training size #6

Comments

IreneSucameli commented Feb 17, 2022