You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I have a question about the training set of AlBERTo.
I've read that the pre-trained lower cased model is based on 200M of tweets (191GB of raw data) but what about the training set only? Could you please specify how large the training set is?
Thanks a lot.
The text was updated successfully, but these errors were encountered:
Hi, I have a question about the training set of AlBERTo.
I've read that the pre-trained lower cased model is based on 200M of tweets (191GB of raw data) but what about the training set only? Could you please specify how large the training set is?
Thanks a lot.
The text was updated successfully, but these errors were encountered: