Should mMiniLMv2 be paired with the tokenizer of mMiniLMv1? #1493

wencan · 2024-03-30T10:23:20Z

I downloaded mMiniLMv2. The compressed package only contains the model file and no tokenizer information. However, from the shape of the embedding, it seems that mMiniLMv2 and mMiniLMv2 may use the same tokenizer.

like this:

from transformers import XLMRobertaTokenizer
tokenizer = XLMRobertaTokenizer.from_pretrained("microsoft/Multilingual-MiniLM-L12-H384")

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should mMiniLMv2 be paired with the tokenizer of mMiniLMv1? #1493

Should mMiniLMv2 be paired with the tokenizer of mMiniLMv1? #1493

wencan commented Mar 30, 2024

Should mMiniLMv2 be paired with the tokenizer of mMiniLMv1? #1493

Should mMiniLMv2 be paired with the tokenizer of mMiniLMv1? #1493

Comments

wencan commented Mar 30, 2024