get_sequence_output is not contextualized #264

maziyarpanahi · 2020-04-27T08:50:24Z

Hi,

I finally managed to use get_sequence_output to get word embeddings after dealing with random embeddings due to dropout, random seed, etc.

However, get_sequence_output() doesn't seem to be contextualized. If you have a string that says 'Bank river.' and get the embeddings for Bank, and try another one with Bank robber.', the embeddings for Bankis identical in both tests. In BERT and other contextualized transformers, theBank` has a different vector since the context is not the same.

I tried to play around with a mask, segments, etc. but it's always the same embeddings for a given word in different contexts. I followed the advice, some examples, etc. and the following is my configs:

xlnetConfig = XLNetConfig(FLAGS=None, json_path=json_path)
xlnetRunConfig = RunConfig(
        is_training=False,
        use_tpu=False,
        use_bfloat16=False,
        dropout=0.0,
        dropatt=0.0
)

Even though I've seen some examples using 0.1 for dropout like here, but they have random embeddings issue: https://github.com/amansrivastava17/embedding-as-service/tree/master/server/embedding_as_service/text/xlnet using

Are my XLNet config and run config correct to use the pre-trained weights/checkpoints?

The text was updated successfully, but these errors were encountered:

maziyarpanahi · 2020-08-01T16:02:21Z

Unfortunately, I couldn't find any solution. It seems for some reason (could be totally my mistake) the XLnet pre-trained models are not aware of their surrounding tokens. So no matter what you put before or after unlike BERT it will always generate the same vectors.

maziyarpanahi mentioned this issue Apr 29, 2020

Is xlnet indeed context aware? #222

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

get_sequence_output is not contextualized #264

get_sequence_output is not contextualized #264

maziyarpanahi commented Apr 27, 2020

maziyarpanahi commented Aug 1, 2020

get_sequence_output is not contextualized #264

get_sequence_output is not contextualized #264

Comments

maziyarpanahi commented Apr 27, 2020

maziyarpanahi commented Aug 1, 2020