For sentence classification using BERT, PAD token is used in IG/Deeplift? #1269

lkqnaruto · 2024-04-10T01:24:36Z

For sentence classification task using BERT, is the PAD token used in IG/Deeplift? or Unkown token? or it can be customized?

EldadTalShir · 2024-05-15T16:50:37Z

The default reference in IG is a zero scalar corresponding to each input tensor (effectively PAD for BERT). It can be customized by setting the 'baselines' parameter when calling the attribute function. For example (setting UNK as reference, assuming seq_len are the number of tokens in your input):

# Custom token for IG
from transformers import AutoTokenizer
from captum.attr import TokenReferenceBase

tokenizer = AutoTokenizer.from_pretrained('all-MiniLM-L6-v2') # Load your model's tokenizer
ref_token_id = tokenizer.unk_token_id  # Choose the id of your desired token, you can call tokenizer.all_special_tokens for a list of all special tokens supported by your model
token_reference = TokenReferenceBase(reference_token_idx=ref_token_id) # Use Captum to generate a reference based on the number of tokens in your input
device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
ref = token_reference.generate_reference(seq_len,device=device).unsqueeze(0)

Then when you call attribute set baselines=ref. You can follow this guide as well: https://captum.ai/tutorials/IMDB_TorchText_Interpret

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

For sentence classification using BERT, PAD token is used in IG/Deeplift? #1269

For sentence classification using BERT, PAD token is used in IG/Deeplift? #1269

lkqnaruto commented Apr 10, 2024

EldadTalShir commented May 15, 2024

For sentence classification using BERT, PAD token is used in IG/Deeplift? #1269

For sentence classification using BERT, PAD token is used in IG/Deeplift? #1269

Comments

lkqnaruto commented Apr 10, 2024

EldadTalShir commented May 15, 2024