KeyError: '!' during conditional generation #17

cmwilson252 · 2023-10-27T21:22:47Z

Hello, I am experiencing a key error attempting to use the code 1 for 1 from the example notebook for conditional generation:
https://github.com/microsoft/evodiff/blob/main/examples/evodiff.ipynb

from evodiff.pretrained import MSA_OA_DM_MAXSUB
from evodiff.generate_msa import generate_query_oadm_msa_simple
import re

checkpoint = MSA_OA_DM_MAXSUB()
model, collater, tokenizer, scheme = checkpoint

path_to_msa = 'bfd_uniclust_hits.a3m'
n_sequences=64 # number of sequences in MSA to subsample
seq_length=256 # maximum sequence length to subsample
selection_type='random' # or 'MaxHamming'; MSA subsampling scheme

tokeinzed_sample, generated_sequence  = generate_query_oadm_msa_simple(path_to_msa, model, tokenizer, n_sequences, seq_length, device='cpu', selection_type=selection_type)
print("New sequence (no gaps, pad tokens)", re.sub('[!-]', '', generated_sequence[0][0],))

The error can be traced back to:

evodiff/utils.py, line 247, in
return np.array([self.a_to_i[a] for a in seq[0]]) # for nested lists

The alphabet seems to not know how to handle ! which should be the padding token. This alphabet appears to be imported from sequence_models.constants as MSA_ALPHABET.

Also this is much less important but I noticed there's three instances of "tokeinzed_sample" as a variable name in the example notebook that almost certainly are meant to be "tokenized_sample"

The text was updated successfully, but these errors were encountered:

sherryliu987 · 2024-02-10T23:18:12Z

If you're struggling to install EvoDiff locally, feel free to try https://www.tamarind.bio/evodiff, a website which offers a no-code interface for bioinformatics tools including protein design with EvoDiff for free.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KeyError: '!' during conditional generation #17

KeyError: '!' during conditional generation #17

cmwilson252 commented Oct 27, 2023 •

edited

sherryliu987 commented Feb 10, 2024

KeyError: '!' during conditional generation #17

KeyError: '!' during conditional generation #17

Comments

cmwilson252 commented Oct 27, 2023 • edited

sherryliu987 commented Feb 10, 2024

cmwilson252 commented Oct 27, 2023 •

edited