Skip to content

Running NER on tokenized data only: KeyError: "Parameter 'E' for model 'hashembed' has not been allocated yet." #10507

Discussion options

You must be logged in to vote

This is kind of the same issue as #10508, just with slightly different (admittedly confusing) error messages.

I'm not sure exactly what you're trying to do, but you probably want to keep the ner component from en_ner_bc5cdr_md rather than excluding it and trying to replace it with new uninitialized/untrained one.

The Doc construction is fine, but it's unlikely that en_ner_bc5cdr_md has been trained on BPE/wordpiece-y tokens like ##na so you might not see good results. Is there a particular reason that you're using this tokenization?

Replies: 2 comments 4 replies

Comment options

You must be logged in to vote
3 replies
@Lenala39
Comment options

@adrianeboyd
Comment options

@Lenala39
Comment options

Answer selected by Lenala39
Comment options

You must be logged in to vote
1 reply
@svlandeg
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
third-party Third-party packages and services feat / ner Feature: Named Entity Recognizer feat / doc Feature: Doc, Span and Token objects
4 participants