Skip to content

Yuanhy1997/PartialKB-EL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PartialKB-EL

This is the official code repository for ACL-BioNLP 2023 paper

Exploring Partial Knowledge Base Inference in Biomedical Entity Linking (ACL Anthology) or ArXiv-version

Abstract

Biomedical entity linking (EL) consists of named entity recognition (NER) and named entity disambiguation (NED). EL models are trained on corpora labeled by a predefined KB. However, it is a common scenario that only entities within a subset of the KB are precious to stakeholders. We name this scenario partial knowledge base inference; training an EL model with one KB and inferring on the part of it without further training. In this work, we give a detailed definition and evaluation procedures for this practically valuable but significantly understudied scenario and evaluate methods from three representative EL paradigms. We construct partial KB inference benchmarks and witness a catastrophic degradation in EL performance due to dramatically precision drop.Our findings reveal these EL paradigms can not correctly handle unlinkable mentions (NIL), so they are not robust to partial KB inference. We also propose two simple-and-effective redemption methods to combat the NIL issue with little computational overhead.

Poster @ ACL2023 for paper overview

You can view our poster@ACL2023 for a quick view of our work.

Citation

If you find our work useful, please cite:

@inproceedings{yuan-etal-2023-exploring,
    title = "Exploring Partial Knowledge Base Inference in Biomedical Entity Linking",
    author = "Yuan, Hongyi  and
      Lu, Keming  and
      Yuan, Zheng",
    booktitle = "The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks",
    month = jul,
    year = "2023",
    address = "Toronto, Canada",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.bionlp-1.3",
    pages = "37--49",
    abstract = "Biomedical entity linking (EL) consists of named entity recognition (NER) and named entity disambiguation (NED). EL models are trained on corpora labeled by a predefined KB. However, it is a common scenario that only entities within a subset of the KB are precious to stakeholders. We name this scenario partial knowledge base inference; training an EL model with one KB and inferring on the part of it without further training. In this work, we give a detailed definition and evaluation procedures for this practically valuable but significantly understudied scenario and evaluate methods from three representative EL paradigms. We construct partial KB inference benchmarks and witness a catastrophic degradation in EL performance due to dramatically precision drop.Our findings reveal these EL paradigms can not correctly handle unlinkable mentions (NIL), so they are not robust to partial KB inference. We also propose two simple-and-effective redemption methods to combat the NIL issue with little computational overhead.",
}

About

Exploring Partial Knowledge Base Inference in Biomedical Entity Linking [ACL-BioNLP 2023]

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published