Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How does entity-fishing manage to assign a "Domain" to an entity ? #127

Open
aa303554 opened this issue May 11, 2021 · 1 comment
Open
Labels

Comments

@aa303554
Copy link

In the example, Switzerland has the domains: Geology, Oceanography, Earth. Does this domain correspond to a wikipedia category or a wikidata category or are the domains obtained in another way ?

image

@kermitt2
Copy link
Owner

Hello @aa303554 !

The domain is assigned based on a mapping using the high level category hierarchy of the English Wikipedia. If I remember well the mapping is there -> https://github.com/kermitt2/entity-fishing/blob/master/data/wikipedia/mapping.txt

The motivation is that we have around 1 million categories in the English Wikipedia and it's very hard to get a clue about the domains. Domain information is also found in many ontologies and it can help mapping between knowledge base.

Using the domain could be also a feature interesting to use for the disambiguation by the decision tree, but it turned out that no, it doesn't help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants