Skip to content

Collection of Python scripts to build a Solr index from selected Dutch and English DBpedia dumps.

License

Notifications You must be signed in to change notification settings

KBNLresearch/dbpedia-indexer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DBpedia indexer

Collection of Python scripts to build a Solr index from a subset of the Dutch and English DBpedia dumps for use with our entity linker. The dumps were taken from the 2016-04 Downloads dataset and loaded into a local Virtuoso triple store.

Usage

Building the Solr index from the Virtuoso graph is a two-step process:

  1. Generating lists of the Dutch and English resource URIs that are to be indexed: ./get_uris.py

  2. Retreiving the data for the URIs on the lists and sending it to Solr: ./index.py, where the extraction of the relevant fields from the Virtuoso response(s) for each resource takes place in record.py.

About

Collection of Python scripts to build a Solr index from selected Dutch and English DBpedia dumps.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages