Heritage Connector

Transforming text into data to extract meaning and make connections. In development.

See also our paper, Heritage connector: A machine learning framework for building linked open data from museum collections, at https://doi.org/10.1002/ail2.23.

A set of tools to:

load tabular collection data to a knowledge graph
find links between collection entities and Wikidata
perform NLP to create more links in the graph (also see hc-nlp)
explore and analyse a collection graph ways that aren't possible in existing collections systems

Collections as tabular data (left) vs knowledge graphs (right)

For Developers (TODO: put in docs)

Python 3
Create a new branch / Pull Request for each new feature / unit of functionality

Installation

We use pipenv for dependency management. You can also install dependencies from requirements.txt and dev dependencies from requirements_dev.txt.

Optional dependencies (for experimental features):

torch, dgl, dgl-ke: KG embeddings
spacy-nightly: export to spaCy KnowledgeBase for Named Entity Linking

Running tests

Run python -m pytest with optional --cov=heritageconnector for a coverage report.

We use pytest for tests, and all tests are in ./test.

Running

To run web app (in development): python -m heritageconnector.web.app

Citation

Cite as:

Dutia, K, Stack, J. Heritage connector: A machine learning framework for building linked open data from museum collections. Applied AI Letters. 2021;e23. https://doi.org/10.1002/ail2.23

Name		Name	Last commit message	Last commit date
Latest commit History 798 Commits
.ebextensions		.ebextensions
.elasticbeanstalk		.elasticbeanstalk
bin/fuseki		bin/fuseki
config		config
demos		demos
docs		docs
experiments		experiments
heritageconnector		heritageconnector
smg_jobs		smg_jobs
test		test
vanda_jobs		vanda_jobs
.eslintrc.js		.eslintrc.js
.flake8		.flake8
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yaml		.readthedocs.yaml
LICENCE		LICENCE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
PIPELINES.md		PIPELINES.md
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
config.sample.ini		config.sample.ini
package-lock.json		package-lock.json
package.json		package.json
requirements-vam.txt		requirements-vam.txt
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt
setup.py		setup.py
wikidata_test.py		wikidata_test.py

License

TheScienceMuseum/heritage-connector

Folders and files

Latest commit

History

Repository files navigation

Heritage Connector

Further Reading

For Developers (TODO: put in docs)

Installation

Running tests

Running

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages