Skip to content

PatristicTextArchive/pta_metadata

Repository files navigation

Patristic Text Archive (Meta-)Data

DOI

This repository contains (meta-)data information used in the frontend of PTA at https://pta.bbaw.de and information for Linked Open Data (see folder LOD). The scripts used to generate this data are in the folder scripts.

pta_data.json

This file is built from metadata files conforming to the Capitains guidelines. It is converted from __cts__.xml-files in the pta_data-Respository to json. It contains the metadata for all text groups, works, and versions of works.

pta_statistics.json

This file is generated using the CLTK lemmatizer. It contains statistical data on words (and their number of occurences), lemmata (and their number of occurences), POS tags (and their number of occurences), morphological analysis (and its number of occurences), dependency analysis (and its number of occurences) in each work (identified by its urn); currently only the Greek and Latin texts are analyzed.

pta_manuscripts.json

The file is built from metadata in the teiHeader of the files in the pta_manuscripts-Respository. It contains the metadata for all manuscript descriptions.

pta_biblereferences.json

This file is built from all annotated biblical references in the PTA and enriched

pta_groups.json

This file is built from the PTA persons registry. Data from these sources is extracted and mashed up for the group index of PTA.

pta_persons.json

This file is built from the TIPNR (Translators Individualised Proper Names with all References) and from the PTA persons registry. Data from these sources is extracted and mashed up for the person index of PTA.

pta_places.json

This file is built from the Pleiades Gazetteer and the TIPNR (Translators Individualised Proper Names with all References). Data from these sources is extracted and mashed up for the place index of PTA.