Skip to content

marcusklang/docria

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

86 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Docria

Documentation Status

Semi-structured document storage model library.

Why?

To provide the ability of sharing, processing and transforming large amounts of natural text from heterogeneous sources.

Most commonly available formats such as TSV and JSON are too flexible and are not self-describing, does not natively define graph concepts such as Nodes and Edges or provide a language independent solution for text spans which are capable of unicode offsetts which are retained over the language boundary (compare Java and Python 3).

Implementations