Skip to content
This repository has been archived by the owner on Oct 19, 2023. It is now read-only.
/ datavark Public archive

An automated information acquisition and extraction platform, for domain-specific data.

Notifications You must be signed in to change notification settings

UplandsDynamic/datavark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 

Repository files navigation

DatavArk

IMPORTANT: This appliction is for reference use only. It is NOT maintained and contains references to now outdated libraries that include security vulnerabilities.

DatavArk is an automated, domain-specific information acquisition and extraction platform.

This prototype was developed to gather data in the domain of Unexplained Anomalous Phenomena (UAP). The app ingests unstructured textual reports submitted to NUFORC.org and posted on Reddit.com. Extracted entities are recorded in a PostGIS SQL database.

Natural Language Processing (NLP) is implemented through a custom-trained, transformer-based machine learning model, deployed through the spaCy Python library. The web app is written using the Python Django framework.

Project author

The project was solely authored by Dan Bright, dan@uplandsdynamic.com