Skip to content
This repository has been archived by the owner on Aug 8, 2020. It is now read-only.

sp1thas/nlp_geofetures_extraction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NLP - Formal and Geolocation Features Extraction

This is a Python 2 project about Natural Language Processing and specially about geolocation features extraction from a corpus. We use the known NLTK and some other python modules. We used an annotated corpus as input. Corpus has the same structure with the corpus in this project.

This scripts extract general features for Natural Language Processing. For example calculates chars per document or symbols per document etc. For more info check the script.

This script extract geolocation features for english documents. Author's nationality is necessary.

This script extract both general and geolocation features.

Pre-requirements

  • NLTK
  • Dataset with correct annotations.

Python 2 modules Installation

run as root:

pip install -r requirements.txt

Author

Simakis Panagiotis (Initial Work)

Licence

This project is licensed under the GNU General Public License version 3 - see the LICENSE file for details