NLP - Formal and Geolocation Features Extraction

This is a Python 2 project about Natural Language Processing and specially about geolocation features extraction from a corpus. We use the known NLTK and some other python modules. We used an annotated corpus as input. Corpus has the same structure with the corpus in this project.

geolocation.py

This scripts extract general features for Natural Language Processing. For example calculates chars per document or symbols per document etc. For more info check the script.

general.py

This script extract geolocation features for english documents. Author's nationality is necessary.

both.py

This script extract both general and geolocation features.

Pre-requirements

NLTK
Dataset with correct annotations.

Python 2 modules Installation

run as root:

pip install -r requirements.txt

Author

Simakis Panagiotis (Initial Work)

Licence

This project is licensed under the GNU General Public License version 3 - see the LICENSE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENCE		LICENCE
README.md		README.md
both.py		both.py
general.py		general.py
geolocation.py		geolocation.py
requirments.txt		requirments.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENCE

LICENCE

README.md

README.md

both.py

both.py

general.py

general.py

geolocation.py

geolocation.py

requirments.txt

requirments.txt

Repository files navigation

NLP - Formal and Geolocation Features Extraction

Pre-requirements

Python 2 modules Installation

Author

Licence

About

Releases

Packages

Languages

License

sp1thas/nlp_geofetures_extraction

Folders and files

Latest commit

History

Repository files navigation

NLP - Formal and Geolocation Features Extraction

Pre-requirements

Python 2 modules Installation

Author

Licence

About

Topics

Resources

License

Stars

Watchers

Forks

Languages