These three notebooks accompany the materials for the CAS at FHNW in Spatial Statistics and the module on GIR.
Basic NLP
This notebook demonstrates some simple NLP on German texts.
TFIDF
TF-IDF is a very basic way of ranking documents. This notebook implements tf-idf from scratch, and allows you to experiment with it.
Finding placenames This project illustrates an application to find new placenames in a dataset, by performing NER on Geograph image descriptions and comparing those with an Ordnance Survey gazetteer. It uses a simple local index to compare gazetteer entries, and only allows exact matching.