Skip to content

Predicting future winners of the Ingeborg-Bachmann-Preis with the help of NLP. Has a Flask app and a Rasa chatbot!

Notifications You must be signed in to change notification settings

senzelden/bachmannpreis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project: Bachmannpreis

ER model bachmann database

Background

The 'Tage der deutschsprachigen Literatur' in Klagenfurt, Austria, is a major literary festival and basically the only one that is being televised on German television. This event consists of readings of 30 minutes by a number of invited writers who are being evaluated by a circle of critics. The culmination of the event is a live voting by the critics to determine which writers receive an award, most notably the Ingeborg-Bachmann-Preis. For several germanspeaking writers this event has been a starting or ending point for their literary career.

Structure

This project consists of five parts:

1. Getting the data / Wrangling the data / Storing the data
2. NLP
  • Sentiment analysis for jury discussion (with spacy and SentiWS)
  • NER and POS-tagging for texts using flair
  • Topic modeling with NMF and TFIDF
3. Feature Engineering / Prediction
  • Feature engineering for prediction within pandas
  • Predict winning authors with Random Forest model using AutoSKLearn
4. Web application
5. Chatbot
  • Deploy chatbot for user interaction (Rasa)

rasa_bot