Skip to content

German paraphraser which combines different paraphrasing methods to generate as many different paraphrases as possible for NLU evaluations.

Notifications You must be signed in to change notification settings

j0st/german-paraphraser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

German Paraphrasing Tool

Introduction

Modified version of the code used in my bachelor's thesis titled "Der Einfluss von automatisch generierten Paraphrasen auf die NLU-Performance am Beispiel des Miele Alexa Skills" (2021). In this project, German paraphrases are generated using different methods at the lexical and syntactic levels. The resulting paraphrases can be used for NLU evaluations for example.

Method

Paraphrases were automatically generated in the following ways:

  • Word Substitution
  • Pivot Translation
  • T5 (Transformer Model)

Getting Started

  1. Clone project
git clone https://github.com/j0st/german-paraphraser
  1. Install requirements (Odenet needs to be installed directly from https://github.com/hdaSprachtechnologie/odenet)
pip install -r requirements.txt
pip install git+https://github.com/hdaSprachtechnologie/odenet
  1. Import paraphraser.py and create an instance of the paraphraser class
paraphraser = Paraphraser()
  1. Generate paraphrases from your text file (input sentences need punctuation for the paraphraser to work properly)
paraphraser.generate(YOUR_LIST_OF_INPUT_SENTENCES, OUTPUT_FILE)

About

German paraphraser which combines different paraphrasing methods to generate as many different paraphrases as possible for NLU evaluations.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages