Skip to content

Latest commit

 

History

History
21 lines (11 loc) · 1.23 KB

File metadata and controls

21 lines (11 loc) · 1.23 KB

Taxonomy of Mathematical Plagiarism

This repository provides the dataset published in the paper "Taxonomy of Mathematical Plagiarism" and experimented's code.

Dataset

We curated a dataset of potentially plagiarised document math content span pairs along with Obfuscation (the way in which content is modified) types. The dataset and information on the accompanying files are available in data/

Experiments

We analyzed the best-performing approaches to detect plagiarism and mathematical content similarity on the newly established taxonomy. Corresponding code is present in code/experiments/.

Paper

A. Satpute, A. Greiner-Petter, N. Giessing, I. Beckenbach, M. Schubotz, O. Teschke, A. Aizawa, and B. Gipp, “Taxonomy of Mathematical Plagiarism,” in 46th European Conference on Information Retrieval (ECIR), Glasgow, Scotland, 2024.

License

CC-BY-SA 4.0. This defines the license for the whole dataset, which contains non-copyrighted bibliographic metadata and reference data derived from I4OSC (CC0).