Explainable ERP Fraud Detection

This repository contains code for the paper 'Towards Explainable Occupational Fraud Detection' published at the 7th Workshop on MIning DAta for financial applicationS (MIDAS) as part of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD) 2022.

Abstract

Occupational fraud within companies currently causes losses of around 5% of company revenue each year. While enterprise resource planning systems can enable automated detection of occupational fraud through recording large amounts of company data, the use of state-of-the-art machine learning approaches in this domain is limited by their untraceable decision process. In this study, we evaluate whether machine learning combined with explainable artificial intelligence can provide both strong performance and decision traceability in occupational fraud detection. We construct an evaluation setting that assesses the comprehensibility of machine learning-based occupational fraud detection approaches, and evaluate both performance and comprehensibility of multiple approaches with explainable artificial intelligence. Our study finds that high detection performance does not necessarily indicate good explanation quality, but specific approaches provide both satisfactory performance and decision traceability, underlining the suitability of machine learning for practical application in occupational fraud detection and the importance of research evaluating both performance and comprehensibility together.

Contained Materials

The data folder contains the ERP fraud detection data of Tritscher et al. [1]. The full data is available at https://professor-x.de/erp-fraud-data.

Results of the hyperparameter studies from both paper experiments can be found unter outputs/summary.

Additionally, the folder output/explanation contains the generated SHAP explanations from both experiments.

Usage

Hyperparameter studies and training for fraud detection approaches can be conducted through param_search.py.

Stored models can be explained with SHAP [2] using run_xai.py.

Interactive visualization of explanations is possible through visualize_xai.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
anomaly_detection		anomaly_detection
data		data
outputs		outputs
plotting		plotting
xai		xai
Readme.md		Readme.md
param_search.py		param_search.py
requirements.txt		requirements.txt
run_detector.py		run_detector.py
run_xai.py		run_xai.py
visualize_xai.ipynb		visualize_xai.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

anomaly_detection

anomaly_detection

data

data

outputs

outputs

plotting

plotting

xai

xai

Readme.md

Readme.md

param_search.py

param_search.py

requirements.txt

requirements.txt

run_detector.py

run_detector.py

run_xai.py

run_xai.py

visualize_xai.ipynb

visualize_xai.ipynb

Repository files navigation

Explainable ERP Fraud Detection

Abstract

Contained Materials

Usage

About

Releases

Packages

Languages

LSX-UniWue/explainable-ERP-fraud-detection

Folders and files

Latest commit

History

Repository files navigation

Explainable ERP Fraud Detection

Abstract

Contained Materials

Usage

About

Resources

Stars

Watchers

Forks

Languages