A Java application that uses Phileas to identify PII (Personally Identifiable Information) in text across a wide variety of file formats. Types of PII are scored by magnitude, density, and confidence. A list of files suggested for redaction testing will be generated.
The goal of Phinder is to provide a comprehensive analysis of PII to help you take the next step to redact it with Philter. Note that Phinder may support more file types than Philter.
Visit http://philterd.github.io/phinder for documentation and more information.
mvn clean installjava -jar target/phinder-1.0.0-SNAPSHOT.jar -i src/test/resources/input.txtTo process a directory recursively:
java -jar target/phinder-1.0.0-SNAPSHOT.jar -i src/test/resources/ -RNote
Processing images requires tesseract-ocr to be installed.
At the completion of the scan, report.json and report.html files will be generated in the current directory.
To store the report history in MongoDB, use the --mongodb CLI option:
java -jar target/phinder-1.0.0-SNAPSHOT.jar -i src/test/resources/input.txt --mongodb "mongodb://localhost:27017/phinder"For more examples and detailed usage, please refer to the documentation.
Copyright 2026 Philterd, LLC.
This project is licensed under the Apache License 2.0.
