This project contains a Python script that converts PDF files to text files
The script scans specified directories for PDF files. For each PDF file, it extracts the text content and saves it as a new text file in a specified output directory.
- Python 3.7 or higher
- pdfminer.six
- Clone the repository to your local machine.
- Install the required Python package:
pip install pdfminer.six
- Open the Python script (
pdf_to_text.py
) in a text editor. - Modify the
directories
list with the paths to the directories containing your PDF files. - Modify the
output_dir
variable with the path to the directory where you want to save the text files. - Run the script:
python pdf_to_text.py
This project is licensed under the MIT License