PDF 2 NER

Web application to convert scanned PDF files to text-based data and apply Named Entity Recognition (NER) to extract entities in Spanish

Created by: Fer Aguirre

Directory Structure

├── app.py
├── assets
│   └── pdfs
├── config.ini
├── config.ini.secret
├── data
│   ├── processed
│   └── raw
├── docs
│   ├── data-dictionary.md
│   ├── explore-data.md
│   ├── references
│   └── reports
├── LICENSE
├── notebooks
│   ├── 0.0-testing-nlp-models.ipynb
│   ├── 1.0-scraping-data.ipynb
│   └── 2.0-analyzing-data.ipynb
├── outputs
│   ├── figures
│   └── tables
├── pdf_2_ner
│   ├── data
│   ├── __init__.py
│   └── utils
├── Pipfile
├── Pipfile.lock
├── README.md
└── setup.py

License

This project is released under MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitsecret		.gitsecret
assets		assets
data		data
docs		docs
notebooks		notebooks
outputs		outputs
pdf_2_ner		pdf_2_ner
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
app.py		app.py
config.ini.secret		config.ini.secret
setup.py		setup.py

License

fer-aguirre/pdf-2-ner

Folders and files

Latest commit

History

Repository files navigation

PDF 2 NER

Directory Structure

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages