OCR Web

This is a web app which uses the Tesseract API for Optical Image Recognition. This app will be deployed heresoon.

The basic functionality of the application is demonstrated in this video

Usage Instructions

Instructions for contributing can be found -> CONTRIBUTING.md file.

Nodejs (with Express)
Node Tesseract : Used as a wrapper for using the tesseract API's for the Node Platform
Fred's ImageMagicks textcleaner Bash scripts : To enhancing the image and reducing noisefor better read.
Tesseract OCR Engine
Multer : For managing the uploaded image files (Node)

As of now, it is implemented to recognize only English characters.
Though the Tesseract-OCR engine is powerful, there is a limitation to its performance.

Reduce noise in the uploaded image (Clean).
Display the cleaned image in front-end.
Use multipart formdata to upload files through Angular HTTP Request(and add other processing functionalities).
Copy text to be implemented using ngClipboard angular directive.
Retrieve text from multiple images in a single request.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
bin		bin
lib		lib
node_modules		node_modules
public		public
routes		routes
views		views
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Procfile		Procfile
README.md		README.md
app.js		app.js
package.json		package.json