Skip to content

Web-App to convert image to text. Fully backed by AWS services start to end.

Notifications You must be signed in to change notification settings

kishore7403/AWS_Textract_TextRecognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text Recognition using AWS Textract

Introduction

The Text Recognition Application is a cloud-based solution that leverages AWS services to offer users the ability to extract text from images using Amazon Textract. This user-friendly application facilitates the extraction of text from various sources, including handwritten notes, scanned documents, printed text, and code snippets.

Project Architecture

Project Links and Demo

Hosted Application: http://52.73.6.62:3000/

Architecture and flow

architecture-flow.and.demo.mp4

Short Demo

short.demo.mp4

Table of Contents

Project Goals and Objectives

The primary goal of this project is to create an efficient and secure application for text recognition using Amazon Textract. Key objectives include:

  • Enabling users to log in securely.
  • Accepting image uploads in multiple formats.
  • Accurate and efficient text recognition.
  • Support for various use cases and input types.
  • High availability, scalability, and cost-effectiveness.

Target Users

This application is designed for individuals and professionals who require accurate text extraction from diverse sources. Target users include students, researchers, programmers, administrators, and anyone dealing with text data from various inputs.

Performance Targets

  • Response Time: The application aims for fast processing with minimal response time per image.
  • Accuracy: The text recognition engine strives for high accuracy to minimize extraction errors.
  • Versatility: The application is designed to handle a wide range of text recognition tasks.
  • Scalability: The system is built to efficiently scale to accommodate user demand and task complexities.

Services Used

The following AWS services are utilized in this project:

  • Compute: Amazon EC2, AWS Lambda
  • Storage: Amazon S3, Amazon DynamoDB
  • Security: AWS Secrets Manager
  • Text Recognition: Amazon Textract
  • Network: Amazon API Gateway

How Cloud Mechanisms Fit Together

Amazon API Gateway serves as the entry point, routing user requests from the frontend hosted on Amazon EC2 to relevant AWS Lambda functions. These functions handle user login, image uploads, and text extraction. Amazon DynamoDB securely stores user data, while Amazon S3 stores uploaded images. The Lambda function responsible for text extraction employs Amazon Textract to process images and extract text. AWS Secrets Manager ensures secure access to DynamoDB and S3. The combination of these cloud mechanisms creates a robust and scalable system, offering users a seamless experience.

Data Storage

  • User information and login credentials are stored in Amazon DynamoDB, a secure NoSQL database.
  • Uploaded images are stored in Amazon S3, providing scalable and durable object storage.

Programming Languages

  • Python: Used for writing AWS Lambda functions.
  • JavaScript: Used for frontend application development.