Skip to content

WebText Extractor, built with Python Flask backend and Next.js frontend, is a web application used to extract text content from any website URL.

Notifications You must be signed in to change notification settings

imlakshay08/webtext-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

WebText Extractor

WebText Extractor is a web application that allows users to extract text content from any given website URL. It comprises a Python Flask backend for text extraction and a Next.js frontend for user interaction.

Features

  • Extract text content from any website URL.
  • Simple and intuitive user interface.
  • Efficient text extraction using Python's BeautifulSoup library.
  • Built-in error handling for robust performance.
  • Easily deployable on various platforms.

Getting Started

Follow these steps to get started with WebText Extractor:

Prerequisites

  • Node.js
  • Python 3

Installation

  1. Clone this repository to your local machine.
git clone https://github.com/your-username/your-repository.git
  1. Navigate to the project directory.
cd your-repository
  1. Install frontend dependencies.
npm install

Usage

  1. Start the Flask backend server.
python app.py
  1. Start the Next.js frontend server.
npm run dev
  1. Open your browser and navigate to http://localhost:3000 to access the WebText Extractor application.

Contributing

Contributions are welcome! Feel free to fork this repository, make changes, and submit pull requests. Please follow the contribution guidelines outlined in the CONTRIBUTING.md file.

Acknowledgements

  • Flask - Python web framework
  • Next.js - React framework
  • Beautiful Soup - Python library for web scraping

About

WebText Extractor, built with Python Flask backend and Next.js frontend, is a web application used to extract text content from any website URL.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published