Skip to content

abhi777/Cloud-based-Web-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 

Repository files navigation

Scrapify v1.0.10

This is a Web Application developed using Python which can scrap Data from the web and save the extracted Data in Google Cloud Storage. It's also very convenient to use, even for a Non-Technical Person as it provides a really nice UI.

To check out the live version of my project, Click Here...

image

First, We need to signup by clicking on the Sign Up section present in the side bar and Login using the newly created credentials.

image image

It needs various inputs like:

  • Name of the GCS Bucket
  • A text file containing a list of links of webpages line by line
  • A json file containing the credentials of the GCS(Google Cloud Storage)
  • Types of Data to be scraped

Executing the script after providing the inputs would start saving the required data into the GCS Bucket.

image

image

image