Skip to content

A simple tool to scrape google images by simply specifying multiple queries

License

Notifications You must be signed in to change notification settings

gigatesseract/GimageScrape

Repository files navigation

GimageScrape

A simple tool to scrape google images by simply specifying queries

Instructions:

  1. git clone https://github.com/gigatesseract/GimageScrape.git
  2. Install virtualenv (recommended) pip install vitualenv
  3. Create virtual environment virtualenv env
  4. Activate the virtual environment source env/bin/activate (You can deactivate it by deactivate)
  5. Install requirements. pip install -r requirements.txt
  6. Populate to_search.txt with your search queries. Make sure each query is in a new line. See to_search.txt for more details. Supports space separated queries.
  7. cd googleScraper
  8. In the settings.py file, add the following line:
    IMAGES_STORE = "path/to/valid/dir" where the value is a path to any valid directory
  9. cd ..
  10. scrapy crawl search

Sit back and relax. A folder called images will be created in the valid directory as mentioned in the IMAGES_STORE in settings.py and each searh query will create a new folder inside it and populate it with 20 images.

License

MIT

About

A simple tool to scrape google images by simply specifying multiple queries

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published