Skip to content

rayjasson98/image-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python Image Scraper

💻 Commands

  • To scrape images using FastClass, run
    fcd -c BING -s 0 -o raw_images config/fastclass.csv
  • To scrape images using google-images-download, run
    googleimagesdownload -cf ../config/google_images_download.json
  • To rename image files to zero-padded sequential numbers, run
    ls | cat -n | while read n f; do mv "$f" `printf "%03d.extension" $n`; done

‼️ Things to Note

  • Refer to google-images-download and FastClass for documentation of usage.
  • Google crawler from FastClass won't work.
  • If you face issues when using google-images-download, see #360 and #298 to learn how to download the forked version that has the bug fix. It seems like google-images-download is not maintained by the current repo owner anymore.
  • You can use TinyPNG and its VS Code extension to resize and compress your scraped images. Get your API key here.
  • By default, this repo comes with a devcontainer.json configuration. If you have Docker installed, you can start the container to get all packages and extensions installed at one go.

About

Python commands to scrape images from the web.

Topics

Resources

Stars

Watchers

Forks