GitHub - zzhanghub/crawler_pixabay: Crawling Images on Pixabay by category

Crawling Images on Pixabay by Category

An easy to use crawler for making datasets
⭐ Project Home »

Pixabay is an international, copyleft, and free-to-use website for sharing photos, illustrations, vector graphics, film footage, and music.

You can copy, modify, distribute, and use the images, even for commercial purposes, all without asking for permission or giving credits to the artist. (link)

Here we release an easy-to-use crawler. Users can specify image category, resolution, etc. The crawler will crawl the image and the corresponding license.

Prerequisites

pip install python-pixabay

Usage

1. Configure `pixabay_config.json`

keywords_file: JSON file of custom category;
save_root: Where to store pictures and licenses;
pixabay_api_key: Sign up here to get the api key;
url_keys: Used to set the resolution.
- previewURL: Low resolution images with a maximum width or height of 150 px;
- webformatURL: Medium sized image with a maximum width or height of 640 px;
- largeImageURL:Scaled image with a maximum width/height of 1280px;
npage: The larger the value, the larger the search scope.

{
    "keywords_file": "./keywords.json",
    "save_root": "./Images", 
    "pixabay_api_key": "Your Key",
    "url_keys": "webformatURL",
    "npage":2
}

2.Configure `keywords.json`

The form is as "Class name" : "Search keyword"

{
    "camera" : "camera",
    "earphone" : "AirPods"
}

3. Run by

python pixabay_crawler.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
img		img
.gitignore		.gitignore
README.md		README.md
keywords.json		keywords.json
pixabay_config.json		pixabay_config.json
pixabay_crawler.py		pixabay_crawler.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

.gitignore

.gitignore

README.md

README.md

keywords.json

keywords.json

pixabay_config.json

pixabay_config.json

pixabay_crawler.py

pixabay_crawler.py

Repository files navigation

Crawling Images on Pixabay by Category

Prerequisites

Usage

1. Configure `pixabay_config.json`

2.Configure `keywords.json`

3. Run by

About

Languages

zzhanghub/crawler_pixabay

Folders and files

Latest commit

History

Repository files navigation

Crawling Images on Pixabay by Category

Prerequisites

Usage

1. Configure pixabay_config.json

2.Configure keywords.json

3. Run by

About

Topics

Resources

Stars

Watchers

Forks

Languages

1. Configure `pixabay_config.json`

2.Configure `keywords.json`