Skip to content

Scrapes images and metadata from ukiyo-e.org to create a woodblock print dataset

Notifications You must be signed in to change notification settings

sqrtNOT/YO12-downloader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

YO12-downloader

YOOOOOOOOOOOO! pon The purpose of this code is to create a local archive of the images and metadata found on ukiyo-e.org.

Data Model

For a concrete example of the data model consider a print like this one: https://ukiyo-e.org/image/mfa/sc130578
This print would result in a filepath of: Katsushika Hokusai/mfa_sc130578.jpg

The date, artist, and description are embedded in the exif metadata description tag as essentially a stable diffusion style prompt:
Katsushika Hokusai, Surugadai in Edo (Tôto sundai), from the series Thirty-six Views of Mount Fuji (Fugaku sanjûrokkei), 1830-31

Limitations

  • Complete metadata is not available for all images.
  • Some of the metadata has not been fully translated and parts remain in Japanese.
  • There are semi-duplicated data with some popular prints such as the great wave having many different scans of both the same and different print runs.
  • A handful of images on ukiyo-e.org contain errors and won't be downloaded
  • A number of the images are low quality, low resolution, and are non-aligned photographs that include picture frames.

About

Scrapes images and metadata from ukiyo-e.org to create a woodblock print dataset

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages