Skip to content

Cube++ is a novel dataset collected for illumination estimation problem. It has 4890 raw 18-megapixel images, each containing a SpyderCube color target in their scenes, manually labelled categories, and ground truth illumination chromaticities.

Visillect/CubePlusPlus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cube++

Image examples

Cube++ is a novel dataset for the color constancy problem that continues on the Cube+ dataset. It includes 4890 images of different scenes under various conditions. For calculating the ground truth illumination, a calibration object with known surface colors was placed in every scene. The Cube++ dataset was used in ICMV 2020 2nd Illumination Estimation Challenge.

Download

Data is avalilable on zenodo.org and the FTP mirror under CC BY 4.0 license.

  • Cube++ (200GB, 4890 images) with PNG0 can be downloaded with the following commands
pip install zenodo-get
zenodo_get https://zenodo.org/record/4153431 --output-dir=Cube++
cd Cube++/
wget https://storage.yandexcloud.net/cubepng0/PNG0.zip
wget https://zenodo.org/record/4153431/files/SimpleCube++.zip
  • FTP mirror ftp://vis.iitp.ru/Cube++

Description

  • 4890 raw camera images
  • 200GB of raw and preprocessed data
  • Manual annotations and metadata
  • Various scene illuminations
  • 2GB SimpleCube++ subdataset

Cube++

Images were obtained with sensors of same type on Canon 550D and Canon 600D cameras. As a calibration tool, SpyderCube was used due to its ability to identify multiple illumination sources from different angles. The dataset includes:

  • PNG/{img_id}.png – 16-bit PNG images
  • gt.csv – Ground truth chromaticities answers. Ground truth file. The table contains automatically calculated ground truth values. The columns are: image and for each of the 4 triangles (left, right, left bottom, right bottom) it contains three columns r, g, b with the corresponding RGB illumination estimation. The illumination estimation is normalized so that r + g + b = 1
  • properties.csv – Annotation and metadata file. The table contains the most relevant meta information of the dataset images. It includes the average triangle brightness, manually labeled properties, selected EXIF fields
  • JPG/{img_id}.jpg – JPEG images, for visualization purposes only
  • auxiliary/
    • extra/
      • exif/{img_id}.json – All the extracted EXIF data
      • gt_json/{img_id}.json – Calculated gts, all the data is duplicated in gt.csv or properties csv
      • cam_estimation.csv – Selected EXIF estimations made by camera
      • exif_stat.csv – EXIF fields statistics
    • source/ – the dataset is automatically build from this directory
      • CR2/{img_id}.CR2 – Original raw CR2 images
      • JPG.JSON/{img_id}.jpg.json – JSON markup files. Each file contains manually labeled features and cube coordinates
      • full_estimation.csv – extra markup file for full estimation or partial estimation

See also a more detailed description.

SimpleCube++

In addition to the main Cube++ dataset, small and simpler version of it is prepared. Simple dataset contains 4x downscaled images that have less than 1° difference between left and right ground truth illumination estimation. It includes only 2234 images with single source illumination; consequently, there is only one ground truth per image. The simple dataset weights around 2GB.

Small dataset contains PNG and JPG files, gt.csv with ground truth data and properties.csv with manual annotation data:

  • train
    • PNG/{img_id}.png – 16-bit PNG images with cropped cube area
    • gt.csv – Ground truth chromaticities answers. Ground truth file. The table contains automatically calculated ground truth values. The columns are: image (contains image id) and three columns r, g, b with the corresponding RGB illumination estimation. The illumination estimation is normalized so that r + g + b = 1
  • test – The same format
    • PNG/{img_id}.png
    • gt.csv
  • auxiliary
    • {test/train}_properties.csv – Annotation and metadata file. The table contains the most relevant meta information of the dataset images. It includes image column (contains image id), ds_version (0.0 for Cube images; 1.0 for Cube+ extension and IEC2019 test images; 2.0 for Cube++ extension), manually labeled properties (only daytime; place; illumination; is_sharp; shadows)
    • JPG/{test/train}_{img_id}.jpg – JPEG images with cropped cube area, for visualization purposes only

See also a more detailed description.

Papers

If you use the dataset in your research, please refer to the following paper:

@article{ershov2020cube++,
  title={The Cube++ Illumination Estimation Dataset},
  author={Ershov, Egor and Savchik, Alexey and Semenkov, Illya and Bani{\'c}, Nikola and Belokopytov, Alexander and Senshina, Daria and Ko{\v{s}}{\v{c}}evi{\'c}, Karlo and Suba{\v{s}}i{\'c}, Marko and Lon{\v{c}}ari{\'c}, Sven},
  journal={IEEE Access},
  volume={8},
  pages={227511--227527},
  year={2020},
  publisher={IEEE}
}
@article{ershov2020problems,
  title={Problems of dataset creation for light source estimation},
  author={Ershov, Egor and Belokopytov, Alexander and Savchik, Alex},
  journal={arXiv preprint arXiv:2006.02692},
  year={2020}
}

About

Cube++ is a novel dataset collected for illumination estimation problem. It has 4890 raw 18-megapixel images, each containing a SpyderCube color target in their scenes, manually labelled categories, and ground truth illumination chromaticities.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Languages