Image Quality Issues

Describe Quality Issues in Images Taken by People Who Are Blind

A panel of 8 images exemplifying different quality issues. The first image has no flaws and has the caption “A dry golden leaf on a brownish red background.” The second image suffers from blur and has the caption “A gold thermostat or compass attached to a piece of wood.” The third image is too bright and has the caption “A black Acer brand laptop with monitor off.”
The fourth image is too dark and has the caption “A person holding a Chap Stick brand lip balm.” The fifth image suffers from poor framing and has the caption “A hand holding a blue Kraft Foods jar.” The sixth image is partially obscured and has the caption “A coin placed on a left hand or palm.” The seventh image is rotated and has the caption “A green bag of potato chips from the store.” The eighth image is unrecognizable and has the caption “Quality issues are too severe to recognize visual content.”

Overview

Motivated by the aim to tie the assessment of image quality to practical vision tasks, we introduce a new image quality assessment dataset that emerges from a real use case. Our dataset is built around 39,181 images that were taken by people who are blind who were authentically trying to learn about images they took using the VizWiz mobile phone application. Users submitted these images to overcome real visual challenges that they faced in their daily lives. Of these images, 17% were submitted to collect image captions from remote humans. The remaining 83% were submitted with a question to collect answers to their visual questions. For each image, we asked crowdworkers to either supply a caption describing it or clarify that the quality issues are too severe for them to be able to create a caption. We call this task the unrecognizability classification task. We also ask crowdworkers to label each image with perceived quality flaws: blur, overexposure (bright), underexposure (dark), improper framing, obstructions, and rotated views. We call this task the quality flaws classification task. Altogether, we call this dataset VizWiz-QualityIssues. Algorithms for these tasks can be of immediate use to blind photographers.

Dataset

The VizWiz-Image Quality Issues dataset includes:

  • 23,431 training images
  • 7,750 validation images
  • 8,000 test images
  • number of votes, out of five crowd workers, for quality flaws and unrecognizability for each image

The download files are organized as follows:

  • Images: training, validation, and test sets
  • Annotations and API:
    • Images are split into three JSON files: train, validation and test. Quality Issues are publicly shared for the train and validation splits and hidden for the test split.
    • API is provided to demonstrate how to parse the JSON files and evaluate methods against the ground truth.
    • Details about each image are in the following format:
'flaws': {'BLR': 5,
  'BRT': 3,
  'DRK': 0,
  'FRM': 1,
  'NON': 0,
  'OBS': 0,
  'OTH': 0,
  'ROT': 0},
 'image': 'VizWiz_train_00022585.jpg',
 'unrecognizable': 4

Publications

The new dataset is described in the following publication:

Assessing Image Quality Issues for Real-World Problems
Tai-Yin Chiu, Yinan Zhao, and Danna Gurari. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.

Code

Code for the baseline models in the paper and evaluation: link

Contact Us

For questions, please send them to Tai-Yin Chiu at chiu.taiyin@utexas.edu.