COMPUTER VISION FOR VISUALLY IMPAIRED PEOPLE : ANALYSIS ON THE VIZWIZ DATASET
Collections
Author
Ma, Yezhou
Wang, Kuan-Wen
Abstract
In this study, we explore analyze the VizWiz dataset in a computer vision perspective, and identify challenges and potential solutions toward building computer vision model for visually impaired people. We conducted analysis on the images and question-answer(QA) pairs on the VizWiz dataset to identify common domains of problems. We also targeted object detection as a first step. By analyzing and mapping the QA pairs to ImageNet labels, we found that building a new set of labels specifically designed for this domain would be crucial. We then inspect and build a vocabulary set for the object detection task.
Description
31 pages
Date Issued
2020-05
Committee Chair
Azenkot, Shiri
Committee Member
Estrin, Deborah
Degree Discipline
Information Science
Degree Name
M.S., Information Science
Degree Level
Master of Science
Type
dissertation or thesis
Link(s) to Catalog Record