eCommons

 

COMPUTER VISION FOR VISUALLY IMPAIRED PEOPLE : ANALYSIS ON THE VIZWIZ DATASET

dc.contributor.authorMa, Yezhou
dc.contributor.authorWang, Kuan-Wen
dc.contributor.chairAzenkot, Shiri
dc.contributor.committeeMemberEstrin, Deborah
dc.date.accessioned2020-08-10T19:48:49Z
dc.date.available2020-08-10T19:48:49Z
dc.date.issued2020-05
dc.description31 pages
dc.description.abstractIn this study, we explore analyze the VizWiz dataset in a computer vision perspective, and identify challenges and potential solutions toward building computer vision model for visually impaired people. We conducted analysis on the images and question-answer(QA) pairs on the VizWiz dataset to identify common domains of problems. We also targeted object detection as a first step. By analyzing and mapping the QA pairs to ImageNet labels, we found that building a new set of labels specifically designed for this domain would be crucial. We then inspect and build a vocabulary set for the object detection task.
dc.identifier.doihttps://doi.org/10.7298/5mg1-s592
dc.identifier.otherMa_cornell_0058O_10832
dc.identifier.otherhttp://dissertations.umi.com/cornell:10832
dc.identifier.otherWang_cornell_0058O_10834
dc.identifier.otherhttp://dissertations.umi.com/cornell:10834
dc.identifier.urihttps://hdl.handle.net/1813/70218
dc.language.isoen
dc.titleCOMPUTER VISION FOR VISUALLY IMPAIRED PEOPLE : ANALYSIS ON THE VIZWIZ DATASET
dc.typedissertation or thesis
dcterms.licensehttps://hdl.handle.net/1813/59810
thesis.degree.disciplineInformation Science
thesis.degree.grantorCornell University
thesis.degree.levelMaster of Science
thesis.degree.nameM.S., Information Science

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Wang_cornell_0058O_10834.pdf
Size:
16.87 MB
Format:
Adobe Portable Document Format