Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell University Graduate School
  3. Cornell Theses and Dissertations
  4. COMPUTER VISION FOR VISUALLY IMPAIRED PEOPLE : ANALYSIS ON THE VIZWIZ DATASET

COMPUTER VISION FOR VISUALLY IMPAIRED PEOPLE : ANALYSIS ON THE VIZWIZ DATASET

File(s)
Wang_cornell_0058O_10834.pdf (16.87 MB)
Permanent Link(s)
https://doi.org/10.7298/5mg1-s592
https://hdl.handle.net/1813/70218
Collections
Cornell Theses and Dissertations
Author
Ma, Yezhou
Wang, Kuan-Wen
Abstract

In this study, we explore analyze the VizWiz dataset in a computer vision perspective, and identify challenges and potential solutions toward building computer vision model for visually impaired people. We conducted analysis on the images and question-answer(QA) pairs on the VizWiz dataset to identify common domains of problems. We also targeted object detection as a first step. By analyzing and mapping the QA pairs to ImageNet labels, we found that building a new set of labels specifically designed for this domain would be crucial. We then inspect and build a vocabulary set for the object detection task.

Description
31 pages
Date Issued
2020-05
Committee Chair
Azenkot, Shiri
Committee Member
Estrin, Deborah
Degree Discipline
Information Science
Degree Name
M.S., Information Science
Degree Level
Master of Science
Type
dissertation or thesis
Link(s) to Catalog Record
https://catalog.library.cornell.edu/catalog/13254541

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance