COMPUTER VISION FOR VISUALLY IMPAIRED PEOPLE : ANALYSIS ON THE VIZWIZ DATASET
dc.contributor.author | Ma, Yezhou | |
dc.contributor.author | Wang, Kuan-Wen | |
dc.contributor.chair | Azenkot, Shiri | |
dc.contributor.committeeMember | Estrin, Deborah | |
dc.date.accessioned | 2020-08-10T19:48:49Z | |
dc.date.available | 2020-08-10T19:48:49Z | |
dc.date.issued | 2020-05 | |
dc.description | 31 pages | |
dc.description.abstract | In this study, we explore analyze the VizWiz dataset in a computer vision perspective, and identify challenges and potential solutions toward building computer vision model for visually impaired people. We conducted analysis on the images and question-answer(QA) pairs on the VizWiz dataset to identify common domains of problems. We also targeted object detection as a first step. By analyzing and mapping the QA pairs to ImageNet labels, we found that building a new set of labels specifically designed for this domain would be crucial. We then inspect and build a vocabulary set for the object detection task. | |
dc.identifier.doi | https://doi.org/10.7298/5mg1-s592 | |
dc.identifier.other | Ma_cornell_0058O_10832 | |
dc.identifier.other | http://dissertations.umi.com/cornell:10832 | |
dc.identifier.other | Wang_cornell_0058O_10834 | |
dc.identifier.other | http://dissertations.umi.com/cornell:10834 | |
dc.identifier.uri | https://hdl.handle.net/1813/70218 | |
dc.language.iso | en | |
dc.title | COMPUTER VISION FOR VISUALLY IMPAIRED PEOPLE : ANALYSIS ON THE VIZWIZ DATASET | |
dc.type | dissertation or thesis | |
dcterms.license | https://hdl.handle.net/1813/59810 | |
thesis.degree.discipline | Information Science | |
thesis.degree.grantor | Cornell University | |
thesis.degree.level | Master of Science | |
thesis.degree.name | M.S., Information Science |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Wang_cornell_0058O_10834.pdf
- Size:
- 16.87 MB
- Format:
- Adobe Portable Document Format