Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell University Graduate School
  3. Cornell Theses and Dissertations
  4. Mining Visual Knowledge from Pre-trained Models

Mining Visual Knowledge from Pre-trained Models

File(s)
Tang_cornellgrad_0058F_14373.pdf (95.62 MB)
Permanent Link(s)
https://doi.org/10.7298/avvm-v924
https://hdl.handle.net/1813/116596
Collections
Cornell Theses and Dissertations
Author
Tang, Luming
Abstract

Computer vision has made significant progress in the past decade, primarily due to the dominant supervised learning paradigm, which involves training large-scale neural networks on extensive datasets for each task. However, scalable data and annotation collection often prove to be intractable. In contrast, humans can adapt to new vision tasks with very little data or few labels. This thesis aims to bridge this gap by presenting a practical solution: pre-training deep neural networks on accessible large-scale internet images, and then employing various techniques to adapt these pre-trained models to diverse downstream tasks with minimal or no additional data. In the pre-training stage, I introduce two meta-learning methods to achieve better pre-trained image representations that generalize to novel classes with minimal extra annotations. In the adaptation stage, I demonstrate multiple techniques for effectively adapting pre-trained models to data-constrained downstream tasks such as recognition, dense prediction, 3D generation, and reference-based image completion.

Description
304 pages
Date Issued
2024-08
Keywords
Computer Vision
•
Generative Model
•
Machine Learning
•
Representation Learning
Committee Chair
Hariharan, Bharath
Committee Member
Marschner, Stephen
Snavely, Keith
Degree Discipline
Computer Science
Degree Name
Ph. D., Computer Science
Degree Level
Doctor of Philosophy
Rights
Attribution-NonCommercial 4.0 International
Rights URI
https://creativecommons.org/licenses/by-nc/4.0/
Type
dissertation or thesis
Link(s) to Catalog Record
https://newcatalog.library.cornell.edu/catalog/16611759

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance