Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell University Graduate School
  3. Cornell Theses and Dissertations
  4. Learning from Less: Improving and Understanding Model Selection in Penalized Machine Learning Problems

Learning from Less: Improving and Understanding Model Selection in Penalized Machine Learning Problems

File(s)
Seto_cornellgrad_0058F_12148.pdf (8.91 MB)
Permanent Link(s)
https://doi.org/10.7298/fz7r-bz15
https://hdl.handle.net/1813/103002
Collections
Cornell Theses and Dissertations
Author
Seto, Skyler
Abstract

Model selection is the task of selecting a "good" model from a set of candidate models given data. In machine learning, it is important that models fit the training data well, however it is more important for a model to generalize to unseen data. Additionally, it is desirable for a model to be as small as possible allowing for deployment in low-resource settings. In this thesis, we focus on three problems where the structure of the problem benefits from reducing the size of the models. In the first problem setting, we explore word embedding models and propose a framework in which we connect popular word embedding methods to low-rank matrix models and suggest new models for computing word vectors. In the second problem setting, we explore sparsity-inducing penalties for deep neural networks in order to obtain highly sparse networks which perform competitively with their over-parametrized counterparts. Finally, we explore the task of robot planar pushing and propose a novel penalty which adapts the parameters of a neural network to unseen examples allowing for robots to better interact in unseen environments. Our results on simulations and real-world data applications indicate that penalization is effective for learning models which perform well in settings with less computational budget, storage, or labeled data.

Description
142 pages
Date Issued
2020-08
Keywords
Deep Learning
•
Machine Learning
•
Matrix Factorization
•
Model Selection
•
Penalization
•
Regularization
Committee Chair
Wells, Martin Timothy
Committee Member
Wilson, Andrew Gordon
Joachims, Thorsten
Degree Discipline
Statistics
Degree Name
Ph. D., Statistics
Degree Level
Doctor of Philosophy
Rights
Attribution 4.0 International
Rights URI
https://creativecommons.org/licenses/by/4.0/
Type
dissertation or thesis
Link(s) to Catalog Record
https://catalog.library.cornell.edu/catalog/13277855

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance