Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell University Graduate School
  3. Cornell Theses and Dissertations
  4. Advances in Sparse and Bayesian Optimization for Autonomous Scientific Discovery

Advances in Sparse and Bayesian Optimization for Autonomous Scientific Discovery

File(s)
Ament_cornellgrad_0058F_13285.pdf (17.07 MB)
Permanent Link(s)
https://doi.org/10.7298/rg9z-6p14
https://hdl.handle.net/1813/112894
Collections
Cornell Theses and Dissertations
Author
Ament, Sebastian
Abstract

Scientists are increasingly leveraging modern computational methods for the analysis of experimental data and the design of new experiments in order to enable and accelerate scientific progress. Particularly valuable to scientific research are sparse, interpretable models, uncertainty quantification, and the minimization of the number of experiments that are required to achieve a scientific end. The fields of sparse and Bayesian optimization (BO) constitute a highly suitable basis for tackling these scientific problems and, despite considerable prior work, contain many questions that require further inquiry: Can we design algorithms that can outperform existing ones on key problems? What are the precise conditions under which an algorithm can determine a sparse model from little data? How can machines best design scientific experiments to minimize their cost? This thesis puts forth algorithmic and theoretical advances that aim to answer these questions. Part I provides an overview of the main contributions of this thesis. In Part II, we develop novel theoretical insights on sparsity-promoting algorithms and propose per- formant new algorithms. In Part III, we propose exact methods that reduce the complexity of a critical step in first-order BO from quadratic to linear in the dimensionality of the input. In Part IV, we focus on applications in scientific discovery, a highlight being the Scientific Autonomous Reasoning Agent (SARA), which was deployed at the Cornell High-Energy Synchrotron Source (CHESS) and the Stanford Linear Accelerator Center (SLAC), accelerating the acquisition of relevant scientific data for materials discovery by orders of magnitude. We conclude with future research directions in Part V.

Description
283 pages
Date Issued
2022-12
Keywords
Active Learning
•
Autonomous
•
Bayesian Optimization
•
Gaussian Process
•
Materials Science
•
Sparsity
Committee Chair
Gomes, Carla
Committee Member
Bindel, David
Selman, Bart
Degree Discipline
Computer Science
Degree Name
Ph. D., Computer Science
Degree Level
Doctor of Philosophy
Rights
Attribution 4.0 International
Rights URI
https://creativecommons.org/licenses/by/4.0/
Type
dissertation or thesis
Link(s) to Catalog Record
https://newcatalog.library.cornell.edu/catalog/15644057

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance