Combining Reasoning and Learning for Multi-stage Inference in Computational Sustainability and Scientific Discovery

Xue, Yexiang

Combining Reasoning and Learning for Multi-stage Inference in Computational Sustainability and Scientific Discovery

dc.contributor.author	Xue, Yexiang
dc.contributor.chair	Selman, Bart
dc.contributor.committeeMember	Gomes, Carla P.
dc.contributor.committeeMember	Joachims, Thorsten
dc.contributor.committeeMember	Hopcroft, John E.
dc.date.accessioned	2018-10-23T13:23:08Z
dc.date.available	2020-06-04T06:01:59Z
dc.date.issued	2018-05-30
dc.description.abstract	Problems at the intersection of reasoning, optimization, and learning often involve multi-stage inference. For example, making decisions based on machine learning models often leads to multi-stage inference problems where probabilistic models learned from data are embedded as first-stage subproblems within a global second-stage problem for decision-making. Multi-agent reasoning also involves multi-stage inference, since the reasoning of any given agent has to incorporate the goals of the other agents. As a result, the decision processes of the other agents are embedded as first-stage subproblems within the overall decision-making problem of that agent. With formal complexities beyond NP, multi-stage inference problems are often highly intractable. In this thesis, I introduce a novel computational framework, based on embeddings, to tackle multi-stage inference problems. Our embedding technique approximates the intractable sub-problems of a multi-stage inference problem through a series of novel representations. We then embed these representations into the global problem, effectively reducing a multi-stage inference to a single-stage inference. As one example, I present a novel way to encode the reward allocation problem for a two-stage organizer--agent game as a single-stage optimization. The encoding embeds an approximation of the agents' decision-making process into the organizer's problem in the form of linear constraints. We apply this methodology to eBird, a well-established citizen-science program for collecting bird observations, in a game called Avicaching. Our AI-based reward allocation was shown to be highly effective, surpassing the expectations of the eBird organizers and bird conservation experts. As another example, I present a novel constant approximation algorithm to solve stochastic optimization problems which identify the optimal policy that maximizes the expectation of a stochastic objective. To tackle this problem, I propose the embedding of its intractable counting subproblems as queries to NP oracles subject to additional XOR constraints. As a result, the entire problem is encoded as a single NP-equivalent optimization. This approach outperforms state-of-the-art solvers based on variational inference and MCMC sampling, on probabilistic inference benchmarks, deep learning applications, and a novel decision-making application in network design for wildlife conservation. I also apply the embedding technique to automated reasoning and machine learning for dimensionality reduction in scientific discovery. As one example, I propose the use of embeddings based on Fourier analysis as a compact representation of high-dimensional probability distributions. As a second example, I show how human computation, crowdsourcing, and parallel computation can identify key backdoor information, thereby drastically reducing the computation time from days to minutes in a dimensionality reduction application with complex physical constraints. Our novel integration of reasoning and learning has led to the discovery of new solar light absorbers by solving a dimensionality reduction problem to characterize the crystal structure of metal oxide materials using X-ray diffraction data.
dc.identifier.doi	https://doi.org/10.7298/X4V986B4
dc.identifier.other	Xue_cornellgrad_0058F_10840
dc.identifier.other	http://dissertations.umi.com/cornellgrad:10840
dc.identifier.other	bibid: 10489534
dc.identifier.uri	https://hdl.handle.net/1813/59449
dc.language.iso	en_US
dc.rights	Attribution-NonCommercial 4.0 International	*
dc.rights.uri	https://creativecommons.org/licenses/by-nc/4.0/	*
dc.subject	Computer science
dc.title	Combining Reasoning and Learning for Multi-stage Inference in Computational Sustainability and Scientific Discovery
dc.type	dissertation or thesis
dcterms.license	https://hdl.handle.net/1813/59810
thesis.degree.discipline	Computer Science
thesis.degree.grantor	Cornell University
thesis.degree.level	Doctor of Philosophy
thesis.degree.name	Ph. D., Computer Science

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Xue_cornellgrad_0058F_10840.pdf
Size:: 7.2 MB
Format:: Adobe Portable Document Format

Download

Collections

Cornell Theses and Dissertations