Adversarial Machine Learning Methods for Causal Inference under Unmeasured Confounding
dc.contributor.author | Bennett, Andrew | |
dc.contributor.chair | Kallus, Nathan | en_US |
dc.contributor.committeeMember | Gurvich, Itai | en_US |
dc.contributor.committeeMember | Belongie, Serge | en_US |
dc.contributor.committeeMember | Joachims, Thorsten | en_US |
dc.date.accessioned | 2024-01-31T21:18:31Z | |
dc.date.issued | 2023-05 | |
dc.description.abstract | There has recently been incredible progress in developing both theory and practice for applying machine learning (ML) to important problems in causal inference. However, despite this rapidly growing work, settings in which there is unmeasured confounding have remained relatively under-explored, with many remaining challenges and open problems. In this thesis, we consider various causal inference problems in which there is unmeasured confounding, and investigate how to solve them with by utilizing recent advances in adversarial machine learning. We also consider an application of this theory to more efficient policy learning from observational data. We start in Part I by considering the problem of instrumental variable (IV) regression, which is a common framework for causal inference when confounders are not fully observed, but we instead have access to instrumental variables that can only affect the outcome via the treatment. Chapter 2 proposes a novel approach for solving IV regression using smooth game optimization with neural networks, which is motivated by a novel formulation of the optimally-weighted generalized method of moments as a min-max optimization problem. Then, Chapter 3 extends this approach to a more general class of conditional moment problems, as well as to more general ML classes such as kernel spaces, and it analyzes the theoretical properties of such estimators in detail. Next, in Part II we consider various settings of policy evaluation with unobserved confounding. Chapters 4 and 5 both consider approaches to this problem by extending existing work on optimal balancing to such settings, solving for weighted combinations of the observed outcomes that identify the policy value, by minimizing an adversarial formulation of the corresponding risk. Whereas Chapter 4 does this in single treatment settings, Chapter 5 extends this to infinite-horizon RL settings. Then, Chapter 6 considers an even more general partially observed Markov decision process (POMDP) setting, and proposes a policy evaluation method based on a novel sequential extension of existing proximal causal learning methods. In particular, the methods in Chapters 5 and 6 require solving conditional moment problems, for which we propose to use our methodology from Part I. Finally, in Part III we consider an application of our theory to more efficient policy learning given observational data. In Chapter 7, we consider the popular approach of policy learning via surrogate loss reductions, and show that under the assumption that this approach is valid, the optimal policy parameters are defined by a conditional moment restriction. We show how this restriction can be efficiently solved using our methodology from Part I, and that this results in asymptotically optimal regret and superior empirical performance. | en_US |
dc.description.embargo | 2025-06-13 | |
dc.identifier.doi | https://doi.org/10.7298/4een-gt72 | |
dc.identifier.other | Bennett_cornellgrad_0058F_13625 | |
dc.identifier.other | http://dissertations.umi.com/cornellgrad:13625 | |
dc.identifier.uri | https://hdl.handle.net/1813/113989 | |
dc.language.iso | en | |
dc.rights | Attribution 4.0 International | * |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | * |
dc.subject | Adversarial Machine Learning | en_US |
dc.subject | Causal Inference | en_US |
dc.subject | Conditional Moment Problems | en_US |
dc.subject | Instrumental Variables | en_US |
dc.subject | Off Policy Evaluation | en_US |
dc.subject | Policy Learning | en_US |
dc.title | Adversarial Machine Learning Methods for Causal Inference under Unmeasured Confounding | en_US |
dc.type | dissertation or thesis | en_US |
dcterms.license | https://hdl.handle.net/1813/59810.2 | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | Cornell University | |
thesis.degree.level | Doctor of Philosophy | |
thesis.degree.name | Ph. D., Computer Science |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Bennett_cornellgrad_0058F_13625.pdf
- Size:
- 5.51 MB
- Format:
- Adobe Portable Document Format