Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell University Graduate School
  3. Cornell Theses and Dissertations
  4. Reasoning in the Wild

Reasoning in the Wild

File(s)
Zhao_cornellgrad_0058F_14909.pdf (16.36 MB)
Permanent Link(s)
https://doi.org/10.7298/8wqw-ze23
https://hdl.handle.net/1813/117673
Collections
Cornell Theses and Dissertations
Author
Zhao, Wenting
Abstract

Teaching machines to reason has been a longstanding goal in artificial intelligence. Recently, the rapid advancement of language modeling has advanced this vision, opening up new possibilities for automated reasoning. Although existing benchmarks have demonstrated strong reasoning performance of LMs, it remains unclear how effectively such models reason in real-world scenarios, where queries differ significantly in complexity and style from the standard evaluation datasets. This dissertation identifies two main obstacles that prevent LLMs from reasoning effectively in realistic settings: (1) distributional mismatches between standard training data and the user queries encountered in the wild, and (2) the difficulty and cost associated with collecting expert-annotated training data for complex reasoning tasks. To better assess reasoning performance under realistic conditions, we first introduce data sources and evaluation benchmarks directly collected from real-world use cases, establishing representative real-world reasoning challenges. Analysis of these benchmarks reveals significant limitations in contemporary language models, highlighting areas that require progress. Subsequently, we explore training approaches using alternative supervision that enable reasoning without reliance on manually annotated data. We investigate structural supervision, an approach that incorporates prior knowledge about the underlying structure of reasoning tasks into latent variable models, enabling them to better handle different reasoning scenarios, such as multi-hop inference and abductive reasoning. Additionally, we explore using language agents for complex reasoning tasks. Language agents utilize environmental feedback, where they learn iteratively by interacting with an external environment rather than from explicit annotations.

Description
201 pages
Date Issued
2025-05
Committee Chair
Cardie, Claire
Committee Member
Rush, Alexander
Ellis, Kevin
Kleinberg, Robert
Degree Discipline
Computer Science
Degree Name
Ph. D., Computer Science
Degree Level
Doctor of Philosophy
Rights
Attribution 4.0 International
Rights URI
https://creativecommons.org/licenses/by/4.0/
Type
dissertation or thesis
Link(s) to Catalog Record
https://newcatalog.library.cornell.edu/catalog/16938219

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance