Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell University Graduate School
  3. Cornell Theses and Dissertations
  4. Understanding and Directing What Models Learn

Understanding and Directing What Models Learn

File(s)
Thompson_cornellgrad_0058F_12369.pdf (25.57 MB)
Permanent Link(s)
https://doi.org/10.7298/5m5j-eg95
https://hdl.handle.net/1813/103379
Collections
Cornell Theses and Dissertations
Author
Thompson, Laure Jean
Abstract

Machine learning and statistical methods, such as unsupervised semantic models, make massive cultural heritage collections more explorable and analyzable. These models capture many underlying patterns of raw textual and visual materials, but neither model creators nor model users fully understand which specific patterns are learned by a given model nor under what conditions a particular pattern becomes more learnable. In this dissertation I address two core questions (i) what do models actually learn? and (ii) how can we direct what they learn? Instead of proposing new models, I focus on expanding the affordances, as well as our understanding, of existing ones that are used by scholars in the humanities and social sciences. In the first part of this dissertation, I study what models learn by way of expanding the ways in which they can be used. In the second part, I investigate how existing models can be directed away from known, uninteresting structures via corpus- and representation-level interventions. Throughout this work, I show how machine learning and statistical methods provide an opportunity to view collections from alien, defamiliarized perspectives that can call into question the boundaries of established categories. Likewise, I show how the uses of computational methods within humanities and social science scholarship can test, challenge, and expand the affordances of these methods. Ultimately, this dissertation highlights some of the many ways in which machine learning and the humanities help one another.

Description
203 pages
Date Issued
2020-12
Keywords
Artificial Intelligence
•
Digital Humanities
•
Image Analysis
•
Magical Gems
•
Science Fiction
•
Text Analysis
Committee Chair
Mimno, David
Committee Member
Barrett, Caitlín Eilís
Kozen, Dexter
Degree Discipline
Computer Science
Degree Name
Ph. D., Computer Science
Degree Level
Doctor of Philosophy
Rights
Attribution-NonCommercial-ShareAlike 4.0 International
Rights URI
https://creativecommons.org/licenses/by-nc-sa/4.0/
Type
dissertation or thesis
Link(s) to Catalog Record
https://newcatalog.library.cornell.edu/catalog/13312182

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance