Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell University Graduate School
  3. Cornell Theses and Dissertations
  4. Language Learning as Language Use: Statistically-based Chunking in Development

Language Learning as Language Use: Statistically-based Chunking in Development

File(s)
McCauley_cornellgrad_0058F_10193.pdf (1.65 MB)
Permanent Link(s)
https://doi.org/10.7298/X4PC30CB
https://hdl.handle.net/1813/47905
Collections
Cornell Theses and Dissertations
Author
McCauley, Stewart
Abstract

While usage-based approaches to language development enjoy considerable support from computational studies, there have been few attempts to answer a key computational challenge posed by usage-based theory: the successful modeling of language learning as language use. I present a usage-based computational model of language acquisition which learns in a purely incremental fashion, through on-line processing based on chunking, and which offers broad, cross-linguistic coverage while uniting comprehension and production processes within a single framework. The model's design reflects memory constraints imposed by the real-time nature of language processing, and is inspired by psycholinguistic evidence for children's sensitivity to the distributional properties of multi-word sequences and for shallow language comprehension based on local information. It learns from corpora of child-directed speech, chunking incoming words together to incrementally build an item-based "shallow parse." When the model encounters an utterance made by the target child, it attempts to generate an identical utterance using the same chunks and statistics involved during comprehension. In Chapter 2, I show that the model achieves high performance across over 200 single-child corpora representing 29 languages from the CHILDES database. It also succeeds in capturing findings from children's production of complex sentence types. In Chapter 3, I show that the model captures key developmental psycholinguistic findings on children's language learning and use. Chapter 4 investigates the use of the model for understanding the different outcomes of child first-language learning versus second-language learning in adults, providing evidence that adult learners may rely on more fine-grained linguistic units. Together, the modeling results presented in this dissertation suggest that much of children's early linguistic behavior may be accounted for by item-based learning through on-line processing of simple distributional cues, consistent with the notion that acquisition can be understood as learning to process language.

Date Issued
2017-01-30
Keywords
Psychology
•
chunking
•
computational modelling
•
corpora
•
language learning
•
psycholinguistics
•
statistical learning
•
Language
•
Cognitive psychology
Committee Chair
Christiansen, Morten H.
Committee Member
Goldstein, Michael H.
Finlay, Barbara L.
Edelman, Shimon J.
Degree Discipline
Psychology
Degree Name
Ph. D., Psychology
Degree Level
Doctor of Philosophy
Rights
Attribution 4.0 International
Rights URI
https://creativecommons.org/licenses/by/4.0/
Type
dissertation or thesis

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance