Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. College of Arts and Sciences
  3. Linguistics
  4. Linguistics - Monographs, Papers and Research
  5. Web Harvest of Minimal Intonational Pairs

Web Harvest of Minimal Intonational Pairs

File(s)
HowellRooth2009WebHarvest.pdf (188.92 KB)
Permanent Link(s)
https://hdl.handle.net/1813/13079
Collections
Linguistics - Monographs, Papers and Research
Author
Howell, Jonathan
Rooth, Mats
Abstract

This paper describes experiments on gathering spoken-language data on the web that bears on issues of the phonetics-phonology and semantics-pragmatics of intonation. The target data are tokens of fixed word strings like "than I did", where intonation varies in a way which correlates with grammatical and pragmatic context. In a web harvest procedure, audio files were identified using a search engine based in speech-to-text, downloaded, and cut to a relevant segment under program control. In an application of such a database, an SVM classifier was trained to make a grammatically determined distinction in intonation based on purely acoustic cues. Sources of error in the retrieval are quantified.

Description
Preliminary version of paper to be presented at Web as Corpus 5, September 2009. Final version will be substituted on July 17, 2009.
Date Issued
2009-07-02T12:50:31Z
Keywords
intonation
•
focus
•
web as corpus
•
machine learning
•
prosody
•
comparatives
•
speech recognition
•
linguistics
Type
article

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance