Show simple item record

dc.contributor.authorHowell, Jonathan
dc.contributor.authorRooth, Mats
dc.date.accessioned2009-07-02T12:50:31Z
dc.date.available2009-07-02T12:50:31Z
dc.date.issued2009-07-02T12:50:31Z
dc.identifier.urihttps://hdl.handle.net/1813/13079
dc.descriptionPreliminary version of paper to be presented at Web as Corpus 5, September 2009. Final version will be substituted on July 17, 2009.en_US
dc.description.abstractThis paper describes experiments on gathering spoken-language data on the web that bears on issues of the phonetics-phonology and semantics-pragmatics of intonation. The target data are tokens of fixed word strings like "than I did", where intonation varies in a way which correlates with grammatical and pragmatic context. In a web harvest procedure, audio files were identified using a search engine based in speech-to-text, downloaded, and cut to a relevant segment under program control. In an application of such a database, an SVM classifier was trained to make a grammatically determined distinction in intonation based on purely acoustic cues. Sources of error in the retrieval are quantified.en_US
dc.language.isoenen_US
dc.subjectintonationen_US
dc.subjectfocusen_US
dc.subjectweb as corpusen_US
dc.subjectmachine learningen_US
dc.subjectprosodyen_US
dc.subjectcomparativesen_US
dc.subjectspeech recognitionen_US
dc.subjectlinguisticsen_US
dc.titleWeb Harvest of Minimal Intonational Pairsen_US
dc.typearticleen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Statistics