A corpus search methodology for focus realization

Howell, Jonathan; Rooth, Mats

A corpus search methodology for focus realization

dc.contributor.author	Howell, Jonathan
dc.contributor.author	Rooth, Mats
dc.date.accessioned	2009-07-05T07:44:02Z
dc.date.available	2009-07-05T07:44:02Z
dc.date.issued	2009-07-05T07:44:02Z
dc.description	Poster presentation, 157th Meeting of the Acoustical Society of America. Abstract appears in J. Acoust. Soc. Am. Volume 125, Issue 4, pp. 2573-2573.	en_US
dc.description.abstract	We describe a methodology for investigating the semantic-grammatical conditioning and phonetic realization of contrastive intonation using a web harvest of particular word strings followed by grammatical and acoustic analysis. A commercial audio web search engine using speech recognition retrieved 179 MP3 files purportedly containing a token of the string 'than I did.' In this comparative clause fragment, contrastive focus commonly falls on the subject 'she did more than I_F did' , on 'did', 'I wish I had done more than I did_F', or following 'I said more now than I did before_F' . The 96 true tokens of 'than I did' were classified into the categories 'subject', 'did', and 'following' by grammatical and semantic criteria. For each token, 5 segment intervals were hand-annotated and more than 300 acoustic parameters extracted using a Praat script. SVM machine learning classifiers were trained that identify focus classes by acoustic criteria. On a 10-fold crossvalidation test, the classifier achieves 90.2% accuracy in discriminating the dominant 'subject' and 'following' classes. In a listening task, human subjects achieved comparable accuracy of 90.3 given only the acoustic target 'than I did'. Stepwise logistic regression revealed measures of duration, f0, intensity, formants, and formant bandwidths among the significant factors.	en_US
dc.identifier.uri	https://hdl.handle.net/1813/13093
dc.subject	prosody	en_US
dc.subject	focus	en_US
dc.subject	contrastive intonation	en_US
dc.subject	comparative	en_US
dc.subject	phonetics	en_US
dc.subject	support vector machine	en_US
dc.subject	web harvest	en_US
dc.title	A corpus search methodology for focus realization	en_US
dc.type	presentation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: HowellRooth_ASA_B.pdf
Size:: 516.94 KB
Format:: Adobe Portable Document Format

Download

Collections

Linguistics - Monographs, Papers and Research