Publication View

INTERSPEECH 2007 Detecting Pitch Accent Using Pitch-corrected Energy-based Predictors (2008)

Abstract
Previous work has shown that the energy components of frequency subbands with a variety of frequencies and bandwidths predict pitch accent with various degrees of accuracy, and produce correct predictions for distinct subsets of data points. In this paper, we describe a series of experiments exploring techniques to leverage the predictive power of these energy components by including pitch and duration features – other known correlates to pitch accent. We perform these experiments on Standard American English read, spontaneous and broadcast news speech, each corpus containing at least four speakers. Using an approach by which we correct energy-based predictions using pitch and duration information prior to using a majority voting classifier, we were able to detect pitch accent in read, spontaneous and broadcast news speech at 84.0%, 88.3 % and 88.5 % accuracy, respectively. Human performance at pitch accent detection is generally taken to be between 85 % and 90%. Index Terms: prosodic analysis, spectral emphasis 1.

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=?doi=10.1.1.115.8328
Source http://www.cs.columbia.edu/nlp/papers/2007/rosenberg_hirschberg_07a.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Type text
Language English
Relation 10.1.1.14.9528, 10.1.1.3.3934, 10.1.1.50.5272, 10.1.1.16.949, 10.1.1.64.1829, 10.1.1.12.4438, 10.1.1.12.5732, 10.1.1.28.4028, 10.1.1.10.4516, 10.1.1.72.28, 10.1.1.6.4225