Agustín Gravano, Stefan Benus, Julia Hirschberg, Shira Mitchell, Ilia Vovsha
We present results of a series of machine learning experiments that address the classification of the discourse function of single affirmative cue words such as alright, okay and mm-hm in a spoken...
The Effect of Contour Type and Epistemic Modality on the Assessment of Speaker Certainty (2009)
Agustín Gravano, Stefan Benus, Julia Hirschberg, Elisa Sneed German, Gregory Ward
In an empirical study participants were asked to rate the perceived degree of certainty of utterances that contained either the modal would or main verb be (e.g. That would be me vs. That’s me),...
Julia Hirschberg, Agustín Gravano, Ani Nenkova, Elisa Sneed, Gregory Ward
Intonational contours are overloaded, conveying different meanings in different contexts. In this paper we examine two potential uses of the downstepped contours in Standard American English, in the...
Fadi Biadsy, Julia Hirschberg, Andrew Rosenberg, Wisam Dakka
Charisma, the ability to lead by virtue of personality alone, is difficult to define but relatively easy to identify. However, cultural factors clearly affect perceptions of charisma. In this paper...
TC-Star: Cross-Language Voice Conversion Revisited (2008)
David Sündermann, Harald Höge, Antonio Bonafonte, Hermann Ney, Julia Hirschberg
In the framework of the European speech-to-speech translation project TC-Star, one of the research tasks is cross-language voice conversion. In the recent second evaluation campaign, five...
Fadi Biadsy, Julia Hirschberg, Andrew Rosenberg, Wisam Dakka
Charisma, the ability to lead by virtue of personality alone, is difficult to define but relatively easy to identify. However, cultural factors clearly affect perceptions of charisma. In this paper...
AT&T at TREC-7 SDR Track (2008)
Amit Singhal, John Choi, Donald Hindle, Julia Hirschberg, O Pereira, Steve Whittaker
AT&T participated in the Spoken Document Retrieval (SDR) track of TREC-7. Our speech retrieval system uses modern Information Retrieval (IR) methods in conjunction with in-house automatic speech...
INTERSPEECH 2007 Prosody, emotions, and … ‘whatever’ (2008)
Stefan Benus, Agustín Gravano, Julia Hirschberg
We examine the role of prosody in cueing a scale of negative meanings associated with the use of whatever. The analysis of a corpus of elicited examples shows that the more negative the token, the...
Frank Enos, Stefan Benus, Robin L. Cautin, Martin Graciarena, Julia Hirschberg, Elizabeth Shriberg
Previous studies of human performance in deception detection have found that humans generally are quite poor at this task, comparing unfavorably even to the performance of automated procedures....
INTERSPEECH 2007 Detecting Pitch Accent Using Pitch-corrected Energy-based Predictors (2008)
Andrew Rosenberg, Julia Hirschberg
Previous work has shown that the energy components of frequency subbands with a variety of frequencies and bandwidths predict pitch accent with various degrees of accuracy, and produce correct...
on Discourse Structure in Natural Language Understanding and Generation, ed. by (2008)
Julia Hirschberg, Diane Litman, Kathy Mccoy, Y Sidner, Pacific Grove
YAROWSKY, DAVID. 1992. Word sense disambiguation using statistical models of Roget’s categories trained on large corpora. In Proceedings of the Fourteenth International Conference on Computational...
Detecting Deception Using Critical Segments (2008)
Frank Enos, Elizabeth Shriberg, Martin Graciarena, Julia Hirschberg, Andreas Stolcke
We present an investigation of segments that map to GLOBAL LIES, that is, the intent to deceive with respect to salient topics of the discourse. We propose that identifying the truth or falsity of...
Frank Enos, Stefan Benus, Robin L. Cautin, Martin Graciarena, Julia Hirschberg, Elizabeth Shriberg
Previous studies of human performance in deception detection have found that humans generally are quite poor at this task, comparing unfavorably even to the performance of automated procedures....
High frequency word entrainment in spoken dialogue (2008)
Ani Nenkova, Agustín Gravano, Julia Hirschberg
Cognitive theories of dialogue hold that entrainment, the automatic alignment between dialogue partners at many levels of linguistic representation, is key to facilitating both production and...
Now You Hear It, Now You Don't: Empirical Studies Of Audio Browsing Behavior (2007)
Christine H. Nakatani, Steve Whittaker, Julia Hirschberg
We present several studies that investigate how people use audio documents and uncover new principles for designing audio navigation technology. In particular, we report on an ethnographic study of...
Richard W. Sproat, Joseph P. Olive, Julia Hirschberg, Briony Williams
This is a weighty book in more senses than one. At nearly 600 pages (plus an ac-companying CD-ROM), it has the space to range very widely over the field of speech synthesis. The chapters comprise a...
"I just played that a minute ago!: " Designing User Interfaces for Audio Navigation (2007)
Julia Hirschberg, John Choi, Christine Nakatani, Steve Whittaker
The current popularity of multimodal information retrieval research critically assumes that consumers will be found for the multimodal information thus retrieved and that interfaces can be designed...
Aaron Rosenberg, Julia Hirschberg, Michiel Bacchiani, S Parthasarathy, Philip Isenhour, Larry Stead
SCANMail is a prototype system developed at AT&T Labs for the purpose of providing useful tools for managing and searching through voicemail messages. Content is extracted from voicemail messages...
Julia Hirschberg, Michiel Bacchiani, Don Hindle, Phil Isenhour, Aaron Rosenberg, Litza Stark, ...
Increasing amounts of public, corporate, and private audio data are available for use, but limited in usefulness by the lack of tools to permit their browsing and search. In this paper, we describe...
juliaresearch.att.com Pitch accent placement is a major topic in intonational phonology research and its application to speech synthesis. What factors influence whether or not a word is made...
1 IMPROVING INTONATIONAL PHRASING WITH SYNTACTIC INFORMATION (2007)
Steven Abney, Julia Hirschberg, Michael Collins
The prediction of intonational phrase boundaries from raw text is an important step for a text-to-speech system: Locating where to place short pauses enables more natural sounding speech, that can be...
AT&T at TREC-7 SDR Track (2007)
Amit Singhal, John Choi, Donald Hindle, Julia Hirschberg, O Pereira, Steve Whittaker
AT&T participated in the Spoken Document Retrieval (SDR) track of TREC-7. Our speech retrieval system uses modern Information Retrieval (IR) methods in conjunction with in-house automatic speech...
AT&T at TREC-7 SDR Track (2007)
Amit Singhal, John Choi, Donald Hindle, Julia Hirschberg, O Pereira, Steve Whittaker
AT&T participated in the Spoken Document Retrieval (SDR) track of TREC-7. Our speech retrieval system uses modern Information Retrieval (IR) methods in conjunction with in-house automatic speech...
The prosody of backchannels in American English (2007)
Stefan Benus, Agustín Gravano, Julia Hirschberg
We examine prosodic and contextual factors characterizing the backchannel function of single affirmative words. Data is drawn from collaborative task-oriented dialogues between speakers of Standard...
The prosody of backchannels in American English (2007)
Stefan Benus, Agustín Gravano, Julia Hirschberg
We examine prosodic and contextual factors characterizing the backchannel function of single affirmative words. Data is drawn from collaborative task-oriented dialogues between speakers of Standard...
The role of context and prosody in the interpretation of okay (2007)
Agustín Gravano, Stefan Benus, Héctor Chávez, Julia Hirschberg, Lauren Wilcox
We examine the effect of contextual and acoustic cues in the disambiguation of three discourse-pragmatic functions of the word okay. Results of a perception study show that contextual cues are...
Andrew Rosenberg, Mehrbod Sharifi, Julia Hirschberg
Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent processes such as question answering and information retrieval. In previous work, a decision tree...
Detecting pitch accent using pitch-corrected energy-based predictors (2007)
Andrew Rosenberg, Julia Hirschberg
Previous work has shown that the energy components of frequency subbands with a variety of frequencies and bandwidths predict pitch accent with various degrees of accuracy, and produce correct...
Detecting question-bearing turns in spoken tutorial dialogues (2006)
Jackson Liscombe, Jennifer J. Venditti, Julia Hirschberg
Current speech-enabled Intelligent Tutoring Systems do not model student question behavior the way human tutors do, despite evidence indicating the importance of doing so. Our study examined a corpus...
Steve Whittaker, Julia Hirschberg
When users access information from text, they engage in strategic fixation, visually scanning the text to focus on regions of interest. However, because speech is both serial and ephemeral, it does...
Combining prosodic, lexical and cepstral systems for deceptive speech detection (2006)
Martin Graciarena, Elizabeth Shriberg, Andreas Stolcke, Frank Enos, Julia Hirschberg, Sachin Kajarekar
We report on machine learning experiments to distinguish deceptive from nondeceptive speech in the Columbia-SRI-Colorado (CSC) corpus. Specifically, we propose a system combination approach using...
Characterizing and predicting corrections in spoken dialogue systems (2006)
Diane Litman, Marc Swerts, Julia Hirschberg
This article focuses on the analysis and prediction of corrections, defined as turns where a user tries to correct a prior error made by a spoken dialogue system. We describe our labeling procedure...
Combining prosodic, lexical and cepstral systems for deceptive speech detection (2006)
Martin Graciarena, Elizabeth Shriberg, Andreas Stolcke, Frank Enos, Julia Hirschberg, Sachin Kajarekar
We report on machine learning experiments to distinguish deceptive from nondeceptive speech in the Columbia-SRI-Colorado (CSC) corpus. Specifically, we propose a system combination approach using...
Pauses in deceptive speech (2006)
Stefan Benus, Frank Enos, Julia Hirschberg, Elizabeth Shriberg
We use a corpus of spontaneous interview speech to investigate the relationship between the distributional and prosodic characteristics of silent and filled pauses and the intent of an interviewee to...
Distinguishing Deceptive from Non-Deceptive Speech (2005)
Julia Hirschberg, Stefan Benus, Jason M. Brenier, Frank Enos, Sarah Friedman, Sarah Gilman, ...
To date, studies of deceptive speech have largely been confined to descriptive studies and observations from subjects, researchers, or practitioners, with few empirical studies of the specific...
Do Summaries Help? A Task-Based Evaluation of Multi-Document Summarization (2005)
Kathleen Mckeown, Rebecca J. Passonneau, David K. Elson, Ani Nenkova, Julia Hirschberg
We describe a task-based evaluation to determine whether multi-document summaries measurably improve user performance when using online news browsing systems for directed research. We evaluated the...
Distinguishing Deceptive from Non-Deceptive Speech (2005)
Julia Hirschberg, Stefan Benus, Jason M. Brenier, Frank Enos, Sarah Friedman, Sarah Gilman, ...
To date, studies of deceptive speech have largely been confined to descriptive studies and observations from subjects, researchers, or practitioners, with few empirical studies of the specific...
Sameer Maskey, Julia Hirschberg
We present results of an empirical study of the usefulness of different types of features in selecting extractive summaries of news broadcasts for our Broadcast News Summarization System. We evaluate...
Detecting Certainness in Spoken Tutorial Dialogues (2005)
Jackson Liscombe Julia, Julia Hirschberg, Jennifer J. Venditti
What role does affect play in spoken tutorial systems and is it automatically detectable? We investigated the classification of student certainness in a corpus collected for ITSPOKE, a speech-enabled...
Distinguishing Deceptive from Non-Deceptive Speech (2005)
Julia Hirschberg, Stefan Benus, Jason M. Brenier, Frank Enos, Sarah Friedman, Sarah Gilman, ...
To date, studies of deceptive speech have largely been confined to descriptive studies and observations from subjects, researchers, or practitioners, with few empirical studies of the specific...
Prediction of upcoming Swedish prosodic boundaries by Swedish and American listeners (2004)
Rolf Carlson, Julia Hirschberg, Marc Swerts
We describe results of a study of perceptually based predictions of upcoming prosodic breaks in spontaneous Swedish speech materials by native speakers of Swedish and of standard American English....
Identifying Agreement and Disagreement in Conversational Speech: (2004)
Use Of Bayesian, Michel Galley, Kathleen Mckeown, Julia Hirschberg, Elizabeth Shriberg
We describe a statistical approach for modeling agreements and disagreements in conversational interaction. Our approach first identifies adjacency pairs using maximum entropy ranking based on a set...
Identifying Agreement and Disagreement in Conversational Speech: (2004)
Use Of Bayesian, Michel Galley, Kathleen Mckeown, Julia Hirschberg, Elizabeth Shribergý
We describe a statistical approach for modeling agreements and disagreements in conversational interaction. Our approach first identifies adjacency pairs using maximum entropy ranking based on a set...
Classifying subject ratings of emotional speech using acoustic features (2003)
Jackson Liscombe, Jennifer Venditti, Julia Hirschberg
This paper presents results from a study examining emotional speech using acoustic features and their use in automatic machine learning classification. In addition, we propose a classification scheme...
Classifying subject ratings of emotional speech using acoustic features (2003)
Jackson Liscombe, Jennifer Venditti, Julia Hirschberg
This paper presents results from a study examining emotional speech using acoustic features and their use in automatic machine learning classification. In addition, we propose a classification scheme...
Look or listen: Discovering effective techniques for accessing speech data (2003)
Steve Whittaker, Julia Hirschberg
Commercial interfaces for accessing digital speech data are based on ‘tape recorder ’ metaphors. However, such interfaces make it highly laborious to access complex speech data. The absence of...
Scanmail: a voicemail interface that makes speech browsable, readable and searchable (2002)
Steve Whittaker, Julia Hirschberg, Brian Amento, Litza Stark, Michiel Bacchiani, Philip Isenhour, ...
Increasing amounts of public, corporate, and private speech data are now available on-line. These are limited in their usefulness, however, by the lack of tools to permit their browsing and search....
Scanmail: a voicemail interface that makes speech browsable, readable and searchable (2002)
Steve Whittaker, Julia Hirschberg, Brian Amento, Litza Stark, Michiel Bacchiani, Phil Isenhour, ...
Increasing amounts of public, corporate, and private speech data are now available on-line. These are limited in their usefulness, however, by the lack of tools to permit their browsing and search....
The Pragmatics of Intonational Meaning (2002)
Starting from Carlos Gussenhoven's proposal Gussenhoven02 that intonational meaning be understood in terms of three biological codes, this paper suggests an augmentation of that proposal to...
SCANMail: a voicemail interface that makes speech browsable, readable and searchable (2002)
Steve Whittaker, Julia Hirschberg, Brian Amento, Litza Stark, Michiel Bacchiani, Philip Isenhour, ...
Increasing amounts of public, corporate, and private speech data are now available on-line. These are limited in their usefulness, however, by the lack of tools to permit their browsing and search....
Predicting user reactions to system error (2001)
Diane Litman, Julia Hirschberg
diane/julia¡ This paper focuses on the analysis and prediction of so-called aware sites, defined as turns where a user of a spoken dialogue system first becomes aware that the system has made a...
The Character, Value, and Management of Personal Paper Archives (2001)
Steve Whittaker, Julia Hirschberg
We explore general issues concerning personal information management by investigating the characteristics of office workers ’ paper-based information. We examine the reasons people collect paper,...
Predicting user reactions to system error (2001)
Diane Litman, Julia Hirschberg
diane/julia¡ This paper focuses on the analysis and prediction of so-called aware sites, defined as turns where a user of a spoken dialogue system first becomes aware that the system has made a...
The Character, Value, and Management of Personal Paper Archives (2001)
Steve Whittaker, Julia Hirschberg
We explored general issues concerning personal information management by investigating the characteristics of office workers ’ paper-based information, in an industrial research environment. We...
SCANMail: Browsing and searching speech data by content (2001)
Julia Hirschberg, Michiel Bacchiani, Don Hindle, Phil Isenhour, Aaron Rosenberg, Litza Stark, ...
Increasing amounts of public, corporate, and private audio data are available for use, but limited in usefulness by the lack of tools to permit their browsing and search. In this paper, we describe...
SCANMail: Audio navigation in the voicemail domain (2000)
Michiel Bacchiani, Julia Hirschberg, Aaron Rosenberg, Steve Whittaker, Donald Hindle, Phil Isenhour, ...
This paper describes SCANMail, a system that allows users to browse and search their voicemail messages by content through a GUI. Content based navigation is realized by use of automatic speech...
Jotmail: a voicemail interface that enables you to see what was said (2000)
Steve Whittaker, Richard Daws, Julia Hirschberg, Urs Muller
stevew/julia/urs @ research.att.com Voicemail is a pervasive, but under-researched tool for workplace communication. Despite potential advantages of voicemail over email, current phone-based...
SCANMail: Audio navigation in the voicemail domain (2000)
Michiel Bacchiani, Julia Hirschberg, Aaron Rosenberg, Steve Whittaker, Donald Hindle, Phil Isenhour, ...
This paper describes SCANMail, a system that allows users to browse and search their voicemail messages by content through a GUI. Content based navigation is realized by use of automatic speech...
SCANMail: Audio navigation in the voicemail domain (2000)
Michiel Bacchiani, Julia Hirschberg, Aaron Rosenberg, Steve Whittaker, Donald Hindle, Phil Isenhour, ...
Increasing amounts of public, corporate, and private audio present a major challenge to speech, information retrieval, and human-computer interaction research: how can we help people to take...
Generalizing prosodic prediction of speech recognition errors (2000)
Julia Hirschberg, Diane Litman, Marc Swerts
Since users of spoken dialogue systems have difficulty correcting system misconceptions, it is important for automatic speech recognition (ASR) systems to know when their best hypothesis is...
Jotmail: a voicemail interface that enables you to see what was said (2000)
Steve Whittaker, Richard Davis, Julia Hirschberg, Urs Muller
Voicemail is a pervasive, but under-researched tool for workplace communication. Despite potential advantages of voicemail over email, current phone-based voicemail UIs are highly problematic for...
Corrections in spoken dialogue systems (2000)
Marc Swerts, Diane Litman, Julia Hirschberg
This study analyzes user corrections of system errors in the TOOT spoken dialogue system. We find that corrections differ from noncorrections prosodically, in ways consistent with hyperarticulated...
ASR satisficing: the effects of ASR accuracy on speech retrieval (2000)
Litza Stark, Steve Whittaker, Julia Hirschberg
We examine how differences in the accuracy of Automatic Speech Recognition transcripts affect users ' ability to use these in tasks requiring the retrieval of speech...
The Rules Behind Roles: Identifying Speaker Role in Radio Broadcasts (2000)
Regina Barzilay, Michael Collins, Julia Hirschberg, Steve Whittaker
Previous work has shown that providing information about story structure is critical for browsing audio broadcasts. We investigate the hypothesis that Speaker Role is an important cue to story...
Modeling Local Context for Pitch Accent Prediction (2000)
Pitch accent placement is a major topic in intonational phonology research and its application to speech synthesis. What factors influence whether or not a word is made intonationally prominent or...
Predicting Automatic Speech Recognition Performance Using Prosodic Cues (2000)
Julia Hirschberg, Diane Litman, Marc Swerts
In spoken dialogue systems, it is important for a system to know how likely a speech recognition hypothesis is to be correct, so it can reprompt for fresh input, or, in cases where many errors have...
Julia Hirschberg, Marilyn A, Irene Langkilde, Jerry Wright, Allen Gorin, ...
man, Kim, Ashok Kalyanswamy, Julie Silverman, Sara Basson, and Dina Yashchin. 1993. Synthesiser intelligibility in the context of a name-and-address information service. In Proceedings of...
Finding information in audio: A new paradigm for audio browsing/retrieval (1999)
Julia Hirschberg, Steve Whittaker, Don Hindle, O Pereira, Amit Singhal
Information retrieval from audio data is sharply different from information retrieval from text, not simply because speech recognition errors affect retrieval effectiveness, but more fundamentally...
Spoken content-based audio navigation (SCAN (1999)
John Choi, Donald Hindle, Julia Hirschberg, O Pereira, Amit Singhal, Steve Whittaker
We describe SCAN, a system for retrieving and browsing speech documents from large audio corpora that uses new information retrieval and speech processing techniques to create easily navigable...
Spoken content-based audio navigation (SCAN (1999)
John Choi, Donald Hindle, Julia Hirschberg, O Pereira, Amit Singhal, Steve Whittaker
We describe SCAN, a system for retrieving and browsing speech documents from large audio corpora that uses new information retrieval and speech processing techniques to create easily navigable...
Finding Information In Audio: A New Paradigm For Audio Browsing And Retrieval (1999)
Julia Hirschberg, Steve Whittaker, Don Hindle, Fernando Pereira, O Pereira, Amit Singhal
Information retrieval from audio data is sharply different from information retrieval from text, not simply because speech recognition errors affect retrieval effectiveness, but more fundamentally...
SCAN: Designing and evaluating user interfaces to support retrieval from speech archives (1999)
Steve Whittaker Julia, Julia Hirschberg, John Choi, Don Hindle, O Pereira, Amit Singhal
Previous examinations of search in textual archives have assumed that users first retrieve a ranked set of documents relevant to their query, and then visually scan through these documents, to...
Prosodic Cues to Recognition Errors (1999)
Julia Hirschberg, Diane Litman, Marc Swerts
We identify methods of distinguishing between correctly and incorrectly recognized utterances (scored by hand for semantic concept accuracy) for a speech recognition system, using acoustic/prosodic...
SCAN: Designing and evaluating user interfaces to support retrieval from speech archives (1999)
Steve Whittaker, Julia Hirschberg, John Choi, Don Hindle, Fernando Pereira, O Pereira, ...
Previous examinations of search in textual archives have assumed that users first retrieve a ranked set of documents relevant to their query, and then visually scan through these documents, to...
Finding information in audio: A new paradigm for audio browsing/retrieval (1999)
Julia Hirschberg, Steve Whittaker, Don Hindle, O Pereira, Amit Singhal
Information retrieval from audio data is sharply different from information retrieval from text, not simply because speech recognition errors affect retrieval effectiveness, but more fundamentally...
SCAN - speech content based audio navigator: a systems overview (1998)
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, O Pereira, ...
SCAN (Speech Content based Audio Navigator) is a spoken document retrieval system integrating speaker-independent, large-vocabulary speech recognition with information-retrieval to support...
An overview of the AT&T spoken document retrieval (1998)
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, O Pereira, ...
We present an overview of a spoken document retrieval system developed at AT&T Labs-Research for the HUB4 Broadcast News corpus. This overview includes a description of the intonational phrase...
SCAN - speech content based audio navigator: a systems overview (1998)
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, O Pereira, ...
SCAN (Speech Content based Audio Navigator) is a spoken document retrieval system integrating speaker-independent, large-vocabulary speech recognition with information-retrieval to support...
An overview of the AT&T spoken document retrieval (1998)
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, O Pereira, ...
We present an overview of a spoken document retrieval system developed at AT&T Labs-Research for the HUB4 Broadcast News corpus. This overview includes a description of the intonational phrase...
All talk and all action: strategies for managing voicemail messages (1998)
Steve Whittaker, Julia Hirschberg, Christine H. Nakatani
Voicemail is a pervasive technology, but we know little about how users manage voice messages in executing everyday work. We analyze server logs, user surveys and interviews to identify three...
An overview of the AT&T spoken document retrieval (1998)
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, O Pereira, ...
We present an overview of a spoken document retrieval system developed at AT&T Labs-Research for the HUB4 Broadcast News corpus. This overview includes a description of the intonational phrase...
Play it again: A study of the factors underlying speech browsing behavior (1998)
Steve Whittaker, Julia Hirschberg, Christine H. Nakatani
Several recent UIs support access to recorded speech archives, but these have not yet been systematically evaluated. We describe a laboratory study of speech archive browsing using a GUI. We evaluate...
SCAN - Speech Content Based Audio Navigator: A Systems Overview (1998)
John Choi Don, John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, ...
SCAN (Speech Content based Audio Navigator) is a spoken document retrieval system integrating speaker-independent, large-vocabulary speech recognition with information-retrieval to support...
Steve Whittaker John, John Choi, Julia Hirschberg, Christine H. Nakatani
Despite the recent growth and potential utility of speech archives, we currently lack tools for effective archival access. Previous research on search of textual archives has assumed that the system...
All Talk and All Action: Strategies for Managing Voicemail Messages (1998)
Steve Whittaker, Julia Hirschberg, Christine H. Nakatani
Voicemail is a pervasive technology, but we know little about how users manage voice messages in executing everyday work. We analyze server logs, user surveys and interviews to identify three...
Play It Again: A Study of the Factors Underlying Speech Browsing Behavior (1998)
Steve Whittaker Julia, Julia Hirschberg, Christine H. Nakatani
Several recent UIs support access to recorded speech archives, but these have not yet been systematically evaluated. We describe a laboratory study of speech archive browsing using a GUI. We evaluate...
An Overview Of The Att Spoken Document Retrieval (1998)
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, Fernando Pereira, ...
We present an overview of a spoken document retrieval system developed at AT&T Labs-Research for the HUB4 Broadcast News corpus. This overview includes a description of the intonational phrase...
An Overview Of The Att Spoken Document Retrieval (1998)
John Choi Don, John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, ...
We present an overview of a spoken document retrieval system developed at AT&T Labs-Research for the HUB4 Broadcast News corpus. This overview includes a description of the intonational phrase...
SCAN - Speech Content Based Audio Navigator: A Systems Overview (1998)
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, Fernando Pereira, ...
SCAN (Speech Content based Audio Navigator) is a spoken document retrieval system integrating speaker-independent, large-vocabulary speech recognition with information-retrieval to support...
SCAN - speech content based audio navigator: a systems overview (1998)
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, O Pereira, ...
SCAN (Speech Content based Audio Navigator) is a spoken document retrieval system integrating speaker-independent, large-vocabulary speech recognition with information-retrieval to support...
An overview of the AT&T spoken document retrieval (1998)
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-chagnolleau, Christine Nakatani, O Pereira, ...
We present an overview of a spoken document retrieval system developed at AT&T Labs-Research for the HUB4 Broadcast News corpus. This overview includes a description of the intonational phrase...
The Role Of Prosody In Disambiguating Potentially Ambiguous Utterances In English And Italian (1997)
Julia Hirschberg, Cinzia Avesani
We investigate the role that intonation plays in disambiguating potentially ambiguous utterances in English and Italian, to see a) whether speakers employ intonational means to disambiguate these...
ATT at TREC-7 SDR Track (1997)
Amit Singhal John, John Choi, Donald Hindle, Julia Hirschberg, O Pereira, Steve Whittaker
AT&T participated in the Spoken Document Retrieval (SDR) track of TREC-7. Our speech retrieval system uses modern Information Retrieval (IR) methods in conjunction with in-house automatic speech...
ATT at TREC-7 SDR Track (1997)
Amit Singhal, John Choi, Donald Hindle, Julia Hirschberg, O Pereira, Steve Whittaker
AT&T participated in the Spoken Document Retrieval (SDR) track of TREC-7. Our speech retrieval system uses modern Information Retrieval (IR) methods in conjunction with in-house automatic speech...
A semantics for λ {} str : a calculus with overloading and late-binding (1996)
Steven Bird, Jerry L. Morgan, Iway Fong, Jennifer Cole, John Coleman, Alan M. Frisch, ...
"We... recommend this book to anyone interested in computational phonology. "--Computational Linguistics Computational phonology is one of the newest areas of computational...
A prosodic analysis of discourse segments in direction-giving monologues (1996)
This paper reports on corpus-based research into the relationship between intonational vari-ation and discourse structure. We examine the effects of speaking style (read versus sponta-neous) and of...
A prosodic analysis of discourse segments in direction-giving monologues (1996)
This paper reports on corpus-based research into the relationship between intonational vari-ation and discourse structure. We examine the effects of speaking style (read versus sponta-neous) and of...
Christine H. Nakatani, Barbara J. Grosz, David D. Ahn, Julia Hirschberg, Christine H. Nakatani, Barbara J. Grosz, ...
This guide contains instructions for segmenting discourse that are meant to be self-contained. Although based on a particular theory of discourse structure, they do not make reference in any way to...
A current problem for text-to-speech systems is their failure to produce natural-sounding intonational variation. This memo documents the use of escape sequences to control intonational variation in...
Pitch Accent in Context: Predicting Intonational Prominence from Text (1995)
Explaining speakers' choice of which items to emphasize or de-emphasize intonationally has been an important topic in theoretical linguistics, as well as in applications such as speech...
Discourse Structure in Spoken Language: Studies on Speech Corpora (1995)
Christine H. Nakatani, Julia Hirschberg, Barbara J. Grosz
A better understanding of the intonational characteristics of spoken discourse may lead to new empirical techniques for identifying discourse structure from speech, as well as new algorithms for...
Journal of Phonetics (1995) 23, 429--451 (1995)
Tonal Alignment Patterns, Pilar Prieto, Jan Van Santen, Julia Hirschberg
this article show that the absolute rise time is different in almost all the conditions investigated. In fact, since the start of the rise For English, see Nakatani & Schaffer (1978) and...
Some Bibliographical References on Intonation and Intonational Meaning (1994)
A by-no-means-complete collection of references for those interested in intonational meaning, with other miscellaneous references on intonation included. Additional references are welcome, and should...
Empirical studies on the disambiguation of cue phrases (1993)
Julia Hirschberg, Diane Litman
Cue phrases are linguistic expressions such as now and well that function as explicit indicators of the structure of a discourse. For example, now may signal the beginning of a subtopic or a return...
Empirical Studies on the Disambiguation of Cue Phrases (1993)
Julia Hirschberg, Diane Litman
Cue phrases are linguistic expressions such as now and well that function as explicit indicators of the structure of a discourse. For example, now may signal the beginning of a subtopic or a return...
Some Intonational Characteristics Of Discourse Structure (1992)
Barbara Grosz, Julia Hirschberg
This paper reports on a study of the relationship between acoustic-prosodic variation and discourse structure, as determined from an independent model of discourse. We present results of two pilot...
Some Intonational Characteristics Of Discourse Structure (1992)
Barbara Grosz Julia, Julia Hirschberg
This paper reports on a study of the relationship between acoustic-prosodic variation and discourse structure, as determined from an independent model of discourse. We present results of two pilot...
A Theory of Focus Interpretation (1992)
Mats Rooth, Angelika Kratzer, Manfred Krifka, Irene Heim, Julia Hirschberg, Sjak De Mey
According to the alternative semantics for focus, the semantic reflex of intonational focus is a second semantic value, which in the case of a sentence is a set of propositions. We examine a range of...
Automatic Classification of Intonational Phrase Boundaries (1992)
Michelle Q. Wang, Julia Hirschberg
The relationship between the intonational characteristics of an utterance and other features inferable from its text represents an important source of information both for speech recognition, to...
Some Intonational Characteristics Of Discourse Structure (1992)
Barbara Grosz, Julia Hirschberg
This paper reports on a study of the relationship between acoustic-prosodic variation and discourse structure, as determined from an independent model of discourse. We present results of two pilot...
Robert Kasper, Julia Hirschberg, T Bell Laboratories
This volume contains papers prepared for the Twenty-Seventh Annual Meeting of the Association for
Assigning Intonational Features in Synthesized Spoken Directions (1988)
James Raymond Davis, Julia Hirschberg
Speakers convey much of the information hearers use to interpret discourse by varying prosodic features such as phrasing, pitch accent placement, tune, and pitch range. The ability to emulate such...
A Corpus-based study of repair cues in spontaneous speech
Christine H. Nakatani, Julia Hirschberg
this paper, acoustic and prosodic cues to such repairs are identified, based on an analysis of a corpus taken from the ARPA Air Travel Information System database, and methods are proposed for...