Jordi Turmo, Pere Comas, Sophie Rosset, Lori Lamel, Nicolas Moreau, Djamel Mostefa
This paper describes the experience of QAST 2008, the second time a pilot track of CLEF has been held aiming to evaluate the task of Question Answering in Speech Transcripts. Five sites submitted...
Lori Lamel, Jean-luc Gauvain, Gilles Adda, Claude Barras, Eric Bilinski, Olivier Galibert, ...
This paper describes the speech recognizers evaluated in the TC-STAR Second Evaluation Campaign held in January-February 2006. Systems were developed to transcribe parliamentary speeches in English...
The LIMSI SDR System for TREC-9 (2008)
Jean-luc Gauvain, Lori Lamel, Claude Barras, Gilles Adda, Yannick De Kercardio
In this paper we describe the LIMSI Spoken Document Retrieval system used in the TREC-9 evaluation. This system combines an adapted version of the LIMSI 1999 Hub-4E transcription system for speech...
Jerry Goldman, Steve Renals, Steven Bird, Franciska Jong, Marcello Federico, Carl Fleischhauer, ...
The date of receipt and acceptance will be inserted by the editor Abstract. Spoken word audio collections cover many domains, including radio and television broadcasts, oral narratives, governmental...
This paper provides an overview of the state-of-the-art in laboratory speaker-independent, large vocabulary continuous speech recognition (LVCSR) systems with a view towards adapting such technology...
Some Issues in Speech Recognizer Portability (2007)
Speech recognition technology has greatly evolved over the last decade. However, one of the remaining challenges is reducing the development cost. Most recognition systems are tuned to a particular...
Processing Broadcast Audio for Information Access (2007)
Jean-luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda-decker, Claude Barras, Langzhou Chen, ...
This paper addresses recent progress in speaker-independent, large vocabulary, continuous speech recognition, which has opened up a wide range of near and mid-term applications. One rapidly expanding...
USING INFORMATION RETRIEVAL METHODS FOR LANGUAGE MODEL ADAPTATION (2007)
Langzhou Chen, Jean-luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda
In this paper we report experiments on language model adaptation using information retrieval methods, drawing upon recent developments in information extraction and topic tracking. One of the...
Processing Broadcast Audio for Information Access (2007)
Jean-luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda-decker, Claude Barras, Langzhou Chen, ...
This paper addresses recent progress in speaker-independent, large vocabulary, continuous speech recognition, which has opened up a wide range of near and mid-term applications. One rapidly expanding...
The LIMSI SDR System for TREC-9 (2007)
Jean-luc Gauvain, Lori Lamel, Claude Barras, Gilles Adda, Yannick De Kercardio
In this paper we describe the LIMSI Spoken Document Retrieval system used in the TREC-9 evaluation. This system combines an adapted version of the LIMSI 1999 Hub-4E transcription system for speech...
An Overview of Speech Recognition Activities at LIMSI (2007)
Jean-luc Gauvain, Gilles Adda, Martine Adda-decker, Claude Barras, Langzhou Chen, Michele Jardino, ...
This paper provides an overview of recent activities at LIMSI in multilingual speech recognition and its applications. The main goal of speech recognition is to provide a transcription of the speech...
Dragon Systems Resource Management Benchmark Results -February 1991 (2007)
Baker, James, Baker, Janet, Bamberg, Paul, Gillick, Larry, Lamel, Lori, Roth, Robert, ...
In this paper, we present preliminary results obtained at Dragon Systems on the Resource Management benchmark task. The basic conceptual units of our system are Phonemes-in-context (PICs), which are...
Accessing the spoken word (2005)
Goldman, Jerry, Renals, Steve, Bird, Steven, De Jong, Franciska, Federico, Marcello, Fleischhauer, Carl, ...
Spoken word audio collections cover many domains, including radio and television broadcasts, oral narratives, governmental proceedings, lectures, and telephone conversations. The collection, access...
Transforming Access to the Spoken Word (2005)
Goldman, Jerry, Renals, Steve, Bird, Steven, De Jong, Franciska, Federico, Marcello, Fleischhauer, Carl, ...
Spoken word audio collections cover many domains,including radio and television broadcasts, oral narratives,governmental proceedings, lectures, and telephone conversations.The collection, access and...
Accessing the spoken word (2005)
Goldman, Jerry, Renals, Steve, Bird, Steven, De Jong, Franciska, Federico, Marcello, Fleischhauer, Carl, ...
Spoken-word audio collections cover many domains, including radio and television broadcasts, oral narratives, governmental proceedings, lectures, and telephone conversations. The collection, access,...
Accessing the spoken word (2005)
Goldman, Jerry, Renals, Steve, Bird, Steven, De Jong, Franciska, Federico, Marcello, Fleischhauer, Carl, ...
Spoken-word audio collections cover many domains, including radio and television broadcasts, oral narratives, governmental proceedings, lectures, and telephone conversations. The collection, access,...
Accessing the spoken word (2005)
Goldman, Jerry, Renals, Steve, Bird, Steven, De Jong, Franciska, Federico, Marcello, Fleischhauer, Carl, ...
Spoken word audio collections cover many domains, including radio and television broadcasts, oral narratives, governmental proceedings, lectures, and telephone conversations. The collection, access...
Accessing the Spoken Word (2005)
Jerry Goldman, Steve Renals, Steven Bird, Franciska Jong, Mark Kornbluh, ...
Spoken word audio collections cover many domains, including radio and television broadcasts, oral narratives, governmental proceedings, lectures, and telephone conversations. The collection, access...
Automatic detection of dialog acts based on multi-level information (2004)
Recently there has been growing interest in using dialog acts to characterize human-human and human-machine dialogs. This paper reports on our experience in the annotation and the automatic detection...
Structuring Broadcast Audio for Information Access (2003)
One rapidly expanding application area for state-of-the-art speech recognition technology is the automatic processing of broadcast audiovisual data for information access. Since much of the...
Unsupervised language model adaptation for broadcast news (2003)
Langzhou Chen, Jean-luc Gauvain, Lori Lamel, Gilles Adda
Unsupervised language model adaptation for speech recognition is challenging, particularly for complicated tasks such the transcription of broadcast news (BN) data. This paper presents an...
Emotion Detection In Task-Oriented Spoken Dialogs (2003)
Laurence Devillers Lori, Lori Lamel
Detecting emotions in the context of automated call center services can be helpful for following the evolution of the human-computer dialogs, enabling dynamic modification of the dialog strategies...
Structuring Broadcast Audio for Information Access (2003)
One rapidly expanding application area for state-of-the-art speech recognition technology is the automatic processing of broadcast audiovisual data for information access. Since much of the...
Structuring Broadcast Audio for Information Access (2003)
One rapidly expanding application area for state-of-the-art speech recognition technology is the automatic processing of broadcast audiovisual data for information access. Since much of the...
Annotation and Detection of Emotion in a Task-oriented Human-Human Dialog Corpus (2002)
Laurence Devillers, Ioana Vasilescu, Lori Lamel
Automatic emotion detection is potentially important for customer care in the context of call center services. A major difficulty is that the expression of emotion is quite complex in the context of...
Automatic processing of broadcast audio in multiple languages (2002)
This paper addresses recent progress in LVCSR in multiple languages which has enabled the processing of broadcast audio for information access. At LIMSI, broadcast news transcription systems have...
Unsupervised acoustic model training (2002)
Lori Lamel, Jean-luc Gauvain, Gilles Adda
This paper describes some recent experiments using unsupervised techniques for acoustic model training in order to reduce the system development cost. The approach uses a speech recognizer to...
The LIMSI Broadcast News Transcription System (2002)
Jean-luc Gauvain, Lori Lamel, Gilles Adda
language modeling, lexical modeling This paper reports on activites at LIMSI over the last few years directed at the transcription of broadcast news data. We describe our development work in moving...
Transcribing Audio-Video Archives (2002)
Claude Barras, Alexandre Allauzen, Lori Lamel, Jean-luc Gauvain
This paper addresses the automatic transcription of audiovideo archives using a state-of-the-art broadcast news speech transcription system. A 9-hour corpus spanning the latter half of the 20th...
Lightly supervised and unsupervised acoustic model training (2002)
Lori Lamel, Jean-luc Gauvain, Gilles Adda
The last decade has witnessed substantial progress in speech recognition technology, with todays state-of-the-art systems being able to transcribe unrestricted broadcast news audio data with a word...
Investigating Syllabic Structure And Its Variation In Speech (2002)
From French Radio, Martine Adda-decker, Gilles Adda, Lori Lamel
In this paper, we investigate syllabic structure and its variation in a corpus of French radio interview speech. The aim of this study is to relate sequential pronunciation variants, i.e. variants...
Multi-layer Dialogue Annotation for Automated Multilingual (2002)
Customer Service Hilda, Hilda Hardy, Kirk Baker, Laurence Devillers, Lori Lamel, Sophie Rosset, ...
For the AMITIS multilingual human-computer dialogue project [1], we have developed new methods for the manual annotation of spoken dialogue transcriptions from European financial call centers on...
Portability issues for speech recognition technologies (2001)
Lori Lamel, Fabrice Lefevre, Jean-luc Gauvain, Gilles Adda
Although there has been regular improvement in speech recognition technology over the past decade, speech recognition is far from being a solved problem. Most recognition systems are tuned to a...
Genericity and Adaptability Issues for Task-Independent Speech Recognition (2001)
Fabrice Lefevre, Jean-luc Gauvain, Lori Lamel
The last decade has witnessed major advances in core speech recognition technology, with today's systems able to recognize continuous speech from many speakers without the need for an explicit...
Towards task-independent speech recognition (2001)
Fabrice Lefevre, Jean-luc Gauvain, Lori Lamel
Despite the considerable progress made in the last decade, speech recognition is far from a solved problem. For instance, porting a recognition system to a new task (or language) still requires...
Lori Lamel, Jean-luc Gauvain, Gilles Adda
The last decade has witnessed substantial progress in speech recognition technology, with todays state-of-the-art systems being able to transcribe broadcast audio data with a word error of about 20%....
Improving Genericity for Task-Independent Speech Recognition (2001)
Fabrice Lefevre, Jean-luc Gauvain, Lori Lamel
Although there have been regular improvements in speech recognition technology over the past decade, speech recognition is far from being a solved problem. Recognition systems are usually tuned to a...
Language model adaptation for broadcast news transcription (2001)
Langzhou Chen, Jean-luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda
This paper reports on languagemodel adaptation for the broadcast news transcription task. Language model adaptation for this task is challenging in that the subject of any particular show or portion...
Automatic transcription of compressed broadcast audio (2001)
Claude Barras, Lori Lamel, Jean-luc Gauvain
With increasing volumes of audio and video data broadcast over the web, it is of interest to assess the performance of state-of-theart automatic transcription systems on compressed audio data for...
Considerations in the design and evaluation of spoken language dialog systems (2000)
Lori Lamel, Sophie Rosset, Jean-luc Gauvain
In this paper we summarize our experience at LIMSI in the design, development and evaluation of spoken language dialog systems for information retrieval tasks. This work has been for the most part...
Broadcast News Transcription in Mandarin (2000)
Langzhou Chen, Lori Lamel, Gilles Adda, Jean-luc Gauvain
In this paper, our work in developing a Mandarin broadcast news transcription system is described. The main focus of this work is a port of the LIMSI American English broadcast news transcription...
Fast Decoding for Indexation of Broadcast Data (2000)
Processing time is an important factor in making a speech transcription system viable for automatic indexation of radio and television broadcasts. When only concerned by the word error rate, it is...
Martine Adda-decker, Gilles Adda, Lori Lamel
In this paper we describe our ongoing work concerning lexical modeling in the LIMSI broadcast transcription system for German. Lexical decomposition is investigated with a twofold goal: lexical...
Lightly Supervised Acoustic Model Training (2000)
Lori Lamel, Jean-luc Gauvain, Gilles Adda
Although tremendous progress has been made in speech recognition technology, with the capability of todays state-of-the-art systems to transcribe unrestricted continuous speech from broadcast data,...
The LIMSI SDR system for TREC-8 (1999)
Jean-luc Gauvain, Yannick De Kercadio, Lori Lamel, Gilles Adda
The LIMSI SDR system for TREC-8 (1999)
Jean-luc Gauvain, Yannick De Kercadio, Lori Lamel, Gilles Adda
In this paper we report on our TREC-8 SDR system, which combines an adapted version of the LIMSI 1998 Hub-4E transcription system for speech recognition with an IR system based on the Okapi term...
The LIMSI 1998 HUB-4e transcription system (1999)
Jean-luc Gauvain, Lori Lamel, Gilles Adda, Michèle Jardino
In this paper we report on our Nov98 Hub-4E system, which is an extension of our Nov97 system[4]. The LIMSI system for the November 1998 Hub-4E evaluation is a continuous mixture density, tied-state...
Recent Advances in Transcribing Television and Radio Broadcasts (1999)
Jean-luc Gauvain, Lori Lamel, Gilles Adda, Michele Jardino
Transcription of broadcast news shows (radio and television) is a major step in developing automatic tools for indexation and retrieval of the vast amounts of information generated on a daily basis....
The LIMSI 1998 HUB-4e transcription system (1999)
Jean-luc Gauvain, Lori Lamel, Gilles Adda, Michele Jardino
In this paper we report on our Nov98 Hub-4E system, which is an extension of our Nov97 system[4]. The LIMSI system for the November 1998 Hub-4E evaluation is a continuous mixture density, tied-state...
The LIMSI SDR system for TREC-8 (1999)
Jean-luc Gauvain, Yannick De Kercadio, Lori Lamel, Gilles Adda
In this paper we report on our TREC-8 SDR system, which combines an adapted version of the LIMSI 1998 Hub-4E transcription system for speech recognition with an IR system based on the Okapi term...
The LIMSI 1998 Hub-4E Transcription System (1999)
Lori Lamel, Gilles Adda, Michele Jardino
In this paper we report on our Nov98 Hub-4E system, which is an extension of our Nov97 system[4]. The LIMSI system for the November 1998 Hub-4E evaluation is a continuous mixture density, tied-state...
Partitioning and Transcription of Broadcast News Data (1998)
Jean-luc Gauvain, Lori Lamel, Gilles Adda
Radio and television broadcasts consist of a continuous stream of data comprised of segments of different linguistic and acoustic natures, which poses challenges for transcription. In this paper we...
The LIMSI 1997 Hub-4E Transcription System (1998)
Jean-luc Gauvain, Lori Lamel, Gilles Adda
In this paper we report on the LIMSI system used in the Nov'97 Hub-4E benchmark test on transcription of American English broadcast news shows. There are two main differences from the LIMSI...
An Overview of EU Programs Related to Conversational/Interactive Systems (1998)
Since 1984, the European Commission has supported projects related to spoken language processing within various programs (particularly ESPRIT, ESPRIT-BRA and TELEMATICSLANGUAGE ENGINEERING). The...
Spoken Language Dialog System Development and Evaluation at LIMSI (1998)
The development of natural spoken language dialog systems requires expertise in multiple domains, including speech recognition, natural spoken language understanding and generation, dialog managment...
The LIMSI 1997 Hub-4E Transcription System (1998)
Jean-luc Gauvain, Lori Lamel, Gilles Adda
In this paper we report on the LIMSI system used in the Nov'97 Hub-4E benchmark test on transcription of American English broadcast news shows. There are two main differences from the LIMSI...
An Overview of EU Programs Related to Conversational/Interactive Systems (1998)
Since 1984, the European Commission has supported projects related to spoken language processing within various programs (particularly ESPRIT, ESPRIT-BRA and TELEMATICSLANGUAGE ENGINEERING). The...
Pronunciation Variants across Systems, Languages and Speaking Style (1998)
Martine Adda-decker, Lori Lamel
This contribution aims at evaluating the use of pronunciation variants across different system configurations, languages and speaking styles. This study is limited to the use of variants during...
Text normalization and speech recognition in French (1997)
Gilles Adda, Martine Adda-decker, Jean-luc Gauvain, Lori Lamel
In this paper we present a quantitative investigation into the impact of text normalization on lexica and language models for speech recognition in French. The text normalization process defines what...
Transcription Of Broadcast News (1997)
Jean-luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda-decker
In this paper we report on our recent work in transcribing broadcast news shows. Radio and television broadcasts contain signal segments of various linguistic and acoustic natures. The shows contain...
Speaker Recognition With The Switchboard Corpus (1997)
In this paper we present our development work carried out in preparation for the March'96 speaker recognition test on the Switchboard corpus organized by NIST. The speaker verification system...
Speech Recognition of European Languages (1995)
A basic overview is presented of the main ongoing efforts in large vocabulary, continuous speech recognition (LVCSR) for European languages. We address issues in acoustic modeling, lexical...
Recent Activities in Spoken Language Processing at LIMSI (1992)
: This paper summarizes recent activities at LIMSI in multilingual speech recognition and its applications. While the main goal of speech recognition is to provide a transcription of the speech...
The LIMSI SDR System for TREC-9
Lori Lamel, Claude Barras, Gilles Adda, Yannick De Kercardio
In this paper we describe the LIMSI Spoken Document Retrieval system used in the TREC-9 evaluation. This system combines an adapted version of the LIMSI 1999 Hub-4E transcription system for speech...
On Designing Pronunciation Lexicons for Large Vocabulary, Continuous Speech Recognition
Creation of pronunciation lexicons for speech recognition is widely acknowledgedto be an important, but labor-intensive, aspect of system development. Lexicons are often manually created and make use...