RECOGNITION AND UNDERSTANDING OF MEETINGS THE AMI AND AMIDA PROJECTS (2008)
Steve Renals, Thomas Hain, Hervé Bourlard
The AMI and AMIDA projects are concerned with the recognition and interpretation of multiparty meetings. Within these projects we have: developed an infrastructure for recording meetings using...
Audio-Visual Processing in Meetings: Seven Questions and Some AMI Answers (2008)
Marc Al-hames, Thomas Hain, Jan Cernocky, Sascha Schreiber, Mannes Poel, Ronald Müller, ...
Abstract. The project Augmented Multi-party Interaction (AMI) is concerned with the development of meeting browsers and remote meeting assistants for instrumented meeting rooms – and the required...
Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers (2008)
Marc Al-hames, Thomas Hain, Jan Cernocky, Sascha Schreiber, Mannes Poel, Ronald Müller, ...
Abstract. The project Augmented Multi-party Interaction (AMI) is concerned with the development of meeting browsers and remote meeting assistants for instrumented meeting rooms – and the required...
Juicer: A Weighted Finite-State Transducer speech decoder (2008)
Darren Moore, John Dines, Mathew Magimai Doss, Jithendra Vepa, Octavian Cheng, Thomas Hain
Abstract. A major component in the development of any speech recognition system is the decoder. As task complexities and, consequently, system complexities have continued to increase the decoding...
Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers (2008)
Marc Al-hames, Thomas Hain, Jan Cernocky, Sascha Schreiber, Mannes Poel, Ronald Müller, ...
Abstract. The project Augmented Multi-party Interaction (AMI) is concerned with the development of meeting browsers and remote meeting assistants for instrumented meeting rooms – and the required...
The AMI System for the Transcription of Speech in Meetings (2007)
Hain, Thomas, Burget, Lukas, Dines, John, Garau, Giulia, Wan, Vincent, Karafiat, Martin, ...
This paper describes the AMI transcription system for speech in meetings developed in collaboration by five research groups. The system includes generic techniques such as discriminative and speaker...
Recognition and interpretation of meetings: The AMI and AMIDA projects (2007)
Renals, Steve, Hain, Thomas, Bourlard, Herve
The AMI and AMIDA projects are concerned with the recognition and interpretation of multiparty meetings. Within these projects we have: developed an infrastructure for recording meetings using...
The AMI system for the transcription of speech in meetings (2007)
Thomas Hain, Lukas Burget, John Dines, Giulia Garau, Vincent Wan, Martin Karafiat, ...
This paper describes the AMI transcription system for speech in meetings developed in collaboration by five research groups. The system includes generic techniques such as discriminative and speaker...
Recognition and Understanding of Meetings The AMI and AMIDA Projects (2007)
Steve Renals, Thomas Hain, Hervé Bourlard, Steve Renals, Thomas Hain, Hervé Bourlard
to appear in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU’07
Thomas Hain, Giulia Garau, Mike Lincoln, John Dines, Darren Moore, ...
• Posterior based features
Juicer: A Weighted Finite-State Transducer speech decoder (2006)
Moore, Darren, Dines, John, Doss, Mathew Magimai, Vepa, Jithendra, Cheng, Octavian, Hain, Thomas
A major component in the development of any speech recognition system is the decoder. As task complexities and, consequently, system complexities have continued to increase the decoding problem has...
Juicer: A Weighted Finite-State Transducer speech decoder (2006)
Moore, Darren, Dines, John, Doss, Mathew Magimai, Vepa, Jithendra, Cheng, Octavian, Hain, Thomas
A major component in the development of any speech recognition system is the decoder. As task complexities and, consequently, system complexities have continued to increase the decoding problem has...
The segmentation of multi-channel meeting recordings for automatic speech recognition (2006)
Dines, John, Vepa, Jithendra, Hain, Thomas
One major research challenge in the domain of the analysis of meeting room data is the automatic transcription of what is spoken during meetings, a task which has gained considerable attention within...
Applying Vocal Tract Length Normalization to Meeting Recordings (2005)
Garau, Giulia, Renals, Steve, Hain, Thomas
Vocal Tract Length Normalisation (VTLN) is a commonly used technique to normalise for inter-speaker variability. It is based on the speaker-specific warping of the frequency axis, parameterised by a...
Transcription of conference room meetings: an investigation (2005)
Hain, Thomas, Dines, John, Garau, Giulia, Karafiat, Martin, Moore, Darren, Wan, Vincent, ...
The automatic processing of speech collected in conference style meetings has attracted considerable interest with several large scale projects devoted to this area. In this paper we explore the use...
Applying Vocal Tract Length Normalization to Meeting Recordings (2005)
Garau, Giulia, Renals, Steve, Hain, Thomas
Vocal Tract Length Normalisation (VTLN) is a commonly used technique to normalise for inter-speaker variability. It is based on the speaker-specific warping of the frequency axis, parameterised by a...
Transcription of conference room meetings: an investigation (2005)
Hain, Thomas, Dines, John, Garau, Giulia, Karafiat, Martin, Moore, Darren, Wan, Vincent, ...
The automatic processing of speech collected in conference style meetings has attracted considerable interest with several large scale projects devoted to this area. In this paper we explore the use...
The AMI Meeting Corpus: a Pre-Announcement (2005)
Carletta, Jean, Ashby, Simone, Bourban, Sebastien, Flynn, Mike, Guillemot, Mael, Hain, Thomas, ...
The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. It is being created in the context of a project that is developing meeting browsing technology and will...
The ami meeting corpus: a pre-announcement (2005)
Carletta, Jean, Ashby, Simone, Bourban, Sebastien, Flynn, Mike, Guillemot, Mael, Hain, Thomas, ...
The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. It is being created in the context of a project that is developing meeting browsing technology and will...
The AMI meeting corpus: A pre-announcement (2005)
Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Thomas Hain, Jaroslav Kadlec, ...
Abstract. The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. It is being created in the context of a project that is developing meeting browsing...
The 2005 AMI system for the transcription of speech in meetings (2005)
Thomas Hain, Lukas Burget, John Dines, Giulia Garau, Martin Karafiat, Mike Lincoln, ...
Abstract. In this paper we describe the 2005 AMI system for the transcription of speech in meetings used in the 2005 NIST RT evaluations. The system was designed for participation in the speech to...
The 2005 AMI system for the transcription of speech (2005)
Thomas Hain, Lukas Burget, John Dines, Iain Mccowan, Mike Lincoln, Darren Moore, ...
Abstract. The automatic processing of speech collected in conference style meetings has attracted considerable interest with several large scale projects devoted to this area. This paper describes...
The 2005 AMI system for the transcription of speech in meetings (2005)
Thomas Hain, Lukas Burget, John Dines, Giulia Garau, Martin Karafiat, Mike Lincoln, ...
Abstract. In this paper we describe the 2005 AMI system for the transcription of speech in meetings used for participation in the 2005 NIST RT evaluations. The system was designed for participation...
Automatic Transcription of Conversational Telephone Speech (2003)
Thomas Hain, Phil Woodl, Gunnar Evermann, Mark Gales, Andrew Liu, Gareth Moore, ...
This paper discusses the Cambridge University HTK (CU-HTK) system for the automatic transcription of conversational telephone speech. A detailed discussion of the most important techniques in...
Thomas Hain, Phil Woodland, Gunnar Evermann, Mark Gales, Andrew Liu, Gareth Moore, ...
This paper discusses the Cambridge University HTK (CU-HTK) system for the automatic transcription of conversational telephone speech. A detailed discussion of the most important techniques in...
Resegmentation Gender detection VTLN,CMN, CVN (2002)
Phil Woodl, Gunnar Evermann, Mark Gales, Thomas Hain, Andrew Liu, Gareth Moore, ...
– 12 MF-PLP cepstral parameters + C0 and 1st/2nd derivatives – Side-based cepstral mean and variance normalisation – Vocal tract length normalisation in training and test • Decision tree...
Hidden Model Sequence Models for Automatic Speech Recognition (2001)
Most modern automatic speech recognition systems make use of acoustic models based on hidden Markov models. To obtain reasonable recognition performance within a large vocabulary framework, the...
The 1998 HTK System For Transcription of Conversational Telephone Speech (1998)
Thomas Hain, Phil Woodland, P. C. Woodl, Thomas Niesler, Edward Whittacker
This paper describes the 1998 HTK large vocabulary speech recognition system for conversational telephone speech as used in the NIST 1998 Hub5E evaluation. Front-end and language modelling...
Zugl.: München, Univ., Diss., 1995.
On the Convergence of Fractal Transforms (1994)
This paper reports on investigations concerning the convergence of fractal transforms for signal modelling. Convergence is essential for the functionality of fractal based coding schemes. The coding...