Roger Hsiao

Inductive Programming by Expectation Maximization Algorithm (2008)

Roger Hsiao

This paper proposes an algorithm which can write programs automatically to solve problems. We model the sequence of instructions as a n-gram language model and the sequence is represented by some...

June 19–21, 2006 • Barcelona, Spain TC-STAR Workshop on Speech-to-Speech Translation The ISL TC-STAR Spring 2006 ASR Evaluation Systems (2008)

Sebastian Stüker, Christian Fügen, Roger Hsiao, Shajith Ikbal, Qin Jin, Florian Kraft, ...

The project Technology and Corpora for Speech to Speech Translation (TC-STAR) aims at making a break-through in speech-to-speech translation research, significantly reducing the gap between the...

A comparison of various adaptation methods for speaker verification with limited enrollment data (2008)

Man-wai Mak, Roger Hsiao, Brian Mak

One key factor that hinders the widespread deployment of speaker verification technologies is the requirement of long enrollment utterances to guarantee low error rate during verification. To gain...

Feature Decision extractionSpeaker Modeling (2008)

Man-wai Mak, Roger Hsiao, Brian Mak

◮ To gain user acceptance of speaker verification technologies, adaptation algorithms that can enroll speakers with short utterances are highly essential. ◮ This paper compares four...

DISCRIMINATIVE TRAINING OF AUDITORY FILTERS OF DIFFERENT SHAPES FOR ROBUST SPEECH RECOGNITION ABSTRACT (2008)

Brian Mak, Yik-cheung Tamf, Roger Hsiao

The bank-of-filters spectrum analysis model is commonly used in the extraction of acoustic features for automatic speech recogni-tion. The most critical component in the analysis model is a bank of...

The CMU TransTac 2007 Eyes-free and Hands-free Two-way Speech-to-Speech Translation System (2008)

Nguyen Bach, Matthias Eck, Paisarn Charoenpornsawat, Thilo Köhler, Sebastian Stüker, Thuylinh Nguyen, ...

The paper describes our portable two-way speech-tospeech translation system using a completely eyesfree/hands-free user interface. This system translates between the language pair English and Iraqi...

Optimizing Components for Handheld Two-way Speech Translation for an (2008)

Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, ...

This paper described our handheld two-way speech translation system for English and Iraqi. The focus is on developing a field usable handheld device for speech-to-speech translation. The computation...

Kernel eigenspace-based MLLR adaptation (2007)

Mak, Brian Kan-Wing, Hsiao, Roger

Recently, we have been investigating the application of kernel methods for fast speaker adaptation by exploiting possible non-linearity in the input speaker space. In this paper, we propose another...

Robustness of several kernel-based fast adaptation methods on noisy LVCSR (2007)

Mak, Brian Kan-Wing, Hsiao, Roger

We have been investigating the use of kernel methods to improve conventional linear adaptation algorithms for fast adaptation, when there are less than 10s of adaptation speech. On clean speech, we...

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting (2006)

Mak, Brian Kan-Wing, Hsiao, Roger, Ho, Simon, Kwok, James Tin-Yau

Recently, we proposed an improvement to theconventional eigenvoice (EV) speaker adaptation using kernel methods. In our novel kernel eigenvoice (KEV) speaker adaptation [1], speaker supervectors are...

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting (2006)

Mak, Brian Kan-Wing, Hsiao, Roger, Ho, Simon, Kwok, James Tin-Yau

Recently, we proposed an improvement to theconventional eigenvoice (EV) speaker adaptation using kernel methods. In our novel kernel eigenvoice (KEV) speaker adaptation [1], speaker supervectors are...

Optimizing Components for Handheld Two-way Speech Translation for an English-Iraqi Arabic System (2006)

Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, Andreas Zollmann, ...

This paper described our handheld two-way speech translation system for English and Iraqi. The focus is on developing a field usable handheld device for speech-to-speech translation. The computation...

Optimizing Components for Handheld Two-way Speech Translation for an (2006)

Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, ...

This paper described our handheld two-way speech translation system for English and Iraqi. The focus is on developing a field usable handheld device for speech-to-speech translation. The computation...

A comparative study of two kernel eigenspace-based speaker adaptation methods on large vocabulary continuous speech recognition (2005)

Hsiao, Roger, Mak, Brian Kan-Wing

Eigenvoice (EV) speaker adaptation has been shown effective for fast speaker adaptation when the amount of adaptation data is scarce. In the past two years, we have been investigating the application...

Kernel eigenspace-based MLLR adaptation using multiple regression classes (2005)

Hsiao, Roger, Mak, Brian Kan-Wing

Recently, we have been investigating the application of kernel methods to improve the performance of eigenvoice-based adaptation methods by exploiting possible nonlinearity in their original working...

Kernel Eigenspace-Based Mllr Adaptation Using Multiple Regression (2005)

Classes Roger Hsiao, Roger Hsiao, Brian Mak

Recently, we have been investigating the application of kernel methods to improve the performance of eigenvoice-based adaptation methods by exploiting possible nonlinearity in their original working...

Improving eigenspace-based MLLR adaptation by kernel PCA (2004)

Mak, Brian Kan-Wing, Hsiao, Roger

Eigenspace-based MLLR (EMLLR) adaptation has been shown effective for fast speaker adaptation. It applies the basic idea of eigenvoice adaptation, and derives a small set of eigenmatrices using...

Improving Eigenspace-based MLLR Adaptation by Kernel PCA (2004)

Brian Mak And, Brian Mak, Roger Hsiao

Eigenspace-based MLLR (EMLLR) adaptation has been shown effective for fast speaker adaptation. It applies the basic idea of eigenvoice adaptation, and derives a small set of eigenmatrices using...