Amarnag Subramanya, Zhengyou Zhang, Arun C. Surendran, Patrick Nguyen, Mukund Narasimhan, Alex Acero
Speaker Verification can be treated as a statistical hypothesis testing problem. The most commonly used approach is the likelihood ratio test (LRT), which can be shown to be optimal using the...
Amarnag Subramanya, Zhengyou Zhang, Zicheng Liu, Alex Acero
A good speech model is essential for speech enhancement, but it is very difficult to build because of huge intra- and extra-speaker variation. We present a new speech model for speech enhancement,...
Speech, Signal and Language Interpretation Lab, University of Washington, (2008)
Amarnag Subramanya, Jeff Bilmes
We propose a new set of features based on the temporal statistics of the spectral entropy of speech. We show why these features make good inputs for a speech detector. Moreover, we propose a back-end...
UNCERTAINTY IN TRAINING LARGE VOCABULARY SPEECH RECOGNIZERS (2008)
Amarnag Subramanya, Chris Bartels, Jeff Bilmes, Patrick Nguyen
We propose a technique for annotating data used to train a speech recognizer. The proposed scheme is based on labeling only a single frame for every word in the training set. We make use of the...
Hierarchical Models for Activity Recognition (2008)
Abstract — In this paper we propose a hierarchical dynamic Bayesian network to jointly recognize the activity and environment of a person. The hierarchical nature of the model allows us to...
We describe a novel technique for multi-sensory speech processing for enhancing noisy speech and for improved noiserobust speech recognition. Both air- and bone-conductive microphones are used to...
Virtual evidence for training speech recognizers using partially labeled data (2007)
Collecting supervised training data for automatic speech recognition (ASR) systems is both time consuming and expensive. In this paper we use the notion of virtual evidence in a graphical-model based...
Alvin Raj, Amarnag Subramanya, Dieter Fox, Jeff Bilmes
Recent advances in wearable sensing and computing devices and in fast probabilistic inference techniques make possible the fine-grained estimation of a person’s activities over extended periods of...
and et.al., “The Vocal Joystick (2006)
Jeff A. Bilmes, Jonathan Malkin, Xiao Li, Susumu Harada, Kelley Kilanski, Katrin Kirchhoff, ...
The Vocal Joystick is a novel human-computer interface mechanism designed to enable individuals with motor impairments to make use of vocal parameters to control objects on a computer screen...
Recognizing activities and spatial context using wearable sensors (2006)
We introduce a new dynamic model with the capability of recognizing both activities that an individual is performing as well as where that individual is located. Our approach is novel in that it...
Recognizing activities and spatial context using wearable sensors (2006)
We introduce a new dynamic model with the capability of recognizing both activities that an individual is performing as well as where that individual is located. Our approach is novel in that it...
Alvin Raj, Amarnag Subramanya, Dieter Fox, Jeff Bilmes
Recent advances in wearable sensing and computing devices and in fast probabilistic inference techniques make possible the fine-grained estimation of a person’s activities over extended periods of...
Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard Wright, Katrin Kirchhoff, ...
We present a novel voice-based humancomputer interface designed to enable individuals with motor impairments to use vocal parameters for continuous control tasks. Since discrete spoken commands are...
The Vocal Joystick Demo at UIST05: A Voice-Based Human-Computer Interface (2005)
Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard Wright, Amarnag Subramanya, ...
We will demonstrate a novel voice-based human-computer interface we call the Vocal Joystick (VJ), designed to enable individuals with motor impairments to use vocal parameters for both discrete and...
Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard Wright, Katrin Kirchhoff, ...
We present a novel voice-based humancomputer interface designed to enable individuals with motor impairments to use vocal parameters for continuous control tasks. Since discrete spoken commands are...
Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard Wright, Katrin Kirchhoff, ...
We present a novel voice-based humancomputer interface designed to enable individuals with motor impairments to use vocal parameters for continuous control tasks. Since discrete spoken commands are...
Microphone array position calibration by basis-point classical multidimensional scaling (2005)
Stanley T. Birchfield, Amarnag Subramanya
Abstract—Classical multidimensional scaling (MDS) is a global, noniterative technique for finding coordinates of points given their interpoint distances. We describe the algorithm and show how it...
Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard Wright, Amarnag Subramanya, ...
We describe the technical details behind a novel voicebased human-computer interface designed to enable individuals with motor impairments to use vocal parameters for both discrete and continuous...
Dynamic Bayesian networks for audio-visual speech recognition / (2004)
Thesis (M.S.)--Clemson University, 2004.