Amarnag Subramanya

Publication List Details

Period

2004 - 2008

Number

19

Co-Authors

A GENERATIVE-DISCRIMINATIVE FRAMEWORK USING ENSEMBLE METHODS FOR TEXT-DEPENDENT SPEAKER VERIFICATION (2008)

Amarnag Subramanya, Zhengyou Zhang, Arun C. Surendran, Patrick Nguyen, Mukund Narasimhan, Alex Acero

Speaker Verification can be treated as a statistical hypothesis testing problem. The most commonly used approach is the likelihood ratio test (LRT), which can be shown to be optimal using the...

SPEECH MODELING WITH MAGNITUDE-NORMALIZED COMPLEX SPECTRA AND ITS APPLICATION TO MULTISENSORY SPEECH ENHANCEMENT (2008)

Amarnag Subramanya, Zhengyou Zhang, Zicheng Liu, Alex Acero

A good speech model is essential for speech enhancement, but it is very difficult to build because of huge intra- and extra-speaker variation. We present a new speech model for speech enhancement,...

Speech, Signal and Language Interpretation Lab, University of Washington, (2008)

Amarnag Subramanya, Jeff Bilmes

We propose a new set of features based on the temporal statistics of the spectral entropy of speech. We show why these features make good inputs for a speech detector. Moreover, we propose a back-end...

UNCERTAINTY IN TRAINING LARGE VOCABULARY SPEECH RECOGNIZERS (2008)

Amarnag Subramanya, Chris Bartels, Jeff Bilmes, Patrick Nguyen

We propose a technique for annotating data used to train a speech recognizer. The proposed scheme is based on labeling only a single frame for every word in the training set. We make use of the...

Hierarchical Models for Activity Recognition (2008)

Amarnag Subramanya, Alvin Raj

Abstract — In this paper we propose a hierarchical dynamic Bayesian network to jointly recognize the activity and environment of a person. The hierarchical nature of the model allows us to...

MULTI-SENSORY SPEECH PROCESSING: INCORPORATING AUTOMATICALLY EXTRACTED HIDDEN DYNAMIC INFORMATION (2008)

Amarnag Subramanya

We describe a novel technique for multi-sensory speech processing for enhancing noisy speech and for improved noiserobust speech recognition. Both air- and bone-conductive microphones are used to...

Virtual evidence for training speech recognizers using partially labeled data (2007)

Amarnag Subramanya

Collecting supervised training data for automatic speech recognition (ASR) systems is both time consuming and expensive. In this paper we use the notion of virtual evidence in a graphical-model based...

Rao-blackwellized particle filters for recognizing activities and spatial context from wearable sensors (2006)

Alvin Raj, Amarnag Subramanya, Dieter Fox, Jeff Bilmes

Recent advances in wearable sensing and computing devices and in fast probabilistic inference techniques make possible the fine-grained estimation of a person’s activities over extended periods of...

and et.al., “The Vocal Joystick (2006)

Jeff A. Bilmes, Jonathan Malkin, Xiao Li, Susumu Harada, Kelley Kilanski, Katrin Kirchhoff, ...

The Vocal Joystick is a novel human-computer interface mechanism designed to enable individuals with motor impairments to make use of vocal parameters to control objects on a computer screen...

Recognizing activities and spatial context using wearable sensors (2006)

Amarnag Subramanya, Alvin Raj

We introduce a new dynamic model with the capability of recognizing both activities that an individual is performing as well as where that individual is located. Our approach is novel in that it...

Recognizing activities and spatial context using wearable sensors (2006)

Amarnag Subramanya, Alvin Raj

We introduce a new dynamic model with the capability of recognizing both activities that an individual is performing as well as where that individual is located. Our approach is novel in that it...

Rao-blackwellized particle filters for recognizing activities and spatial context from wearable sensors (2006)

Alvin Raj, Amarnag Subramanya, Dieter Fox, Jeff Bilmes

Recent advances in wearable sensing and computing devices and in fast probabilistic inference techniques make possible the fine-grained estimation of a person’s activities over extended periods of...

The Vocal Joystick: A Voice-Based Human-Computer Interface for Individuals with Motor Impairments (2005)

Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard Wright, Katrin Kirchhoff, ...

We present a novel voice-based humancomputer interface designed to enable individuals with motor impairments to use vocal parameters for continuous control tasks. Since discrete spoken commands are...

The Vocal Joystick Demo at UIST05: A Voice-Based Human-Computer Interface (2005)

Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard Wright, Amarnag Subramanya, ...

We will demonstrate a novel voice-based human-computer interface we call the Vocal Joystick (VJ), designed to enable individuals with motor impairments to use vocal parameters for both discrete and...

The Vocal Joystick: A Voice-Based Human-Computer Interface for Individuals with Motor Impairments (2005)

Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard Wright, Katrin Kirchhoff, ...

We present a novel voice-based humancomputer interface designed to enable individuals with motor impairments to use vocal parameters for continuous control tasks. Since discrete spoken commands are...

The Vocal Joystick: A Voice-Based Human-Computer Interface for Individuals with Motor Impairments (2005)

Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard Wright, Katrin Kirchhoff, ...

We present a novel voice-based humancomputer interface designed to enable individuals with motor impairments to use vocal parameters for continuous control tasks. Since discrete spoken commands are...

Microphone array position calibration by basis-point classical multidimensional scaling (2005)

Stanley T. Birchfield, Amarnag Subramanya

Abstract—Classical multidimensional scaling (MDS) is a global, noniterative technique for finding coordinates of points given their interpoint distances. We describe the algorithm and show how it...

The Vocal Joystick: A Voice-Based Human-Computer Interface for Individuals with Motor Impairments (2005)

Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard Wright, Amarnag Subramanya, ...

We describe the technical details behind a novel voicebased human-computer interface designed to enable individuals with motor impairments to use vocal parameters for both discrete and continuous...