Publication View

1 (2008)

Abstract
This chapter was written in 1994. Further advances have been made such as: context- dependent phone modelling; forward-backward training and adaptation using linear input transformations. This chapter describes a use of recurrent neural networks (i.e., feedback is incorpo- rated in the computation) as an acoustic model for continuous speech recognition. The form of the recurrent neural network is described along with an appropriate pa- rameter estimation procedure. For each frame of acoustic data, the recurrent network generates an estimate of the posterior probability of of the possible phones given the observed acoustic signal. The posteriors are then converted into scaled likelihoods and used as the observation probabilities within a conventional decoding paradigm (e.g., Viterbi decoding). The advantages of using recurrent networks are that they require a small number of parameters and provide a fast decoding capability (relative to conventional, large-vocabulary, HMM systems). Most- if not all- automatic speech recognition systems explicitly or implicitly compute a (equivalently, etc.) indicating how well an input acoustic signal matches a speech model of the hypothesised utterance. A fundamental problem in speech recognition is how this score may be computed,

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.65.749
Source http://www.cstr.ed.ac.uk/downloads/publications/1996/rnn4csr96.ps.gz
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Keywords NEURAL NETWORKS IN
Type text
Language English
Relation 10.1.1.133.4884, 10.1.1.137.4028, 10.1.1.49.7119, 10.1.1.55.3690