|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/1130
|
| Title: | CDNN: a context dependent neural network for continuous speech recognition |
| Authors: | Bourlard, Herve Morgan, Nelson Wooters, Chuck Renals, Steve |
| Issue Date: | Mar-1992 |
| Citation: | Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on, Volume 2, 23-26 March 1992 Page(s):349 - 352. |
| Publisher: | IEEE |
| Abstract: | A series of theoretical and experimental results have suggested that multilayer perceptrons (MLPs) are an effective family of algorithms for the smooth estimate of highly dimensioned probability density functions that are useful in continuous speech recognition. All of these systems have exclusively used context-independent phonetic models, in the sense that the probabilities or costs are estimated for simple speech units such as phonemes or words, rather than biphones or triphones. Numerous conventional systems based on hidden Markov models (HMMs) have been reported that use triphone or triphone like context-dependent models. In one case the outputs of many context-dependent MLPs (one per context class) were used to help choose the best sentence from the N best sentences as determined by a context-dependent HMM system. It is shown how, without any simplifying assumptions, one can estimate likelihoods for context-dependent phonetic models with nets that are not substantially larger than context-independent MLPs. |
| URI: | Digital Object Identifier 10.1109/ICASSP.1992.226048 http://ieeexplore.ieee.org/ http://hdl.handle.net/1842/1130 |
| ISSN: | 1520-6149 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|