|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/1024
|
| Title: | Suprasegmental Duration Modelling with Elastic Constraints in Automatic Speech Recognition |
| Authors: | Molloy, Laurence Isard, Stephen |
| Issue Date: | Dec-1998 |
| Citation: | In ICSLP-1998, paper 1103. |
| Publisher: | International Speech Communication Association |
| Abstract: | In this paper a method of integrating a model of suprasegmental duration with a HMM-based recogniser at the post-processing level is presented. The N-Best utterance output is rescored using a suitable linear combination of acoustic log-likelihood (provided by a set of tied-state triphone HMMs) and duration log-likelihood (provided by a set of durational models). The durational model used in the post-processing imposes syllable-level elastic constraints on the durational behaviour of speech segments. Results are presented for word accuracy on the Resource Management database after rescoring, using two different syllable-like constraint units, a fixed-size N-phone window and simple (no constraint) phone duration probability scoring. |
| URI: | http://www.isca-speech.org/archive/icslp_1998/index.html http://hdl.handle.net/1842/1024 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|