Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/1024

This item has been viewed 11 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
Molloy_1998_a.pdf54.66 kBAdobe PDFView/Open
Title: Suprasegmental Duration Modelling with Elastic Constraints in Automatic Speech Recognition
Authors: Molloy, Laurence
Isard, Stephen
Issue Date: Dec-1998
Citation: In ICSLP-1998, paper 1103.
Publisher: International Speech Communication Association
Abstract: In this paper a method of integrating a model of suprasegmental duration with a HMM-based recogniser at the post-processing level is presented. The N-Best utterance output is rescored using a suitable linear combination of acoustic log-likelihood (provided by a set of tied-state triphone HMMs) and duration log-likelihood (provided by a set of durational models). The durational model used in the post-processing imposes syllable-level elastic constraints on the durational behaviour of speech segments. Results are presented for word accuracy on the Resource Management database after rescoring, using two different syllable-like constraint units, a fixed-size N-phone window and simple (no constraint) phone duration probability scoring.
URI: http://www.isca-speech.org/archive/icslp_1998/index.html
http://hdl.handle.net/1842/1024
Appears in Collections:CSTR publications

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback