Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/987

This item has been viewed 18 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
wester.2000.2 ICSLP.pdf55.44 kBAdobe PDFView/Open
Title: A Comparison of Data-Derived and Knowledge-Based Modeling of Pronunciation Variation
Authors: Wester, Mirjam
Fosler-Lussier, Eric
Issue Date: Oct-2000
Citation: In ICSLP-2000, vol.1, 270-273.
Publisher: International Speech Communication Association
Abstract: This paper focuses on modeling pronunciation variation in two different ways: data-derived and knowledge-based. The knowledge-based approach consists of using phonological rules to generate variants. The data-derived approach consists of performing phone recognition, followed by various pruning and smoothing methods to alleviate some of the errors in the phone recognition. Using phonological rules led to a small improvement in WER; whereas, using a data-derived approach in which the phone recognition was smoothed using simple decision trees (d-trees) prior to lexicon generation led to a significant improvement compared to the baseline. Furthermore, we found that 10% of variants generated by the phonological rules were also found using phone recognition, and this increased to 23% when the phone recognition output was smoothed by using d-trees. In addition, we propose a metric to measure confusability in the lexicon and we found that employing this confusion metric to prune variants results in roughly the same improvement as using the d-tree method.
URI: http://www.isca-speech.org/archive/icslp_2000
http://hdl.handle.net/1842/987
Appears in Collections:CSTR publications

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback