Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/923

This item has been viewed 18 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
Wester_et_al_IEICE.pdf128.71 kBAdobe PDFView/Open
Title: Asynchronous Articulatory Feature Recognition Using Dynamic Bayesian networks
Authors: Wester, Mirjam
Frankel, Joe
King, Simon
Issue Date: Dec-2004
Citation: Proc. IEICI Beyond HMM Workshop, Kyoto
Abstract: This paper builds on previous work where dynamic Bayesian networks (DBN) were proposed as a model for articulatory feature recognition. Using DBNs makes it possible to model the dependencies between features, an addition to previous approaches which was found to improve feature recognition performance. The DBN results were promising, giving close to the accuracy of artificial neural nets (ANNs). However, the system was trained on canonical labels, leading to an overly strong set of constraints on feature co-occurrence. In this study, we describe an embedded training scheme which learns a set of data-driven asynchronous feature changes where supported in the data. Using a subset of the OGI Numbers corpus, we describe articulatory feature recognition experiments using both canonically-trained and asynchronous-feature DBNs. Performance using DBNs is found to exceed that of ANNs trained on an identical task, giving a higher recognition accuracy. Furthermore, inter-feature dependencies result in a more structured model, giving rise to fewer feature combinations in the recognition output. In addition to an empirical evaluation of this modeling approach, we give a qualitative analysis, investigating the asynchrony found through our data-driven method and interpreting it using linguistic knowledge.
Keywords: Articulatory feature recognition
dynamic Bayesian networks
URI: http://hdl.handle.net/1842/923
Appears in Collections:CSTR publications
Linguistics and English Language publications

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback