|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/923
|
| Title: | Asynchronous Articulatory Feature Recognition Using Dynamic Bayesian networks |
| Authors: | Wester, Mirjam Frankel, Joe King, Simon |
| Issue Date: | Dec-2004 |
| Citation: | Proc. IEICI Beyond HMM Workshop, Kyoto |
| Abstract: | This paper builds on previous work where dynamic Bayesian networks (DBN) were proposed as a model
for articulatory feature recognition. Using DBNs makes it possible to model the dependencies between features, an addition to previous approaches which was found to improve feature recognition performance. The DBN results were promising, giving close to the accuracy of artificial neural nets (ANNs). However, the system was trained on canonical labels, leading to an overly strong set of constraints on feature co-occurrence. In this study, we describe
an embedded training scheme which learns a set of data-driven asynchronous feature changes where supported in the data. Using a subset of the OGI Numbers corpus, we describe articulatory feature recognition experiments using both canonically-trained and asynchronous-feature DBNs. Performance using DBNs is found to exceed that of ANNs trained on an identical task, giving a higher recognition accuracy. Furthermore, inter-feature dependencies
result in a more structured model, giving rise to fewer feature combinations in the recognition output. In addition to an empirical evaluation of this modeling approach, we give a qualitative analysis, investigating the asynchrony
found through our data-driven method and interpreting it using linguistic knowledge. |
| Keywords: | Articulatory feature recognition dynamic Bayesian networks |
| URI: | http://hdl.handle.net/1842/923 |
| Appears in Collections: | CSTR publications Linguistics and English Language publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|