|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/1116
|
| Title: | Modelling the uncertainty in recovering articulation from acoustics |
| Authors: | Richmond, Korin King, Simon Taylor, Paul |
| Issue Date: | 2003 |
| Citation: | Computer Speech and Language, 17:153-172, 2003. |
| Publisher: | Elsevier |
| Abstract: | This paper presents an experimental comparison of the performance of the multilayer perceptron (MLP) with that of the mixture density network (MDN) for an acoustic-to-articulatory mapping task. A corpus of acoustic-articulatory data recorded by electromagnetic articulography (EMA) for a single speaker was used as training and test data for this purpose. In theory, the MDN is able to provide a richer, more flexible description of the target variables in response to a given input vector than the least-squares trained MLP. Our results show that the mean likelihoods of the target articulatory parameters for an unseen test set were indeed consistently higher with the MDN than with the MLP. The increase ranged from approximately 3% to 22%, depending on the articulatory channel in question. On the basis of these results, we argue that using a more flexible description of the target domain, such as that offered by the MDN, can prove beneficial when modelling the acoustic-to-articulatory mapping. |
| Keywords: | electromagnetic articulography speech multilayer perceptron |
| URI: | doi:10.1016/S0885-2308(03)00005-6 http://hdl.handle.net/1842/1116 |
| Appears in Collections: | CSTR publications Linguistics and English Language publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|