Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Philosophy, Psychology and Language Sciences, School of >
Linguistics and English Language >
Linguistics and English Language Masters thesis collection >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/6168

This item has been viewed 68 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
Simms 2011 Msc.pdfMasters thesis2.99 MBAdobe PDFView/Open
Title: Investigating Non-Uniqueness in the Acoustic-Articulatory Inversion Mapping
Authors: Simms, Terence
Supervisor(s): Richmond, Korin
Renals, Steve
Issue Date: 31-Aug-2011
Publisher: The University of Edinburgh
Abstract: The task of inferring articulatory configurations from a given acoustic signal is a problem for which a reliable and accurate solution has been lacking for a number of decades. The changing shape of the vocal-tract is responsible for altering the parameters of sound. Each different configuration of articulators will regularly lead to a single distinct sound being produced (a unique mapping from the articulator to the acoustics). Therefore, it should be possible to take an acoustic signal and invert the process, giving the exact vocal-tract shape for a given sound. This would have wide-reaching applications in the field of speech and language technology, such as in improving facial animation and speech recognition systems. Using vocal-tract information inferred from the acoustic signal can facilitate a richer understanding of the actual constraints in articulator movement. However, research concerned with the inversion mapping has revealed that there is often a multi-valued mapping from the acoustic domain to the articulatory domain. Work in identifying and resolving this non-uniqueness thus far has been somewhat successful, with Mixture-Density Networks (MDN) and articulator trajectory systems presenting probabilistic methods of finding the most likely articulatory configuration for a given signal. Using an subset of an EMA corpus, along with a combination of an instantaneous inversion mapping and a non-parametric clustering algorithm, I aim to quantify the extent to which acoustically similar vectors to a given phone can exhibit qualitatively different vocal-tract shapes. Categorical identification of acoustically similar sounds that can have shown a multi-valued mapping in the articulatory domain, as well as identifying which articulators this occurs for, could be key to resolving issues in the reliability and quality of the inversion mapping.
Keywords: Inversion Mapping
Acoustic-Articulatory
Non-uniqueness
Mean-shift clustering
URI: http://hdl.handle.net/1842/6168
Appears in Collections:Linguistics and English Language Masters thesis collection

This item is licensed under a Creative Commons License
Creative Commons

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback