|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/3912
|
| Title: | Integrating Articulatory Features into HMM-based Parametric Speech Synthesis |
| Authors: | Ling, Zhenhua Richmond, Korin Yamagishi, Junichi Wang, Ren-Hua |
| Issue Date: | 2009 |
| Journal Title: | IEEE Transactions on Audio, Speech and Language Processing |
| Volume: | 17 |
| Issue: | 6 |
| Page Numbers: | 1171--1 |
| Abstract: | This paper presents an investigation of ways to integrate articulatory features into Hidden Markov Model (HMM)-based parametric speech synthesis, primarily with the aim of improving the performance of acoustic parameter generation. The joint distribution of acoustic and articulatory features is estimated during training and is then used for parameter generation at synthesis time in conjunction with a maximum-likelihood criterion. Different model structures are explored to allow the articulatory features to influence acoustic modeling: model clustering, state synchrony and cross-stream feature dependency. The results of objective evaluation show that the accuracy of acoustic parameter prediction can be improved when shared clustering and asynchronous-state model structures are adopted for combined acoustic and articulatory features. More significantly, our experiments demonstrate that modeling the dependency between these two feature streams can make speech synthesis more flexible. The characteristics of synthetic speech can be easily controlled by modifying generated articulatory features as part of the process of acoustic parameter generation. |
| URI: | http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5109768 http://hdl.handle.net/1842/3912 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|