|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/1089
|
| Title: | Bayesian modelling of vowel segment duration for text-to-speech synthesis using distinctive features |
| Authors: | Goubanova, Olga V |
| Issue Date: | 2003 |
| Citation: | In Proc. ICPhS 2003, volume 3, page 2349, Barcelona, Spain, 2003. |
| Publisher: | International Congress of Phonetic Sciences |
| Abstract: | We report the results of applying the Bayesian Belief Network (BN) approach to predicting vowel duration. A Bayesian inference of the vowel duration is performed on a hybrid Bayesian network consisting of discrete and continuous nodes, with the nodes in the network representing the linguistic factors that affect segment duration. New to the present research, we model segment identity factor as a set of distinctive features. The features chosen were height, frontness, length, and roundness. We also experimented with a word class feature that implicitly represents word frequency information. We contrasted the results of the belief network model with those of the sums of products (SoP) model and classification and regression tree (CART) model. We trained and tested all three models on the same data. In terms of the RMS error and correlation coefficient, our BN model performs no worse than SoP model, and it significantly outperforms CART model. |
| Keywords: | speech Bayesian Belief Network |
| URI: | http://hdl.handle.net/1842/1089 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|