|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/957
|
| Title: | From text to prosody without ToBI |
| Authors: | Strom, Volker |
| Issue Date: | Sep-2002 |
| Citation: | In Proceedings [ICSLP-2002] 7th International Conference on Spoken Language Processing (ICSLP2002 - INTERSPEECH 2002), Denver, Colorado, USA, September 16-20, 2002 |
| Publisher: | International Speech Communication Association |
| Abstract: | A new method for predicting prosodic parameters, i.e. phone durations and F0 targets, from preprocessed text is presented. The prosody model comprises a set of CARTs, which are learned from a large database of labeled speech. This database need not be annotated with Tone and Break Indices (ToBI labels). Instead, a
simpler symbolic prosodic description is created by a bootstrapping method. The method had been applied to one Spanish and two German speakers. For the German voices, two listening tests
showed a significant preference for the new method over a more traditional approach of prosody prediction, based on hand-crafted
rules. |
| URI: | http://www.isca-speech.org/archive/icslp02 http://hdl.handle.net/1842/957 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|