|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/2132
|
| Title: | Improved Average-Voice-based Speech Synthesis Using Gender-Mixed Modeling and a Parameter Generation Algorithm Considering GV |
| Authors: | Yamagishi, Junichi Kobayashi, Takao Renals, Steve King, Simon Zen, Heiga Toda, Tomoki Tokuda, Keiichi |
| Issue Date: | Aug-2007 |
| Citation: | Junichi Yamagishi, Takao Kobayashi, Steve Renals, Simon King, Heiga Zen, Tomoki Toda, and Keiichi Tokuda. Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV. In Proc. 6th ISCA Workshop on Speech Synthesis (SSW-6), August 2007. |
| Abstract: | For constructing a speech synthesis system which can achieve
diverse voices, we have been developing a speaker independent
approach of HMM-based speech synthesis in which statistical
average voice models are adapted to a target speaker using a
small amount of speech data. In this paper, we incorporate a
high-quality speech vocoding method STRAIGHT and a parameter
generation algorithm with global variance into the system
for improving quality of synthetic speech. Furthermore, we
introduce a feature-space speaker adaptive training algorithm
and a gender mixed modeling technique for conducting further
normalization of the average voice model. We build an English
text-to-speech system using these techniques and show the performance
of the system. |
| Keywords: | speech technology |
| URI: | http://hdl.handle.net/1842/2132 |
| Appears in Collections: | CSTR publications Linguistics and English Language publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|