|
Edinburgh Research Archive >
Informatics, School of >
Informatics Publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/4865
|
| Title: | Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis |
| Authors: | Yamagishi, Junichi Watts, Oliver King, Simon Usabaev, Bela |
| Issue Date: | 2010 |
| Citation: | Proc. Interspeech 2010 (Tokyo, Japan). |
| Publisher: | ISCA |
| Abstract: | In speaker-adaptive HMM-based speech synthesis, there are a few speakers whose synthetic speech sounds worse than that
of other speakers, despite having the same amount of adaptation data from within the same corpus. This paper investigates
these fluctuations in quality and found that as mel-cepstral distance from the average voice becomes larger, the MOS scores
generally become worse. Although the negative correlation obtained is not strong enough, this helps us improve the training and adaptation strategies for average voice models. Furthermore we remark that this correlation is strongly linked to “vocal attractiveness.” |
| Sponsor(s): | The European Community’s Seventh Framework Programme (FP7/2007-2013) under Grant agreement 213845 (the EMIME project) |
| Keywords: | Speech Synthesis HMM average voice speaker adaptation |
| URI: | http://hdl.handle.net/1842/4865 |
| Appears in Collections: | Informatics Publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|