Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Informatics, School of >
Informatics Publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/4865

This item has been viewed 26 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
p50227.pdf405.16 kBAdobe PDFView/Open
Title: Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis
Authors: Yamagishi, Junichi
Watts, Oliver
King, Simon
Usabaev, Bela
Issue Date: 2010
Citation: Proc. Interspeech 2010 (Tokyo, Japan).
Publisher: ISCA
Abstract: In speaker-adaptive HMM-based speech synthesis, there are a few speakers whose synthetic speech sounds worse than that of other speakers, despite having the same amount of adaptation data from within the same corpus. This paper investigates these fluctuations in quality and found that as mel-cepstral distance from the average voice becomes larger, the MOS scores generally become worse. Although the negative correlation obtained is not strong enough, this helps us improve the training and adaptation strategies for average voice models. Furthermore we remark that this correlation is strongly linked to “vocal attractiveness.”
Sponsor(s): The European Community’s Seventh Framework Programme (FP7/2007-2013) under Grant agreement 213845 (the EMIME project)
Keywords: Speech Synthesis
HMM
average voice
speaker adaptation
URI: http://hdl.handle.net/1842/4865
Appears in Collections:Informatics Publications

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback