Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/4657

This item has been viewed 17 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
oura_icassp2010.pdf508.89 kBAdobe PDFView/Open
Title: Unsupervised Cross-lingual Speaker Adaptation for HMM-based Speech Synthesis
Authors: Oura, Keiichiro
Tokuda, Keiichi
Yamagishi, Junichi
Wester, Mirjam
King, Simon
Issue Date: 2010
Journal Title: Proc. of ICASSP
Abstract: In the EMIME project, we are developing a mobile device that performs personalized speech-to-speech translation such that a user's spoken input in one language is used to produce spoken output in another language, while continuing to sound like the user's voice. We integrate two techniques, unsupervised adaptation for HMM-based TTS using a word-based large-vocabulary continuous speech recognizer and cross-lingual speaker adaptation for HMM-based TTS, into a single architecture. Thus, an unsupervised cross-lingual speaker adaptation system can be developed. Listening tests show very promising results, demonstrating that adapted voices sound similar to the target speaker and that differences between supervised and unsupervised cross-lingual speaker adaptation are small.
URI: http://hdl.handle.net/1842/4657
Appears in Collections:CSTR publications

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback