|
Edinburgh Research Archive >
Informatics, School of >
Informatics Publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/3680
| Title: | Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project |
| Authors: | Wester, Mirjam Dines, John Gibson, Matthew Liang, Hui Wu, Yi-Jian Saheer, Lakshmi King, Simon Oura, Keiichiro Garner, Philip N. Byrne, William Guan, Yong Hirsimäki, Teemu Karhila, Reima Kurimo, Mikko Shannon, Matt Shiota, Sayaka Tian, Jilei Tokuda, Keiichi Yamagishi, Junichi |
| Issue Date: | Sep-2010 |
| Publisher: | 7th ISCA Speech Synthesis Workshop |
| Abstract: | This paper provides an overview of speaker adaptation research carried out in the EMIME speech-to-speech translation (S2ST) project. We focus on how speaker adaptation transforms can be learned from speech in one language and applied to the acoustic models of another language. The adaptation is transferred across languages and/or from recognition models to synthesis models. The various approaches investigated can all be viewed as a process in which a mapping is defined in terms of either acoustic model states or linguistic units. The mapping is used to transfer either speech data or adaptation transforms between the two models. Because the success of speaker adaptation in text-to-speech synthesis is measured by judging speaker similarity, we also discuss issues concerning evaluation of speaker similarity in an S2ST scenario. |
| Sponsorship: | European Community's Seventh Framework Programme (FP7/2007-2013) grant agreement 213845 (the EMIME project) |
| Keywords: | speech synthesis speaker adaptation |
| URI: | http://hdl.handle.net/1842/3680 |
| Appears in Collections: | Informatics Publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|