Now showing items 1-5 of 5
Vocal Attractiveness Of Statistical Speech Synthesisers
Our previous analysis of speaker-adaptive HMM-based speech synthesis methods suggested that there are two possible reasons why average voices can obtain higher subjective scores than any individual adapted voice: 1) model ...
The CSTR/EMIME HTS system for Blizzard Challenge 2010
In the 2010 Blizzard Challenge, we focused on improving steps relating to feature extraction and labeling in the procedures for training HMM-based speech synthesis systems. New auditory scales were used for spectral ...
Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis
In speaker-adaptive HMM-based speech synthesis, there are a few speakers whose synthetic speech sounds worse than that of other speakers, despite having the same amount of adaptation data from within the same corpus. This ...
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project
(7th ISCA Speech Synthesis Workshop, 2010-09)
This paper provides an overview of speaker adaptation research carried out in the EMIME speech-to-speech translation (S2ST) project. We focus on how speaker adaptation transforms can be learned from speech in one language ...
The Romanian Speech Synthesis (RSS) corpus: building a high quality HMM-based speech synthesis system using a high sampling rate
This paper first introduces a newly-recorded high quality Romanian speech corpus designed for speech synthesis, called “RSS”, along with Romanian front-end text processing modules and HMM-based synthetic voices built from ...