Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/4656

This item has been viewed 41 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
100116.pdf161.19 kBAdobe PDFView/Open
Title: Prediction and Realisation of Conversational Characteristics by Utilising Spontaneous Speech for Unit Selection
Authors: Andersson, Sebastian
Georgila, Kallirroi
Traum, David
Aylett, Matthew
Clark, Robert A J
Issue Date: 2010
Journal Title: Speech Prosody 2010
Abstract: Unit selection speech synthesis has reached high levels of naturalness and intelligibility for neutral read aloud speech. However, synthetic speech generated using neutral read aloud data lacks all the attitude, intention and spontaneity associated with everyday conversations. Unit selection is heavily data dependent and thus in order to simulate human conversational speech, or create synthetic voices for believable virtual characters, we need to utilise speech data with examples of how people talk rather than how people read. In this paper we included carefully selected utterances from spontaneous conversational speech in a unit selection voice. Using this voice and by automatically predicting type and placement of lexical fillers and filled pauses we can synthesise utterances with conversational characteristics. A perceptual listening test showed that it is possible to make synthetic speech sound more conversational without degrading naturalness.
URI: http://hdl.handle.net/1842/4656
Appears in Collections:CSTR publications

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh 2013, and/or the original authors. Privacy and Cookies Policy