Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Philosophy, Psychology and Language Sciences, School of >
Linguistics and English Language >
Linguistics and English Language Masters thesis collection >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/2847

This item has been viewed 27 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
MSc_Dissertation_Steve_Woodcock.doc1.45 MBMicrosoft WordView/Open
Title: Optimising Join Cost Weights For Unit Selection Speech Synthesis
Authors: Woodcock, Steve
Supervisor(s): Clark, Robert
Issue Date: 4-Dec-2008
Abstract: Unit selection synthesis predominates today, but is not yet of a quality to rival natural speech. Unit selection can be inconsistent in quality and one of the causes are the joins. Earlier research suggested joins are perceived differently according to category. We investigated whether synthesis was perceived as more natural if join costs were calculated with reference to phonetic category. The join cost in the Festival multisyn synthesis system was extended beyond purely acoustic measures to categorise joins phonetically. 2 methods were used to optimise the join subcosts for each category: hand tuned heuristic, and an automated data-centric approach. For this task the data-centric approach ultimately proved more suitable. Default synthesis was compared to the ‘optimised’ synthesis in a perceptual experiment. Results were mixed; some syntheses were perceived as better, some worse and participants expressed no preference for others. There was no significant overall preference for the optimised synthesis. The results indicated our optimised join cost was not yet a good model. No attempt to optimise the Festival multisyn join cost had been made prior to this investigation. This suggests further studies, in which varying the model, and/or use of more sophisticated optimisation methods, may yet produce synthesis that is perceived as more natural for any input text.
Keywords: synthesis
join cost
optimise
URI: http://hdl.handle.net/1842/2847
Appears in Collections:Linguistics and English Language Masters thesis collection

This item is licensed under a Creative Commons License
Creative Commons

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback