Show simple item record

dc.contributor.authorShiga, Yoshinori
dc.contributor.authorMatsuura, Hiroshi
dc.contributor.authorNitta, Tsuneo
dc.coverage.spatial4en
dc.date.accessioned2006-05-11T16:51:38Z
dc.date.available2006-05-11T16:51:38Z
dc.date.issued1998-12
dc.identifier.citationIn ICSLP-1998, paper 0518en
dc.identifier.urihttp://www.isca-speech.org/archive/icslp_1998/index.html
dc.identifier.urihttp://hdl.handle.net/1842/1006
dc.description.abstractThis paper proposes a new method that determines segmental duration for text-to-speech conversion based on the movement of articulatory organs which compose an articulatory model. The articulatory model comprises four time-variable articulatory parameters representing the conditions of articulatory organs whose physical restriction seems to significantly influence the segmental duration. The parameters are controlled according to an input sequence of phonetic symbols, following which segmental duration is determined based on the variation of the articulatory parameters. The proposed method is evaluated through an experiment using a Japanese speech database that consists of 150 phonetically balanced sentences. The results indicate that the mean square error of predicted segmental duration is approximately 15[ms] for the closed set and 15-17[ms] for the open set. The error is within 20[ms], the level of acceptability for distortion of segmental duration without loss of naturalness, and hence the method is proved to effectively predict segmental duration.en
dc.format.extent136850 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherInternational Speech Communication Associationen
dc.titleSegmental Duration Control Based on an Articulatory Modelen
dc.typeConference Paperen


Files in this item

This item appears in the following Collection(s)

Show simple item record