Show simple item record

dc.contributor.authorCampbell, Nick
dc.contributor.authorIsard, Stephen
dc.contributor.authorMonaghan, Alex
dc.contributor.authorVerhoeven, J.
dc.coverage.spatial4en
dc.date.accessioned2006-06-06T10:31:22Z
dc.date.available2006-06-06T10:31:22Z
dc.date.issued1990-11
dc.identifier.citation[ICSLP-1990] First International Conference on Spoken Language Processing (ICSLP 90), Kobe, Japan, November 18-22, 1990. pp.825-828.en
dc.identifier.urihttp://www.isca-speech.org/archive/icslp_1990
dc.identifier.urihttp://hdl.handle.net/1842/1208
dc.description.abstractThis paper describes the prosodic processing and wave-form generation components of the text-to-speech system being developed at Edinburgh University's Centre for Speech Technology Research. Intonation is specified as a sequence of minimal descriptors whose locations are given in terms of syntactically-determined prosodic domains. A pitch contour is computed by converting the descriptors into a sequence of abstract targets whose absolute values depend on a specific speaker model. Duration is determined first at the level of the syllable by a neural network, then accommodated at the segment level according to the distributions observed in a phonetically balanced database. The output waveform is generated by LPC resynthesis of diphone units. Three methods of diphone segmentation are discussed.en
dc.format.extent351312 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherInternational Speech Communication Associationen
dc.titleDuration, Pitch and Diphones in the CSTR TTS Systemen
dc.typeConference Paperen


Files in this item

This item appears in the following Collection(s)

Show simple item record