Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/1258

This item has been viewed 75 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
Taylor 99.pdf272.1 kBAdobe PDFView/Open
Title: Assigning phrase breaks from part-of-speech sequences
Authors: Taylor, Paul
Black, Alan W
Issue Date: Apr-1998
Citation: Computer Speech and Language (1998) 12, 99-117.
Publisher: Academic Press
Abstract: This paper presents an algorithm for automatically assigning phrase breaks to unrestricted text for use in a text-to-speech synthesizer. Text is first converted into a sequence of part-of-speech tags. Next a Markov model is used to give the most likely sequence of phrase breaks for the input part-of-speech tags. In the Markov model, states represent types of phrase break and the transitions between states represent the likelihoods of sequences of phrase types occurring. The paper reports a variety of experiments investigating part-of-speech tag-sets, Markov model structure and smoothing. The best setup correctly identifies 79% of breaks in the test corpus.
URI: http://dx.doi.org/10.1006/csla.1998.0041
http://hdl.handle.net/1842/1258
ISSN: 0885-2308
Appears in Collections:CSTR publications

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback