|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/1110
|
| Title: | Accent Phrase Segmentation by Finding N-Best Sequences of Pitch Pattern Templates |
| Authors: | Nakai, Mitsuru Shimodaira, Hiroshi |
| Issue Date: | Sep-1994 |
| Citation: | Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, September 18-22, 1994. pp.347-350. |
| Publisher: | International Speech Communication Association |
| Abstract: | This paper describes a prosodic method for segmenting continuous speech into accent phrases. Optimum sequences are obtained on the basis of least squared error criterion by using dynamic time warping between F0 contours of input speech and reference accent patterns called 'pitch pattern templates'. But the optimum sequence does not always give good agreement with phrase boundaries labeled by hand, while the second or the third optimum candidate sequence does well. Therefore, we expand our system to be able to find out multiple candidates by using N-best algorithm. Evaluation tests were carried out using the ATR continuous speech database of 10 speakers. The results showed about 97% of phrase boundaries were correctly detected when we took 30-best candidates, and this accuracy is 7.5% higher than the conventional method without using N-best search algorithm. |
| URI: | http://www.isca-speech.org/archive/icslp_1994 http://hdl.handle.net/1842/1110 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|