|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/948
|
| Title: | Maximum entropy segmentation of broadcast news |
| Authors: | Christensen, Heidi Kolluru, BalaKrishna Gotoh, Yoshihiko Renals, Steve |
| Issue Date: | 2005 |
| Citation: | In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-05), Philadelphia, PA, USA, March 2005. |
| Publisher: | IEEE Signal Processing Society Press. |
| Abstract: | This paper presents an automatic system for structuring and preparing a news broadcast for applications such as speech summarization, browsing, archiving and information retrieval. This process comprises transcribing the audio using an automatic speech recognizer and subsequently segmenting the text into utterances and topics. A maximum entropy approach is used to build statistical models for both utterance and topic segmentation. The experimental work addresses the effect on performance of the topic boundary detector of three factors: the information sources used, the quality of the ASR transcripts, and the quality of the utterance boundary detector. The results show that the topic segmentation is not affected severely by transcripts errors, whereas errors in the utterance segmentation are more devastating. |
| URI: | http://hdl.handle.net/1842/948 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|