|
|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/1045
|
| Title: | Dialog act modelling for conversational speech |
| Authors: | Stolcke, Andreas Shriberg, Elizabeth Bates, Rebecca Coccaro, Noah Jurafsky, Daniel Martin, Rachel Meteer, Marie Ries, Klaus Taylor, Paul Van Ess-Dykema, Carol |
| Issue Date: | 1998 |
| Citation: | In Applying Machine Learning to Discourse Processing: Papers from the 1998 Spring Symposium, ed. Jennifer Chu-Carroll and Nancy Green, 96-105. Technical Report SS-98-01. American Association for Artificial Intelligence, Menlo Park, California. |
| Publisher: | AAAI Press |
| Abstract: | We describe an integrated approach for statistical modeling of discourse structure for natural conversational speech. Our model is based on 42 'dialog acts’ (e.g., Statement, Question, Backchannel, Agreement, Disagreement, Apology), which were hand-labeled in 1155 conversations from the Switchboard corpus of spontaneous human-to-human telephone speech. We developed several models and algorithms to automatically detect dialog acts from transcribed or automatically recognized words and from prosodic properties of the speech signal, and by using a statistical discourse grammar. All of these components were probabilistic in nature and estimated from data, employing a variety of techniques (hidden Markov models, N-gram language models, maximum entropy estimation, decision tree classifiers, and neural networks). In preliminary studies, we achieved a dialog act labeling accuracy of 65% based on recognized words and prosody, and an accuracy of 72% based on word transcripts. Since humans achieve 84% on this task (with chance performance at 35%) we find these results encouraging. |
| URI: | http://aaaipress.org/Library/Symposia/Spring/ss98-01.php http://hdl.handle.net/1842/1045 |
| ISBN: | 978-1-57735-046-0 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|