|
|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/2005
|
| Title: | The AMI System for the Transcription of Speech in Meetings |
| Authors: | Hain, Thomas Burget, Lukas Dines, John Garau, Giulia Wan, Vincent Karafiat, Martin Vepa, Jithendra Lincoln, Michael |
| Issue Date: | 2007 |
| Citation: | T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, J. Vepa, and V. Wan. The AMI System for the Transcription of Speech in Meetings. In Proc. ICASSP, 2007 |
| Abstract: | This paper describes the AMI transcription system for speech in
meetings developed in collaboration by five research groups. The
system includes generic techniques such as discriminative and speaker
adaptive training, vocal tract length normalisation, heteroscedastic
linear discriminant analysis, maximum likelihood linear regression,
and phone posterior based features, as well as techniques specifically
designed for meeting data. These include segmentation and
cross-talk suppression, beam-forming, domain adaptation, web-data
collection, and channel adaptive training. The system was improved
by more than 20% relative in word error rate compared to our previous
system and was used in the NIST RT’06 evaluations where it was
found to yield competitive performance. |
| Keywords: | speech technology speech recognition |
| URI: | http://hdl.handle.net/1842/2005 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|