Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/1070

This item has been viewed 14 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
trec6.pdf47.65 kBAdobe PDFView/Open
Title: The THISL Spoken Document Retrieval System
Authors: Abberley, Dave
Renals, Steve
Cook, Gary
Robinson, Tony
Issue Date: 1998
Citation: NIST Special Publication 500-240: The Sixth Text REtrieval Conference (TREC-6). pp.747-752.
Publisher: Department of Commerce, National Institute of Standards and Technology
Abstract: The THISL spoken document retrieval system is based on the Abbot Large Vocabulary Continuous Speech Recognition (LVCSR) system developed by Cambridge University, Sheffield University and SoftSound, and uses PRISE (NIST) for indexing and retrieval. We participated in full SDR mode. Our approach was to transcribe the spoken documents at the word level using Abbot, indexing the resulting text transcriptions using PRISE. The LVCSR system uses a recurrent network-based acoustic model (with no adaptation to different conditions) trained on the 50 hour Broadcast News training set, a 65,000 word vocabulary and a trigram language model derived from Broadcast News text. Words in queries which were out-of-vocabulary (OOV) were word spotted at query time (utilizing the posterior phone probabilities output by the acoustic model), added to the transcriptions of the relevant documents and the collection was then re-indexed. We generated pronunciations at run-time for OOV words using the Festival TTS system (University of Edinburgh).
URI: http://trec.nist.gov/pubs/trec6/t6_proceedings.html
http://hdl.handle.net/1842/1070
Appears in Collections:CSTR publications

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh 2013, and/or the original authors. Privacy and Cookies Policy