Information Services banner Edinburgh Research Archive The University of Edinburgh crest

Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1842/951

This item has been viewed 4 times in the last year. View Statistics

Files in This Item:

File Description SizeFormat
shimodaira ICASSP.pdf336.7 kBAdobe PDFView/Open
Title: Jacobian Joint Adaptation to Noise, Channel and Vocal Tract Length
Authors: Shimodaira, Hiroshi
Sakai, Nobuyoshi
Nakai, Mitsuru
Sagayama, Shigeki
Issue Date: May-2002
Citation: Proc. IEEE ICASSP2002, pp.197-200
Publisher: IEEE
Abstract: A new Jacobian approach that linearly decomposes the composite of additive noise, multiplicative noise (channel transfer function) and speaker's vocal tract length, and adapts the acoustic model parameters simultaneously to these factors is proposed in this paper. Due to the fact that these factors non-linearly degrade the observed features for speech recognition, existing approaches fail to adapt the acoustic models adequately. Approximating the nonlinear operation by a linear model enables to employ the least square error estimation of the factors and adapt the acoustic model parameters with small amount of speech samples. Speech recognition experiments on ATR isolated word database demonstrate significant reduction of error rates, which supports the effectiveness of the proposed scheme.
URI: http://hdl.handle.net/1842/951
Appears in Collections:CSTR publications

Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback