|
Edinburgh Research Archive >
Centre for Speech Technology Research >
CSTR publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/951
|
| Title: | Jacobian Joint Adaptation to Noise, Channel and Vocal Tract Length |
| Authors: | Shimodaira, Hiroshi Sakai, Nobuyoshi Nakai, Mitsuru Sagayama, Shigeki |
| Issue Date: | May-2002 |
| Citation: | Proc. IEEE ICASSP2002, pp.197-200 |
| Publisher: | IEEE |
| Abstract: | A new Jacobian approach that linearly decomposes the composite of additive noise, multiplicative noise (channel transfer function) and speaker's vocal tract length, and adapts the acoustic model parameters simultaneously to these factors is proposed in this paper. Due to the fact that these factors non-linearly degrade the observed features for speech recognition, existing approaches fail to adapt the acoustic models adequately. Approximating the nonlinear operation by a linear model enables to employ the least square error estimation of the factors and adapt the acoustic model parameters with small amount of speech samples. Speech recognition experiments on ATR isolated word database demonstrate significant reduction of error rates, which supports the effectiveness of the proposed scheme. |
| URI: | http://hdl.handle.net/1842/951 |
| Appears in Collections: | CSTR publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|