ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Forward masking on a generalized logarithmic scale for robust speech recognition

Yoshihiro Ito, Hiroshi Matsumoto, Kazumasa Yamamoto

This paper examines the forward masking on the generalized logarithmic scale for robust speech recognition to both additive and convolutional noise. The forward masking in the dynamic cepstral (DyC) representation is based upon subtraction of a masking pattern from a current spectrum on a logarithmic spectral domain, whereas the proposed method intends to make a compromise between the logarithmic and linear spectral domains by choosing an appropriate value of the power. This technique is incorporated into a modified MFCC-based frontend. The connected- digit recognition tests showed that in noisy conditions this technique outperforms the conventional techniques such as the DyC, the continuous spectral subtraction method, the cepstral mean subtraction while maintaining the robustness to the convolutional noise.


Cite as: Ito, Y., Matsumoto, H., Yamamoto, K. (2000) Forward masking on a generalized logarithmic scale for robust speech recognition. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 530-533

@inproceedings{ito00b_icslp,
  author={Yoshihiro Ito and Hiroshi Matsumoto and Kazumasa Yamamoto},
  title={{Forward masking on a generalized logarithmic scale for robust speech recognition}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 530-533}
}