ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

A novel feature extraction using multiple acoustic feature planes for HMM-based speech recognition

Tsuneo Nitta, Masashi Takigawa, Takashi Fukuda

This paper describes an attempt to extract multiple peripheral features of a point x(ti,fj) on a timespectrum (TS) pattern by observing n´n neighborhoods of the point, and to incorporate these peripheral features (MPFPs: multiple peripheral feature planes) into the feature extractor of a speech recognition system together with MFCC parameters. Two types of peripheral feature extractor, MPFP-KL and MPFP-LR, are proposed. MPFP-KL adopts the orthogonal bases extracted directly from speech data by using KLT of 7x7 blocks on TS patterns. In MPFP-LR, the upper two primal bases are selected and simplified in the form of Δt-operator and Δfoperator obtained by linear regression calculation. MPFP-KL and MPFP-LR show significant improvements in comparison with the standard MFCC feature extractor in experiments with the HMM-based ASR system.


Cite as: Nitta, T., Takigawa, M., Fukuda, T. (2000) A novel feature extraction using multiple acoustic feature planes for HMM-based speech recognition. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 385-388

@inproceedings{nitta00_icslp,
  author={Tsuneo Nitta and Masashi Takigawa and Takashi Fukuda},
  title={{A novel feature extraction using multiple acoustic feature planes for HMM-based speech recognition}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 385-388}
}