In this paper, a fast computational method is proposed to approximate the Karhunen-Lo_eve transform (KLT) in signal-subspace-based speech enhancement algorithm. The discrete cosine transform (DCT) is shown to be a good approximation of KLT for the covariance matrix of the autoregressive process of order p (AR(p)). A fast algorithm which reduces the computation of eigenvalues of an N _ N symmetric Toeplitz matrix from O(N 3) inKLT to O(N 2 ) is developed. Experiment results demonstrate that the performance of the fast algorithm is very close to that of the KLT-based method in robust speech recognition in car environment while significantly reduces the computation time. An acoustic normalization scheme is also found to be usful to compensate the mismatch between the training and test conditions and thus further improves the recognition performance.
Cite as: Huang, J., Zhao, Y., Levinson, S. (1999) A DCT-based fast enhancement technique for robust speech recognition in automobile usage. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1947-1950, doi: 10.21437/Eurospeech.1999-428
@inproceedings{huang99d_eurospeech, author={Jun Huang and Yunxin Zhao and Stephen Levinson}, title={{A DCT-based fast enhancement technique for robust speech recognition in automobile usage}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={1947--1950}, doi={10.21437/Eurospeech.1999-428} }