Earlier work in parametric modeling of distortions for robust speech recognition has focussed on estimating the distortion parameter using maximum likelihood and other techniques as a point in the parameter space, and treating this estimate as if it is the true value in a plug-in maximum a posteriori(MAP) decoder. This approach is deficient in most real environments where, due to many reasons, the value of the distortion parameter varies significantly. In this paper we introduce an approach which combines the power of parametric transformation and Bayesian prediction to solve this problem. Instead of approximating the distortion parameter with a point estimate, we average over its variation, thus taking into consideration the distribution of the parameter as well. This approach provides more robust performance than the conventional maximum-likelihood approach. It also provides the solution that minimizes the overall error given the distribution of the parameter. We present results to demonstrate the robustness and effectiveness of the predictive approach.
Cite as: Surendran, A.C., Lee, C.-H. (1998) Predictive adaptation and compensation for robust speech recognition. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0859, doi: 10.21437/ICSLP.1998-306
@inproceedings{surendran98_icslp, author={Arun C. Surendran and Chin-Hui Lee}, title={{Predictive adaptation and compensation for robust speech recognition}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0859}, doi={10.21437/ICSLP.1998-306} }