ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Accounting for the uncertainty of speech estimates in the complex domain for minimum mean square error speech enhancement

Ramón Fernandez Astudillo, Dorothea Kolossa, Reinhold Orglmeister

Uncertainty decoding and uncertainty propagation, or error propagation, techniques have emerged as a powerful tool to increase the accuracy of automatic speech recognition systems by employing an uncertain, or probabilistic, description of the speech features rather than the usual point estimate. In this paper we analyze the uncertainty generated in the complex Fourier domain when performing speech enhancement with the Wiener or Ephraim-Malah filters. We derive closed form solutions for the computation of the error of estimation and show that it provides a better insight into the origin of estimation uncertainty. We also show how the combination of such an error estimate with uncertainty propagation and uncertainty decoding or modified imputation yields superior recognition robustness when compared to conventional MMSE estimators with little increase in the computational cost.


doi: 10.21437/Interspeech.2009-371

Cite as: Astudillo, R.F., Kolossa, D., Orglmeister, R. (2009) Accounting for the uncertainty of speech estimates in the complex domain for minimum mean square error speech enhancement. Proc. Interspeech 2009, 2491-2494, doi: 10.21437/Interspeech.2009-371

@inproceedings{astudillo09_interspeech,
  author={Ramón Fernandez Astudillo and Dorothea Kolossa and Reinhold Orglmeister},
  title={{Accounting for the uncertainty of speech estimates in the complex domain for minimum mean square error speech enhancement}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2491--2494},
  doi={10.21437/Interspeech.2009-371}
}