ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Environment-independent mask estimation for missing-feature reconstruction

Wooil Kim, Richard M. Stern, Hanseok Ko

In this paper, we propose an effective mask-estimation method for missing-feature reconstruction in order to achieve robust speech recognition in unknown noise environments. In previous work, it was found that training a model for mask estimation on speech corrupted by white noise did not provide environment-independent recognition accuracy. In this paper we describe a training method based on bands of colored noise that is more effective in reflecting spectral variations across neighboring frames and subbands. We also achieved further improvement in recognition accuracy by reconsidering frames that appeared to be unvoiced in the initial pitch analysis. Performance is evaluated using the Aurora 2.0 database in the presence of various types of noise maskers. Experimental results indicate that the proposed methods are effective in estimating masks for missing-feature reconstruction while remaining more independent of the noise conditions.


doi: 10.21437/Interspeech.2005-248

Cite as: Kim, W., Stern, R.M., Ko, H. (2005) Environment-independent mask estimation for missing-feature reconstruction. Proc. Interspeech 2005, 2637-2640, doi: 10.21437/Interspeech.2005-248

@inproceedings{kim05b_interspeech,
  author={Wooil Kim and Richard M. Stern and Hanseok Ko},
  title={{Environment-independent mask estimation for missing-feature reconstruction}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2637--2640},
  doi={10.21437/Interspeech.2005-248}
}