ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Evaluation and optimization of noise robust front-end technologies for the automatic recognition of Hungarian telephone speech

Péter Mihajlik, Zoltán Tobler, Zoltán Tüske, Géza Gordos

In this paper a variety of front-end configurations are evaluated on Hungarian telephone speech databases. Our aim was to measure directly the efficiency of the front-ends on real noisy and normal speech data. As a baseline the ETSI ADSR standard front-end is used. Some simplification on the standard is introduced resulting in better performance on our databases than the original front-end in terms of both speed and recognition rate. Besides, another recently proposed feature extraction approach is also investigated. Finally the effect of the novel voice activity detection approach is evaluated. The best front-end configuration augmented with this voice activity detector outperformed significantly the baseline in each recognition test and by 24,7% relative in average.


doi: 10.21437/Interspeech.2005-258

Cite as: Mihajlik, P., Tobler, Z., Tüske, Z., Gordos, G. (2005) Evaluation and optimization of noise robust front-end technologies for the automatic recognition of Hungarian telephone speech. Proc. Interspeech 2005, 2677-2680, doi: 10.21437/Interspeech.2005-258

@inproceedings{mihajlik05_interspeech,
  author={Péter Mihajlik and Zoltán Tobler and Zoltán Tüske and Géza Gordos},
  title={{Evaluation and optimization of noise robust front-end technologies for the automatic recognition of Hungarian telephone speech}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2677--2680},
  doi={10.21437/Interspeech.2005-258}
}