In this paper, the implementation of a robust front-end to be used for a large-vocabulary Continuous Speech Recognition (CSR) system based on a Voiced-Unvoiced (V-U) decision has been addressed. Our approach is based on the separation of the speech signal into voiced and unvoiced components. Consequently, speech enhancement can be achieved through processing of the voiced and the unvoiced components separately. Enhancement of the voiced component is performed using an adaptive comb filtering, whereas the unvoiced component is enhanced using the modified spectral subtraction approach. We proved via experiments that the proposed CSR system is robust in additive noisy environments (SNR down to 0 dB).
Cite as: Tolba, H., O'Shaughnessy, D. (1998) Robust automatic continuous-speech recognition based on a voiced-unvoiced decision. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0342, doi: 10.21437/ICSLP.1998-669
@inproceedings{tolba98c_icslp, author={Hesham Tolba and Douglas O'Shaughnessy}, title={{Robust automatic continuous-speech recognition based on a voiced-unvoiced decision}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0342}, doi={10.21437/ICSLP.1998-669} }