Third International Conference on Spoken Language Processing (ICSLP 94)

Yokohama, Japan
September 18-22, 1994

Robust Signal Preprocessing for HMM Speech Recognition in Adverse Conditions

Jean-Baptiste Puel, Regine André-Obrecht

IRIT, URA CNRS 1399, Université Paul Sabatier, Toulouse, France

The detection of speech endpoints is a strategic process for speech recognition systems in adverse conditions, but it remains a rather delicate problem. We introduce two signal processing methods that offer a good robustness without requiring high level informations about the signal. The first approach uses temporal parameters, the other frequential ones. We discuss and compare their performances using the ARS ESPRIT database (isolated words pronounced in a car). We show that these methods coupled with a statistical segmentation offer very good discrimination between noisy segments and speech segments, and a better precision for locating the speech boundaries. The preprocessing is introduced in a HMM speech recognition system.

Full Paper

Bibliographic reference.  Puel, Jean-Baptiste / André-Obrecht, Regine (1994): "Robust signal preprocessing for HMM speech recognition in adverse conditions", In ICSLP-1994, 259-262.