Robust Speech Recognition for Unknown Communication Channels

Pont-à-Mousson, France
April 17-18, 1997

Automatic Segmentation and Labeling of Speech Recorded in Unknown Noisy Channel Environments

Bryan L. Pellom, John H. L. Hansen

Robust Speech Processing Laboratory, Department of Electrical and Computer Engineering, Duke University, Durham, NC, USA

This paper investigates the problem of automatic segmentation of speech recorded in noisy channel corrupted environments. Several speech enhancement and channel compensation techniques previously proposed for robust speech recognition are evaluated and compared for improved segmentation in colored noise. Speech enhancement algorithms considered include: Generalized Spectral Subtraction, Nonlinear Spectral Subtraction, Ephraim-Malah MMSE enhancement, and AutoLSP Constrained Iterative Wiener filtering. In addition, the coupling of front-end processing such as cepstral mean subtraction in tandem with speech enhancement methods is considered for noisy channel compensation. Compensation performance is assessed for each method using automatic segmentation of the telephone transmitted NTIMIT and cellular telephone transmitted CTIMIT databases.

Full Paper

Bibliographic reference.  Pellom, Bryan L. / Hansen, John H. L. (1997): "Automatic segmentation and labeling of speech recorded in unknown noisy channel environments", In RSR-1997, 167-170.