13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

A Fast-Converging Adaptive Frequency-Domain MVDR Beamformer for Speech Enhancement

Shengkui Zhao (1), Douglas L. Jones (2)

(1) Advanced Digital Sciences Center, Illinois at Singapore, Singapore
(2) Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Illinois, USA

In this paper, we present a fast-converging adaptive frequency-domain minimum-variance-distortionless-response (MVDR) beamformer (FMV) for speech enhancement. The well-known FMV solution is optimum in the microphone array processing. However, the direct computation of the optimum FMV solution is often undesirable due to the the inversion of the spatio-spectral correlation matrix which is often unstable and is expensive for large arrays. To avoid the matrix inversion, we develop a fast-converging conjugate gradient (CG) algorithm for iteratively computing the FMV solution. Compared to the existing steepest descent (SD) algorithm, the CG algorithm can dramatically improve the convergence speed for the case of multiple interfering signals in speech enhancement. Therefore, the computational load and processing time can be significantly reduced. The speech enhancement experiments using a four-channel acoustic-vector-sensor (AVS) microphone array are demonstrated for the target speech signal corrupted by two and five interfering speech signals and superior performance are achieved.

Index Terms: speech enhancement, microphone arrays, correlation, convergence, adaptive signal processing

Full Paper

Bibliographic reference.  Zhao, Shengkui / Jones, Douglas L. (2012): "A fast-converging adaptive frequency-domain MVDR beamformer for speech enhancement", In INTERSPEECH-2012, 1930-1933.