INTERSPEECH 2009
10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Performance Comparison of HMM and VQ Based Single Channel Speech Separation

M. H. Radfar (1), W. -Y. Chan (2), R. M. Dansereau (3), W. Wong (1)

(1) University of Toronto, Canada
(2) Queen's University, Canada
(3) Carleton University, Canada

In this paper, single channel speech separation (SCSS) techniques based on hidden Markov models (HMM) and vector quantization (VQ) are described and compared in terms of (a) signal-to-noise ratio (SNR) between separated and original speech signals, (b) preference of listeners, and (c) computational complexity. The SNR results show that the HMM-based technique marginally outperforms the VQ-based technique by 0.85 dB in experiments conducted on mixtures of female-female, male-male, and male-female speakers. Subjective tests show that listeners prefer HMM over VQ for 86.70% of test speech files. This improvement, however, is at the expense of a drastic increase in computational complexity when compared with the VQ-based technique.

Full Paper

Bibliographic reference.  Radfar, M. H. / Chan, W. -Y. / Dansereau, R. M. / Wong, W. (2009): "Performance comparison of HMM and VQ based single channel speech separation", In INTERSPEECH-2009, 1951-1954.