Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

A Graphical Model for Multi-Sensory Speech Processing in Air-and-Bone Conductive Microphones

Amarnag Subramanya (1), Zhengyou Zhang (2), Zicheng Liu (2), Jasha Droppo (2), Alex Acero (2)

(1) University of Washington, USA; (2) Microsoft Research, Redmond, WA, USA

In continuation of our previous work on using an air-and-boneconductive microphone for speech enhancement, in this paper we propose a graphical model based approach to estimating the clean speech signal given the noisy observations in the air sensor. We also show how the same model can be used as a speech/nonspeech classifier. With the aid of MOS (mean opinion score) tests we show, that the performance of the proposed model is better in comparison to our previously proposed direct filtering algorithm.

Full Paper

Bibliographic reference.  Subramanya, Amarnag / Zhang, Zhengyou / Liu, Zicheng / Droppo, Jasha / Acero, Alex (2005): "A graphical model for multi-sensory speech processing in air-and-bone conductive microphones", In INTERSPEECH-2005, 2361-2364.