8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Audio-Based Approaches to Head Orientation Estimation in a Smart-Room

Alberto Abad, Carlos Segura, Climent Nadeu, Javier Hernando

Universitat Politècnica de Catalunya, Spain

The head orientation of human speakers in a smart-room affects the quality of the signals recorded by far-field microphones, and consequently influences the performance of the technologies deployed based on those signals. Additionally, knowing the orientation in these environments can be useful for the development of several multimodal advanced services, for instance, in microphone network management. Consequently, head orientation estimation has recently become a growing interesting research topic. In this paper, we propose two different approaches to head orientation estimation on the basis of multi-microphone recordings: first, an approach based on the generalization of the well-known SRP-PHAT speaker localization algorithm, and second a new approach based on measurements of the ratio between the high and the low band speech energies. Promising results are obtained in both cases, with a generalized better performance of the algorithms based on speaker localization methods.

Full Paper

Bibliographic reference.  Abad, Alberto / Segura, Carlos / Nadeu, Climent / Hernando, Javier (2007): "Audio-based approaches to head orientation estimation in a smart-room", In INTERSPEECH-2007, 590-593.