ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Estimating the position and orientation of an acoustic source with a microphone array network

Alberto Yoshihiro Nakano, Seiichi Nakagawa, Kazumasa Yamamoto

We propose a method that finds the position and orientation of an acoustic source in an enclosed environment. For each of eight T-shaped arrays forming a microphone array network, the time delay of arrival (TDOA) of signals from microphone pairs, a source position candidate, and energy related features are estimated. These form the input for artificial neural networks (ANNs), the purpose of which is to provide indirectly a more precise position of the source and, additionally, to estimate the source’s orientation using various combinations of the estimated parameters. The best combination of parameters (TDOAs and microphone positions) yields a 21.8% reduction in the mean average position error compared to baselines, and a correct orientation ratio higher than 99.0%. The position estimation baselines include two estimation methods: a TDOA-based method that finds the source position geometrically, and the SRP-PHAT that finds the most likely source position by spatial exploration.


doi: 10.21437/Interspeech.2009-329

Cite as: Nakano, A.Y., Nakagawa, S., Yamamoto, K. (2009) Estimating the position and orientation of an acoustic source with a microphone array network. Proc. Interspeech 2009, 1127-1130, doi: 10.21437/Interspeech.2009-329

@inproceedings{nakano09_interspeech,
  author={Alberto Yoshihiro Nakano and Seiichi Nakagawa and Kazumasa Yamamoto},
  title={{Estimating the position and orientation of an acoustic source with a microphone array network}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1127--1130},
  doi={10.21437/Interspeech.2009-329}
}