Multiple Sound Source Localization with SVD-PHAT

Fran├žois Grondin, James Glass


This paper introduces a modification of phase transform on singular value decomposition (SVD-PHAT) to localize multiple sound sources. This work aims to improve localization accuracy and keeps the algorithm complexity low for real-time applications. This method relies on multiple scans of the search space, with projection of each low-dimensional observation onto orthogonal subspaces. We show that this method localizes multiple sound sources more accurately than discrete SRP-PHAT, with a reduction in the Root Mean Square Error up to 0.0395 radians.


 DOI: 10.21437/Interspeech.2019-2653

Cite as: Grondin, F., Glass, J. (2019) Multiple Sound Source Localization with SVD-PHAT. Proc. Interspeech 2019, 2698-2702, DOI: 10.21437/Interspeech.2019-2653.


@inproceedings{Grondin2019,
  author={Fran├žois Grondin and James Glass},
  title={{Multiple Sound Source Localization with SVD-PHAT}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={2698--2702},
  doi={10.21437/Interspeech.2019-2653},
  url={http://dx.doi.org/10.21437/Interspeech.2019-2653}
}