Sparseness-Aware DOA Estimation with Majorization Minimization

Masahito Togami, Robin Scheibler


We propose a direction-of-arrival (DOA) estimation technique which assumes that speech sources are sufficiently sparse and there is only one active speech source at each time-frequency (T-F) point. The proposed method estimates the DOA of the active speech source at each T-F point. A typical way for DOA estimation is based on grid-searching for all possible directions. However, computational cost of grid-searching is proportional to the resolution of search area. Instead of accurate grid-searching, the proposed method adopts rough grid-searching followed by an iterative parameter optimization based on Majorization-Minimization (MM) algorithm. We propose a parameter optimization method which guarantees a monotonical increase of the objective function. Experimental results show that the proposed method estimates DOAs of speech sources more accurately than conventional DOA estimation methods when computational cost of each method is almost the same.


 DOI: 10.21437/Interspeech.2020-1168

Cite as: Togami, M., Scheibler, R. (2020) Sparseness-Aware DOA Estimation with Majorization Minimization. Proc. Interspeech 2020, 5046-5050, DOI: 10.21437/Interspeech.2020-1168.


@inproceedings{Togami2020,
  author={Masahito Togami and Robin Scheibler},
  title={{Sparseness-Aware DOA Estimation with Majorization Minimization}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={5046--5050},
  doi={10.21437/Interspeech.2020-1168},
  url={http://dx.doi.org/10.21437/Interspeech.2020-1168}
}