A Sparse Spherical Harmonic-Based Model in Subbands for Head-Related Transfer Functions

Xiaoke Qi, Jianhua Tao


Several functional models for head-related transfer function (HRTF) have been proposed based on spherical harmonic (SH) orthogonal functions, which yield an encouraging performance level in terms of log-spectral distortion (LSD). However, since the properties of subbands are quite different and highly subject-dependent, the degree of SH expansion should be adapted to the subband and the subject, which is quite challenging. In this paper, a sparse spherical harmonic-based model termed SSHM is proposed in order to achieve an intelligent frequency truncation. Different from SH-based model (SHM) which assigns the degree for each subband, SSHM constrains the number of SH coefficients by using an l1 penalty, and automatically preserves the significant coefficients in each subband. As a result, SSHM requires less coefficients at the same SD level than other truncation methods to reconstruct HRTFs . Furthermore, when used for interpolation, SSHM gives a better fitting precision since it naturally reduces the influence of the fluctuation caused by the movement of the subject and the processing error. The experiments show that even using about 40% less coefficients, SSHM has a slightly lower LSD than SHM. Therefore, SSHM can achieve a better tradeoff between efficiency and accuracy.


DOI: 10.21437/Interspeech.2016-987

Cite as

Qi, X., Tao, J. (2016) A Sparse Spherical Harmonic-Based Model in Subbands for Head-Related Transfer Functions. Proc. Interspeech 2016, 540-544.

Bibtex
@inproceedings{Qi+2016,
author={Xiaoke Qi and Jianhua Tao},
title={A Sparse Spherical Harmonic-Based Model in Subbands for Head-Related Transfer Functions},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-987},
url={http://dx.doi.org/10.21437/Interspeech.2016-987},
pages={540--544}
}