ISCA Archive Interspeech 2017
ISCA Archive Interspeech 2017

A Domain Knowledge-Assisted Nonlinear Model for Head-Related Transfer Functions Based on Bottleneck Deep Neural Network

Xiaoke Qi, Jianhua Tao

Many methods have been proposed for modeling head-related transfer functions (HRTFs) and yield a good performance level in terms of log-spectral distortion (LSD). However, most of them utilize linear weighting to reconstruct or interpolate HRTFs, but not consider the inherent nonlinearity relationship between the basis function and HRTFs. Motivated by this, a domain knowledge-assisted nonlinear modeling method is proposed based on bottleneck features. Domain knowledge is used in two aspects. One is to generate the input features derived from the solution to sound wave propagation equation at the physical level, and the other is to design the loss function for model training based on the knowledge of objective evaluation criterion, i.e., LSD. Furthermore, with utilizing the strong representation ability of the bottleneck features, the nonlinear model has the potential to achieve a more accurate mapping. The objective and subjective experimental results show that the proposed method gains less LSD when compared with linear model, and the interpolated HRTFs can generate a similar perception to those of the database.


doi: 10.21437/Interspeech.2017-222

Cite as: Qi, X., Tao, J. (2017) A Domain Knowledge-Assisted Nonlinear Model for Head-Related Transfer Functions Based on Bottleneck Deep Neural Network. Proc. Interspeech 2017, 3058-3062, doi: 10.21437/Interspeech.2017-222

@inproceedings{qi17_interspeech,
  author={Xiaoke Qi and Jianhua Tao},
  title={{A Domain Knowledge-Assisted Nonlinear Model for Head-Related Transfer Functions Based on Bottleneck Deep Neural Network}},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={3058--3062},
  doi={10.21437/Interspeech.2017-222}
}