Double Joint Bayesian Modeling of DNN Local I-Vector for Text Dependent Speaker Verification with Random Digit Strings

Ziqiang Shi, Huibin Lin, Liu Liu, Rujie Liu


Double joint Bayesian is a recently introduced analysis method that models and explores multiple information explicitly from the samples to improve the verification performance. It was recently applied to voice pass phrase verification, result in better results on text dependent speaker verification task. However little is known about its effectiveness in other challenging situations such as speaker verification for short, text-constrained test utterances, e.g. random digit strings. Contrary to conventional joint Bayesian method that cannot make full use of multi-view information, double joint Bayesian can incorporate both intra-speaker/digit and inter-speaker/digit variation and calculated the likelihood to describe whether the features having all labels consistent or not. We show that double joint Bayesian outperforms conventional method on modeling DNN local (digit-dependent) i-vectors for speaker verification with random prompted digit strings. Since the strength of both double joint Bayesian and conventional DNN local i-vector appear complementary, the combination significantly outperforms either of its components.


 DOI: 10.21437/Interspeech.2018-1103

Cite as: Shi, Z., Lin, H., Liu, L., Liu, R. (2018) Double Joint Bayesian Modeling of DNN Local I-Vector for Text Dependent Speaker Verification with Random Digit Strings. Proc. Interspeech 2018, 67-71, DOI: 10.21437/Interspeech.2018-1103.


@inproceedings{Shi2018,
  author={Ziqiang Shi and Huibin Lin and Liu Liu and Rujie Liu},
  title={Double Joint Bayesian Modeling of DNN Local I-Vector for Text Dependent Speaker Verification with Random Digit Strings},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={67--71},
  doi={10.21437/Interspeech.2018-1103},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1103}
}