Analysis of the Effect of Speech-Laugh on Speaker Recognition System

Sri Harsha Dumpala, Ashish Panda, Sunil Kumar Kopparapu


A robust speaker recognition system should be able to recognize a speaker despite all the possible variations in speaker's speech. A common variation of the neutral speech is speech-laugh, which occurs when a person is speaking and laughing, simultaneously. In this paper, we show that speech-laugh significantly degrades the performance of an i-vector based speaker recognition system. Further, we show that laughter and neutral speech contain complementary speaker information, which can be combined to improve the performance of the speaker recognition system for speech-laugh scenarios. Using AMI meeting corpus database, we show that by including neutral speech and laughter in enrollment phase, the performance of the system in the speech-laugh scenarios can be relatively improved by 36% in EER.


 DOI: 10.21437/Interspeech.2018-2090

Cite as: Dumpala, S.H., Panda, A., Kopparapu, S.K. (2018) Analysis of the Effect of Speech-Laugh on Speaker Recognition System. Proc. Interspeech 2018, 1751-1755, DOI: 10.21437/Interspeech.2018-2090.


@inproceedings{Dumpala2018,
  author={Sri Harsha Dumpala and Ashish Panda and Sunil Kumar Kopparapu},
  title={Analysis of the Effect of Speech-Laugh on Speaker Recognition System},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={1751--1755},
  doi={10.21437/Interspeech.2018-2090},
  url={http://dx.doi.org/10.21437/Interspeech.2018-2090}
}