ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Merging human and automatic system decisions to improve speaker recognition performance

Rosa González Hautamäki, Ville Hautamäki, Padmanabhan Rajan, Tomi Kinnunen

Human judgment is the final authority in forensic speaker recognition, but the use of modern speaker verification systems with accurate algorithms to perform the task under various circumstances has a huge potential to help the expert. The ultimate goal is to improve the accuracy of automatic systems when challenging data is provided and find a methodology for human-aided speaker recognition systems. This work presents an evaluation of speaker recognition carried out by human listeners and a gender dependent i-vector recognizer with a strategy for fusion of the decision process. Our experiments with HASR 2010 and HASR 2012 data indicate complementarity in the performance of the automatic system and the naive listeners decisions.


doi: 10.21437/Interspeech.2013-422

Cite as: Hautamäki, R.G., Hautamäki, V., Rajan, P., Kinnunen, T. (2013) Merging human and automatic system decisions to improve speaker recognition performance. Proc. Interspeech 2013, 2519-2523, doi: 10.21437/Interspeech.2013-422

@inproceedings{hautamaki13c_interspeech,
  author={Rosa González Hautamäki and Ville Hautamäki and Padmanabhan Rajan and Tomi Kinnunen},
  title={{Merging human and automatic system decisions to improve speaker recognition performance}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2519--2523},
  doi={10.21437/Interspeech.2013-422}
}