14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

I-Vectors Meet Imitators: On Vulnerability of Speaker Verification Systems Against Voice Mimicry

Rosa González Hautamäki (1), Tomi Kinnunen (1), Ville Hautamäki (1), Timo Leino (2), Anne-Maria Laukkanen (2)

(1) University of Eastern Finland, Finland
(2) University of Tampere, Finland

Voice imitation is mimicry of another speaker's voice characteristics and speech behavior. Professional voice mimicry can create entertaining, yet realistic sounding target speaker renditions. As mimicry tends to exaggerate prosodic, idiosyncratic and lexical behavior, it is unclear how modern spectral-feature automatic speaker verification systems respond to mimicry "attacks". We study the vulnerability of two well-known speaker recognition systems, traditional Gaussian mixture model . universal background model (GMM-UBM) and a state-of-the-art i-vector classifier with cosine scoring. The material consists of one professional Finnish imitator impersonating five well-known Finnish public figures. In a carefully controlled setting, mimicry attack does slightly increase the false acceptance rate for the i-vector system, but generally this is not alarmingly large in comparison to voice conversion or playback attacks.

Full Paper

Bibliographic reference.  Hautamäki, Rosa González / Kinnunen, Tomi / Hautamäki, Ville / Leino, Timo / Laukkanen, Anne-Maria (2013): "I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry", In INTERSPEECH-2013, 930-934.