14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Standoff Speaker Recognition: Effects of Recording Distance Mismatch on Speaker Recognition System Performance

Mike Fowler (1), Mark McCurry (2), Jonathan Bramsen (1), Kehinde Dunsin (1), Jeremiah Remus (1)

(1) Clarkson University, USA
(2) Georgia Institute of Technology, USA

Speech can potentially be used to identify individuals from a distance and contribute to the growing effort to develop methods for standoff, multimodal biometric identification. However, mismatched recording distances for the enrollment and verification speech samples can potentially introduce new challenges for speaker recognition systems. This paper describes a data collection, referred to as the Standoff Multi-Microphone Speech Corpus, which allows investigation of the impact of recording distance mismatch on the performance of speaker recognition systems. Additionally, a supervised method for linear subspace decomposition was evaluated in an effort to mitigate the effects of recording distance mismatch. The results of this study indicate that mismatched recording distances have a consistent negative impact on performance of a standoff speaker recognition system; however, subspace decomposition techniques may be able to reduce the penalty observed with mismatched recording distances.

Full Paper

Bibliographic reference.  Fowler, Mike / McCurry, Mark / Bramsen, Jonathan / Dunsin, Kehinde / Remus, Jeremiah (2013): "Standoff speaker recognition: effects of recording distance mismatch on speaker recognition system performance", In INTERSPEECH-2013, 3713-3716.