In conventional speaker recognition method based on MFCC, the phase information has been ignored. In this paper, we proposed a method that integrates the phase information on a speaker recognition method. The speaker identification experiments were performed using NTT database which consists of sentences uttered at normal speed mode by 35 Japanese speakers (22 males and 13 females) on five sessions over ten months. Each speaker uttered only 5 training utterances (about 20 seconds in total). Using the phase information, the speaker recognition error rate was reduced by about 44%.
Bibliographic reference. Nakagawa, Seiichi / Asakawa, Kouhei / Wang, Longbiao (2007): "Speaker recognition by combining MFCC and phase information", In INTERSPEECH-2007, 2005-2008.