Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Combination of Temporal Trajectory Filtering and Projection Measure for Robust Speaker Identification

Kuo-Hwei Yuo (1), Tai-Hwei Hwang (2), Hsiao-Chuan Wang (1)

(1) Department of Electrical Engineering, National Tsing Hua University, Hsinchu, Taiwan
(2) E000/Computer & Communication Labs, Industrial Technology Research Institute, Chutong, Hsinchu, Taiwan

This paper presents a method that combines the techniques of temporal trajectory filtering and projection measure for robust speaker identification. The proposed robust feature, called Relative Autocorrelation Sequence Mel-scale Frequency Cepstral Coefficients (RAS-MFCC), is derived based on filtering the temporal trajectories of short-time one-sided autocorrelation sequences. This filtering process can minimize the effect of additive noise in the noisy speech. Since the norm of RAS-MFCC shrinks due to noise corruption, the projection measure (PM) technique, which is effective in dealing with the norm shrinkage of cepstrum, can be applied for the distance measure of RAS-MFCCs. The combination of these two techniques is then applied to a task of speaker identification of 100 speakers. Our experiment shows that the use of RASMFCC feature achieves significant improvement in identification rate as comparing with the use of MFCC. The combination of RAS-MFCC feature with PM technique can further improve the recognition accuracy.


Full Paper

Bibliographic reference.  Yuo, Kuo-Hwei / Hwang, Tai-Hwei / Wang, Hsiao-Chuan (2000): "Combination of temporal trajectory filtering and projection measure for robust speaker identification", In ICSLP-2000, vol.2, 791-794.