Third International Conference on Spoken Language Processing (ICSLP 94)

Yokohama, Japan
September 18-22, 1994

A Speaker Verification System Using Prosodic Features

B. Yegnanarayana (1), S. P. Wagh (2), S. Rajendran (1)

(1) Dept. of Computer Science and Engineering, Indian Institute of Technology, Madras, India
(2) Dept. of Electrical Engineering, Indian Institute of Technology, Madras, India

This paper describes a technique for automatic speaker verification based on prosodic knowledge in Hindi using neural networks. Properties of intonation patterns (changes in F0 as a function of time) and duration were exploited to extract speaker specific information from natural speech utterances, which were used for fixed text speaker verification task. A set of twenty-three features (fifteen pitch features and eight durational features) were extracted from a fixed natural utterance, using a word boundary hypothesization algorithm. A neural network model based on adaptive resonance theory (ART2) was used to verify speaker from the input feature set. The system was trained for twenty five speakers and tested with twenty seven impostors. The results show that the overall percentage of correct acceptance and correct rejection was found to be about 98%.

Full Paper

Bibliographic reference.  Yegnanarayana, B. / Wagh, S. P. / Rajendran, S. (1994): "A speaker verification system using prosodic features", In ICSLP-1994, 1867-1870.