4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Application of Speaker Modification Techniques to Phonetic Vocoding

Carlos M. Ribeiro, Isabel M. Trancoso

INESC - Instituto de Engenharia e Sistemas de Computadores, Lisbon, Portugal

The goal of the work described in this paper is to develop a very low bit rate vocoding scheme. The vocoder is a typical LPC vocoder, whose parameters are post-processed on a phone-by-phone basis, resulting in a variable bit rate segment vocoder. Given the well known speaker recognizability problems presented by vocoders at such low bit rates, we have attempted to integrate a speaker modification method based on altering the formant frequencies and bandwidths of vowel segments. This is done by transmitting the mean value and standard deviation of the radius and angle of the poles corresponding to formant frequencies for each phone. In the decoder stage, the phone index is used to retrieve a set of normalized values from a codebook of ‘typical’ phones. This set is speaker adapted to preserve the static characteristics (average and standard deviation) but relies in the typical phone to represent the dynamic characteristics such as formant trajectories.

Full Paper

Bibliographic reference.  Ribeiro, Carlos M. / Trancoso, Isabel M. (1996): "Application of speaker modification techniques to phonetic vocoding", In ICSLP-1996, 306-309.