ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Tailoring kalman filtering towards speaker characterisation

John McKenna, Stephen Isard

This paper describes a method for obtaining smoothed vocal tract parameters from analysis during the closed phase of the glottis. The method is based upon Expectation Maximisation (EM) and uses Kalman-Rauch forward-backward iterations through a voiced segment, in which the speech data during excitation and open phases are excluded by treating them as ‘missing data’. This approach exploits the non-independence of neighbouring spectra and compensates for small numbers of available points, while preserving speaker-characteristic information and tracking variations in it. The vocal tract filter parameters are then used for inverse filtering the speech, thus obtaining estimates of the source excitation. The extracted excitation signal can be used to excite other sets of parameters to produce natural sounding speech.

doi: 10.21437/Eurospeech.1999-616

Cite as: McKenna, J., Isard, S. (1999) Tailoring kalman filtering towards speaker characterisation. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2793-2796, doi: 10.21437/Eurospeech.1999-616

  author={John McKenna and Stephen Isard},
  title={{Tailoring kalman filtering towards speaker characterisation}},
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},