ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Total least squares based subband modelling for scalable speech representations with damped sinusoids

Kris Hermus, Werner Verhelst, Patrick Wambacq, Philippe Lemmerling

We describe how Total Least Squares (TLS) algorithms can be applied as a powerful and eƆcient modelling tool for wideband speech. A detailed description in both time domain and fre- quency domain illustrates how the modelling functions { damped sinusoids { naturally synthesise non-stationary signals. Straightforward implementations of TLS applied to fullband speech are known to be computationally hard and they can suffer from numerical sensitivity.

In this paper we introduce a subband approach, which leads to a significant reduction of the computational load with an enhanced numerical stability. Moreover, it enables to control the distribution of the TLS components over the spectral range of the input signal such that perceptual criteria can be incorporated in the modelling scheme.

We also address the scalability of our design from smallband speech to high quality audio, and provide evidence for the existence of coupled components in TLS modelled segments.


Cite as: Hermus, K., Verhelst, W., Wambacq, P., Lemmerling, P. (2000) Total least squares based subband modelling for scalable speech representations with damped sinusoids. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 1129-1132

@inproceedings{hermus00b_icslp,
  author={Kris Hermus and Werner Verhelst and Patrick Wambacq and Philippe Lemmerling},
  title={{Total least squares based subband modelling for scalable speech representations with damped sinusoids}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 1129-1132}
}