ISCA Archive NOLISP 2007
ISCA Archive NOLISP 2007

Phase-based methods for voice source analysis

Christophe d'Alessandro

Voice source analysis is an important but difficult issue for speech processing. In this talk, three aspects of voice source analysis recently developed at LIMSI (Orsay) and FPMs (Mons) are discussed. In a first part, time domain and spectral domain modelling of glottal flow signals are presented (Doval & al., 2006). It is shown that the glottal flow can be modelled as an anticausal filter (maximum phase) before the glottal closure, and by as a causal filter (minimum phase) after the glottal closure. In a second part, taking advantage of this phase structure, causal and anticausal components of the speech signal are separated according to the location in the Z-plane of the zeros of the Z-Transform (ZZT) of the windowed signal (Bozkurt & al. 2005). Results of a comparative evaluation of the ZZT and linear prediction for source/tract separation are reported. This method is also useful for voice source parameters analysis. Voice open quotient and glottal flow asymmetry analyses using the ZZT and ElectroGlottoGraphy (EGG) are compared (Sturmel & al. 2006). In a third part, glottal closure instant detection using the phase of the wavelet transform is discussed. A method based on the lines of maximum phase in the time-scale plane is proposed. This method is compared to EGG for robust glottal closure instant analysis (Vu Ngoc Tuan & al, 2000).

s Doval, B. / D'Alessandro, C. / Henrich, N., "The spectrum of glottal flow model", Acta Acustica United with Acustica, p. 21p, vol. 92, N°6, November-December 2006. Bozkurt, B. / Doval, B. / D'Alessandro, C. / Dutoit, T., "Zeros of Z-transform representation with application to source-filter separation in speech", IEEE Signal Processing Letters, p. 344-347, vol. 12, N °4, April 2005. Sturmel, Nicolas / D'Alessandro, C. / Doval, B., "A spectral method for estimation of the voice speed quotient and evaluation using electroglottography", 7th Conference on Advances in Quantitative Laryngology, p. 6p, Groningen, The Netherlands, October 6-7, 2006. Vu Ngoc Tuan / d'Alessandro, C., "Glottal Closure Detection using EGG and the Wavelet Transform", 4th workshop "Advances in Objective Laryngoscopy, Voice and Speech Research, Jena, Germany, April 7-8, 2000.

Cite as: d'Alessandro, C. (2007) Phase-based methods for voice source analysis. Proc. ITRW on Nonlinear Speech Processing (NOLISP 2007)

  author={Christophe d'Alessandro},
  title={{Phase-based methods for voice source analysis}},
  booktitle={Proc. ITRW on Nonlinear Speech Processing (NOLISP 2007)}