INTERSPEECH 2011
12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Sinewave Representations of Nonmodality

Nicolas Malyska, Thomas F. Quatieri, Robert Dunn

MIT Lincoln Laboratory, USA

Regions of nonmodal phonation, exhibiting deviations from uniform glottal-pulse periods and amplitudes, occur often and convey information about speaker- and linguistic-dependent factors. Such waveforms pose challenges for speech modeling, analysis/synthesis, and processing. In this paper, we investigate the representation of nonmodal pulse trains as a sum of harmonically-related sinewaves with time-varying amplitudes, phases, and frequencies. We show that a sinewave representation of any impulsive signal is not unique and also the converse, i.e., frame-based measurements of the underlying sinewave representation can yield different impulse trains. Finally, we argue how this ambiguity may explain addition, deletion, and movement of pulses in sinewave synthesis and a specific illustrative example of time-scale modification of a nonmodal case of diplophonia.

Full Paper

Bibliographic reference.  Malyska, Nicolas / Quatieri, Thomas F. / Dunn, Robert (2011): "Sinewave representations of nonmodality", In INTERSPEECH-2011, 69-72.