10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Time-Varying Autoregressive Tests for Multiscale Speech Analysis

Daniel Rudoy (1), Thomas F. Quatieri (2), Patrick J. Wolfe (1)

(1) Harvard University, USA
(2) MIT, USA

In this paper we develop hypothesis tests for speech waveform nonstationarity based on time-varying autoregressive models, and demonstrate their efficacy in speech analysis tasks at both segmental and sub-segmental scales. Key to the successful synthesis of these ideas is our employment of a generalized likelihood ratio testing framework tailored to autoregressive coefficient evolutions suitable for speech. After evaluating our framework on speech-like synthetic signals, we present preliminary results for two distinct analysis tasks using speech waveform data. At the segmental level, we develop an adaptive short-time segmentation scheme and evaluate it on whispered speech recordings, while at the sub-segmental level, we address the problem of detecting the glottal flow closed phase. Results show that our hypothesis testing framework can reliably detect changes in the vocal tract parameters across multiple scales, thereby underscoring its broad applicability to speech analysis.

Full Paper

Bibliographic reference.  Rudoy, Daniel / Quatieri, Thomas F. / Wolfe, Patrick J. (2009): "Time-varying autoregressive tests for multiscale speech analysis", In INTERSPEECH-2009, 2839-2842.