INTERSPEECH 2014

We present a method to derive a small number of speech motor control “primitives” that can produce linguisticallyinterpretable articulatory movements. We envision that such a dictionary of primitives can be useful for speech motor control, particularly in finding a lowdimensional subspace for such control. First, we use the iterative Linear Quadratic Gaussian with Learned Dynamics (iLQGLD) algorithm to derive (for a set of utterances) a set of stochastically optimal control inputs to a learned dynamical systems model of the vocal tract that produces desired movement sequences. Second, we use a convolutive Nonnegative Matrix Factorization with sparseness constraints (cNMFsc) algorithm to find a small dictionary of control input primitives that can be used to reproduce the aforementioned optimal control inputs that produce the observed articulatory movements. The method performs favorably on both qualitative and quantitative evaluations conducted on synthetic data produced by an articulatory synthesizer. Such a primitivesbased framework could help inform theories of speech motor control and coordination.
Bibliographic reference. Ramanarayanan, Vikram / Goldstein, Louis / Narayanan, Shrikanth S. (2014): "Motor control primitives arising from a learned dynamical systems model of speech articulation", In INTERSPEECH2014, 150154.