7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

Acoustic-to-Articulatory Inverse Mapping Using an HMM-Based Speech Production Model

Sadao Hiroya, Masaaki Honda

NTT Corporation, Japan

We present a method that determines articulatory movements from speech acoustics using an HMM (Hidden Markov Model)-based speech production model. The model statistically generates speech acoustics and articulatory movements from a given phonemic string. It consists of HMMs of articulatory movements for each phoneme and an articulatory-to-acoustic mapping for each HMM state. For a given speech acoustics, the maximum a posteriori probability estimate of the articulatory parameters of the statistical model is presented. The methodís performance on sentences was evaluated by comparing the estimated articulatory parameters with observed parameters. The average rms error of the estimated articulatory parameters was 1.79 mm with phonemic information and 2.16 mm without phonemic information in an utterance.

Full Paper

Bibliographic reference.  Hiroya, Sadao / Honda, Masaaki (2002): "Acoustic-to-articulatory inverse mapping using an HMM-based speech production model", In ICSLP-2002, 2305-2308.