8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Design of Compact Acoustic Models through Clustering of Tied-Covariance Gaussians

Mark Mao (1), Vincent Vanhoucke (2)

(1) Stanford University, USA
(2) Nuance Communications, USA

We propose a new approach for designing compact acoustic models particularly suited to large systems that combine multiple model sets to represent distinct acoustic conditions or languages. We show that Gaussians based on mixtures of inverse covariances (MIC) with shared parameters can be clustered using an efficient Lloyd algorithm. As a result, more compact acoustic models can be built by clustering Gaussians across tied mixtures. In addition, we show that the tied parameters of MIC models can be shared across acoustic models and languages, making it possible to build more efficient multi-model systems which take advantage of a common pool of clustered Gaussians.

Full Paper

Bibliographic reference.  Mao, Mark / Vanhoucke, Vincent (2004): "Design of compact acoustic models through clustering of tied-covariance Gaussians", In INTERSPEECH-2004, 2805-2808.