10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Compression Techniques Applied to Multiple Speech Recognition Systems

Catherine Breslin, Matt Stuttle, Kate Knill

Toshiba Research Europe Ltd., UK

Speech recognition systems typically contain many Gaussian distributions, and hence a large number of parameters. This makes them both slow to decode speech, and large to store. Techniques have been proposed to decrease the number of parameters. One approach is to share parameters between multiple Gaussians, thus reducing the total number of parameters and allowing for shared likelihood calculation. Gaussian tying and subspace clustering are two related techniques which take this approach to system compression. These techniques can decrease the number of parameters with no noticeable drop in performance for single systems. However, multiple acoustic models are often used in real speech recognition systems. This paper considers the application of Gaussian tying and subspace compression to multiple systems. Results show that two speech recognition systems can be modelled using the same number of Gaussians as just one system, with little effect on individual system performance.

Full Paper

Bibliographic reference.  Breslin, Catherine / Stuttle, Matt / Knill, Kate (2009): "Compression techniques applied to multiple speech recognition systems", In INTERSPEECH-2009, 1407-1410.