ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Compression techniques applied to multiple speech recognition systems

Catherine Breslin, Matt Stuttle, Kate Knill

Speech recognition systems typically contain many Gaussian distributions, and hence a large number of parameters. This makes them both slow to decode speech, and large to store. Techniques have been proposed to decrease the number of parameters. One approach is to share parameters between multiple Gaussians, thus reducing the total number of parameters and allowing for shared likelihood calculation. Gaussian tying and subspace clustering are two related techniques which take this approach to system compression. These techniques can decrease the number of parameters with no noticeable drop in performance for single systems. However, multiple acoustic models are often used in real speech recognition systems. This paper considers the application of Gaussian tying and subspace compression to multiple systems. Results show that two speech recognition systems can be modelled using the same number of Gaussians as just one system, with little effect on individual system performance.

doi: 10.21437/Interspeech.2009-433

Cite as: Breslin, C., Stuttle, M., Knill, K. (2009) Compression techniques applied to multiple speech recognition systems. Proc. Interspeech 2009, 1407-1410, doi: 10.21437/Interspeech.2009-433

  author={Catherine Breslin and Matt Stuttle and Kate Knill},
  title={{Compression techniques applied to multiple speech recognition systems}},
  booktitle={Proc. Interspeech 2009},