ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Improving the multigram algorithm by using lattices as input

Joris Driesen, Hugo Van hamme

The multigram algorithm is a statistical technique that can be used for extracting recurring patterns from a sequential input. When provided with a symbol sequence representing a speech signal, it is able to extract word-like patterns from it, despite the large amount of subsequences that can represent a single word. For this, it uses statistical information derived from the entire input. However, due to the abstraction of speech to symbols, much of the information originally present in the signal is no longer available to the algorithm.

In this paper we propose a way of using a richer abstraction of the signal in the form of a lattice. Furthermore, a way of grounding recurring patterns to concepts in other modalities will be presented. Finally, the information learned by the algorithm using both kinds of input is tested in a recognition experiment. This will show that the use of lattices leads to a significant improvement in terms of recognition rate.

doi: 10.21437/Interspeech.2008-541

Cite as: Driesen, J., Van hamme, H. (2008) Improving the multigram algorithm by using lattices as input. Proc. Interspeech 2008, 2086-2089, doi: 10.21437/Interspeech.2008-541

  author={Joris Driesen and Hugo {Van hamme}},
  title={{Improving the multigram algorithm by using lattices as input}},
  booktitle={Proc. Interspeech 2008},