ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Speech recognition with state-based nearest neighbour classifiers

Thomas Deselaers, Georg Heigold, Hermann Ney

We present a system that uses nearest neighbour classification on the state level of the hidden Markov model. Common speech recognition systems nowadays use Gaussian mixtures with a very high number of densities. We propose to carry this idea to the extreme, such that each observation is a prototype of its own. This approach is well-known and widely used in other areas of pattern recognition and has some immediate advantages over other classification approaches, but has never been applied to speech recognition. We evaluate the proposed method on the SieTill corpus of continuous digit strings and on the large vocabulary EPPS English task. It is shown that nearest neighbour outperforms conventional systems when training data is sparse.


doi: 10.21437/Interspeech.2007-566

Cite as: Deselaers, T., Heigold, G., Ney, H. (2007) Speech recognition with state-based nearest neighbour classifiers. Proc. Interspeech 2007, 2093-2096, doi: 10.21437/Interspeech.2007-566

@inproceedings{deselaers07_interspeech,
  author={Thomas Deselaers and Georg Heigold and Hermann Ney},
  title={{Speech recognition with state-based nearest neighbour classifiers}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={2093--2096},
  doi={10.21437/Interspeech.2007-566}
}