ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Fast speech recognition for voice destination entry in a car navigation system

Hoon Chung, JeonGue Park, HyeonBae Jeon, YunKeun Lee

In this paper, we introduce a multi-stage decoding algorithm optimized to recognize very large number of entry names on a resource-limited embedded device. The multi-stage decoding algorithm is composed of a two-stage HMM-based coarse search and a detailed search. The two-stage HMM-based coarse search generates a small set of candidates that are assumed to contain a correct hypothesis with high probability, and the detailed search re-ranks the candidates by rescoring them with sophisticate acoustic models. In this paper, we take experiments with 1-millions of point-of-interest (POI) names on an in-car navigation device with a fixed-point processor running at 620MHz. The experimental result shows that the multi-stage decoding algorithm runs about 2.23 times real-time on the device without serious degradation of recognition performance.


doi: 10.21437/Interspeech.2009-290

Cite as: Chung, H., Park, J., Jeon, H., Lee, Y. (2009) Fast speech recognition for voice destination entry in a car navigation system. Proc. Interspeech 2009, 975-978, doi: 10.21437/Interspeech.2009-290

@inproceedings{chung09_interspeech,
  author={Hoon Chung and JeonGue Park and HyeonBae Jeon and YunKeun Lee},
  title={{Fast speech recognition for voice destination entry in a car navigation system}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={975--978},
  doi={10.21437/Interspeech.2009-290}
}