4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

LR-Parser-driven Viterbi Search with Hypotheses Merging Mechanism Using Context-Dependent Phone Models

Tomokazu Yamada, Shigeki Sagayama

NTT Human Interface Laboratories, Kanagawa, Japan

This paper describes a Viterbi search algorithm for continuous speech recognition using context-dependent phone models under the constraint defined by a context-free grammar (CFG). It is based on a frame synchronous LR parser which dynamically generates a finite state network (FSN) from the CFG with an efficient path merging mechanism. Full context-dependency (intra- and interword context) is taken into account in the likelihood calculation process. This paper first describes the algorithm and the processing mechanism, then compares the experimental results of our algorithm and the conventional tree-based HMM-LR speech recognition algorithm which uses HMMs and an LR parser in phone-synchronous processing. The experiments show that our algorithm runs faster than the conventional HMM-LR algorithm with an equivalent recognition accuracy.

Full Paper

Bibliographic reference.  Yamada, Tomokazu / Sagayama, Shigeki (1996): "LR-parser-driven viterbi search with hypotheses merging mechanism using context-dependent phone models", In ICSLP-1996, 2103-2106.