EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Improved Phoneme-History-Dependent Search for Large-Vocabulary Continuous-Speech Recognition

Takaaki Hori (1), Yoshiaki Noda (2), Shoichi Matsunaga (1)

(1) NTT Cyber Space Laboratories, Japan
(2) NTT Communications Corporation, Japan

This paper describes an improved phoneme-history-dependent (PHD) search algorithm. This method is an optimum algorithm under the assumption that the starting time of a word depends on only a few preceding phonemes (phoneme history). The computational cost and number of recognition errors made by a multi-pass-based recognizer can be reduced if the PHD search of the first decoding pass uses re-selection of the preceding word and the optimum length of phoneme histories. These improvements increase the speed of the first decoding pass and help that the word lattice has the correct word sequence. Consequently search errors can be reduced in the second decoding pass. In 65k-word domain-independent Japanese read-speech dictation task and 1000-word spontaneous-speech airplane-reservation task, the improved PHD search was 1.2-2.0 times faster than a traditional word-dependent search under the condition of equal word accuracy.

Full Paper

Bibliographic reference.  Hori, Takaaki / Noda, Yoshiaki / Matsunaga, Shoichi (2001): "Improved phoneme-history-dependent search for large-vocabulary continuous-speech recognition", In EUROSPEECH-2001, 1809-1813.