ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

A three-stage solution for flexible vocabulary speech understanding

Grace Chung

This paper discusses our three-stage approach to a flexible vocabulary speech understanding system, which can detect out-ofvocabulary (OOV) words, and hypothesize their phonetic and orthographic transcriptions. In the first stage, we introduce the column-bigram finite-state transducer (FST)which, while embedding ANGIE sublexical models, also supports previously unseen data from unknown words. Secondly, the ANGIE models utilize grapheme information, providing tighter linguistic constraint as well as instantaneous sound-to-letter capability during recognition. Thirdly, the syllable-level lexical units of the first stage are automatically derived via an iterative procedure to optimize performance. The second-stage recognizer employs ANGIE to output a word network which is parsed by TINA, our natural language (NL) processor, in stage three. Experiments with a JUPITER implementation of this system are described in [1].

G. Chung, "Automatically Incorporating Unknown Words in JUPITER," in these Proceedings.


Cite as: Chung, G. (2000) A three-stage solution for flexible vocabulary speech understanding. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 266-269

@inproceedings{chung00_icslp,
  author={Grace Chung},
  title={{A three-stage solution for flexible vocabulary speech understanding}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 4, 266-269}
}