September 22-25, 1997
We motivate the integration of a probabilistic pronunciation model into a system for recognizing spontaneous speech and propose a possible architecture of such a model. In order to develop an environment for experiments, a simplified version employing constrained phone recognition and discrete syllable-size HMM subword units was implemented and evaluated. Although the results are still significantly worse than those achieved by our "conventional" word recognizer, they are encouraging given that the experimental system is only a coarse approximation of the proposed approach.
Bibliographic reference. Jost, Uwe / Heine, Henrik / Evermann, Gunnar (1997): "What is wrong with the lexicon - an attempt to model pronunciations probabilistically", In EUROSPEECH-1997, 2475-2478.