8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Creating Speech Recognition Grammars from Regular Expressions for Alphanumeric Concepts

Ye-Yi Wang, Yun-Cheng Ju

Microsoft Corporation, USA

We propose a novel approach that enables the developers with little grammar authoring experience to construct high performance speech grammars for alphanumeric concepts, which are often needed in the more commonly used directed dialog systems in practice. A developer can simply write down a regular expression for the concept and the algorithm automatically constructs a W3C grammar with appropriate semantic interpretation tags. While the quality of the grammar is ultimately determined by the way in which the regular expression is written, the algorithm relieves the developers from the difficult tasks of optimizing grammar structures and assigning appropriate semantic interpretation tags, thus it greatly speeds up grammar development and reduces the requirement of expertise. Preliminary experimental results have shown that the grammar created with this approach consistently out-performed the general alphanumeric rules in the grammar library. In some cases the semantic error rates were cut by more than 50%.

Full Paper

Bibliographic reference.  Wang, Ye-Yi / Ju, Yun-Cheng (2004): "Creating speech recognition grammars from regular expressions for alphanumeric concepts", In INTERSPEECH-2004, 2161-2164.