7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

Integrating Speech with Keypad Input for Automatic Entry of Spelling and Pronunciation of New Words

Grace Chung (1), Stephanie Seneff (2)

(1) Corporation for National Research Initiatives, USA; (2) MIT Laboratory for Computer Science, USA

This paper describes research whose ultimate aim is to support automatic entry of new words into a spoken dialogue system through interaction with a user. This research demonstrates an important step towards this goal, through a procedure which integrates information made available via the telephone keypad with a spoken instance of the target word, to produce a candidate spelling and pronunciation for the word. Through the use of a parsing mechanism applied to a 73,000 word proper name lexicon [4], we have been able to create a finite-state transducer (FST) that maps phonetics to graphemics, which can be composed with an FST derived from the keypad input to greatly reduce the search space. Experiments conducted on both the OGI name corpus [2] and a set of enrollment data obtained from our Mercury system [5] validate the procedure.


Full Paper

Bibliographic reference.  Chung, Grace / Seneff, Stephanie (2002): "Integrating speech with keypad input for automatic entry of spelling and pronunciation of new words", In ICSLP-2002, 2053-2056.