5th International Conference on Spoken Language Processing
Phonetic baseforms are the basic recognition units in most speech recognition systems. These baseforms are usually determined by linguists once a vocabulary is chosen and not modified thereafter. However, several applications, such as name dialing, require the user be able to add new words to the vocabulary. These new words are often names, or task-specific jargon, that have user-specific pronunciations. This paper describes a novel method for generating phonetic transcriptions (baseforms) of words based on acoustic evidence alone. It does not require any prior acoustic representation of the new word, is vocabulary independent, and uses phonological rules in a post processing stage to enhance the quality of the baseforms thus produced. Our experiments demonstrate the high decoding accuracies obtained when baseforms deduced using this approach are incorporated into our speech recognizer. Our experiments also compare the use of acoustic models that are trained on task-specific data with models trained for a general purpose ( to do digit, names, large vocabulary recognition, etc.), for generating phonetic transcriptions.
Bibliographic reference. Ramabhadran, Bhuvana / Ittycheriah, Abraham (1998): "Phonological rules for enhancing acoustic enrollment of unknown words", In ICSLP-1998, paper 0534.