ISCA Archive ICSLP 1998
ISCA Archive ICSLP 1998

Phonological rules for enhancing acoustic enrollment of unknown words

Bhuvana Ramabhadran, Abraham Ittycheriah

Phonetic baseforms are the basic recognition units in most speech recognition systems. These baseforms are usually determined by linguists once a vocabulary is chosen and not modified thereafter. However, several applications, such as name dialing, require the user be able to add new words to the vocabulary. These new words are often names, or task-specific jargon, that have user-specific pronunciations. This paper describes a novel method for generating phonetic transcriptions (baseforms) of words based on acoustic evidence alone. It does not require any prior acoustic representation of the new word, is vocabulary independent, and uses phonological rules in a post processing stage to enhance the quality of the baseforms thus produced. Our experiments demonstrate the high decoding accuracies obtained when baseforms deduced using this approach are incorporated into our speech recognizer. Our experiments also compare the use of acoustic models that are trained on task-specific data with models trained for a general purpose ( to do digit, names, large vocabulary recognition, etc.), for generating phonetic transcriptions.


doi: 10.21437/ICSLP.1998-431

Cite as: Ramabhadran, B., Ittycheriah, A. (1998) Phonological rules for enhancing acoustic enrollment of unknown words. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0534, doi: 10.21437/ICSLP.1998-431

@inproceedings{ramabhadran98_icslp,
  author={Bhuvana Ramabhadran and Abraham Ittycheriah},
  title={{Phonological rules for enhancing acoustic enrollment of unknown words}},
  year=1998,
  booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)},
  pages={paper 0534},
  doi={10.21437/ICSLP.1998-431}
}