Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Unified Acoustic Modeling for Continuous Speech Recognition

Rathinavelu Chengalvarayan

Speech Processing Group, Lucent Speech Solutions Department, Lucent Technologies, Naperville, IL, USA

Usually the speech and the silence models are trained together depending upon the type of recognition task. For example, if the recognition task is only on con- nected-digits then the corresponding digit models are built using only the connected-digit training corpus. Similarly for large-vocabulary recognition tasks, the subword or the phoneme models are generated using only the subword training set. Further the alphabet models are separately trained using the alphabet training data for letter recognition. In certain applications the developer needs to perform mixed-mode operations like alphabet followed by digits, digits succeeded by keywords, letters preceded by keywords etc. So there is a need to robustly design a speech recognizer for such kind of specific applications. In that context, we propose several acoustic modeling techniques to improve the unified model performance for applications that require mixed-mode operations.

Full Paper

Bibliographic reference.  Chengalvarayan, Rathinavelu (2000): "Unified acoustic modeling for continuous speech recognition", In ICSLP-2000, vol.4, 552-555.