ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Landmark-based approach to speech recognition: an alternative to HMMs

Carol Y. Espy-Wilson, Tarun Pruthi, Amit Juneja, Om Deshmukh

In this paper, we compare a Probabilistic Landmark-Based speech recognition System (LBS) which uses Knowledge-based Acoustic Parameters (APs) as the front-end with an HMM-based recognition system that uses the Mel-Frequency Cepstral Coefficients as its front end. The advantages of LBS based on APs are (1) the APs are normalized for extra-linguistic information, (2) acoustic analysis at different landmarks may be performed with different resolutions and with different APs, (3) LBS outputs multiple acoustic landmark sequences that signal perceptually significant regions in the speech signal, (4) it may be easier to port this system to another language since the phonetic features captured by the APs are universal, and (5) LBS can be used as a tool for uncovering and subsequently understanding variability. LBS also has a probabilistic framework that can be combined with pronunciation and language models in order to make it more scalable to large vocabulary recognition tasks.

doi: 10.21437/Interspeech.2007-324

Cite as: Espy-Wilson, C.Y., Pruthi, T., Juneja, A., Deshmukh, O. (2007) Landmark-based approach to speech recognition: an alternative to HMMs. Proc. Interspeech 2007, 886-889, doi: 10.21437/Interspeech.2007-324

  author={Carol Y. Espy-Wilson and Tarun Pruthi and Amit Juneja and Om Deshmukh},
  title={{Landmark-based approach to speech recognition: an alternative to HMMs}},
  booktitle={Proc. Interspeech 2007},