ISCA Archive SPKD 2008
ISCA Archive SPKD 2008

Incorporating suprasegmental knowledge for phone recognition with conditional random fields

Prateeti Mohapatra, Eric Fosler-Lussier

In this paper, we investigate the integration of lexical stress and syllabic position of segments in a Multi-Layer Perceptron (MLP) classification system that is part of our Conditional Random Fields (CRF) phone recognizer. CRFs are used to integrate MLP posterior estimates, particularly of phonological features or phonetic classes, which stand in as representations of the acoustics; we show that incorporating suprasegmental information as part of the MLP classification system augments the acoustic space in a beneficial way for phonological feature based CRF models. TIMIT phone recognition experiments show a small but statistically significant improvement.


Cite as: Mohapatra, P., Fosler-Lussier, E. (2008) Incorporating suprasegmental knowledge for phone recognition with conditional random fields. Proc. ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery, paper 054

@inproceedings{mohapatra08_spkd,
  author={Prateeti Mohapatra and Eric Fosler-Lussier},
  title={{Incorporating suprasegmental knowledge for phone recognition with conditional random fields}},
  year=2008,
  booktitle={Proc. ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery},
  pages={paper 054}
}