ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Stream-based context-sensitive phone mapping for cross-lingual speech recognition

Khe Chai Sim, Haizhou Li

Recently, a Probabilistic Phone Mapping (PPM) model was proposed to facilitate cross-lingual automatic speech recognition using a foreign phonetic system. Under this framework, discrete hidden Markov models (HMMs) are used to map a foreign phone sequence to a target phone sequence. Context-sensitive mapping is made possible by expanding the discrete observation symbols to include the contexts of the foreign phones in which they appear in the sequence. Unfortunately, modelling the context dependencies jointly results in dramatic increase in model parameters as wider contexts are used. In this paper, the probability of observing a contextdependent symbol is decomposed into the product of probabilities of observing the symbol and its contexts. This allows wider contexts to be modelled without greatly compromising the model complexity. This can be modelled conveniently using a multiple-stream discrete HMM system where the contexts are treated as independent streams. Experimental results are reported on TIMIT English phone recognition task using the Czech, Hungarian and Russian foreign phone recognisers.

doi: 10.21437/Interspeech.2009-764

Cite as: Sim, K.C., Li, H. (2009) Stream-based context-sensitive phone mapping for cross-lingual speech recognition. Proc. Interspeech 2009, 3019-3022, doi: 10.21437/Interspeech.2009-764

  author={Khe Chai Sim and Haizhou Li},
  title={{Stream-based context-sensitive phone mapping for cross-lingual speech recognition}},
  booktitle={Proc. Interspeech 2009},