ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

A non-acoustic approach to crosslingual speech recognition performance prediction

Chen Liu, Lynette Melnar

Crosslingual acoustic modeling is an effective technique for building acoustic models in the absence of native training data. A small amount of native speech data is still needed for verifying the crosslingual models by running an actual recognition test. In some very stringent yet realistic situations, however, even the test data may not be available. We introduce an algorithm that objectively predicts the recognition performance of crosslingual acoustic models. This approach does not require conducting of actual speech recognition tests with target-language speech data; nor does it depend on any acoustic measurement techniques. The algorithm is based on a series of linguistic metrics characterizing the articulatory phonetic and phonological information of phonemes from both the target and source languages. It is useful both for validating crosslingual models for speech recognition applications, and for making database acquisition decisions that could prove very cost-beneficial.


doi: 10.21437/Interspeech.2008-674

Cite as: Liu, C., Melnar, L. (2008) A non-acoustic approach to crosslingual speech recognition performance prediction. Proc. Interspeech 2008, 2719-2722, doi: 10.21437/Interspeech.2008-674

@inproceedings{liu08f_interspeech,
  author={Chen Liu and Lynette Melnar},
  title={{A non-acoustic approach to crosslingual speech recognition performance prediction}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={2719--2722},
  doi={10.21437/Interspeech.2008-674}
}