ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Cross-language use of acoustic information for automatic speech recognition

C. Nieuwoudt, Elizabeth C. Botha

Techniques are investigated that use acoustic information from existing source language databases to implement automatic speech recognition (ASR) systems for new target languages for which little data are available. Strategies for cross-language use of acoustic information are proposed and are implemented via maximum a posteriori probability (MAP) and transformation-based techniques, as well as via discriminative learning techniques. The discriminative learning technique used is based on a cost-based extension of the minimum classification error (MCE) approach. Experiments are performed using relatively large amounts of English speech data from either a separate database or from the same database as smaller amounts of Afrikaans speech data to improve the performance of an Afrikaans speech recogniser. Results indicate that a significant reduction in word error rate is achievable (between 14% and 48% for experiments), depending on the method used and the amount of target language data available.


Cite as: Nieuwoudt, C., Botha, E.C. (2000) Cross-language use of acoustic information for automatic speech recognition. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 722-725

@inproceedings{nieuwoudt00_icslp,
  author={C. Nieuwoudt and Elizabeth C. Botha},
  title={{Cross-language use of acoustic information for automatic speech recognition}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 722-725}
}