Sixth International Conference on Spoken Language Processing
October 16-20, 2000
Towards a Common Phone Alphabet for Multilingual Speech Recognition
Francisco Palou, P. Bravetti, O. Emam, V. Fischer, E. Janke
IBM Voice Systems, European Speech Research,
New automatic speech recognition applications, mainly for
small and medium vocabulary sizes, demand the capability
of recognizing speech in several languages simultaneously.
We have started exploring the possibility of building
acoustic models that integrate multiple languages (up
to seven in the initial stage), using speech transcriptions
based on a common phoneme alphabet across all the
languages. To reach a common alphabet, we start from the
previously existing alphabets for each one of the seven
languages. We first proceed to simplify some of them,
partially following SAMPA transcription guidelines, and then
to merge phones present in several languages that
correspond to the same IPA symbol.
We study and compare
two variants of the common phoneme alphabet. The first
of these two alphabets is closer to the starting ones, and
includes the use of diphthong phones for English and
German, and long-vowel phones for Arabic, English, and
German. The second one avoids the long-vowel and diphthong
phone models, and also the stressed vowel models. We
present and discuss the results of decoding large
vocabulary dictation tests, comparing the two alphabet variants,
and also the multilingual decoding results with the
corresponding monolingual acoustic models.
Palou, Francisco / Bravetti, P. / Emam, O. / Fischer, V. / Janke, E. (2000):
"Towards a common phone alphabet for multilingual speech recognition",
In ICSLP-2000, vol.1, 501-504.