8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Discriminative Optimization of Language Adapted HMMs for a Language Identification System Based on Parallel Phoneme Recognizers

Josef G. Bauer, Bernt Andrassy, Ekaterina Timoshenko

Siemens AG, Germany

Recently an unsupervised learning scheme for Hidden Markov Models (HMMs) used in acoustical Language Identification (LID) based on Parallel Phoneme Recognizers (PPR) was proposed. This avoids the high costs for orthographically transcribed speech data and phonetic lexica but was found to introduce a considerable increase of classification errors. Also very recently discriminative Minimum Language Identification Error (MLIDE) optimization of HMMs for PPR based LID was introduced that again only requires language tagged speech data and an initial HMM. The described work shows how to combine both approaches to an unsupervised and discriminative learning scheme. Experimental results on large telephone speech databases show that using MLIDE the relative increase in error rate introduced by unsupervised learning can be reduced from 61% to 26%. The absolute difference in LID error rate due to the supervised learning step is reduced from 4.1% to 0.8%.

Full Paper

Bibliographic reference.  Bauer, Josef G. / Andrassy, Bernt / Timoshenko, Ekaterina (2007): "Discriminative optimization of language adapted HMMs for a language identification system based on parallel phoneme recognizers", In INTERSPEECH-2007, 166-169.