Current language identification systems vary significantly in their complexity. The systems that use higher level linguistic information have the best performance. Nevertheless, that information is hard to collect for each new language. The system presented in this paper is easily extendable to new languages because it uses very little linguistic information. In fact, the presented system needs only one language specific phone recogniser (in our case the Portuguese one), and is trained with speech from each of the other languages. With the SpeechDat-M corpus, with 6 European languages (English, French, German, Italian, Portuguese and Spanish) our system achieved an identification rate of 83.4% on 5-second utterances, this result shows an improvement of 5% over our previous version, mainly through the use of a neural network classifier. Both the baseline and the full system were implemented in realtime.
Cite as: Caseiro, D., Trancoso, I.M. (1998) Spoken language identification using the speechdat corpus. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 1093, doi: 10.21437/ICSLP.1998-256
@inproceedings{caseiro98_icslp, author={Diamantino Caseiro and Isabel M. Trancoso}, title={{Spoken language identification using the speechdat corpus}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 1093}, doi={10.21437/ICSLP.1998-256} }