ISCA Archive ICSLP 1998
ISCA Archive ICSLP 1998

Recurrent substrings and data fusion for language recognition

Harvey Lloyd-Thomas, Eluned S. Parris, Jeremy H. Wright

Recurrent phone substrings that are characteristic of a language are a promising technique for language recognition. In previous work on language recognition, building anti-models to normalise the scores from acoustic phone models for target languages, has been shown to reduce the Equal Error Rate (EER) by a third. Recurrent substrings and anti-models have now been applied alongside three other techniques (bigrams, usefulness and frequency histograms) to the NIST 1996 Language Recognition Evaluation, using data from the CALLFRIEND and OGI databases for training. By fusing scores from the different techniques using a multi-layer perceptron the ERR on the NIST data can be reduced further.


doi: 10.21437/ICSLP.1998-222

Cite as: Lloyd-Thomas, H., Parris, E.S., Wright, J.H. (1998) Recurrent substrings and data fusion for language recognition. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 1061, doi: 10.21437/ICSLP.1998-222

@inproceedings{lloydthomas98_icslp,
  author={Harvey Lloyd-Thomas and Eluned S. Parris and Jeremy H. Wright},
  title={{Recurrent substrings and data fusion for language recognition}},
  year=1998,
  booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)},
  pages={paper 1061},
  doi={10.21437/ICSLP.1998-222}
}