ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

The use of 'rare' segments for language identification

Jean-Marie Hombert, Ian Maddieson

Knowledge of the distribution of rare segments across the languages of the world might be used in identifying languages within an open set. Segments which are both discriminatory (i.e. rare) and robust (i.e. easy to identify) are the best targets for efficient language identification. Considering several properties at the same time allows to use more common segments and/or features in a still very discriminatory way.


doi: 10.21437/Eurospeech.1999-98

Cite as: Hombert, J.-M., Maddieson, I. (1999) The use of 'rare' segments for language identification. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 379-382, doi: 10.21437/Eurospeech.1999-98

@inproceedings{hombert99_eurospeech,
  author={Jean-Marie Hombert and Ian Maddieson},
  title={{The use of 'rare' segments for language identification}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={379--382},
  doi={10.21437/Eurospeech.1999-98}
}