A variety of approaches to language identification, based on (a) acoustic features, (b) broad-category segmentation, and (c) fine phonetic classification, are introduced. These approaches are evaluated in terms of their ability to distinguish between English and Japanese utterances spoken over a telephone channel. It is found that the best performance (86.3 % accurate classification of utterances with a mean length of 13.4 sec) is obtained when fine phonetic features are employed. In addition, the results show the importance of discriminatory training rather than likelihood estimation.
Bibliographic reference. Muthusamy, Yeshwant / Berkling, Kay / Arai, Takayuki / Cole, Ronald / Barnard, Etienne (1993): "A comparison of approaches to automatic language identification using telephone speech", In EUROSPEECH'93, 1307-1310.