11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

A Study of Term Weighting in Phonotactic Approach to Spoken Language Recognition

Sirinoot Boonsuk (1), Donglai Zhu (2), Bin Ma (2), Atiwong Suchato (1), Proadpran Punyabukkana (1), Nattanun Thatphithakkul (3), Chai Wutiwiwatchai (3)

(1) Chulalongkorn University, Thailand
(2) A*STAR, Singapore
(3) NECTEC, Thailand

In the spoken language recognition approach of modeling phonetic lattice with the Support Vector Machine (SVM), term weighting on the supervector of N-gram probabilities is critical to the recognition performance because the weighting prevents the SVM kernel from being dominated by a few large probabilities. We investigate several term weighting functions that are used in text retrieval, which can incorporate the long-term semantic modeling in the short-term N-gram modeling. The functions are evaluated on the NIST 2007 Language Recognition Evaluation (LRE) task. Results suggest the term weighting with redundancy of term frequency (rd) which eliminates the redundancy of unit frequency co-occurrence across languages, and the combination of rd and logtf which demonstrates the effectiveness of combining the local and global weighting functions.

Full Paper

Bibliographic reference.  Boonsuk, Sirinoot / Zhu, Donglai / Ma, Bin / Suchato, Atiwong / Punyabukkana, Proadpran / Thatphithakkul, Nattanun / Wutiwiwatchai, Chai (2010): "A study of term weighting in phonotactic approach to spoken language recognition", In INTERSPEECH-2010, 2714-2717.