12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

A New Phonetic Candidate Generator for Improving Search Query Efficiency

Bo Peng (1), Yao Qian (1), Frank K. Soong (1), Bo Zhang (2)

(1) Microsoft Research Asia, China
(2) Nankai University, China

Misspelled query due to homophones or mispronunciation is difficult to be corrected in the conventional spelling correction methods. In phonetic candidate generation, the generator is to produce candidates which are phonetically similar to a given query. In this paper, we present a new phonetic candidate generator for improving the search efficiency of a query. The proposed generator consists of three modules: letter-to-sound (LTS) conversion, phonetic "trie" and phonetic similarity estimator based upon Levenshtein distance and Kullback-Leibler Divergence (KLD) between phones. This generator yields a significant improvement over Double-metaphone in terms of candidate accuracy and effective candidate set size.

Full Paper

Bibliographic reference.  Peng, Bo / Qian, Yao / Soong, Frank K. / Zhang, Bo (2011): "A new phonetic candidate generator for improving search query efficiency", In INTERSPEECH-2011, 1117-1120.