INTERSPEECH 2011
12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Learning Weighted Entity Lists from Web Click Logs for Spoken Language Understanding

Dustin Hillard, Asli Celikyilmaz, Dilek Hakkani-Tür, Gokhan Tur

Microsoft Speech Labs, USA

Named entity lists provide important features for language understanding, but typical lists can contain many ambiguous or incorrect phrases. We present an approach for automatically learning weighted entity lists by mining user clicks from web search logs. The approach significantly outperforms multiple baseline approaches and the weighted lists improve spoken language understanding tasks such as domain detection and slot filling. Our methods are general and can be easily applied to large quantities of entities, across any number of lists.

Full Paper

Bibliographic reference.  Hillard, Dustin / Celikyilmaz, Asli / Hakkani-Tür, Dilek / Tur, Gokhan (2011): "Learning weighted entity lists from web click logs for spoken language understanding", In INTERSPEECH-2011, 705-708.