INTERSPEECH 2004 - ICSLP
8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Phonetic Confusion Based Document Expansion for Spoken Document Retrieval

Nicolas Moreau, Hyoung-Gook Kim, Thomas Sikora

Technical University Berlin, Germany

This paper presents a phone-based approach of spoken document retrieval (SDR), developed in the framework of the emerging MPEG-7 standard. We describe an indexing and retrieval system that uses phonetic information only. The retrieval method is based on the vector space IR model, using phone N-grams as indexing terms. We propose a technique to expand the representation of documents by means of phone confusion probabilities in order to improve the retrieval performance. This method is tested on a collection of short German spoken documents, using 10 city names as queries.

Full Paper

Bibliographic reference.  Moreau, Nicolas / Kim, Hyoung-Gook / Sikora, Thomas (2004): "Phonetic confusion based document expansion for spoken document retrieval", In INTERSPEECH-2004, 1593-1596.