EUROSPEECH '97
5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997


Subword Unit Representations for Spoken Document Retrieval

Kenney Ng, Victor W. Zue

Spoken Language Systems Group MIT Laboratory for Computer Science, Cambridge, MA, USA

This paper investigates the feasibility of using subword unit representations for spoken document retrieval as an alternative to using words generated by either keyword spotting or word recognition. Our investigation is motivated by the observation that word-based retrieval approaches face the problem of either having to know the keywords to search for a priori, or requiring a very large recognition vocabulary in order to cover the contents of growing and diverse message collections. In this study, we examine a range of subword units of varying complexity derived from phonetic transcriptions. The basic underlying unit is the phone; more and less complex units are derived by varying the level of detail and the length of sequences of the phonetic units. We measure the ability of the different subword units to effectively index and retrieve a large collection of recorded speech messages. We also compare their performance when the underlying phonetic transcriptions are perfect and when they contain phonetic recognition errors.

Full Paper

Bibliographic reference.  Ng, Kenney / Zue, Victor W. (1997): "Subword unit representations for spoken document retrieval", In EUROSPEECH-1997, 1607-1610.