12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Evaluation of Fast Spoken Term Detection Using a Suffix Array

Kouichi Katsurada, Shinta Sawada, Shigeki Teshima, Yurie Iribe, Tsuneo Nitta

Toyohashi University of Technology, Japan

We previously proposed [1] fast spoken term detection that uses a suffix array as a data structure for searching a large-scale speech documents. In this method, a keyword is divided into sub-keywords, and the phoneme sequences that contain two or more sub-keywords are output as results. Although the search is executed very quickly on a 10,000-h speech database, we only proposed a variety of matching procedures in [1]. In this paper, we compare different varieties of matching procedures in which the number of phonemes in a sub-keyword and the required number of sub-keywords to be contained in a search result are different. We also compare the performance and the process time of our method with typical spoken term detection using an inverted index.


  1. Katsurada, K., Teshima, S. and Nitta, T., “Fast Keyword Detection Using Suffix Array”, InterSpeech2009, pp.2147-2150, 2009

Full Paper

Bibliographic reference.  Katsurada, Kouichi / Sawada, Shinta / Teshima, Shigeki / Iribe, Yurie / Nitta, Tsuneo (2011): "Evaluation of fast spoken term detection using a suffix array", In INTERSPEECH-2011, 909-912.