8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

The SRI/OGI 2006 Spoken Term Detection System

Dimitra Vergyri (1), Izhak Shafran (2), Andreas Stolcke (1), Ramana R. Gadde (1), Murat Akbacak (1), Brian Roark (2), Wen Wang (1)

(1) SRI International, USA
(2) Oregon Health & Science University, USA

This paper describes the system developed jointly at SRI and OGI for participation in the 2006 NIST Spoken Term Detection (STD) evaluation. We participated in the three genres of the English track: Broadcast News (BN), Conversational Telephone Speech (CTS), and Conference Meetings (MTG). The system consists of two phases. First, audio indexing, an offline phase, converts the input speech waveform into a searchable index. Second, term retrieval, possibly an online phase, returns a ranked list of occurrences for each search term. We used a word-based indexing approach, obtained with SRI's large vocabulary Speech-to-Text (STT) system.

Apart from describing the submitted system and its performance on the NIST evaluation metric, we study the tradeoffs between performance and system design. We examine performance versus indexing speed, effectiveness of different index ranking schemes on the NIST score, and the utility of approaches to deal with out-of-vocabulary (OOV) terms.

Full Paper

Bibliographic reference.  Vergyri, Dimitra / Shafran, Izhak / Stolcke, Andreas / Gadde, Ramana R. / Akbacak, Murat / Roark, Brian / Wang, Wen (2007): "The SRI/OGI 2006 spoken term detection system", In INTERSPEECH-2007, 2393-2396.