This paper proposes a spoken term detection using syllable transition network (STN) derived from multiple speech recognizers. An STN is similar to a sub-word based confusion network, which is derived from the output of a speech recognizer. The one we proposed is derived from the outputs of multiple speech recognition systems, which is well known to be robust to certain recognition errors and the out-of-vocabulary problem. Therefore, the STN should also be robust to recognition errors on the STD. This experiment showed that the STN was very effective at detecting out-of-vocabulary terms, improving detection rate to 83%, which was as high as the in-vocabulary term detection performance.
Bibliographic reference. Natori, Satoshi / Nishizaki, Hiromitsu / Sekiguchi, Yoshihiro (2010): "Japanese spoken term detection using syllable transition network derived from multiple speech recognizers' outputs", In INTERSPEECH-2010, 681-684.