4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
We propose a new approach for detecting topic boundaries and keywords in arbitrary speech, with neither recognition nor prosodic processing, aiming at quick access to the content of recorded raw speech. This approach is based on the general tendency that frequently-repeated phrases/words in speech are characteristic of topics in discourse, so it uses pairs of phonetically similar segments (PPSSs) of speech to represent topics in speech. This approach has the advantage of being domain and language-independent and robust against variations in the speaker and background noise, as it needs neither a language nor acoustic model in advance. Experiments using simulated dialogues confirmed the good performance of this approach. We also propose Incremental Reference Interval-free Continuous Dynamic Programming (IRIFCDP) as an algorithm for detecting PPSSs in speech for the above method. IRIFCDP can detect PPSSs efficiently in synchronization with the speech, so it is suitable for handling long speech samples.
Bibliographic reference. Kiyama, Jiro / Itoh, Yoshiaki / Oka, Ryuichi (1996): "Automatic detection of topic boundaries and keywords in arbitrary speech using incremental reference interval-free continuous DP", In ICSLP-1996, 1946-1949.