8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Event Detection of Speech Signals Based on Auditory Processing with a Dynamic Compressive Gammachirp Filterbank

Satomi Tanaka (1), Minoru Tsuzaki (1), Hiroaki Kato (2), Yoshinori Sagisaka (3)

(1) Kyoto City University of Arts, Japan
(2) ATR-CIS, Japan
(3) Waseda University, Japan

To simulate the perceptual extraction of temporal structures of speech, the authors have been proposing an event-plausibility model that detects the occurrence of subevents in continuous speech signals based on a auditory processing. One of its core components is the filterbank module that simulates the mechanical frequency analysis of the basilar membrane in the cochlea. In this paper, output by the new model using a dynamic compressive gammachirp (dcGC) auditory filterbank was compared with the previous model using a gammatone auditory filterbank. The most important difference between these filters was the nonlinear dynamic level-dependence of the new filter; the previous filterbank was linear. Simulation results revealed that no significant advantage for the new filter (dcGC) was observed for event detection by the event-plausibility model, which suggests that the algorithm for the event-plausibility model has robustness against differences in peripheral auditory processing.

Full Paper

Bibliographic reference.  Tanaka, Satomi / Tsuzaki, Minoru / Kato, Hiroaki / Sagisaka, Yoshinori (2007): "Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank", In INTERSPEECH-2007, 1949-1952.