16th Annual Conference of the International Speech Communication Association

Dresden, Germany
September 6-10, 2015

DNN Derived Filters for Processing of Modulation Spectrum of Speech

Jan Pešán, Lukáš Burget, Hynek Hermansky, Karel Veselý

Brno University of Technology, Czech Republic

We propose a novel approach to design modulation frequency filters for the first stage processing of critical band spectrum of speech using deep neural network (DNN). These filters replace conventional modulation frequency filters currently used in state-of-the-art BUT speech recognition system and yield about 10% relative improvement in phoneme recognition accuracy. The resulting filters are consistent with some known temporal properties of higher levels of mammalian auditory processing and suggest more efficient scheme for pre-processing of speech for ASR.

Full Paper

Bibliographic reference.  Pešán, Jan / Burget, Lukáš / Hermansky, Hynek / Veselý, Karel (2015): "DNN derived filters for processing of modulation spectrum of speech", In INTERSPEECH-2015, 1908-1911.