ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Dynamic features in the linear domain for robust automatic speech recognition in a reverberant environment

Osamu Ichikawa, Takashi Fukuda, Ryuki Tachibana, Masafumi Nishimura

Since the MFCC are calculated from logarithmic spectra, the delta and delta-delta are considered as difference operations in a logarithmic domain. In a reverberant environment, speech signals have trailing reverberations, whose power is plotted as a long-term exponential decay. This means the logarithmic delta value tends to remain large for a long time. This paper proposes a delta feature calculated in the linear domain, due to the rapid decay in reverberant environments. In an experiment using an evaluation framework (CENSREC-4), significant improvements were found in reverberant situations by simply replacing the MFCC dynamic features with the proposed dynamic features.


doi: 10.21437/Interspeech.2009-9

Cite as: Ichikawa, O., Fukuda, T., Tachibana, R., Nishimura, M. (2009) Dynamic features in the linear domain for robust automatic speech recognition in a reverberant environment. Proc. Interspeech 2009, 44-47, doi: 10.21437/Interspeech.2009-9

@inproceedings{ichikawa09_interspeech,
  author={Osamu Ichikawa and Takashi Fukuda and Ryuki Tachibana and Masafumi Nishimura},
  title={{Dynamic features in the linear domain for robust automatic speech recognition in a reverberant environment}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={44--47},
  doi={10.21437/Interspeech.2009-9}
}