11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Jointly Optimized Discriminative Features for Speech Recognition

Tim Ng, Bing Zhang, Long Nguyen

Raytheon BBN Technologies, USA

In the past decade, methods to extract long-term acoustic features for speech recognition using Multi-Layer Perceptrons have been proposed. These features have been proved to be good complementary features in some feature augmentations and/or through system combination. Usually, conventional linear dimension reduction algorithms, e.g. Linear Discriminative Analysis, are not applied on the combined features. In this paper, Region Dependent Transform is applied to jointly optimize the feature combination under a discriminative training criterion. When compared to a conventional augmentation, 3%to 6%relative character error rate reduction forMandarin speech recognition has been achieved using Region Dependent Transform.

Full Paper

Bibliographic reference.  Ng, Tim / Zhang, Bing / Nguyen, Long (2010): "Jointly optimized discriminative features for speech recognition", In INTERSPEECH-2010, 2618-2621.