In the past decade, methods to extract long-term acoustic features for speech recognition using Multi-Layer Perceptrons have been proposed. These features have been proved to be good complementary features in some feature augmentations and/or through system combination. Usually, conventional linear dimension reduction algorithms, e.g. Linear Discriminative Analysis, are not applied on the combined features. In this paper, Region Dependent Transform is applied to jointly optimize the feature combination under a discriminative training criterion. When compared to a conventional augmentation, 3%to 6%relative character error rate reduction forMandarin speech recognition has been achieved using Region Dependent Transform.
Bibliographic reference. Ng, Tim / Zhang, Bing / Nguyen, Long (2010): "Jointly optimized discriminative features for speech recognition", In INTERSPEECH-2010, 2618-2621.