ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Discriminative feature transformation using output coding for speech recognition

Omid Dehzangi, Bin Ma, Eng Siong Chng, Haizhou Li

In this paper, we present a new mechanism to extract discriminative acoustic features for speech recognition using continuous output coding (COC) based feature transformation. Our proposed method first expands the short-time spectral features into a higher dimensional feature space to improve its discriminative capability. The expansion is performed by employing the polynomial expansion. The high dimension features are then projected into lower dimension space using continuous output coding technique implemented by a set of linear SVMs. The resulting feature vectors are designed to encode the difference between phones. The generated features are shown to be more discriminative than MFCCs and experimental results on both TIMIT and NTIMIT corpus showed better phone recognition accuracy with the proposed features.


doi: 10.21437/Interspeech.2009-754

Cite as: Dehzangi, O., Ma, B., Chng, E.S., Li, H. (2009) Discriminative feature transformation using output coding for speech recognition. Proc. Interspeech 2009, 2979-2982, doi: 10.21437/Interspeech.2009-754

@inproceedings{dehzangi09_interspeech,
  author={Omid Dehzangi and Bin Ma and Eng Siong Chng and Haizhou Li},
  title={{Discriminative feature transformation using output coding for speech recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2979--2982},
  doi={10.21437/Interspeech.2009-754}
}