ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Adaptive-order fractional Fourier transform features for speech recognition

Hui Yin, Xiang Xie, Jingming Kuang

We propose an acoustic feature for speech recognition based on the combination of MFCC and fractional Fourier transform (FrFT). Since the transform order is critical for the performance of FrFT, we use the ambiguity function to adaptively determine the optimal orders of FrFT for each frame. The performance of the proposed feature is compared with traditional MFCCs on recognizing speech of isolated and connected digits under both clean and noisy backgrounds. The recognition results and detailed confusion matrices are given and analyzed, which implies that the proposed feature is promising in certain speech processing fields.


doi: 10.21437/Interspeech.2008-205

Cite as: Yin, H., Xie, X., Kuang, J. (2008) Adaptive-order fractional Fourier transform features for speech recognition. Proc. Interspeech 2008, 654-657, doi: 10.21437/Interspeech.2008-205

@inproceedings{yin08_interspeech,
  author={Hui Yin and Xiang Xie and Jingming Kuang},
  title={{Adaptive-order fractional Fourier transform features for speech recognition}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={654--657},
  doi={10.21437/Interspeech.2008-205}
}