ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Speech/non-speech segments detection based on chaotic and prosodic features

Soheil Shafiee, Farshad Almasganj, Ayyoob Jafari

Every speech recognition system contains a speech/non-speech detection stage. Detected speech sequences are only passed through the speech recognition stage later on. In a noisy environment, non-speech segments can be an important source of error. In this work, we introduce a new speech/non-speech detection system based on fractal dimension and prosodic features plus the common used MFCC features. We evaluated our system performance using neural network and SVM classifiers on TIMIT speech database with a HMM based speech recognizer. Experimental results show very good performance in speech/non-speech detection.


doi: 10.21437/Interspeech.2008-25

Cite as: Shafiee, S., Almasganj, F., Jafari, A. (2008) Speech/non-speech segments detection based on chaotic and prosodic features. Proc. Interspeech 2008, 111-114, doi: 10.21437/Interspeech.2008-25

@inproceedings{shafiee08_interspeech,
  author={Soheil Shafiee and Farshad Almasganj and Ayyoob Jafari},
  title={{Speech/non-speech segments detection based on chaotic and prosodic features}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={111--114},
  doi={10.21437/Interspeech.2008-25}
}