ISCA Archive VOQUAL 2003
ISCA Archive VOQUAL 2003

Entropy and dynamism criteria for voice quality classification applications

Peter D. Kukharchik, Igor E. Kheidorov, Hanna M. Lukashevich, Denis L. Mitrofanov

We describe the voice quality classification system that uses entropy and dynamism criteria as discrimination features. The main idea of this approach is that the input neural net is considered as an informational channel. Channel tuned to the certain type of information transmits it best of all according to the informational criterion. In our case a multilayer perceptron (MLP) emitted posterior probabilities for speech recognition was used as such information channel. Then two features entropy and dynamism were computed using these posterior probabilities. And finally HMM was used as a classifier. Different experiments demonstrated efficient usage possibilities of entropy and dynamism criteria not only in audio classification tasks but also in the voice quality classification applications.


Cite as: Kukharchik, P.D., Kheidorov, I.E., Lukashevich, H.M., Mitrofanov, D.L. (2003) Entropy and dynamism criteria for voice quality classification applications. Proc. Voice Quality: Functions, Analysis and Synthesis (VOQUAL 2003), 91-96

@inproceedings{kukharchik03_voqual,
  author={Peter D. Kukharchik and Igor E. Kheidorov and Hanna M. Lukashevich and Denis L. Mitrofanov},
  title={{Entropy and dynamism criteria for voice quality classification applications}},
  year=2003,
  booktitle={Proc. Voice Quality: Functions, Analysis and Synthesis (VOQUAL 2003)},
  pages={91--96}
}