Voice Quality: Functions, Analysis and Synthesis
August 27-29, 2003
We describe the voice quality classification system that uses entropy and dynamism criteria as discrimination features. The main idea of this approach is that the input neural net is considered as an informational channel. Channel tuned to the certain type of information transmits it best of all according to the informational criterion. In our case a multilayer perceptron (MLP) emitted posterior probabilities for speech recognition was used as such information channel. Then two features entropy and dynamism were computed using these posterior probabilities. And finally HMM was used as a classifier. Different experiments demonstrated efficient usage possibilities of entropy and dynamism criteria not only in audio classification tasks but also in the voice quality classification applications.
Full Paper Presentation (PDF; 1201 KB) Presentation (Powerpoint)
Bibliographic reference. Kukharchik, Peter D. / Kheidorov, Igor E. / Lukashevich, Hanna M. / Mitrofanov, Denis L. (2003): "Entropy and dynamism criteria for voice quality classification applications", In VOQUAL'03, 91-96.