Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Temporal Patterns of Critical-Band Spectrum for Text-to-Speech

Pratibha Jain (1), Hynek Hermansky (1,2)

(1) Oregon Graduate Institute of Science and Technology, Portland, OR, USA
(2) International Computer Science Institute, Berkeley, CA, USA

The means of the long temporal trajectories of loga- rithmic critical band energies in a vicinity of individ- ual phoneme show distinct patterns (TRAPs Fig 1) in each critical band for di erent phonemes. These temporal patterns were successfully used in Automatic Speech Recognition [1]. By using the fact that they not only contain spectral evolution but also the average co-articulation of the phonemes, we examine to what extent they capture information about sound units by synthesizing speech from them.

Full Paper

Bibliographic reference.  Jain, Pratibha / Hermansky, Hynek (2000): "Temporal patterns of critical-band spectrum for text-to-speech", In ICSLP-2000, vol.2, 439-441.