Interspeech'2005 - Eurospeech
We investigated speech intelligibility of four-mora word sounds degraded with a system based on a high quality vocoder, STRAIGHT, and warped-DCT. This system enables us to independently manipulate essential speech parameters for vocal tract filtering and glottal excitation. We report perceptual effects of: 1) ‘temporal smearing' or reduced temporal modulation; 2) ‘time-frequency smearing' or reduced resolution in both temporal modulation and spectral peak; and 3) ‘source smearing' or reduced resolution of glottal pulses. By analyzing intelligibility scores from the various experiments, we quantitatively confirmed that there are linguistic dependencies of phonemes and morae within words.
Bibliographic reference. Irino, Toshio / Satou, Satoru / Nomura, Shunsuke / Banno, Hideki / Kawahara, Hideki (2005): "Speech intelligibility derived from time-frequency and source smearing", In INTERSPEECH-2005, 1737-1740.