8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Hybridizing Conversational and Clear Speech

Akiko Kusumoto, Alexander B. Kain, John-Paul Hosom, Jan P. H. van Santen

Oregon Health & Science University, USA

"Clear" ( clr) speech is a speaking style that speakers adopt to be understood correctly in a difficult communication environment. Studies have shown that clr speech, as opposed to "conversational" ( cnv) speech, has significantly higher intelligibility in various conditions. While many differences in acoustic features have been identified, it is not known which individual feature or combinations of features cause the higher intelligibility of clr speech. The objectives of the current study are to examine whether it is possible to improve speech intelligibility by approximating clr speech features and to determine which acoustic features contribute to intelligibility. Our approach creates speech samples that combine acoustic features of cnv and clr speech, using a hybridization algorithm. Results with normal-hearing listeners showed significant sentence-level intelligibility improvements of 11-23% over cnv speech when replacing certain acoustic features with those from clr speech.

Full Paper

Bibliographic reference.  Kusumoto, Akiko / Kain, Alexander B. / Hosom, John-Paul / Santen, Jan P. H. van (2007): "Hybridizing conversational and clear speech", In INTERSPEECH-2007, 370-373.