EUROSPEECH 2003 - INTERSPEECH 2003
This paper presents a new voice conversion system that modifies misarticulations and prosodic deviations of the hearing-impaired Mandarin speech. The basic strategy is the detection and exploitation of characteristic features that distinguish the impaired speech from the normal speech at segmental and prosodic levels. For spectral conversion, cepstral coefficients were characterized under the form of a Gaussian mixture model with parameters converted using a mapping function that minimizes the spectral distortion between the impaired and normal speech. We also proposed a VQ-based approach to prosodic conversion that involves modifying the features extracted from the pitch contour by orthogonal polynomial transform. Experimental results indicate that the proposed system appears useful in enhancing the hearing-impaired Mandarin speech.
Bibliographic reference. Lee, Chen-Long / Yang, Ya-Ru / Chang, Wen-Whei / Chiang, Yuan-Chuan (2003): "Enhancement of hearing-impaired Mandarin speech", In EUROSPEECH-2003, 1369-1372.