7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

Computationally Efficient Time-Scale Modification of Speech Using 3 Level Clipping

Sung-Joo Lee (1), Hyung Soon Kim (2)

(1) Electronics and Telecommunications Research Institute, Korea; (2) Pusan National University, Korea

Among the conventional time-scale modification methods [1]- [6], the synchronized overlap and add (SOLA) method [4] is used widely because of its good performance with relatively low computational complexity. But the SOLA method still requires much computation in evaluating the normalized cross-correlation function for synchronization procedure [9]. In this paper, we employ 3 level center clipping method in order to reduce the computational complexity of SOLA method. The result of subjective preference test indicates that the proposed method can reduce computational complexity by over 80% comparing with the conventional SOLA method without considerable performance degradation. We also apply the variable time-scale modification method using transient information [7] to the proposed algorithm. By doing so, we can maintain the intelligibility of time-scale modified speech in the case of very fast playback.


Full Paper

Bibliographic reference.  Lee, Sung-Joo / Kim, Hyung Soon (2002): "Computationally efficient time-scale modification of speech using 3 level clipping", In ICSLP-2002, 2385-2388.