8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Features Interpolation Domain for Distributed Speech Recognition and Performance for ITU-T G.723.1 CODEC

Vladimir Fabregas Surigué de Alencar, Abraham Alcaim

PUC-RIO, Brazil

In this paper, we examine the best domain to perform features interpolation in Distributed Speech Recognition (DSR) systems. We show that the only one domain where a performance gain can be achieved from the linear interpolation procedure is in the Line Spectral Frequencies (LSF) domain. A DSR scenario where the ITU-T G.723.1 codec is employed is also investigated. The recognition feature generated from the reconstructed speech is highly sensitive to the encoding noise. We have also shown that the LSF quantization scheme used by the G.723.1 codec decreases the recognition performance by approximately 2%.

Full Paper

Bibliographic reference.  Alencar, Vladimir Fabregas Surigué de / Alcaim, Abraham (2007): "Features interpolation domain for distributed speech recognition and performance for ITU-t g.723.1 CODEC", In INTERSPEECH-2007, 1142-1145.