In this paper, we examine the best domain to perform features interpolation in Distributed Speech Recognition (DSR) systems. We show that the only one domain where a performance gain can be achieved from the linear interpolation procedure is in the Line Spectral Frequencies (LSF) domain. A DSR scenario where the ITU-T G.723.1 codec is employed is also investigated. The recognition feature generated from the reconstructed speech is highly sensitive to the encoding noise. We have also shown that the LSF quantization scheme used by the G.723.1 codec decreases the recognition performance by approximately 2%.
Bibliographic reference. Alencar, Vladimir Fabregas Surigué de / Alcaim, Abraham (2007): "Features interpolation domain for distributed speech recognition and performance for ITU-t g.723.1 CODEC", In INTERSPEECH-2007, 1142-1145.