EUROSPEECH 2003 - INTERSPEECH 2003
In this paper we present a revision and evaluation of some of the main methods used in variable frame rate (VFR) analysis, applied to speech recognition systems. The work found in the literature in this area usually deals with restricted conditions and scenarios and we have revisited the main algorithmic alternatives and evaluated them under the same experimental framework, so that we have been able to establish objective considerations for each of them, selecting the most adequate strategy. We also show till what extent VFR analysis is useful in its three main application scenarios, namely "reduction of computational load", "improve acoustic modelling" and "handling additive noise conditions in the time domain". From our evaluation on a difficult telephone large vocabulary task, we establish that VFR analysis does not significantly improve the results obtained using the traditional fixed frame rate analysis (FFR), except when additive noise is present in the database and specially for low SNRs.
Bibliographic reference. Macias-Guarasa, J. / Ordonez, J. / Montero, J.M. / Ferreiros, J. / Cordoba, R. / D'Haro, L.F. (2003): "Revisiting scenarios and methods for variable frame rate analysis in automatic speech recognition", In EUROSPEECH-2003, 1809-1812.