11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Using Harmonic Phase Information to Improve ASR Rate

Ibon Saratxaga, Inma Hernáez, Igor Odriozola, Eva Navas, Iker Luengo, Daniel Erro

Aholab Signal Processing Laboratory, University of the Basque Country UPV/EHU, Spain

Spectral phase information is usually discarded in automatic speech recognition (ASR). The Relative Phase Shift (RPS), a novel representation of the phase information of the speech, has features which seem to be appropriate to improve the ASR recognition rate. In this paper we describe the RPS representation, discuss different ways to parameterize this information in a suitable way for the HMM modelling, and present the results of the evaluation experiments. WER improvements ranging from 12 to 22% open promising perspectives for the use of this information jointly with the classical MFCC parameterization. Index Terms: ASR, phase spectrum, harmonic analysis

Full Paper

Bibliographic reference.  Saratxaga, Ibon / Hernáez, Inma / Odriozola, Igor / Navas, Eva / Luengo, Iker / Erro, Daniel (2010): "Using harmonic phase information to improve ASR rate", In INTERSPEECH-2010, 1185-1188.