INTERSPEECH 2009

In this paper we present an analytic derivation of the moments of the phase factor between clean speech and noise cepstral or logmelspectral feature vectors. The development shows, among others, that the probability density of the phase factor is of subGaussian nature and that it is independent of the noise type and the signaltonoise ratio, however dependent on the mel filter bank index. Further we show how to compute the contribution of the phase factor to both the mean and the variance of the noisy speech observation likelihood, which relates the speech and noise feature vectors to those of noisy speech. The resulting phasesensitive observation model is then used in modelbased speech feature enhancement, leading to significant improvements in word accuracy on the AURORA2 database.
Bibliographic reference. Leutnant, Volker / HaebUmbach, Reinhold (2009): "An analytic derivation of a phasesensitive observation model for noise robust speech recognition", In INTERSPEECH2009, 23952398.