INTERSPEECH 2013

In this paper, recently proposed Temporally Varying Weight Regression (TVWR) is investigated in two ways for noise robust speech recognition. Firstly, since typical model compensation approaches assume that the noise feature is independent and identically distributed, nonstationary noise environment can be poorly compensated using conventional model compensation approaches in the standard Hidden Markov Model (HMM) framework. TVWR, however, maintains both the basic HMM structure and additional timevarying property, therefore, model compensation for TVWR is proposed such that i.i.d. noise assumption can be relaxed. Secondly, although Noise Adaptive Training NAT has been proposed to optimize the "pseudoclean" HMM model for a better performance by maximizing the likelihood of multicondition data, NAT heavily depends on the simplicity of Vector Taylor Series (VTS) formulation. Hence, other advanced compensation approaches, such as Trajectorybased Parallel Model Combination (TPMC), have difficulties benefiting from this powerful training schema. This paper exploits the timevarying attribute of TVWR to approximate NAT such that any compensation technique can be applied during noise adaptive training. Experiments on the Aurora 4 corpus show that significant improvements over the standard HMM or NAT system can be obtained by compensating TVWR either trained using clean data or adaptively trained using multicondition data.
Bibliographic reference. Liu, Shilin / Sim, Khe Chai (2013): "An investigation of temporally varying weight regression for noise robust speech recognition", In INTERSPEECH2013, 29632967.