Interspeech'2005 - Eurospeech
In environment with considerably long reverberation time, each frame of speech is affected by reflected energy components from the preceding frames. Therefore to adapt model parameters of a state, it becomes necessary to consider these frames, and compute their contributions to current state. However, these clean speech frames preceding to a state of HMM are not known during adaptation of the models. This paper describes a method to estimate the preceding frames for a state in HMM, by splitting the state into a number of substates. The estimated sequence of frames can then be used to find reflected energy component for the state and compensate its parameters. The effectiveness of the method was confirmed by the experimental results on an isolated-word recognition task.
Bibliographic reference. Raut, Chandra Kant / Nishimoto, Takuya / Sagayama, Shigeki (2005): "Model adaptation by state splitting of HMM for long reverberation", In INTERSPEECH-2005, 277-280.