Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Incorporating HMM-State Sequence Confusion for Rapid MLLR Adaptation to New Speakers

Bing Zhao, Bo Xu

National Lab of Pattern Recognition Institute of Automation, Chinese Academy of Sciences, Beijing, China

In this paper, we introduce the HMM-state sequence confusion characteristics as prior knowledge into the framework of MLLR to relax the transformation and reduce the risks of over-training when adaptation data size is small. There are two issues to be addressed as follows: first, how to estimate such confusion information reliably; second how to use the information in refining the estimation of MLLR adaptation. The pronunciation modeling technology was utilized to build the state sequence confusion table. Then the correlation of states is calculated according to the confusion table. Following proposed algorithm made a relaxation in the process of MLLR adaptation when the adaptation data is very small. Our experiment on a Mandarin state-tying triphone toneless LVCSR system showed that error rate reduction is 9.5% over standard MLLR with about 10 utterances (less than 30 seconds) of adaptation data.

Full Paper

Bibliographic reference.  Zhao, Bing / Xu, Bo (2000): "Incorporating HMM-state sequence confusion for rapid MLLR adaptation to new speakers", In ICSLP-2000, vol.3, 690-693.