11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation

Hui Liang, John Dines

Idiap Research Institute, Switzerland

This paper provides an in-depth analysis of the impacts of language mismatch on the performance of cross-lingual speaker adaptation. Our work confirms the influence of language mismatch between average voice distributions for synthesis and for transform estimation and the necessity of eliminating this mismatch in order to effectively utilize multiple transforms for cross-lingual speaker adaptation. Specifically, we show that language mismatch introduces unwanted language-specific information when estimating multiple transforms, thus making these transforms detrimental to adaptation performance. Our analysis demonstrates speaker characteristics should be separated from language characteristics in order to improve cross-lingual adaptation performance.

Full Paper

Bibliographic reference.  Liang, Hui / Dines, John (2010): "An analysis of language mismatch in HMM state mapping-based cross-lingual speaker adaptation", In INTERSPEECH-2010, 622-625.