7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

Unsupervised Acoustic Model Adaptation Based on Phoneme Error Minimization

Jun Ogata, Yasuo Ariki

Ryukoku University, Japan

In this paper, a new decoding method for unsupervised acoustic model adaptation is presented. In unsupervised adaptation framework, the effectiveness of adaptation process is greatly affected by the mis-recognized labels. Therefore, selection of the adaptation data guided by the confidence measures is effective in unsupervised adaptation. We propose phoneme error minimization framework for exact phoneme labels and use of phoneme-level confidence measures for improved unsupervised adaptation. Experimental results showed that the proposed method could reduce the mis-recognized labels in the adaptation process, and consequently improved the adaptation accuracy. Furthermore, it was confirmed that the proposed method is effective in an iterative unsupervised adaptation framework.

Full Paper

Bibliographic reference.  Ogata, Jun / Ariki, Yasuo (2002): "Unsupervised acoustic model adaptation based on phoneme error minimization", In ICSLP-2002, 1429-1432.