ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Speaking style adaptation for spontaneous speech recognition using multiple-regression HMM

Yusuke Ijima, Takeshi Matsubara, Takashi Nose, Takao Kobayashi

This paper describes a rapid model adaptation technique for spontaneous speech recognition. The proposed technique utilizes a multiple-regression hidden Markov model (MRHMM) and is based on a style estimation technique of speech. In the MRHMM, the mean vector of probability density function (pdf) is given by a function of a low-dimensional vector, called style vector, which corresponds to the intensity of expressivity of speaking style variation. The value of the style vector is estimated for every utterance of the input speech and the model adaptation is conducted by calculating new mean vectors of the pdf using the estimated style vector. The performance evaluation results using “Corpus of spontaneous Japanese (CSJ)” are shown under a condition in which the amount of model training and adaptation data is very small.


doi: 10.21437/Interspeech.2009-198

Cite as: Ijima, Y., Matsubara, T., Nose, T., Kobayashi, T. (2009) Speaking style adaptation for spontaneous speech recognition using multiple-regression HMM. Proc. Interspeech 2009, 552-555, doi: 10.21437/Interspeech.2009-198

@inproceedings{ijima09_interspeech,
  author={Yusuke Ijima and Takeshi Matsubara and Takashi Nose and Takao Kobayashi},
  title={{Speaking style adaptation for spontaneous speech recognition using multiple-regression HMM}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={552--555},
  doi={10.21437/Interspeech.2009-198}
}