ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Improvement of eigenvoice-based speaker adaptation by parameter space clustering

Shutaro Tanji, Koichi Shinoda, Sadaoki Furui, Antonio Ortega

The segmental eigenvoice method has been proposed to provide rapid speaker adaptation with limited amounts of adaptation data. In this method, the speaker-vector space is clustered to several subspaces and PCA is applied to each of the resulting subspaces. In this paper, we propose two new techniques to improve the performance of this segmental eigenvoice approach. First, we propose a soft-clustering method in which each element in a speaker vector can be assigned to more than one cluster. Second, those elements far apart from any of the clusters are removed. Our experiments using the JNAS and S-JNAS databases show that the proposed method outperforms both the original eigenvoice and the segmental eigenvoice methods, e.g., 3.3% average improvement when only 10 utterances are used for adaptation.


doi: 10.21437/Interspeech.2008-373

Cite as: Tanji, S., Shinoda, K., Furui, S., Ortega, A. (2008) Improvement of eigenvoice-based speaker adaptation by parameter space clustering. Proc. Interspeech 2008, 1229-1232, doi: 10.21437/Interspeech.2008-373

@inproceedings{tanji08_interspeech,
  author={Shutaro Tanji and Koichi Shinoda and Sadaoki Furui and Antonio Ortega},
  title={{Improvement of eigenvoice-based speaker adaptation by parameter space clustering}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1229--1232},
  doi={10.21437/Interspeech.2008-373}
}