ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Unit selection for speech synthesis based on a new acoustic target cost

Soufiane Rouibia, Olivier Rosec

This paper presents a new approach to unit selection for corpusbased speech synthesis, in which the units are selected according to acoustic criteria. In a learning stage, an acoustic clustering is carried out using context dependent HMM. During synthesis, an acoustic target is generated and segmented in the required diphone sequence. For each diphone to be synthesized, a pre-selection module determines the N-best instances that match this acoustic target. From these candidates, the optimal unit sequence is then obtained by minimizing a concatenation cost through dynamic programming. Objective as well as subjective tests are carried out which shows the relevance of the proposed method.


doi: 10.21437/Interspeech.2005-796

Cite as: Rouibia, S., Rosec, O. (2005) Unit selection for speech synthesis based on a new acoustic target cost. Proc. Interspeech 2005, 2565-2568, doi: 10.21437/Interspeech.2005-796

@inproceedings{rouibia05_interspeech,
  author={Soufiane Rouibia and Olivier Rosec},
  title={{Unit selection for speech synthesis based on a new acoustic target cost}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2565--2568},
  doi={10.21437/Interspeech.2005-796}
}