ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Reducing spectral mismatches in concatenative speech synthesis via systematic database enrichment

Maria Founda, George Tambouratzis, Aimilios Chalamandaris, George Carayannis

This paper presents work performed for the Time-Domain TTS system, which is being developed at the ILSP for the Greek language. It focuses on the enhancement of the synthetic speech quality, by reducing the spectral mismatches between concatenated segments. To that end, a study has been performed to determine the distance that can best predict when a spectral mismatch is audible. Experimentation with different spectral distances has taken place and the distance with the best performance has been used in order to systematically enrich the segment database, which initially contained only one instance per segment. Results of this procedure indicate a substantial improvement in the synthetic speech quality.


doi: 10.21437/Eurospeech.2001-257

Cite as: Founda, M., Tambouratzis, G., Chalamandaris, A., Carayannis, G. (2001) Reducing spectral mismatches in concatenative speech synthesis via systematic database enrichment. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 837-840, doi: 10.21437/Eurospeech.2001-257

@inproceedings{founda01_eurospeech,
  author={Maria Founda and George Tambouratzis and Aimilios Chalamandaris and George Carayannis},
  title={{Reducing spectral mismatches in concatenative speech synthesis via systematic database enrichment}},
  year=2001,
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},
  pages={837--840},
  doi={10.21437/Eurospeech.2001-257}
}