Optimal Unit Stitching in a Unit Selection Singing Synthesis System

Marius Cotescu


Unit Selection based speech synthesis systems are currently the best performing, producing natural sounding speech with minimal CPU load. One of the important reasons behind their success is the amount of recordings that are now commonly used in synthesis applications. However, in the case of singing applications, it is quite hard for a database to cover a large phonetic space due to the relative inefficiency of the recording process. Thus, due to the reduced catalogue of units, singing unit selection systems are more likely to produce spectral discontinuity artefacts. Taking advantage of the quasi stable nature of articulation during singing, we propose a novel unit stitching method. The method was implemented into the system that was used for the “Fill-In the Gap” Singing Synthesis Challenge.


DOI: 10.21437/Interspeech.2016-1390

Cite as

Cotescu, M. (2016) Optimal Unit Stitching in a Unit Selection Singing Synthesis System. Proc. Interspeech 2016, 1255-1259.

Bibtex
@inproceedings{Cotescu2016,
author={Marius Cotescu},
title={Optimal Unit Stitching in a Unit Selection Singing Synthesis System},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-1390},
url={http://dx.doi.org/10.21437/Interspeech.2016-1390},
pages={1255--1259}
}