Interspeech'2005 - Eurospeech
In the context of the Neologos speech database creation project, we have studied several methods for the selection of representative speaker recordings. These methods operate a selection by optimizing a quality criterion defined in various speaker similarity modeling frameworks. The obtained selections can be cross-validated in the modeling frameworks which were not used for the optimization. The compared methods include K-Medians clustering, Hierarchical clustering, and a new method called the selection of Focal Speakers. Among these, only the new method is able to solve the joint optimization, across all the modeling frameworks, of the selection of representative speakers.
Bibliographic reference. Krstulovic, Sacha / Bimbot, Frédéric / Charlet, Delphine / Boëffard, Olivier (2005): "Focal speakers: a speaker selection method able to deal with heterogeneous similarity criteria", In INTERSPEECH-2005, 3057-3060.