15th Annual Conference of the International Speech Communication Association

September 14-18, 2014

Detecting and Labeling Speakers on Overlapping Speech Using Vector Taylor Series

Pranay Dighe, Marc Ferràs, Hervé Bourlard

Idiap Research Institute, Switzerland

Successfully modeling overlapping speech is a crucial step towards improving the performance of current speaker diarization systems. In this direction, we present ongoing work on a novel Multi-Class Vector Taylor Series (MC-VTS) approach that models overlapping speech from knowledge of the individual speaker models and the feature extraction process. We explore several variants of the MC-VTS technique that aim at modeling overlapping speech more precisely. Bootstrapping the algorithm with both oracle and diarization output segmentations, we show the potential of this approach in terms of overlapping speech detection and speaker labeling performances through a set of experiments on far-field microphone meeting data.

Full Paper

Bibliographic reference.  Dighe, Pranay / Ferràs, Marc / Bourlard, Hervé (2014): "Detecting and labeling speakers on overlapping speech using vector taylor series", In INTERSPEECH-2014, 592-596.