ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Speaker tracking and detection with multiple speakers

Kemal Sönmez, Larry Heck, Mitchel Weintraub

We describe a speaker tracking and detection system, for Switchboard conversations, that uses a two­speaker and silence hidden Markov model (HMM)with a minimumstate duration constraint and Gaussian mixture model (GMM) state distributionsadapted from a single gender- and hand­set­independent imposter model distribution. Speaker tracking is used to segment speakers for detection, which is carried out by averaging frame scores of the Viterbi path and HNORM’ing via a novel parameter interpolation extension of HNORM for use with files of arbitrary lengths. Use of duration statistics augmenting the acoustic scores is also introduced via a nonlinear combination function. Results are reported on the NIST 1998 Multispeaker development evaluation dataset.


doi: 10.21437/Eurospeech.1999-492

Cite as: Sönmez, K., Heck, L., Weintraub, M. (1999) Speaker tracking and detection with multiple speakers. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2219-2222, doi: 10.21437/Eurospeech.1999-492

@inproceedings{sonmez99_eurospeech,
  author={Kemal Sönmez and Larry Heck and Mitchel Weintraub},
  title={{Speaker tracking and detection with multiple speakers}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={2219--2222},
  doi={10.21437/Eurospeech.1999-492}
}