ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Superposed speech localisation using frequency tracking

Maxime Le Coz, Julien Pinquier, Régine André-Obrecht

On this paper we present a new approach for the localisation of superposed speech areas. The system is based on the frequency tracking of speech segments following the evolution of the main amplitude frequencies and uses no learning of acoustic or prosodic models. The set of trackings of the frequencies are then grouped together using a distance based on the harmonicity, each group being the production of a single speaker. The co-occurrence of different harmonic groups is then used as a consequence of the presence of multiple speakers. Our method has been evaluated on the data of the French ANR evaluation campaign ETAPE, showing the usability of this approach.


doi: 10.21437/Interspeech.2013-200

Cite as: Coz, M.L., Pinquier, J., André-Obrecht, R. (2013) Superposed speech localisation using frequency tracking. Proc. Interspeech 2013, 714-717, doi: 10.21437/Interspeech.2013-200

@inproceedings{coz13_interspeech,
  author={Maxime Le Coz and Julien Pinquier and Régine André-Obrecht},
  title={{Superposed speech localisation using frequency tracking}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={714--717},
  doi={10.21437/Interspeech.2013-200}
}