16th Annual Conference of the International Speech Communication Association

Dresden, Germany
September 6-10, 2015

The Zero Resource Speech Challenge 2015

Maarten Versteegh (1), Roland Thiollière (1), Thomas Schatz (1), Xuan Nga Cao (1), Xavier Anguera (2), Aren Jansen (3), Emmanuel Dupoux (1)

(1) ENS, France
(2) Telefónica I+D, Spain
(3) Johns Hopkins University, USA

The Interspeech 2015 Zero Resource Speech Challenge aims at discovering subword and word units from raw speech. The challenge provides the first unified and open source suite of evaluation metrics and data sets to compare and analyse the results of unsupervised linguistic unit discovery algorithms. It consists of two tracks. In the first, a psychophysically inspired evaluation task (minimal pair ABX discrimination) is used to assess how well speech feature representations discriminate between contrastive subword units. In the second, several metrics gauge the quality of discovered word-like patterns. Two data sets are provided, one for English, one for Xitsonga. Both data sets are provided without any annotation except for voice activity and talker identity. This paper introduces the evaluation metrics, presents the results of baseline systems and discusses some of the key issues in unsupervised unit discovery.

Full Paper

Bibliographic reference.  Versteegh, Maarten / Thiollière, Roland / Schatz, Thomas / Cao, Xuan Nga / Anguera, Xavier / Jansen, Aren / Dupoux, Emmanuel (2015): "The zero resource speech challenge 2015", In INTERSPEECH-2015, 3169-3173.