INTERSPEECH 2012
13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Discrimination of Linguistic and Non-Linguistic Vocalizations in Spontaneous Speech: Intra- and Inter-Corpus Perspectives

Felix Weninger, Björn Schuller

Institute for Human-Machine Communication, Technische Universität München, Germany

We present a large-scale study on classification of linguistic and non-linguistic vocalizations including laughter, vocal noise, hesitation and consent on four corpora amounting to 46 hours of spontaneous conversational speech. We consider training and testing on speaker-independent subsets of single corpora (intra-corpus) as well as inter-corpus experiments where models built on one or more corpora are evaluated on a disjoint corpus. Our results reveal that while inter-corpus performance is considerably lower than comparable intra-corpus results, this effect can be countered by data agglomeration; furthermore, we observe that inter-corpus classification accuracies indicate suitability of corpora for building generalizing models.

Full Paper

Bibliographic reference.  Weninger, Felix / Schuller, Björn (2012): "Discrimination of linguistic and non-linguistic vocalizations in spontaneous speech: intra- and inter-corpus perspectives", In INTERSPEECH-2012, 102-105.