Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Segmentation of Recordings Based on Partial Transcriptions

Patrick Cardinal, Gilles Boulianne, Michel Comeau

Centre de Recherche Informatique de Montréal, Canada

In this paper, we present the approach we used to produce a training database from a set of recorded newscasts for which we had inaccurate transcriptions. These transcribed segments correspond to a set of prepared anchor texts and journalist stories, not necessarily in chronological order of their actual presentation. No segmental time boundary information is provided. Our main concern is thus to establish time marks that delimit the audio segments of the corresponding texts. To resolve this problem, we have developed a time marking procedure using our speech recognition engine. We obtain a segmentation accuracy of 80%.

Full Paper

Bibliographic reference.  Cardinal, Patrick / Boulianne, Gilles / Comeau, Michel (2005): "Segmentation of recordings based on partial transcriptions", In INTERSPEECH-2005, 3345-3348.