7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

German Broadcast News Transcription

Robert Hecht, Jürgen Riedler, Gerhard Backfried

Speech Artificial Intelligence and Language Laboratories, Austria

We describe a newly created broadcast news (BN) corpus based on programs of seven different German and Austrian TV stations and the development of a German BN transcription system based on this corpus. We report on a series of experiments addressing the fact that German is less suited than English for word-based trigram language models. Furthermore, we investigate various phoneme sets and examine the difference between a transregional standard (Bavarian dialect spoken in southern Germany and Austria) and standard German (Hochdeutsch) on the word error rate.

Full Paper

Bibliographic reference.  Hecht, Robert / Riedler, Jürgen / Backfried, Gerhard (2002): "German broadcast news transcription", In ICSLP-2002, 1753-1756.