7th International Conference on Spoken Language Processing
September 16-20, 2002
We describe a newly created broadcast news (BN) corpus based on programs of seven different German and Austrian TV stations and the development of a German BN transcription system based on this corpus. We report on a series of experiments addressing the fact that German is less suited than English for word-based trigram language models. Furthermore, we investigate various phoneme sets and examine the difference between a transregional standard (Bavarian dialect spoken in southern Germany and Austria) and standard German (Hochdeutsch) on the word error rate.
Bibliographic reference. Hecht, Robert / Riedler, Jürgen / Backfried, Gerhard (2002): "German broadcast news transcription", In ICSLP-2002, 1753-1756.