ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Investigating text normalization and pronunciation variants for German broadcast transcription

Martine Adda-Decker, Gilles Adda, Lori Lamel

In this paper we describe our ongoing work concerning lexical modeling in the LIMSI broadcast transcription system for German. Lexical decomposition is investigated with a twofold goal: lexical coverage optimization and improved letter-to-sound conversion. A set of about 450 decompounding rules, developed using statistics from a 300M word corpus, reduces the OOV rate from 4.5% to 4.0% on a 30k development text set. Adding partial inflection stripping, the OOV rate drops to 2.9%. For letterto- sound conversion, decompounding reduces cross-lexeme ambiguities and thus contributes to more consistent pronunciation dictionaries. Another point of interest concerns reduced pronunciation modeling. Word error rates, measured on 1.3 hours of ARTE TV broadcast, vary between 18 and 24% depending on the show and the system configuration. Our experiments indicate that using reduced pronunciations slightly decreases word error rates.


Cite as: Adda-Decker, M., Adda, G., Lamel, L. (2000) Investigating text normalization and pronunciation variants for German broadcast transcription. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 266-269

@inproceedings{addadecker00_icslp,
  author={Martine Adda-Decker and Gilles Adda and Lori Lamel},
  title={{Investigating text normalization and pronunciation variants for German broadcast transcription}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 266-269}
}