We use n-gram language models to investigate how far language approximates an optimal code for human communication in terms of Information Theory [1], and what differences there are between Learner proficiency levels. Although the language of lower level learners is simpler, it is less optimal in terms of information theory, and as a consequence more difficult to process.
Cite as: Grigonytė, G., Schneider, G. (2017) Measuring Encoding Efficiency in Swedish and English Language Learner Speech Production. Proc. Interspeech 2017, 1779-1783, doi: 10.21437/Interspeech.2017-337
@inproceedings{grigonyte17_interspeech, author={Gintarė Grigonytė and Gerold Schneider}, title={{Measuring Encoding Efficiency in Swedish and English Language Learner Speech Production}}, year=2017, booktitle={Proc. Interspeech 2017}, pages={1779--1783}, doi={10.21437/Interspeech.2017-337} }