ISCA Archive SLaTE 2007
ISCA Archive SLaTE 2007

Text simplification for language learners: a corpus analysis

Sarah E. Petersen, Mari Ostendorf

Simplified texts are commonly used by teachers and students in bilingual education and other language-learning contexts. These texts are usually manually adapted, and teachers say this is a timeconsuming and sometimes challenging task. Our goal is the development of tools to aid teachers by automatically proposing ways to simplify texts. As a first step, this paper presents a detailed analysis of a corpus of news articles and abridged versions written by a literacy organization in order to learn what kinds of changes people make when simplifying texts for language learners.


Cite as: Petersen, S.E., Ostendorf, M. (2007) Text simplification for language learners: a corpus analysis. Proc. Speech and Language Technology in Education (SLaTE 2007), 69-72

@inproceedings{petersen07_slate,
  author={Sarah E. Petersen and Mari Ostendorf},
  title={{Text simplification for language learners: a corpus analysis}},
  year=2007,
  booktitle={Proc. Speech and Language Technology in Education (SLaTE 2007)},
  pages={69--72}
}