ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Recognizing named entities in spoken Chinese dialogues with a character-level maximum entropy tagger

Changchun Bao, Weiqun Xu, Yonghong Yan

Named Entity Recognition (NER) is an important task in information extraction, where major attention has been paid to written texts of a news or academic paper (esp. biomedical) style. In this paper we report the first piece of work on NER in spoken Chinese dialogues, as a preliminary step for spoken language understanding. The NER task is taken as a sequential classification problem and solved with a character-level maximum entropy (maxent) tagger. Despite that spoken data seems noisier than written data, with a set of carefully selected features, the maxent tagger achieves an overall F1 score of 91.87 on our dialogue data.


doi: 10.21437/Interspeech.2008-348

Cite as: Bao, C., Xu, W., Yan, Y. (2008) Recognizing named entities in spoken Chinese dialogues with a character-level maximum entropy tagger. Proc. Interspeech 2008, 1145-1148, doi: 10.21437/Interspeech.2008-348

@inproceedings{bao08b_interspeech,
  author={Changchun Bao and Weiqun Xu and Yonghong Yan},
  title={{Recognizing named entities in spoken Chinese dialogues with a character-level maximum entropy tagger}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1145--1148},
  doi={10.21437/Interspeech.2008-348}
}