ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Rules, but what for? - rule description as efficient and robust abstraction of corpora and optimal fitting to applications -

Yoshinori Sagisaka, Hirofumi Yamamoto, Minoru Tsuzaki, Hiroaki Kato

Two recent studies are introduced in speech recognition and speech synthesis to reconsider what rules should be looked for spoken language science and technology. To abstract the neighboring characteristics expressed by Ngrams, multi-class composite N-grams have been proposed to model POS characteristics and inflectional forms separately. It is shown that statistical clustering can provide more compact and robust description of word neighboring characteristics than conventional N-grams. For speech synthesis, segmental duration modeling has been examined from the viewpoint of perceptual characteristics of duration changes. A series of perceptual experiments have shown the context dependency of sensitivity to duration change. These two examples respectively illustrate how current rules are interpreted to build scientifically acceptable engineering models and remind us of the deeper scientific meaning and limitation of generalization as a rule.


doi: 10.21437/ICSLP.2000-568

Cite as: Sagisaka, Y., Yamamoto, H., Tsuzaki, M., Kato, H. (2000) Rules, but what for? - rule description as efficient and robust abstraction of corpora and optimal fitting to applications -. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 448-451, doi: 10.21437/ICSLP.2000-568

@inproceedings{sagisaka00_icslp,
  author={Yoshinori Sagisaka and Hirofumi Yamamoto and Minoru Tsuzaki and Hiroaki Kato},
  title={{Rules, but what for? - rule description as efficient and robust abstraction of corpora and optimal fitting to applications -}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 448-451},
  doi={10.21437/ICSLP.2000-568}
}