ISCA Archive ICSLP 1998
ISCA Archive ICSLP 1998

Neural network based pronunciation modeling with applications to speech recognition

Toshiaki Fukada, Takayoshi Yoshimura, Yoshinori Sagisaka

We propose a method for automatically generating a pronunciation dictionary based on a pronunciation neural network that can predict plausible pronunciations (realized pronunciations) from canonical pronunciations. This method can generate multiple forms of realized pronunciations using the pronunciation network. Experimental results on spontaneous speech show that the automatically-derived pronunciation dictionary gives consistently higher recognition rates than a conventional dictionary.


doi: 10.21437/ICSLP.1998-399

Cite as: Fukada, T., Yoshimura, T., Sagisaka, Y. (1998) Neural network based pronunciation modeling with applications to speech recognition. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0658, doi: 10.21437/ICSLP.1998-399

@inproceedings{fukada98_icslp,
  author={Toshiaki Fukada and Takayoshi Yoshimura and Yoshinori Sagisaka},
  title={{Neural network based pronunciation modeling with applications to speech recognition}},
  year=1998,
  booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)},
  pages={paper 0658},
  doi={10.21437/ICSLP.1998-399}
}