ISCA Archive ICSLP 1992
ISCA Archive ICSLP 1992

Continuous speech recognition for medical diagnoses using a character trigram model

Sho-ichi Matsunaga, Toshiaki Tsuboi, Tomokazu Yamada, Kiyohiro Shikano

This paper describes a continuous speech recognition system for medical diagnoses that uses a trigram model based on sequences of Japanese characters. Dictation of medical diagnoses is one of the most promising applications of speech recognition. We devised a prototype based on consonant-vowel spotting using the DP matching technique, as demonstrated at ICSLP'90, and have since improved this system by using a Japanese character trigram model , a phrase syntax and phoneme-based hidden Markov models. Speaker-dependent recognition tests have been done on 543 phrases about X-ray CT scanning. The word lexicon has about 3600 entries. The trigram model reduced the character perplexity from 11.8 to 3.6. Using the new system, 95.8% of the input phrases were correctly transcribed, compared with the 61.5% reported at ICSLP'90. These results show the effectiveness of the character trigram model for continuous speech recognition.


doi: 10.21437/ICSLP.1992-244

Cite as: Matsunaga, S.-i., Tsuboi, T., Yamada, T., Shikano, K. (1992) Continuous speech recognition for medical diagnoses using a character trigram model. Proc. 2nd International Conference on Spoken Language Processing (ICSLP 1992), 727-730, doi: 10.21437/ICSLP.1992-244

@inproceedings{matsunaga92_icslp,
  author={Sho-ichi Matsunaga and Toshiaki Tsuboi and Tomokazu Yamada and Kiyohiro Shikano},
  title={{Continuous speech recognition for medical diagnoses using a character trigram model}},
  year=1992,
  booktitle={Proc. 2nd International Conference on Spoken Language Processing (ICSLP 1992)},
  pages={727--730},
  doi={10.21437/ICSLP.1992-244}
}