The EasyTalk application is a large-vocabulary speaker-independent continuous Chinese speech recognition system, i.e. Chinese dictation machine (CDM), under the WINTEL environment. Addressed in this paper are a number of novel techniques adopted in the CDM engine which is the basis of EasyTalk, including the merging-based syllable detection automaton (MBSDA) and the statistical knowledge based frame synchronous search (SKB-FSS) algorithms in the acoustic processing stage, the percentage in critical area (CAP) and recognition score gap (RSG) methods for the acceptation and rejection decision, the word search tree (WST), the N-Gram, and the syllable synchronous network search (SSNS) algorithm in the language processing stage, the embedded multiple model sheme (EMM) and the fuzzy syllable set (FSS) for the robustness purpose.
Cite as: Zheng, F., Song, Z., Xu, M., Wu, J., Huang, Y., Wu, W., Bi, C. (1999) Easytalk: a large-vocabulary speaker-independent Chinese dictation machine. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 819-822, doi: 10.21437/Eurospeech.1999-199
@inproceedings{zheng99_eurospeech, author={Fang Zheng and Zhanjiang Song and Mingxing Xu and Jian Wu and Yinfei Huang and Wenhu Wu and Cheng Bi}, title={{Easytalk: a large-vocabulary speaker-independent Chinese dictation machine}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={819--822}, doi={10.21437/Eurospeech.1999-199} }