A system for recognising continuously spoken sentences is presented. The system has a vocabulary of approx. 35 words and a grammar specifying a few thousand sentences. The system operates in three stages. In the first stage, cepstrum vectors are computed in real time and used as input to a self organised neural network. The output of the network is mapped to a continuous valued acoustic phonetic distinctive feature vector for each frame of the speech signal. These vectors are in the second stage processed by a multi layer perceptron which is trained to estimate segment boundaries. The output from this stage is a discrete valued acoustic phonetic distinctive feature for each segment of the speech signal (allophones). The third stage contains an expert system, which processes the allophones using a lexicon and a parsing system.
Cite as: Baekgaard, A., Dalsgaard, P. (1989) Recognition of continuous speech using neural nets and expert system. Proc. First European Conference on Speech Communication and Technology (Eurospeech 1989), 2545-2548, doi: 10.21437/Eurospeech.1989-93
@inproceedings{baekgaard89_eurospeech, author={Anders Baekgaard and Paul Dalsgaard}, title={{Recognition of continuous speech using neural nets and expert system}}, year=1989, booktitle={Proc. First European Conference on Speech Communication and Technology (Eurospeech 1989)}, pages={2545--2548}, doi={10.21437/Eurospeech.1989-93} }