4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
A subword-based neural network model for continuous speech recognition is proposed. The system consists of three modules, and each module is composed of simple neural networks. The speech input is segmented into non-uniform units by the network in the first module. Non-uniform unit can model phoneme variations which spread for several phonemes and between words. The second module recognizes segmented units. The unit has stationary and transition parts, and the network is divided according to the two parts. The last module spots words by modeling temporal representation. The results of speaker independent word spotting of 520 words are described.
Bibliographic reference. Yu, Ha-Jin / Oh, Yung-Hwan (1996): "A neural network using acoustic sub-word units for continuous speech recognition", In ICSLP-1996, 506-509.