ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

A neural network speech recognizer based on the both acoustic steady portions and transitions

Seyyed Ali Seyyed Salehi

Previous works on speech recognition utilizing neural networks have often relied on either recognition through segmentation or mapping of the representation trajectories to the phoneme space. Here, information could be missed due to the manner of border labeling techniques. Recent works have indicated that firstly, phonetic borders and transitions would have a good potential to be recognized as acoustic units, and secondly, recognition of the fast transitions by neural networks, as fixed cues in time, results in high performance detection and recognition of those events. This approach was manifested through recognition of basic units formed from the VC and CV borders in Farsi (Persian) spoken language. Analysis of the resulting errors has indicated certain discrepancies amongst the theoretical linguistic points of view and implementation outcome. Implementation results have indicated that the CV, CVC and CVCC linguistic models for Farsi syllables do not always match the reality of the acoustic space in the speech signal.


Cite as: Seyyed Salehi, S.A. (2000) A neural network speech recognizer based on the both acoustic steady portions and transitions. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 871-874

@inproceedings{seyyedsalehi00_icslp,
  author={Seyyed Ali {Seyyed Salehi}},
  title={{A neural network speech recognizer based on the both acoustic steady portions and transitions}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 871-874}
}