ISCA Archive ICSLP 1998
ISCA Archive ICSLP 1998

Combining connectionist multi-band and full-band probability streams for speech recognition of natural numbers

Nikki Mirghafori, Nelson Morgan

Multi-band automatic speech recognition is a new and exploratory area of speech recognition which has been getting much attention in the research community. It has been shown that multi-band ASR reduces word error in noisy conditions, particularly in the case of narrow band noise. In this work we show that multi-band ASR could be used to improve the speech recognition accuracy of natural numbers for clean speech when the multi-band (MB) information stream is used in addition to the full-band (FB) one. We also observe that a similar combination method significantly reduces the error rate on reverberant speech. Finally, we analyze the error patterns of the full-band and multi-band paradigms to understand why the combination of the two streams is effective.


doi: 10.21437/ICSLP.1998-404

Cite as: Mirghafori, N., Morgan, N. (1998) Combining connectionist multi-band and full-band probability streams for speech recognition of natural numbers. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 1150, doi: 10.21437/ICSLP.1998-404

@inproceedings{mirghafori98_icslp,
  author={Nikki Mirghafori and Nelson Morgan},
  title={{Combining connectionist multi-band and full-band probability streams for speech recognition of natural numbers}},
  year=1998,
  booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)},
  pages={paper 1150},
  doi={10.21437/ICSLP.1998-404}
}