To integrate the microphone array processing into speech recognition, we have proposed a speech recognition algorithm based on 3-D Viterbi search, which localizes a target talker considering the likelihood of HMMs (Hidden Markov Models) while performing speech recognition. The performance of the 3-D Viterbi search method depends on the improvement of the SNR (Signal to Noise Ratio) by the beamforming technique. This paper proposes a novel method based on an adaptive beamforming technique instead of the delay-and-sum beamformer used in our previous study. The speaker-dependent isolated-word recognition experiments were carried out on real environment data to evaluate the effect of the adaptive beamformer. These results showed that the adaptive beamformer drastically improves the recognition performance both for a fixed-position talker and for a moving-talker.
Cite as: Yamada, T., Nakamura, S., Shikano, K. (1998) An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi search. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0484, doi: 10.21437/ICSLP.1998-302
@inproceedings{yamada98_icslp, author={Takeshi Yamada and Satoshi Nakamura and Kiyohiro Shikano}, title={{An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi search}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0484}, doi={10.21437/ICSLP.1998-302} }