12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Evaluation of Tree-Trellis Based Decoding in Over-Million LVCSR

Naoaki Ito, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda

Nagoya Institute of Technology, Japan

Very large vocabulary continuous speech recognition (CSR) that can recognize every sentence is one of important goals in speech recognition. Several attempts have been made to achieve very large vocabulary CSR. However, very large vocabulary CSR using a tree-trellis based decoder has not been reported. We report the performance evaluation and improvement of the "Julius" treetrellis based decoder in large vocabulary CSR (LVCSR) involving more than one million vocabulary, referred to here as over-million LVCSR. Experiments indicated that Julius achieved a word accuracy of about 91% and a real time factor of about 2 in over-million LVCSR for Japanese newspaper speech transcription.

Full Paper

Bibliographic reference.  Ito, Naoaki / Nankaku, Yoshihiko / Lee, Akinobu / Tokuda, Keiichi (2011): "Evaluation of tree-trellis based decoding in over-million LVCSR", In INTERSPEECH-2011, 1937-1940.