SLTU-2008 - First International Workshop on Spoken Languages Technologies for Under-Resourced Languages

Hanoi, Vietnam
May 5-7, 2008

Recent Advances in Automatic Speech Recognition for Vietnamese

Viet-Bac Le (1), Laurent Besacier (1), Sopheap Seng (1,2), Brigitte Bigi (1), Thi-Ngoc-Diep Do (1,2)

(1) LIG Laboratory, UMR 5217, Grenoble, France
(2) International Research Center MICA, CNRS/UMI-2954, Hanoi, Vietnam

This paper presents our recent activities for automatic speech recognition for Vietnamese. First, our text data collection and processing methods and tools are described. For language modeling, we investigate word, sub-word and also hybrid word/sub-word models. For acoustic modeling, when only limited speech data are available for Vietnamese, we propose some crosslingual acoustic modeling techniques. Furthermore, since the use of sub-word units can reduce the high out-of-vocabulary rate and improve the lack of text resources in statistical language modeling, we propose several methods to decompose, normalize and combine word and sub-word lattices generated from different ASR systems. Experimental results evaluated on the VnSpeechCorpus demonstrate the feasibility of our methods.

Index Terms – ASR, Vietnamese, word, sub-word unit, acoustic modeling, language modeling.

Full Paper

Bibliographic reference.  Le, Viet-Bac / Besacier, Laurent / Seng, Sopheap / Bigi, Brigitte / Do, Thi-Ngoc-Diep (2008): "Recent advances in automatic speech recognition for vietnamese", In SLTU-2008, 47-52.