SLTU-2008 - First International Workshop on Spoken Languages Technologies for Under-Resourced Languages
This paper presents our recent activities for automatic speech recognition for Vietnamese. First, our text data collection and processing methods and tools are described. For language modeling, we investigate word, sub-word and also hybrid word/sub-word models. For acoustic modeling, when only limited speech data are available for Vietnamese, we propose some crosslingual acoustic modeling techniques. Furthermore, since the use of sub-word units can reduce the high out-of-vocabulary rate and improve the lack of text resources in statistical language modeling, we propose several methods to decompose, normalize and combine word and sub-word lattices generated from different ASR systems. Experimental results evaluated on the VnSpeechCorpus demonstrate the feasibility of our methods.
Index Terms ASR, Vietnamese, word, sub-word unit, acoustic modeling, language modeling.
Bibliographic reference. Le, Viet-Bac / Besacier, Laurent / Seng, Sopheap / Bigi, Brigitte / Do, Thi-Ngoc-Diep (2008): "Recent advances in automatic speech recognition for vietnamese", In SLTU-2008, 47-52.