Interspeech'2005 - Eurospeech
This paper presents an early study on building Vietnamese large vocabulary continuous speech recognition with concentration on choosing type of units and feature set. Our experiments were done using the HTK Toolkit and VOV broadcast corpus. The results show that the recognizer with mixture units achieved better performance than recognizers with initial-final units and phoneme units. Among feature sets are applied, MFCC has performance somewhat better than PLP, and the combination of MFCC and F0 features increases the accuracy of the Vietnamese recognition system.
Bibliographic reference. Vu, Thang Tat / Nguyen, Dung Tien / Luong, Mai Chi / Hosom, John-Paul (2005): "Vietnamese large vocabulary continuous speech recognition", In INTERSPEECH-2005, 1689-1692.