Third Workshop on Spoken Language Technologies for Under-resourced Languages

Cape Town, South Africa
May 7-9, 2012

Incorporating MLP Features in the Unsupervised Training Process

Thiago Fraga-Silva (1), Viet-Bac Le (2), Lori Lamel (1), Jean-Luc Gauvain (1)

(1) Spoken Language Processing Group, LIMSI-CNRS, Orsay, France; (2) Vocapia Research, Orsay, France

The combined use of multi layer perceptron (MLP) and perceptual linear prediction (PLP) features has been reported to improve the performance of automatic speech recognition systems for many different languages and domains. However, MLP features have not yet been used on unsupervised acoustic model training. This approach is introduced in this paper with encouraging results. In addition, unsupervised language model training was also investigated for a Portuguese broadcast speech recognition task, leading to a slight improvement of performance. The joint use of the unsupervised techniques presented here leads to an absolute WER reduction up to 3.2% over a baseline unsupervised system.

Index Terms: Unsupervised Training, MLP features, Acoustic Modeling, Language Modeling

Full Paper

Bibliographic reference.  Fraga-Silva, Thiago / Le, Viet-Bac / Lamel, Lori / Gauvain, Jean-Luc (2012): "Incorporating MLP features in the unsupervised training process", In SLTU-2012, 24-28.