16th Annual Conference of the International Speech Communication Association

Dresden, Germany
September 6-10, 2015

Multilingual Features Based Keyword Search for Very Low-Resource Languages

Pavel Golik, Zoltán Tüske, Ralf Schlüter, Hermann Ney

RWTH Aachen University, Germany

In this paper we describe RWTH Aachen's system for keyword search (KWS) with very limited amount of transcribed audio data available in the target language. This setting has become this year's primary condition within the Babel project [1], seeking to minimize the amount of human effort while retaining a reasonable KWS performance. Thus the highlights presented in this paper include graphemic acoustic modeling; multilingual features trained on language data from the previous project periods; comparison of tandem and hybrid DNN-HMM acoustic models; processing of large amounts of text data available on the web and the morphological KWS based on automatically derived word fragments.
    The evaluation is performed using two training sets for each of the six current project period's languages — full language pack (FLP), consisting of 30 hours and very limited language pack (VLLP), comprising less than 3 hours of transcribed audio data. We put our focus on the latter of the two, which is clearly more challenging. The methods described in this work allowed us to exceed 0.3 MTWV on five out of six languages using development queries.

Full Paper

Bibliographic reference.  Golik, Pavel / Tüske, Zoltán / Schlüter, Ralf / Ney, Hermann (2015): "Multilingual features based keyword search for very low-resource languages", In INTERSPEECH-2015, 1260-1264.