Interspeech'2005 - Eurospeech
Acoustic differences between native accents may prove to be too subtle for straightforward brute force techniques such as blindly clustered Gaussian mixture model (GMM) classifiers to yield satisfactory discrimination performance while these methods work well for classifying more pronounced differences such as language, gender or channel. In this paper it is shown that small channel differences are easier to detect by such coarse classifiers than native accent differences. Performance of native accent classification can be improved considerably by incorporating the knowledge of the underlying phoneme sequence and using phoneme specific GMMs. Further improvements are obtained if optimal feature selection is combined with the phoneme dependent GMMs, resulting in usage of less than 10% of the original features. The presented methods result in a reduction of more than 40% in relative error rate in a 5-class classification task.
Bibliographic reference. Wu, Tingyao / Compernolle, Dirk Van / Duchateau, Jacques / Yang, Qian / Martens, Jean-Pierre (2005): "Improving the discrimination between native accents when recorded over different channels", In INTERSPEECH-2005, 2821-2824.