We attempted to estimate subjective scores of the Japanese Diagnostic Rhyme Test (DRT), a two-to-one forced selection speech intelligibility test, using automatic speech recognizers with language models that force one of the words in the word-pair. The acoustic models were adapted to the speaker, and then adapted to noise at a specified SNR. The match between subjective and recognition scores improved significantly when the adapted noise level and the tested level match. However, when SNR conditions do not match, the recognition scores degraded especially when test SNR conditions were higher than the adapted level.
Bibliographic reference. Kondo, Kazuhiro / Takano, Yusuke (2010): "Estimation of two-to-one forced selection intelligibility scores by speech recognizers using noise-adapted models", In INTERSPEECH-2010, 302-305.