EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Joint Estimation of Thresholds in a Bi-Threshold Verification Problem

Simon Ho, Brian Mak

Hong Kong University of Science & Technology, China

Verification problems are usually posted as a 2-class problem and the objective is to verify if an observation belongs to a class, say, A or its complement A'. However, we find that in a computer-assisted language learning application, because of the relatively low reliability of phoneme verification - with an equal-error-rate of more than 30% - a system built on conventional phoneme verification algorithm needs to be improved. In this paper, we propose to cast the problem as a 3-class verification problem with the addition of an "in-between" class besides A and A'. As a result, there are two thresholds to be designed in such a system. Although one may determine the two thresholds independently, better performance can be obtained by a joint estimation of these thresholds by allowing small deviation from the specified false acceptance and false rejection rates. This paper describes a cost-based approach to do that. Furthermore, issues such as per-phoneme thresholds vs. phoneme-class thresholds, and the use of bagging technique to improve the stability of thresholds are investigated. Experimental results on a kids' corpus show that cost-based thresholds and bagging improve verification performance.

Full Paper

Bibliographic reference.  Ho, Simon / Mak, Brian (2003): "Joint estimation of thresholds in a bi-threshold verification problem", In EUROSPEECH-2003, 893-896.