This paper presents a new approach to measuring how confidently a word has been correctly recognized, or confidence measure. The approach consists of three major steps: (1) standard decoding; (2) forced Viterbi alignment (needed for stack decoders); and (3) rank-ordering of the subphone scores. More specifically, from the aligned sentence the third step computes the likelihood scores of the hypothesized subphone and all other competing subphones. A list of the subphones is generated in the descending order of the scores and a rank is assigned to the hypothesized subphone according to its positioning. Additional processing of selective weighting and upper-bound limiting is applied to minimize contamination of rank computations by bad segments or by highly-variable phones. The obtained rank is then used as the confidence measure. Results of word rejection experiments show that the new approach outperforms other measures such as whole-word scores by reducing the equal error rate from 32% to 20%.
Cite as: Lin, Q., Das, S., Lubensky, D., Picheny, M. (1998) A new confidence measure based on rank-ordering subphone scores. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0806, doi: 10.21437/ICSLP.1998-818
@inproceedings{lin98c_icslp, author={Qiguang Lin and Subrata Das and David Lubensky and Michael Picheny}, title={{A new confidence measure based on rank-ordering subphone scores}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0806}, doi={10.21437/ICSLP.1998-818} }