5th International Conference on Spoken Language Processing
In this paper we define an acoustic confidence measure based on the estimates of local posterior probabilities produced by a HMM/ANN large vocabulary continuous speech recognition system. We use this measure to segment continuous audio into regions where it is and is not appropriate to expend recognition effort. The segmentation is computationally inexpensive and provides reductions in both overall word error rate and decoding time. The technique is evaluated using material from the Broadcast News corpus.
Bibliographic reference. Barker, Jon / Williams, Gethin / Renals, Steve (1998): "Acoustic confidence measures for segmenting broadcast news", In ICSLP-1998, paper 0643.