ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Evaluating acoustic distance measures for template based recognition

Mathias De Wachter, Kris Demuynck, Patrick Wambacq, Dirk Van Compernolle

In this paper we investigate the behaviour of different acoustic distance measures for template based speech recognition in light of the combination of acoustic distances, linguistic knowledge and template concatenation fluency costs. To that end, different acoustic distance measures are compared on tasks with varying levels of fluency/linguistic constraints. We show that the adoption of those constraints invariably results in an acoustically clearly suboptimal template sequence being chosen as the winning hypothesis. There are strong implications for the design of acoustic distance measures: distance measures that are optimal for frame based classification may prove to be suboptimal for full sentence recognition. In particular, we show this is the case when comparing the Euclidean and the recently introduced adaptive kernel local Mahalanobis distance measures.

doi: 10.21437/Interspeech.2007-321

Cite as: Wachter, M.D., Demuynck, K., Wambacq, P., Compernolle, D.V. (2007) Evaluating acoustic distance measures for template based recognition. Proc. Interspeech 2007, 874-877, doi: 10.21437/Interspeech.2007-321

  author={Mathias De Wachter and Kris Demuynck and Patrick Wambacq and Dirk Van Compernolle},
  title={{Evaluating acoustic distance measures for template based recognition}},
  booktitle={Proc. Interspeech 2007},