September 22-25, 1997
Recent work in ASR shows that band splitting, forming multiple paths with recombination at the decision stage, can give recognition accuracy comparable with the conventional full-band approach. One of the many interesting questions with band-splitting relates to the bandwidths of each sub-band, and the use of frequency warping functions such as mel. This paper examines the use of mel and linear frequency scales in the context of band-splitting and speaker recognition. We demonstrate how sub-band error profiles can lead to a new scale, which is between linear and mel, giving both an equalised sub-band error profile and an improved overall recognition accuracy.
Bibliographic reference. Auckenthaler, Roland / Mason, John S. (1997): "Equalizing sub-band error rates in speaker recognition", In EUROSPEECH-1997, 2303-2306.