ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Auditory masking threshold estimation for broadband noise sources with application to speech enhancement

Ruhi Sarikaya, John H. L. Hansen

This paper addresses issues encountered in the use of an Auditory Masking Threshold (AMT) for speech enhancement and proposes an algorithm to improve AMT estimation for broadband noise sources. We determined that while AMT estimation is fairly accurate, and hence an enhancement scheme based on AMT can suppress audible noise to a greater extent for low frequency colored noise sources, the algorithm fails to converge to the clean speech AMT for broadband communication channel noise. We propose a new AMT estimation scheme and incorporate the proposed algorithm into a previously developed enhancement framework [2].We evaluate our algorithm on a set of sentences obtained from the standard TIMIT database for at communications channel noise (FLN), and automobile highway noise (HWY) at 5 dB and 0 dB SNR levels, respectively. Evaluations were performed for 8 kHz and 16 kHz sampled speech and performance is measured with both objective and subjective assessment methods. The results show that the new AMT codebook based enhancement method is more effective than traditional AMT methods. Also, that traditional AMT methods may not be as effective for reduced bandwidth speech (4 kHz), or broadband interference, but that alternative AMT estimation methods can help improve convergence properties.


doi: 10.21437/Eurospeech.1999-565

Cite as: Sarikaya, R., Hansen, J.H.L. (1999) Auditory masking threshold estimation for broadband noise sources with application to speech enhancement. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2571-2574, doi: 10.21437/Eurospeech.1999-565

@inproceedings{sarikaya99_eurospeech,
  author={Ruhi Sarikaya and John H. L. Hansen},
  title={{Auditory masking threshold estimation for broadband noise sources with application to speech enhancement}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={2571--2574},
  doi={10.21437/Eurospeech.1999-565}
}