![]() |
Workshop on the Auditory Basis of Speech PerceptionKeele University, UK |
![]() |
Auditory representations of clean speech contain much redundancy. Arguably, it is this redundancy which enables listeners to recognise speech in adverse conditions. Under the assumption that some time-frequency regions are too heavily masked to derive any useful estimate of speech level, the auditory system faces the missing data problem: how to achieve robust performance with incomplete evidence. This paper develops several techniques for overcoming the missing data problem and demonstrates that high-performance automatic speech recognition can be achieved for quite high degrees of data masking, but that further information, such as provided by context or through auditory induction constraints, will be required to handle realistic masking conditions.
Bibliographic reference. Cooke, Martin / Morris, Andrew / Green, Phil (1996): "Recognising occluded speech", In ABSP-1996, 297-300.