This study focuses on the detection of shouted speech in realistic noisy conditions. An automatic system based on modified mel frequency cepstral coefficient (MFCC) feature extraction and Gaussian mixture model (GMM) classification is developed. The performance of the automatic system is compared against human perception measured by a listening test. At moderate noise levels, the automatic system outperforms humans. In severe conditions, classification by humans is clearly better.
Bibliographic reference. Pohjalainen, Jouni / Raitio, Tuomo / Alku, Paavo (2011): "Detection of shouted speech in the presence of ambient noise", In INTERSPEECH-2011, 2621-2624.