ISCA Archive Interspeech 2021
ISCA Archive Interspeech 2021

Self-Paced Ensemble Learning for Speech and Audio Classification

Nicolae-Cătălin Ristea, Radu Tudor Ionescu

Combining multiple machine learning models into an ensemble is known to provide superior performance levels compared to the individual components forming the ensemble. This is because models can complement each other in taking better decisions. Instead of just combining the models, we propose a self-paced ensemble learning scheme in which models learn from each other over several iterations. During the self-paced learning process based on pseudo-labeling, in addition to improving the individual models, our ensemble also gains knowledge about the target domain. To demonstrate the generality of our self-paced ensemble learning (SPEL) scheme, we conduct experiments on three audio tasks. Our empirical results indicate that SPEL significantly outperforms the baseline ensemble models. We also show that applying self-paced learning on individual models is less effective, illustrating the idea that models in the ensemble actually learn from each other.

doi: 10.21437/Interspeech.2021-155

Cite as: Ristea, N.-C., Ionescu, R.T. (2021) Self-Paced Ensemble Learning for Speech and Audio Classification. Proc. Interspeech 2021, 2836-2840, doi: 10.21437/Interspeech.2021-155

  author={Nicolae-Cătălin Ristea and Radu Tudor Ionescu},
  title={{Self-Paced Ensemble Learning for Speech and Audio Classification}},
  booktitle={Proc. Interspeech 2021},