Music Genre Classification Using Duplicated Convolutional Layers in Neural Networks

Hansi Yang, Wei-Qiang Zhang


Music genres are conventional categories that identify some pieces of music as belonging to a shared tradition or set of conventions. In this paper, we proposed an approach to improve music genre classification with convolutional neural networks (CNN). Using mel-scale spectrogram as the input, we used duplicate convolutional layers whose output will be applied to different pooling layers to provide more statistical information for classification. Also, we made some modifications on residual learning by taking more outputs from convolutional layers. By comparing two different network topologies, our experimental results on the GTZAN dataset show that the proposed method can effectively improve the classification accuracy.


 DOI: 10.21437/Interspeech.2019-1298

Cite as: Yang, H., Zhang, W. (2019) Music Genre Classification Using Duplicated Convolutional Layers in Neural Networks. Proc. Interspeech 2019, 3382-3386, DOI: 10.21437/Interspeech.2019-1298.


@inproceedings{Yang2019,
  author={Hansi Yang and Wei-Qiang Zhang},
  title={{Music Genre Classification Using Duplicated Convolutional Layers in Neural Networks}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={3382--3386},
  doi={10.21437/Interspeech.2019-1298},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1298}
}