A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation

Bastian Schnell, Philip N. Garner


The Generalised Command Response (GCR) model is a time-local model of intonation that has been shown to lend itself to (cross-language) transfer of emphasis. In order to generalise the model to longer prosodic sequences, we show that it can be driven by a recurrent neural network emulating a spiking neural network. We show that a loss function for error backpropagation can be formulated analogously to that of the Spike Pattern Association Neuron (SPAN) method for spiking networks. The resulting system is able to generate prosody comparable to a state-of-the-art deep neural network implementation, but potentially retaining the transfer capabilities of the GCR model.


 DOI: 10.21437/Interspeech.2018-1904

Cite as: Schnell, B., Garner, P.N. (2018) A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation. Proc. Interspeech 2018, 3147-3151, DOI: 10.21437/Interspeech.2018-1904.


@inproceedings{Schnell2018,
  author={Bastian Schnell and Philip N. Garner},
  title={A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={3147--3151},
  doi={10.21437/Interspeech.2018-1904},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1904}
}