8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Speeding-Up Neural Network Training Using Sentence and Frame Selection

Stefano Scanzio (1), Pietro Laface (1), Roberto Gemello (2), Franco Mana (2)

(1) Politecnico di Torino, Italy
(2) Loquendo, Italy

Training Artificial Neural Networks (ANNs) with large amounts of speech data is a time intensive task due to the intrinsically sequential nature of the back-propagation algorithm.

This paper presents an approach for training ANNs using sentence and frame selection. The goal is to speed-up the training process, and to balance the phonetic coverage of the selected frames, trying to mitigate the classification problems related to the prior probabilities of the individual phonetic classes.

These techniques, together with a three-step training approach and software optimizations, reduced by an order of magnitude the training time of our models.

Full Paper

Bibliographic reference.  Scanzio, Stefano / Laface, Pietro / Gemello, Roberto / Mana, Franco (2007): "Speeding-up neural network training using sentence and frame selection", In INTERSPEECH-2007, 1725-1728.