The segmentation of the training data has become the most used initialization method for HMM training. The Viterbi reestimation has been widely applied for that purpose. We introduce a new segmentation method, based on the maximization of the Entropic Cohesion Measure (ECM) between segments and observations, which is equivalent to minimize the entropy model. This maximization is carried out by looking for the optimal boundaries between segments of the training utterances. Thus, we obtain an optimal segmentation (in the ECM sense) that achieves similar performance to the Viterbi reestimation with a considerable computational saving.
Bibliographic reference. Peinado, Antonio M. / Roman, Ramon / Segura, Jose C. / Rubio, Antonio J. / Garcia, Pedro / Diaz, Jesus E. (1991): "Entropic training for HMM speech recognition", In EUROSPEECH-1991, 651-654.