ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Minimum confusibility training of context dependent demiphones

Albino Nogueiras-Rodríguez, José B. Marino

During the last years two different approaches have been widely used in order to improve the acoustic modeling in continuous speech recognition systems: discriminative training algorithms and context dependent subword units. However, while the use of each of these techniques leads to much better results than standard maximum likelihood trained phone models, their combination, i.e. discriminative training of context dependent units, has revealed to be a much more dificult task. In this paper we deal with minimum confusibility training of demiphones using TIMIT database. By applying this approach recently introduced by the authors, the string error rate in the recognition of TIDIGITS using demiphones is reduced some 24% with respect to maximum likelihood training. This improvement is added to the 8% reduction already provided by demiphones with respect to minimum confusibility trained phones.


doi: 10.21437/Eurospeech.1999-603

Cite as: Nogueiras-Rodríguez, A., Marino, J.B. (1999) Minimum confusibility training of context dependent demiphones. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2741-2744, doi: 10.21437/Eurospeech.1999-603

@inproceedings{nogueirasrodriguez99_eurospeech,
  author={Albino Nogueiras-Rodríguez and José B. Marino},
  title={{Minimum confusibility training of context dependent demiphones}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={2741--2744},
  doi={10.21437/Eurospeech.1999-603}
}