Second European Conference on Speech Communication and Technology

Genova, Italy
September 24-26, 1991


Selection of Speech Units for a Speaker-Independent CSR Task

Lorenzo Fissore (1), Egidio P. Giachin (1), P. Laface (2), G. Micca (1)

(1) CSELT - Centro Studi e Laboratori Telecomunicazioni, Torino, Italy
(2) Dipartimento di Automatica e Informatica - Politecnico di Torino, Torino, Italy

This paper focuses on the problem of finding a set of Hidden Markov Models that can be trained to model context dependencies with good statistical accuracy, given the constraint of a fixed amount of training data. Two aspects have been investigated in this work: clustering of intra-word context-dependent units with similar contexts on the basis of different similarity measures, and definition of inter-word coarticulation units. A Dynamic Programming procedure is presented that allows a large set of context-dependent units to be clustered into a given number of units while optimizing a global cost measure. Inter-word units were found to provide better phonetic representations of word junctures and to increase recognition accuracy, though less than it has been reported for the English language.

Full Paper

