5th International Conference on Spoken Language Processing
This paper presents a trial study of using context-dependent segmental duration for continuous speech recognition in a domain-specific application. Different modelling strategies are proposed for function words and content words. Stress status, word position in utterance and phone position in word are identified to be the 3 most crucial factors affecting segmental duration in this particular application. In addition, speaking rate normalization is applied to further reduce the duration variabilities. Experimental results show that the normalized duration models can help improving the rank of the correct sentence in the N-best hypotheses list.
Bibliographic reference. Lee, Tan / Carlson, Rolf / Granström, Björn (1998): "Context-dependent duration modelling for continuous speech recognition", In ICSLP-1998, paper 0441.