Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

The Use of Syllable Segmentation Information in Continuous Speech Recognition Hybrid Systems Applied to the Portuguese Language

Hugo Meinedo, Joao P. Neto

INESC - IST, Lisboa, Portugal

Recent works have shown that the use of syllables as the basic unit in a speech recognition system could be very useful. These works introduced methods exploiting syllable information as a mean to add robustness in "traditional" systems that use phonemes/phones as the basic unit. Being the Portuguese a highly syllabic language we expected that information from syllables would introduce potential benefits in speech recognition tasks. Following these ideas we started by developing different methods of automatic syllable segmentation. Next we applied the best segmentation method to our large vocabulary continuous speech corpus (BD-PUBLICO) achieving an accuracy of 72%. We developed a process to use the segmentation information in the acoustic models of our baseline speech recognisers for the Portuguese language. The results obtained by the modified recognition systems on 5k and 27k vocabulary tasks showed that the use of basic syllable segmentation information helps the systems to improve their overall performance by roughly 10%.


Full Paper

Bibliographic reference.  Meinedo, Hugo / Neto, Joao P. (2000): "The use of syllable segmentation information in continuous speech recognition hybrid systems applied to the Portuguese language", In ICSLP-2000, vol.2, 927-930.