Sixth International Conference on Spoken Language Processing
October 16-20, 2000
The Use of Syllable Segmentation Information in Continuous Speech Recognition Hybrid Systems Applied to the Portuguese Language
Hugo Meinedo, Joao P. Neto
INESC - IST, Lisboa, Portugal
Recent works have shown that the use of syllables as the
basic unit in a speech recognition system could be very useful.
These works introduced methods exploiting syllable
information as a mean to add robustness in "traditional"
systems that use phonemes/phones as the basic unit. Being
the Portuguese a highly syllabic language we expected that
information from syllables would introduce potential benefits
in speech recognition tasks. Following these ideas we
started by developing different methods of automatic syllable
segmentation. Next we applied the best segmentation
method to our large vocabulary continuous speech corpus
(BD-PUBLICO) achieving an accuracy of 72%. We developed
a process to use the segmentation information in
the acoustic models of our baseline speech recognisers for
the Portuguese language. The results obtained by the modified
recognition systems on 5k and 27k vocabulary tasks
showed that the use of basic syllable segmentation information
helps the systems to improve their overall performance
by roughly 10%.
Meinedo, Hugo / Neto, Joao P. (2000):
"The use of syllable segmentation information in continuous speech recognition hybrid systems applied to the Portuguese language",
In ICSLP-2000, vol.2, 927-930.