Due to the enormous development of large vocabulary, speaker-independent continuous speech recognition systems, which occur essentially for the US English language, there is a large demand of this kind of systems for other languages. In this paper we present the work done in the development of a large vocabulary, speaker-independent continuous speech recognition hybrid system for the European Portuguese language. This is a difficult task due to the basic development stage of this technology in the European Portuguese language. The development of a system of this kind for a new language depends on the availability of the appropriate source components, mainly a speech corpus and large amounts of texts. This work became possible due to the development of a new database (BD-PUBLICO), a large vocabulary speech corpus for the European Portuguese language developed by us over the last two years.
Cite as: Neto, J.P., Martins, C., Almeida, L.B. (1998) A large vocabulary continuous speech recognition hybrid system for the portuguese language. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0562, doi: 10.21437/ICSLP.1998-659
@inproceedings{neto98_icslp, author={Joao P. Neto and Ciro Martins and Luis B. Almeida}, title={{A large vocabulary continuous speech recognition hybrid system for the portuguese language}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0562}, doi={10.21437/ICSLP.1998-659} }