This paper describes a method of collecting multilingual speech data for use in the compilation of spoken command vocabularies for ICT devices and services in the EU, the EFTA countries and Turkey and Russia. The resulting vocabularies will be published as a European standard, for use by industry in the production of such applications. The context of this work is the EU i2010 framework for addressing the main challenges and developments in ICT up to 2010.
Bibliographic reference. Orr, Rosemary / Llinares, Bernat González i / Petersen, Françoise / Hüttenrauch, Helge / Böcker, Martin / Tate, Michael (2007): "Collection of empirical data for standardization of generic vocabularies in speech driven ICT devices and services", In INTERSPEECH-2007, 1310-1313.