1st Joint SIG-IL/Microsoft Workshop on Speech and Language Technologies for Iberian Languages
Porto Salvo, Portugal
In this paper, an XML resource definition is presented fitting in with the architecture of a multilingual (Spanish, English, Basque) spoken document retrieval system. The XML resource not only stores all the information extracted from the audio signal, but also adds the structure required to create an index database and retrieve information according to various criteria. The XML resource is based on the concept of segment and provides generic but powerful mechanisms to characterize segments and group segments into sections. Audio and video files described through this XML resource can be easily exploited in other tasks, such as topic tracking, speaker diarization, etc.
Bibliographic reference. Bordel, Germán / Casillas, Arantza / Penagarikano, Mikel / Rodríguez-Fuentes, Luis J. / Varona, Amparo (2009): "An XML resource definition for spoken document retrieval", In SLTECH-2009, 31-34.