ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

A data format enabling interoperation of speech recognition, translation and information extraction engines: the GALE type system

John F. Pitrelli, Burn L. Lewis, Edward A. Epstein, Jerome L. Quinn, Ganesh Ramaswamy

Live interoperation of several speech- and text-processing engines is key to tasks such as real-time cross-language story segmentation, topic clustering, and captioning of video. One requirement for interoperation is a common data format shared across engines, so that the output of one can be understood as the input of another. The GALE Type System has been created to serve this purpose for interoperating language-identification, speaker-recognition, speech-recognition, named-entity-detection, translation, story-segmentation, topic-clustering, summarization, and headline-generation engines in the context of Unstructured Information Management Architecture. GTS includes types designed to bridge across the domains of these engines, for example, linking the text-only domain of translation to the time-domain types needed for speech processing, and the monolingual domain of information-extraction engines to the cross-language types needed for translation.


doi: 10.21437/Interspeech.2008-459

Cite as: Pitrelli, J.F., Lewis, B.L., Epstein, E.A., Quinn, J.L., Ramaswamy, G. (2008) A data format enabling interoperation of speech recognition, translation and information extraction engines: the GALE type system. Proc. Interspeech 2008, 1654-1657, doi: 10.21437/Interspeech.2008-459

@inproceedings{pitrelli08_interspeech,
  author={John F. Pitrelli and Burn L. Lewis and Edward A. Epstein and Jerome L. Quinn and Ganesh Ramaswamy},
  title={{A data format enabling interoperation of speech recognition, translation and information extraction engines: the GALE type system}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1654--1657},
  doi={10.21437/Interspeech.2008-459}
}