ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

SplaSH (spoken language search hawk): integrating time-aligned with text-aligned annotations

Sara Romano, Elvio Cecere, Francesco Cutugno

In this work we present SpLaSH (Spoken Language Search Hawk), a toolkit used to perform complex queries on spoken language corpora. In SpLaSH, tools for the integration of time aligned annotations (TMA), by means of annotation graphs, with text aligned ones (TXA), by means of generic XML files, are provided. SpLaSH imposes a very limited number of constraints to the data model design, allowing the integration of annotations developed separately within the same dataset and without any relative dependency. It also provides a GUI allowing three types of queries: simple query on TXA or TMA structures, sequence query on TMA structure and cross query on both TXA and TMA integrated structures.


doi: 10.21437/Interspeech.2009-453

Cite as: Romano, S., Cecere, E., Cutugno, F. (2009) SplaSH (spoken language search hawk): integrating time-aligned with text-aligned annotations. Proc. Interspeech 2009, 1487-1490, doi: 10.21437/Interspeech.2009-453

@inproceedings{romano09_interspeech,
  author={Sara Romano and Elvio Cecere and Francesco Cutugno},
  title={{SplaSH (spoken language search hawk): integrating time-aligned with text-aligned annotations}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1487--1490},
  doi={10.21437/Interspeech.2009-453}
}