ISCA Archive SpeechProsody 2010
ISCA Archive SpeechProsody 2010

Combining greedy algorithms with expert guided manipulation for the definition of a balanced prosodic Spanish-catalan radio news corpus

David Escudero-Mancebo, C. González-Ferreras, Juan María Garrido Almiñana, E. Rodero, Lourdes Aguilar, Antonio Bonafonte

This article reports the process of building a bilingual (Spanish-Catalan) text corpus balanced in parallel taking into account prosodic features for both languages. We propose an expert guideline for text manipulation that in combination with greedy algorithms significantly improves the quality of the selected corpus. The application of this methodology to a radio news corpus empirically supports the proposed strategy.


Cite as: Escudero-Mancebo, D., González-Ferreras, C., Garrido Almiñana, J.M., Rodero, E., Aguilar, L., Bonafonte, A. (2010) Combining greedy algorithms with expert guided manipulation for the definition of a balanced prosodic Spanish-catalan radio news corpus. Proc. Speech Prosody 2010, paper 061

@inproceedings{escuderomancebo10_speechprosody,
  author={David Escudero-Mancebo and C. González-Ferreras and Juan María {Garrido Almiñana} and E. Rodero and Lourdes Aguilar and Antonio Bonafonte},
  title={{Combining greedy algorithms with expert guided manipulation for the definition of a balanced prosodic Spanish-catalan radio news corpus}},
  year=2010,
  booktitle={Proc. Speech Prosody 2010},
  pages={paper 061}
}