Speech and language researchers need to manage and analyze increasing
quantities of material. Various tools are available for various stages
of the work, but they often require the researcher to use different
interfaces and to convert the output from each tool into suitable input
for the next one.
The Language Bank of Finland (Kielipankki) is developing an on-line
platform called Mylly for processing speech and language data in a
graphical user interface that integrates different tools into a single
workflow. Mylly provides tools and computational resources for processing
material and for the inspecting the results. The tools plugged into
Mylly include a parser, morphological analyzers, generic finite-state
technology, and a speech recognizer. Users can upload data and download
any intermediate results in the tool chain. Mylly runs on CSC’s
Taito cluster and is an instance of the Chipster platform. Access rights
to Mylly are given for academic use.
The Language Bank
of Finland is a collection of corpora, tools and other services maintained
by FIN-CLARIN, a consortium of Finnish universities and research organizations
coordinated by the University of Helsinki. The technological infrastructure
for the Language Bank of Finland is provided by CSC – IT Center
for Science.
Cite as: Lennes, M., Piitulainen, J., Matthiesen, M. (2017) Mylly — The Mill: A New Platform for Processing Speech and Text Corpora Easily and Efficiently. Proc. Interspeech 2017, 829-830
@inproceedings{lennes17_interspeech, author={Mietta Lennes and Jussi Piitulainen and Martin Matthiesen}, title={{Mylly — The Mill: A New Platform for Processing Speech and Text Corpora Easily and Efficiently}}, year=2017, booktitle={Proc. Interspeech 2017}, pages={829--830} }