We present an improvement to our previous work to create a toolkit for integrating automated speech recognition, prosodic feature analysis, and machine learning to create models for identifying and classifying speech characteristics such as filled pauses. The toolkit provides a modular and extensible platform for intaking, analyzing, and formatting data for use in a wide variety of other tools.
Index Terms: speech recognition, machine learning, toolkit, prosody
Bibliographic reference. Okamoto, Jacob / Pakhomov, Serguei / Shriberg, Elizabeth / Stolcke, Andreas (2012): "ProTK: an improved prosody toolkit", In INTERSPEECH-2012, 1892-1893.