Interspeech'2005 - Eurospeech
It is a difficult problem to recognize baseball live speech because the speech is rather fast, noisy, emotional and disfluent due to rephrasing, repetition, mistake and grammatical deviation caused by spontaneous speaking style. To solve these problems, we have been studied the speech recognition method incorporating the baseball game task-dependent knowledge as well as an announcer's emotion in commentary speech . In addition, in this paper, we propose the situation prediction model based on word co-occurrence. Owing to these proposed models, speech recognition errors are effectively prevented. This method is formalized in the framework of probability theory and implemented in the conventional speech decoding (Viterbi) algorithm. The experimental results showed that the proposed approach improved the structuring and segmentation accuracy as well as keywords accuracy.
Bibliographic reference. Sako, Atsushi / Takiguchi, Tetsuya / Ariki, Yasuo (2005): "Situation based speech recognition for structuring baseball live games", In INTERSPEECH-2005, 3453-3456.