EUROSPEECH 2003 - INTERSPEECH 2003
Choosing appropriate voices for synthesizing children's stories requires text analysis techniques that can identify which portions of the text should be read by which speakers. Our work presents techniques to take raw text stories and automatically identify the quoted speech, identify the characters within the stories and assign characters to each quote. The resulting marked-up story may then be rendered with a standard speech synthesizer with appropriate voices for the characters. This paper presents each of the basic stages in identification, and the algorithms, both rule-driven and data-driven, used to achieve this. A variety of story texts are used to test our system. Results are presented with a discussion of the limitations and recommendations on how to improve speaker assignment in further texts.
Bibliographic reference. Zhang, Jason Y. / Black, Alan W. / Sproat, Richard (2003): "Identifying speakers in children's stories for speech synthesis", In EUROSPEECH-2003, 2041-2044.