ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Human voice or prompt generation? can they co-exist in an application?

Géza Németh, Csaba Zainkó, Mátyás Bartalis, Gábor Olaszy, Géza Kiss

This paper describes an R&D project regarding procedures for the automatic maintenance of the interactive voice response (IVR) system of a mobile telecom operator. The original plan was to create a generic voice prompt generation system for the customer service department. The challenge was to create a solution that is hard to distinguish from the human speaker (i.e. passing a sort of Turing-test) so its output can be freely mixed with original human recordings. The domain of the solution at the first step had to be narrowed down to the price lists of available mobile phones and services. This is updated weekly, so the final operational system generates about 3 hours of speech at each weekend. It operates under human supervision but without intervention in the speech generation process. It was tested both by academic procedures and company customers and was accepted as fulfilling the original requirements.


doi: 10.21437/Interspeech.2009-221

Cite as: Németh, G., Zainkó, C., Bartalis, M., Olaszy, G., Kiss, G. (2009) Human voice or prompt generation? can they co-exist in an application? Proc. Interspeech 2009, 620-623, doi: 10.21437/Interspeech.2009-221

@inproceedings{nemeth09_interspeech,
  author={Géza Németh and Csaba Zainkó and Mátyás Bartalis and Gábor Olaszy and Géza Kiss},
  title={{Human voice or prompt generation? can they co-exist in an application?}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={620--623},
  doi={10.21437/Interspeech.2009-221}
}