ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

The AT&t speech API: a study on practical challenges for customized speech to text service

E. Gouvêa, A. Moreno-Daniel, A. Reddy, R. Chengalvarayan, D. Thomson, A. Ljolje

AT&T has recently opened its extensive portfolio of state-of-the-art Speech Technology to external end-developers as a platform called "The AT&T Speech API". This study discusses a series of practical challenges found in an industrial deployment of speech to text services, particularly, we examine different strategies for customizing the speech to text process by considering intrinsic factors, inherent to the audio signal, or extrinsic factors, available from other sources, in an industry-grade implementation.


Cite as: Gouvêa, E., Moreno-Daniel, A., Reddy, A., Chengalvarayan, R., Thomson, D., Ljolje, A. (2013) The AT&t speech API: a study on practical challenges for customized speech to text service. Proc. Interspeech 2013, 2071-2073

@inproceedings{gouvea13_interspeech,
  author={E. Gouvêa and A. Moreno-Daniel and A. Reddy and R. Chengalvarayan and D. Thomson and A. Ljolje},
  title={{The AT&t speech API: a study on practical challenges for customized speech to text service}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2071--2073}
}