ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

An experimental study of an audio indexing system for the web

Beth Logan, Pedro Moreno, Jean-Manuel van Thong, Ed Whittaker

We have developed a speech recognition based audio search engine for indexing spoken documents found on the World Wide Web. Our site (http://www.compaq.com/speechbot) indexes around 20 news and talk radio shows covering a wide range of topics, speaking styles and acoustic conditions from a selection of public Web sites with multimedia archives. In this paper, we describe our system and its performance, focusing on the speech recognition and retrieval aspects. We describe our training procedure in some detail and report our historical error rate since the site launch. We also investigate the impact of Out Of Vocabulary (OOV) words. Finally we report the results of retrieval experiments which demonstrate that our system can index effectively.


Cite as: Logan, B., Moreno, P., Thong, J.-M.v., Whittaker, E. (2000) An experimental study of an audio indexing system for the web. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 676-679

@inproceedings{logan00_icslp,
  author={Beth Logan and Pedro Moreno and Jean-Manuel van Thong and Ed Whittaker},
  title={{An experimental study of an audio indexing system for the web}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 676-679}
}