ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Focus detection by comparison of speech waveforms

Satoshi Kitagawa, Nick Campbell

For the eficient translation of speech by machine, the word sequence alone is not always sufficient to convey the intended meaning. Prosodic information can be lost in the speech recognition process. This paper presents methods by which focus can be detected in the input speech using timing and pitch information. By comparing the prosodic characteristics of an input utterance against profiles generated by components of a speech synthesiser for a default rendition of the same sequence of words, we are able to detect areas in the signal where prominence has been added.


doi: 10.21437/Eurospeech.1999-408

Cite as: Kitagawa, S., Campbell, N. (1999) Focus detection by comparison of speech waveforms. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1867-1870, doi: 10.21437/Eurospeech.1999-408

@inproceedings{kitagawa99_eurospeech,
  author={Satoshi Kitagawa and Nick Campbell},
  title={{Focus detection by comparison of speech waveforms}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={1867--1870},
  doi={10.21437/Eurospeech.1999-408}
}