Using periodic energy to enrich acoustic representations of pitch in speech: A demonstration

Aviad Albert, Francesco Cangemi, Martine Grice


This paper aims to strengthen the link between acoustic and perceptual representations of intonation, a link that has been weakened by the over-reliance on the F0 trajectory, which can only be interpreted in relation to landmarks in the segmental string, placed manually or semi-automatically at a separate stage in the analysis. Only then can F0 events be identified as linguistically relevant (e.g. early, medial or late peaks, accentual tones or edge tones etc.). We provide an analysis and visualization of two acoustic dimensions contributing towards the perceived pitch contour, F0 rate over time and, crucially, periodic energy. Periodic energy reflects the degree to which pitch is intelligible, a higher value representing a stronger F0 signal that is consequently more easily perceived. A representation that includes F0 strength is thus able to flag portions of the speech signal that are relevant for the analysis of intonation, without the need for a separate segmentation of the signal into phones and syllables.


 DOI: 10.21437/SpeechProsody.2018-162

Cite as: Albert, A., Cangemi, F., Grice, M. (2018) Using periodic energy to enrich acoustic representations of pitch in speech: A demonstration. Proc. 9th International Conference on Speech Prosody 2018, 804-808, DOI: 10.21437/SpeechProsody.2018-162.


@inproceedings{Albert2018,
  author={Aviad Albert and Francesco Cangemi and Martine Grice},
  title={Using periodic energy to enrich acoustic representations of pitch in speech: A demonstration},
  year=2018,
  booktitle={Proc. 9th International Conference on Speech Prosody 2018},
  pages={804--808},
  doi={10.21437/SpeechProsody.2018-162},
  url={http://dx.doi.org/10.21437/SpeechProsody.2018-162}
}