The OMe (Octave-Median) scale: a natural scale for speech melody.

Céline De Looze, Daniel Hirst


Fundamental frequency, the primary acoustic correlate of speech melody, is generally analysed and displayed using a linear scale (Hertz) or a logarithmic one, generally in semitones and usually offset to an arbitrary reference level such as 100 Hz. In this paper we argue that a more natural scale for analysing speech is the OME (Octave-MEdian) scale, using the octave (o) as the basic unit, offset to the median value of the speaker’s range. We present results showing that a reasonable estimate of a speaker’s pitch range can be obtained directly from the median.


 DOI: 10.21437/SpeechProsody.2014-170

Cite as: Looze, C.D., Hirst, D. (2014) The OMe (Octave-Median) scale: a natural scale for speech melody.. Proc. 7th International Conference on Speech Prosody 2014, 910-914, DOI: 10.21437/SpeechProsody.2014-170.


@inproceedings{Looze2014,
  author={Céline De Looze and Daniel Hirst},
  title={{The OMe (Octave-Median) scale: a natural scale for speech melody.}},
  year=2014,
  booktitle={Proc. 7th International Conference on Speech Prosody 2014},
  pages={910--914},
  doi={10.21437/SpeechProsody.2014-170},
  url={http://dx.doi.org/10.21437/SpeechProsody.2014-170}
}