This paper uses the method of mutual information to estimate the distribution of partial phonetic information in the time-frequency plane relative to an acoustic landmark. TIMIT transcriptions are parsed to estimate the locations of consonant closure landmarks, consonant release landmarks, manner change landmarks, and vowel or glide pivot landmarks. A mel-scale spectrogram is computed over the 250ms centered at each landmark, and the logarithmic energy of each point in time-frequency space is linearly quantized. The phoneme label associated with a landmark determines the values of 25 binary distinctive features. Finally, coincidences between feature and spectral energy values are counted, and the average log probabilities are calculated in order to produce an \infogram" of each distinctive feature: a measurement of the mutual information between the value of the feature and the energy of each point in the time-frequency plane.
Cite as: Hasegawa-Johnson, M. (2000) Time-frequency distribution of partial phonetic information measured using mutual information. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 133-136, doi: 10.21437/ICSLP.2000-769
@inproceedings{hasegawajohnson00_icslp, author={Mark Hasegawa-Johnson}, title={{Time-frequency distribution of partial phonetic information measured using mutual information}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 4, 133-136}, doi={10.21437/ICSLP.2000-769} }