ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Vocal tract cross-distance estimation from real-time MRI using region-of-interest analysis

Adam Lammert, Vikram Ramanarayanan, Michael Proctor, Shrikanth Narayanan

Real-Time Magnetic Resonance Imaging affords speech articulation data with good spatial and temporal resolution and complete midsagittal views of the moving vocal tract, but also brings many challenges in the domain of image processing and analysis. Region-of-interest analysis has previously been proposed for simple, efficient and robust extraction of linguistically-meaningful constriction degree information. However, the accuracy of such methods has not been rigorously evaluated, and no method has been proposed to calibrate the pixel intensity values or convert them into absolute measurements of length. This work provides such an evaluation, as well as insights into the placement of regions in the image plane and calibration of the resultant pixel intensity measurements. Measurement errors are shown to be generally at or below the spatial resolution of the imaging protocol with a high degree of consistency across time and overall vocal tract configuration, validating the utility of this method of image analysis.


doi: 10.21437/Interspeech.2013-168

Cite as: Lammert, A., Ramanarayanan, V., Proctor, M., Narayanan, S. (2013) Vocal tract cross-distance estimation from real-time MRI using region-of-interest analysis. Proc. Interspeech 2013, 959-962, doi: 10.21437/Interspeech.2013-168

@inproceedings{lammert13_interspeech,
  author={Adam Lammert and Vikram Ramanarayanan and Michael Proctor and Shrikanth Narayanan},
  title={{Vocal tract cross-distance estimation from real-time MRI using region-of-interest analysis}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={959--962},
  doi={10.21437/Interspeech.2013-168}
}