ISCA Archive Interspeech 2017
ISCA Archive Interspeech 2017

Database of Volumetric and Real-Time Vocal Tract MRI for Speech Science

Tanner Sorensen, Zisis Skordilis, Asterios Toutios, Yoon-Chul Kim, Yinghua Zhu, Jangwon Kim, Adam Lammert, Vikram Ramanarayanan, Louis Goldstein, Dani Byrd, Krishna Nayak, Shrikanth S. Narayanan

We present the USC Speech and Vocal Tract Morphology MRI Database, a 17-speaker magnetic resonance imaging database for speech research. The database consists of real-time magnetic resonance images (rtMRI) of dynamic vocal tract shaping, denoised audio recorded simultaneously with rtMRI, and 3D volumetric MRI of vocal tract shapes during sustained speech sounds. We acquired 2D real-time MRI of vocal tract shaping during consonant-vowel-consonant sequences, vowel-consonant-vowel sequences, read passages, and spontaneous speech. We acquired 3D volumetric MRI of the full set of vowels and continuant consonants of American English. Each 3D volumetric MRI was acquired in one 7-second scan in which the participant sustained the sound. This is the first database to combine rtMRI of dynamic vocal tract shaping and 3D volumetric MRI of the entire vocal tract. The database provides a unique resource with which to examine the relationship between vocal tract morphology and vocal tract function. The USC Speech and Vocal Tract Morphology MRI Database is provided free for research use at http://sail.usc.edu/span/morphdb.


doi: 10.21437/Interspeech.2017-608

Cite as: Sorensen, T., Skordilis, Z., Toutios, A., Kim, Y.-C., Zhu, Y., Kim, J., Lammert, A., Ramanarayanan, V., Goldstein, L., Byrd, D., Nayak, K., Narayanan, S.S. (2017) Database of Volumetric and Real-Time Vocal Tract MRI for Speech Science. Proc. Interspeech 2017, 645-649, doi: 10.21437/Interspeech.2017-608

@inproceedings{sorensen17_interspeech,
  author={Tanner Sorensen and Zisis Skordilis and Asterios Toutios and Yoon-Chul Kim and Yinghua Zhu and Jangwon Kim and Adam Lammert and Vikram Ramanarayanan and Louis Goldstein and Dani Byrd and Krishna Nayak and Shrikanth S. Narayanan},
  title={{Database of Volumetric and Real-Time Vocal Tract MRI for Speech Science}},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={645--649},
  doi={10.21437/Interspeech.2017-608}
}