16th Annual Conference of the International Speech Communication Association

Dresden, Germany
September 6-10, 2015

Error Analysis of Extracted Tongue Contours from 2D Ultrasound Images

Tamás Gábor Csapó (1), Steven M. Lulich (2)

(1) BME, Hungary
(2) Indiana University, USA

The goal of this study was to characterize errors involved in obtaining midsagittal tongue contours from two-dimensional ultrasound image sequences. Toward that end, two basic experiments were conducted. First, manual tongue contours were obtained from 1,145 tongue ultrasound images recorded from four speakers during production of the sentence ` I owe you a yoyo', and the uncertainty associated with the contours was quantified. Second, tongue contours from the same images were obtained using the EdgeTrak, TongueTrack, and AutoTrace algorithms, and these were compared quantitatively with the manual tongue contours. Three basic error types associated with the tongue contours are identified, indicating areas in need of improvement in future algorithmic developments. Depending on the speaker, RMS errors for the algorithmically obtained contours ranged from 1.76 to 7.11 mm, and the standard deviation of manual contours ranged from 0.97 to 2.07 mm.

Full Paper

Bibliographic reference.  Csapó, Tamás Gábor / Lulich, Steven M. (2015): "Error analysis of extracted tongue contours from 2d ultrasound images", In INTERSPEECH-2015, 2157-2161.