Automatic Glottis Localization and Segmentation in Stroboscopic Videos Using Deep Neural Network

Achuth Rao MV, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadharshini, Prasanta Kumar Ghosh


Exact analysis of the glottal vibration patten is vital for assessing voice pathologies. One of the primary steps in this analysis is automatic glottis segmentation, which, in turn, has two main parts, namely, glottis localization and the glottis segmentation. In this paper, we propose a deep neural network (DNN) based automatic glottis localization and segmentation scheme. We pose the problem as a classification problem where colors of each pixel and its neighborhood is classified as belonging to inside or outside the glottis region. We further process the classification result to get the biggest cluster, which is declared as the segmented glottis. The proposed algorithm is evaluated on a dataset comprising of stroboscopic videos from 18 subjects where the glottis region is marked by the three Speech Language Pathologists (SLPs). On average, the proposed DNN based segmentation scheme achieves a localization performance of 65.33% and segmentation DICE score of 0.74 (absolute), which is better than the baseline scheme by 22.66% and 0.09 respectively. We also find that the DICE score obtained by the DNN based segmentation scheme correlates well with the average DICE score computed between annotation provided by any two SLPs suggesting the robustness of the proposed glottis segmentation scheme.


 DOI: 10.21437/Interspeech.2018-2572

Cite as: Rao MV, A., Krishnamurthy, R., Gopikishore, P., Priyadharshini, V., Ghosh, P.K. (2018) Automatic Glottis Localization and Segmentation in Stroboscopic Videos Using Deep Neural Network. Proc. Interspeech 2018, 3007-3011, DOI: 10.21437/Interspeech.2018-2572.


@inproceedings{Rao MV2018,
  author={Achuth {Rao MV} and Rahul Krishnamurthy and Pebbili Gopikishore and Veeramani Priyadharshini and Prasanta Kumar Ghosh},
  title={Automatic Glottis Localization and Segmentation in Stroboscopic Videos Using Deep Neural Network},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={3007--3011},
  doi={10.21437/Interspeech.2018-2572},
  url={http://dx.doi.org/10.21437/Interspeech.2018-2572}
}