A Knowledge Driven Structural Segmentation Approach for Play-Talk Classification During Autism Assessment

Manoj Kumar, Pooja Chebolu, So Hyun Kim, Kassandra Martinez, Catherine Lord, Shrikanth Narayanan


Automatically segmenting conversational audio into semantically relevant components has both computational and analytical significance. In this paper, we segment play activities and conversational portions interspersed during clinically administered interactions between a psychologist and a child with autism spectrum disorder (ASD). We show that various acoustic-prosodic and turn-taking features commonly used in the literature differ between these segments and hence can possibly influence further inference tasks. We adopt a two-step approach for the segmentation problem by taking advantage of the structural relation between the two segments. First, we use a supervised machine learning algorithm to estimate class posteriors at frame-level. Next, we use an explicit-duration hidden Markov model (EDHMM) to align the states using the posteriors from the previous step. The durational distributions for both play and talk regions are learnt from training data and modeled using the EDHMM. Our results show that speech features can be used to successfully discriminate between play and talk activities, each providing important insights into the child’s condition.


 DOI: 10.21437/Interspeech.2018-1516

Cite as: Kumar, M., Chebolu, P., Kim, S.H., Martinez, K., Lord, C., Narayanan, S. (2018) A Knowledge Driven Structural Segmentation Approach for Play-Talk Classification During Autism Assessment. Proc. Interspeech 2018, 2763-2767, DOI: 10.21437/Interspeech.2018-1516.


@inproceedings{Kumar2018,
  author={Manoj Kumar and Pooja Chebolu and So Hyun Kim and Kassandra Martinez and Catherine Lord and Shrikanth Narayanan},
  title={A Knowledge Driven Structural Segmentation Approach for Play-Talk Classification During Autism Assessment},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={2763--2767},
  doi={10.21437/Interspeech.2018-1516},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1516}
}