Automatically Measuring L2 Speech Fluency without the Need of ASR: A Proof-of-concept Study with Japanese Learners of French

Lionel Fontan, Maxime Le Coz, Sylvain Detey


This research work investigates the possibility of using automatic acoustic measures to assess speech fluency in the context of second language (L2) acquisition. To this end, three experts rated speech recordings of Japanese learners of French who were instructed to read aloud a 21-sentence-long text. A Forward-Backward Divergence Segmentation (FBDS) algorithm was used to segment speech recordings (sentences) into acoustically homogeneous units at a subphonemic scale. The FBDS processing results were used — along with more classic measures such as raw percentage of speech and length/standard deviation of silent pauses — to estimate speech rate and regularity of speech rate, while a formant tracking algorithm was used to estimate speech fluidity (i.e., quality of coarticulation). A step-by-step multiple linear regression was finally computed to predict the experts’ mean fluency ratings. Results show that FBDS-derived measures, raw percentage of speech and standard deviation of the first formant curve derivative can be combined together to calculate accurate estimates of speakers’ fluency scores (R = .92; P < .001). As only low-level signal features were used in the study, the method could also be relevant for the assessment of speakers of other target languages, as well as for the assessment of disordered speech.


 DOI: 10.21437/Interspeech.2018-1336

Cite as: Fontan, L., Le Coz, M., Detey, S. (2018) Automatically Measuring L2 Speech Fluency without the Need of ASR: A Proof-of-concept Study with Japanese Learners of French. Proc. Interspeech 2018, 2544-2548, DOI: 10.21437/Interspeech.2018-1336.


@inproceedings{Fontan2018,
  author={Lionel Fontan and Maxime {Le Coz} and Sylvain Detey},
  title={Automatically Measuring L2 Speech Fluency without the Need of ASR: A Proof-of-concept Study with Japanese Learners of French},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={2544--2548},
  doi={10.21437/Interspeech.2018-1336},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1336}
}