The Airbus Air Traffic Control Speech Recognition 2018 Challenge: Towards ATC Automatic Transcription and Call Sign Detection

Thomas Pellegrini, Jérôme Farinas, Estelle Delpech, François Lancelot


In this paper, we describe the outcomes of the challenge organized and run by Airbus and partners in 2018 on Air Traffic Control (ATC) speech recognition. The challenge consisted of two tasks applied to English ATC speech: 1) automatic speech-to-text transcription, 2) call sign detection (CSD). The registered participants were provided with 40 hours of speech along with manual transcriptions. Twenty-two teams submitted predictions on a five hour evaluation set. ATC speech processing is challenging for several reasons: high speech rate, foreign-accented speech with a great diversity of accents, noisy communication channels. The best ranked team achieved a 7.62% Word Error Rate and a 82.41% CSD F1-score. Transcribing pilots’ speech was found to be twice as harder as controllers’ speech. Remaining issues towards solving ATC ASR are also discussed in the paper.


 DOI: 10.21437/Interspeech.2019-1962

Cite as: Pellegrini, T., Farinas, J., Delpech, E., Lancelot, F. (2019) The Airbus Air Traffic Control Speech Recognition 2018 Challenge: Towards ATC Automatic Transcription and Call Sign Detection. Proc. Interspeech 2019, 2993-2997, DOI: 10.21437/Interspeech.2019-1962.


@inproceedings{Pellegrini2019,
  author={Thomas Pellegrini and Jérôme Farinas and Estelle Delpech and François Lancelot},
  title={{The Airbus Air Traffic Control Speech Recognition 2018 Challenge: Towards ATC Automatic Transcription and Call Sign Detection}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={2993--2997},
  doi={10.21437/Interspeech.2019-1962},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1962}
}