|
ISCA Best Paper Awards - 2025We would like to highlight the award-winning papers! Each year ISCA awards 3 best student papers at Interspeech based on anonymous reviewing and presentation at the conference. The Interspeech Area Chairs nominate candidate papers that are assessed by a jury with representatives from the ISCA Board, Area Chairs and the Interspeech Technical Program Chairs. The jury for the best student paper award is impartial, i.e. members cannot participate in the voting if (s)he is in any way involved in/with any of the award candidate. Each paper is awarded 500 euros to be split between the student authors. Best Papers of the journals Speech Communication, and Computer Speech and Language are also announced by ISCA during Interspeech. Please see best paper awards going back to 2000 here. Please see recent best paper awards: 2024. |
ISCA Award for Best Student Paper (students in bold)On the Relationship between Accent Strength and Articulatory Features Kevin Huang, Sean Foley, Jihwan Lee, Yoonjeong Lee, Dani Byrd, Shrikanth Narayanan [pdf] OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning Models Yifan Peng, Muhammad Shakeel, Yui Sudo, William Chen, Jinchuan Tian, Chyi-Jiunn Lin and Shinji Watanabe [pdf] Attention Models and Auditory Transduction Features for Noise Robustness Cathal Ó Faoláin and Andrew Hines [pdf]
|
Transfer learning from adult to children for speech recognition: Evaluation, analysis and recommendations
Prashanth Gurunath Shivakumar and Panayiotis Georgiou, Computer Speech & Language, Volume 63, 2020 [link]
|
Abstract: Children speech recognition is challenging mainly due to the inherent high variability in children’s physical and articulatory characteristics and expressions. This variability manifests in both acoustic constructs and linguistic usage due to the rapidly changing developmental stage in children’s life. Part of the challenge is due to the lack of large amounts of available children speech data for efficient modeling. This work attempts to address the key challenges using transfer learning from adult’s models to children’s models in a Deep Neural Network (DNN) framework for children’s Automatic Speech Recognition (ASR) task evaluating on multiple children’s speech corpora with a large vocabulary. The paper presents a systematic and an extensive analysis of the proposed transfer learning technique considering the key factors affecting children’s speech recognition from prior literature. Evaluations are presented on (i) comparisons of earlier GMM-HMM and the newer DNN Models, (ii) effectiveness of standard adaptation techniques versus transfer learning, (iii) various adaptation configurations in tackling the variabilities present in children speech, in terms of (a) acoustic spectral variability, and (b) pronunciation variability and linguistic constraints. Our Analysis spans over (i) number of DNN model parameters (for adaptation), (ii) amount of adaptation data, (iii) ages of children, (iv) age dependent-independent adaptation. Finally, we provide Recommendations on (i) the favorable strategies over various aforementioned - analyzed parameters, and (ii) potential future research directions and relevant challenges/problems persisting in DNN based ASR for children’s speech. |
|
ISCA Award for the Best Paper published in Computer Speech and Language (2020-2024)Turn-taking in Conversational Systems and Human-Robot Interaction: A Review Gabriel Skantze, Computer Speech & Language, Computer Speech and Language, Volume 67, 2021 [link]
|