ISCA - International Speech Communication Association
ISCA Archive
Back to list of job offers
We offer a 3-year position (starting, 01.07.2026, fulltime) to a speech scientist or engineer within the Transregional Collaborative Research Center (TRR 318) “Constructing Explainability,” which is jointly run by the Universities of Paderborn and Bielefeld. The TRR investigates how algorithmic transparency can be promoted, particularly in the context of black-box models as used in modern artificial intelligence systems. The position can be used for further academic qualification.
Our subproject deals with explanatory strategies for recognizing stress in clinical explanatory situations based on multimodal signals (facial expressions and voice). We are developing tools that help clinical staff to better recognize the presence of stress in neurodiverse and neurotypical populations.
For more details, and a link to the application process, see https://jobs.uni-bielefeld.de/job/view/4874/research-position-m-f-d-in-the-sfb-trr-318-in-the-field-of-phonetics?page_lang=en
The Phonetics group at Lancaster University, UK is looking to appoint a Senior Research Associate (i.e. postdoc) in Machine Learning for Speech Processing. The position is available from 1 July 2026 for 18 months.
The goal is to recover vocal tract movements from the acoustic signal. We are developing ways to integrate physical knowledge into the models, so the inversions are not just accurate but also reveal underlying principles of speech production.
More information
4-year fully PhD funded position, supervised by Prof. Naomi Harte in School of Engineering, Trinity College Dublin, Ireland. Research will explore how multimodal cues used in speech-based interaction can be used to track an active speaker in conversations that go beyond controlled, 2-person scenarios. Full details, including how to apply are here:
https://www.adaptcentre.ie/careers/phd-studentship-speaker-tracking-in-complex-conversations/
The University of Granada invites applications for a 3.5-year fully funded postdoctoral position in the area of Artificial Intelligence and Neuroscience, within the research project “Boosting AI-driven multimodal decoding of Speech and Language from Brain Signals (brAIn2lang)”.
AI, Deep Learning, NeuroAI, Speech and Language Processing, EEG/sEEG, Brain–Computer Interfaces.
The successful candidate will develop advanced deep learning methods to decode speech and language from brain signals, using non-invasive (EEG) and invasive (sEEG) data. The position is highly interdisciplinary, combining machine learning, neuroscience, and language technologies, and will be carried out at the University of Granada in collaboration with national and international partners.
Development of deep learning models for neural decoding of speech and language.
Analysis and modeling of neural representations from EEG and sEEG data.
Interdisciplinary collaboration between AI and neuroscience.
Publication of results in peer-reviewed journals and presentation at international conferences.
Contribution to scientific dissemination and communication activities.
Applicants must hold a PhD in Computer Science, Artificial Intelligence, Computational Neuroscience, Biomedical Engineering, or a related field.
Strong background in deep learning, experience or interest in neuroimaging data, and familiarity with speech or language technologies are highly desirable.
Duration: 3.5 years (full time)
Start date: Approximately March 2026
Gross salary: ~€2,300 per month
Location: Granada, Spain
Reference: 1768
Applications must be submitted through the official University of Granada platform.
Deadline: 30 January
Application website: https://investigacion.ugr.es/recursos-humanos/personal/contratos
For informal inquiries: Jose A. Gonzalez-Lopez joseangl@ugr.es
The speech group at Technische Hochschule Nürnberg is looking for a Postdoc to work on speech modeling, recognition and/or understanding in the context of atypical or pathological speech. Position part of a large-scale DFG project funded 2026-2031 and will join a team of 10 researchers (and the rest of the speech group ;-) working on foundations and applications of speech technology in health sciences.
Learn more under https://ki-zentrum.bayern or https://karriere.service.th-nuernberg.de/jobposting/65b80c74e56bf35abccf9f72ec3bcfb4eafab5960?ref=homepage
English language is requires, German helpful!
The speech recognition group at Aalto University, Finland, focuses on new machine learning methods in automatic speech recognition and language modelling. We are now looking for a postdoc for 2-3 years to start as soon as possible. The position requires a relevant doctoral degree in CS or EE and skills for doing excellent research in an (English-speaking) group. Research experience in second language learners' speech recognition and assessment is a merit. The candidate is expected to participate in our research projects and supervision of excellent MSc and PhD students. The main focus is in a new DALAI project to develop AI-based tools by which immigrants can improve their spoken language skills. The application, CV, list of publications, references and requests for further information should be sent by email to Prof. Mikko Kurimo (mikko.kurimo at aalto.fi). The DL for applications is November 30, but they will be processed as soon as they come, and recruitment may happen before the DL. We operate in a well-connected academic environment using excellent GPU and CPU computing facilities (including access to Europe’s fastest supercomputer LUMI) and well equipped office space at Aalto University Otaniemi campus that is only 10 minutes subway connection away from downtown Helsinki. In addition to a competitive salary, the contract includes occupational health benefits, and Finland has a comprehensive social security system. The Helsinki Metropolitan area forms a world-class information technology hub, attracting leading scientists and researchers in various fields of ICT and related disciplines. As a living and working environment, Finland consistently ranks high in quality of life, and Helsinki, the capital of Finland, is regularly ranked as one of the most livable cities in the world. See more at https://finland.fi
Applications are invited for a postdoctoral research scientist position in the Spoken Language Processing group at the Institute of Phonetics and Speech Processing, LMU Munich (https://www.phonetik.uni-muenchen.de/). The role of the postdoctoral researcher is to conduct research on the application of speech technology to deepening our understanding of the principle underpinning human speech and language processing.
The candidate should hold a PhD in computer science, computational linguistics, or a related field. The degree must be completed prior to the starting date. Demonstrated experience with speech technology applications is essential, including but not limited to any of: training and fine-tuning of self-supervised speech models; analysis of self-supervised speech representations; text-to-speech synthesis and voice cloning; and acoustic or articulatory signal processing.
Evidence of good programming skills, as well as background in statistics, mathematics, and/or machine learning, is essential. However, the successful candidate will have a strong interest a scientific approach to the study of speech and language, not simply engineering applications. Good spoken and written command of English is required. Salary is determined according to the German pay scale for state employees (Tarifvertrag der Länder TV-L West), group E13. The specific level within E13 depends on prior experience. Positions are for 2 or 3 years with possibility of extension.
To apply, please send your application including the documents below to Prof. Dr. James Kirby (jkirby@phonetik.uni-munchen.de). All supporting documents for the application should be emailed electronically as pdf files. Application should include:
Applications will be reviewed as received. The position is open until filled. LMU is an equal opportunity employer and is committed to increasing the diversity of its staff. Part-time employment is possible. We strongly encourage applications from qualified women. Disabled applicants will be preferred if essentially equally qualified.
For question or further information please contact Prof. Dr. James Kirby (jkirby@phonetik.uni-munchen.de).
The Signal Processing and Speech Communication Laboratory (www.spsc.tugraz.at) of Graz University of Technology (TU Graz, Austria) invites applications for a
Research and Teaching Associate– PhD Position – in Signal Processing and Speech Communication
with appointments planned for November 2025.
The associate is expected to perform excellent research towards a PhD degree (in cooperation with international partners) under the guidance of professors Gernot Kubin and Barbara Schuppler. Furthermore, the associate will co-advise Bachelor’s and Master’s student projects and develop and teach laboratory and problem classes on various aspects of signal processing. Fluency in English is a must, knowledge of German is an asset. A strong background in signal processing and/or speech communication (e.g., Speech LLMs) as well as an excellent Master’s degree in Electrical Engineering, Information Engineering, or similar are required. Entry-level gross yearly salaries are about EUR 52,000 for 40 hours per week and initial contract durations may span up to 4 years.
The Signal Processing and Speech Communication Laboratory was the main organizer of INTERSPEECH 2019 in Graz (interspeech2019.org) and takes the lead in building a Graduate School of Speech Language and AI Technologies together with eight partners of the Unite! University Alliance (speechlanguageai.unite-university.eu/). TU Graz is the longest-established University of Technology in Austria where equal opportunity takes center stage.
Graz (www.tugraz.at/go/welcome-center) is the second largest city of Austria located in the south-eastern province of Styria and enjoys a vibrant student life with eight universities.
Applications are due by Sept 12, 2025, please consult our official job posting at www.euraxess.at/jobs/366642 and follow the instructions there to upload you application to the TU Graz job portal. Interviews with selected candidates are planned for Sep 29-30, 2025. For further information, please contact the two advisors Gernot Kubin at gernot.kubin@tugraz.at and Barbara Schuppler at b.schuppler@tugraz.at.
The Audio Security and Privacy Group at EURECOM, France has openings for 3 PhD candidates in speech deepfake detection and automatic speaker verification (ASV). If you have a Master's degree, an excellent academic track record, strong proficiency in English, also have expertise in computer science, machine learning, artificial intelligence, data science, speech processing, deepfake/spoofing detection, text-to-speech synthesis or voice conversion, and are keen on international collaboration, please consider applying.
Topics include, but are not limited to:
For these particular PhD positions, applications may undergo administration security checks in compliance with French law and regulations. Restrictions on certain nationalities may apply.
In the first instance, please send your CV by email to Nicholas Evans (evans@eurecom.fr) with the subject line 'PhD opportunities'.
Learn about EURECOM by visiting our website https://www.eurecom.fr and about other job opportunities at https://www.eurecom.fr/en/eurecom/job-opportunities/job-opportunities.
Large language models(LLMs) have demonstrated increasingly powerful capabilities for reasoning tasks, especially in text. The project aims to explore and advance these capabilities in reasoning across multiple data modalities, including but not limited to text, speech and audio. The integration of multiple modalities can lead to more robust and general systems capable of understading and reasoning about the world in a more human-like manner. The project will involve fine-tuning pre-trained models and developing self-supervised learning techniques to adapt LLMs for multimodal tasks.
Application deadline: 16 March 2025.
Apply here.
© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.