doi: 10.21437/IberSPEECH.2022
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech
Marek Strelec, Jonas Rohnke, Antonio Bonafonte, Mateusz Lajszczak, Trevor Wood
An animated realistic head with vocal tract for the finite element simulation of vowel /a/
Marc Arnela, Leonardo Pereira-Vivas, Jorge Egea
Exploring the limits of neural voice cloning: A case study on two well-known personalities
Ander González-Docasal, Aitor Álvarez, Haritz Arzelus
Analysis of iterative adaptive and quasi closed phase inverse filtering techniques on OPENGLOT synthetic vowels
Marc Freixes, Joan Claudi Socoró, Francesc Alías
Galician’s Language Technologies in the Digital Age
José Manuel Ramírez Sánchez, Laura Docio-Fernandez, Carmen Garcia Mateo
Contextual-Utterance Training for Automatic Speech Recognition
Alejandro Gomez-Alanis, Lukas Drude, Andreas Schwarz, Rupak Vignesh Swaminathan, Simon Wiesler
Phone classification using electromyographic signals
Eder Del Blanco, Inge Salomons, Eva Navas, Inma Hernáez
Semisupervised training of a fully bilingual ASR system for Basque and Spanish
Mikel Penagarikano, Amparo Varona, German Bordel, Luis J. Rodriguez-Fuentes
Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish
David Gimeno-Gomez, Carlos David Martinez Hinarejos
Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation
Fernando López, Jordi Luque
On the potential of jointly-optimised solutions to spoofing attack detection and automatic speaker verification
Wanying Ge, Hemlata Tak, Massimiliano Todisco, Nicholas Evans
A Study on the Use of wav2vec Representations for Multiclass Audio Segmentation
Pablo Gimeno, Alfonso Ortega, Antonio Miguel, Eduardo Lleida
Respiratory Sound Classification Using an Attention LSTM Model with Mixup Data Augmentation
Noelia Salor-Burdalo, Ascension Gallardo-Antolin
The Vicomtech Spoofing-Aware Biometric System for the SASV Challenge
Juan Manuel Martín-Doñas, Iván González Torre, Aitor Álvarez, Joaquin Arellano
VoxCeleb-PT – a dataset for a speech processing course
John Mendonca, Isabel Trancoso
Cross-Corpus Speech Emotion Recognition with HuBERT Self-Supervised Representation
Miguel Pastor, Dayana Ribas, Alfonso Ortega, Antonio Miguel, Eduardo Lleida
Analysis of Trustworthiness Recognition models from an aural and emotional perspective
Cristina Luna Jiménez, Ricardo Kleinlein, Syaheerah Lebai Lutfi, Juan M. Montero, Fernando Fernández-Martínez
Speech and Text Processing for Major Depressive Disorder Detection
Edward L. Campbell, Laura Docío Fernández, Nicholas Cummins, Carmen García Mateo
Bridging the Semantic Gap with Affective Acoustic Scene Analysis: an Information Retrieval-based Approach
Clara Luis-Mingueza, Esther Rituerto-González, Carmen Peláez-Moreno
Detecting Gender-based Violence aftereffects from Emotional Speech Paralinguistic Features
Emma Reyner Fuentes, Esther Rituerto González, Clara Luis Mingueza, Carmen Peláez Moreno, Celia López Ongil
Extraction of structural and semantic features for the identification of Psychosis in European Portuguese
Rodrigo Sousa, Helena Sofia Pinto, Alberto Abad, Daniel Neto, Joaquim Gago
An Attentional Extractive Summarization Framework
José Ángel González, Encarna Segarra, Fernando García-Granada, Emilio Sanchis, Lluis-F Hurtado
SUMBot: Summarizing Context in Open-Domain Dialogue Systems
Rui Ribeiro, Luísa Coheur
Automatic Detection of Inconsistencies in Open-Domain Chatbots
Jorge Mira Prats, Marcos Estecha-Garitagoitia, Mario Rodríguez-Cantelar, Luis Fernando D’Haro
Ethics Guidelines for the Development of Virtual Assistants for e-Health
Andrés Piñeiro Martín, Carmen García Mateo, Laura Docío Fernández, María del Carmen López Pérez
esCorpius: A Massive Spanish Crawling Corpus
Asier Gutiérrez-Fandiño, David Pérez-Fernández, Jordi Armengol-Estapé, David Griol, Zoraida Callejas
An Experimental Study on Light Speech Features for Small-Footprint Keyword Spotting
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen
S3prl-Disorder: Open-Source Voice Disorder Detection System based in the Framework of S3PRL-toolkit
Dayana Ribas, Miguel Angel Pastor Yoldi, Antonio Miguel, David Martínez, Alfonso Ortega, Eduardo Lleida
Active Learning Improves the Teacher’s Experience: A Case Study in a Language Grounding Scenario
Filipe Reynaud, Eugénio Ribeiro, David Martins de Matos
The role of window length and shift in complex-domain DNN-based speech enhancement
Celia García-Ruiz, Angel M. Gomez, Juan M. Martín-Doñas
Neural Detection of Cross-lingual Syntactic Knowledge
Yongjian Chen, Mireia Farrús
Efficient Transformers for End-to-End Neural Speaker Diarization
Sergio Izquierdo del Alamo, Beltrán Labrador, Alicia Lozano-Diez, Doroteo T. Toledano
CORAA NURC-SP Minimal Corpus: a manually annotated corpus of Brazilian Portuguese spontaneous speech
Vinícius G. Santos, Caroline Adriane Alves, Bruno Baldissera Carlotto, Bruno Angelo Papa Dias, Lucas Rafael Stefanel Gris, Renan de Lima Izaias, Maria Luiza Azevedo de Morais, Paula Marin de Oliveira, Rafael Sicoli, Flaviane Romani Fernandes Svartman, Marli Quadros Leite, Sandra Maria Aluísio
Speaker Characterization by means of Attention Pooling
Federico Costa, Miquel India, Javier Hernando
Enhancing the Design of a Conversational Agent for an Ethical Interaction with Children
Marina Escobar Planas, Emilia Gómez, Carlos-D Martínez-Hinarejos
Sentiment Analysis in Portuguese Dialogues
Isabel Carvalho, Hugo Gonçalo Oliveira, Catarina Silva
On the application of conformers to logical access voice spoofing attack detection
Eros Rosello, Alejandro Gomez-Alanis, Manuel Chica, Angel M. Gomez, Jose A. Gonzalez, Antonio M. Peinado
Speech emotion recognition in Spanish TV Debates
Irune Zubiaga, Raquel Justo, M. Inés Torres, Mikel De Velasco
Assessing Transfer Learning and automatically annotated data in the development of Named Entity Recognizers for new domains
Emanuel Matos, Mário Rodrigues, António Teixeira
On the detection of acoustic events for public security: the challenges of the counter-terrorism domain
Anna Pompili, Tiago Luís, Nuno Monteiro, João Miranda, Carlo Mendes, Sérgio Paulo
Database dependence comparison in detection of physical access voice spoofing attacks
Manuel Chica, Alejandro Gomez-Alanis, Eros Rosello, Angel M. Gomez, Jose A. Gonzalez, Antonio M. Peinado
Measuring trust at zero-acquaintance using acted-emotional videos
Cristina Luna Jiménez, Syaheerah Lebai Lutfi, Manuel Gil-Martín, Ricardo Kleinlein, Juan M. Montero, Fernando Fernández-Martínez
Representation and Metric Learning Advances for Deep Neural Network Face and Speaker Biometric Systems
Victoria Mingote, Antonio Miguel
Voice Biometric Systems based on Deep Neural Networks: A Ph.D. Thesis Overview
Alejandro Gomez-Alanis, Jose Andres Gonzalez-Lopez, Antonio Miguel Peinado Herreros
Online Multichannel Speech Enhancement combining Statistical Signal Processing and Deep Neural Networks: A Ph.D. Thesis Overview
Juan Manuel Martín-Doñas, Antonio M. Peinado, Angel M. Gomez
ReSSInt project: voice restoration using Silent Speech Interfaces
Inma Hernaez, Jose Andres Gonzalez Lopez, Eva Navas, Jose Luis Pérez Córdoba, Ibon Saratxaga, Gonzalo Olivares, Jon Sanchez de la Fuente, Alberto Galdón, Victor Garcia, Jesús del Castillo, Inge Salomons, Eder del Blanco Sierra
ELE Project: an overview of the desk research
Itziar Aldabe, Aritz Farwell, Eva Navas, Inma Hernaez, German Rigau
Snorble: An Interactive Children Companion
Mike Rizkalla, Thomas Chan, Emilio Granell, Chara Tsoukala, Aitor Carricondo, Carlos Bailon, María Teresa González, Vicent Alabau
Fusion of Classical Digital Signal Processing and Deep Learning methods (FTCAPPS)
Angel M. Gómez, Victoria E. Sanchez, Antonio M. Peinado, Juan M. Martín-Doñas, Alejandro Gómez-Alanis, Amelia Villegas-Morcillo, Eros Rosello, Manuel Chica, Celia García, Ivan López-Espejo
Spanish Lipreading in Realistic Scenarios: the LLEER project
Carlos David Martinez Hinarejos, David Gimeno-Gomez, Francisco Casacuberta, Emilio Granell, Roberto Paredes, Moisés Pastor, Enrique Vidal
Clinical Applications of Neuroscience: Locating Language Areas in Epileptic Patients and Restoring Speech in Paralyzed People
Jose Andres Gonzalez Lopez, Alberto Galdón, Gonzalo Olivares, Sneha Raman, David Murcia, Daniela Paolieri, Pedro Macizo, José L. Pérez-Córdoba, Antonio M. Peinado, Angel Gomez, Victoria E. Sanchez, Ana B. Chica
ORKESTA Comprehensive Solution for the Orchestration of Services and Soci-Sanitary Care at Home
Juan Alos, Julien Boullié, M. Inés Torres, Eneko Ruiz, Andoni Beristain, Jacobo López Fernández, Iñaki Tellería, Janeth Carolina Carreño, Iker Garay, Arkaitz Carbajo, Amaia Santamaría, Urtzi Zubiate, Jon Ander Arzallus, Francisco Martínez, Adriana Martínez
The CITA GO-ON trial: A person-centered, digital, intergenerational, and cost-effective dementia prevention multi-modal intervention model to guide strategic policies facing the demographic challenges of progressive aging
Mikel Tainta, Javier Mikel Olaso, M. Inés Torres, Mirian Ecay-Torres, Nekane Balluerka, Naia Ros, Mikel Izquierdo, Mikel Saéz de Asteasu, Usune Etxebarria, Lucía Gayoso, Maider Mateo, Oliver Ibarrondo, Elena Alberdi, Estíbaliz Capetillo-Zárate, Jesus Angel Bravo, Pablo Martínez-Lage
The BioVoz Project: Secure Speech Biometrics by Deep Processing Techniques
Antonio M. Peinado, Alejandro Gomez-Alanis, Jose Andres Gonzalez-Lopez, Angel M. Gomez, Eros Rosello, Manuel Chica-Villar, Jose C. Sanchez-Valera, Jose L. Perez-Cordoba, Victoria Sanchez
Automatic evaluation of the pronunciation of people with Down syndrome in an educational video game (EvaProDown)
César González-Ferreras, Valentín Cardeñoso-Payo, David Escudero-Mancebo, Carlos Enrique Vivaracho-Pascual, Lourdes Aguilar, Valle Flores-Lucas, Mario Corrales-Astorgano
SONOC Platform for Audio and Speech Analytics in Call Centers
Dayana Ribas, Antonio Miguel, Luis Guillen, Jose Javier Castejon, Juan Antonio Navarro, Alfonso Ortega, Luis Benavente
The Vicomtech-UPM Speech Transcription Systems for the Albayzín-RTVE 2022 Speech to Text Transcription Challenge
Haritz Arzelus, Iván G. Torres, Juan Manuel Martín-Doñas, Ander González-Docasal, Aitor Alvarez
TID Spanish ASR system for the Albayzin 2022 Speech-to-Text Transcription Challenge
Fernando López, Jordi Luque
BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge
Martin Kocour, Jahnavi Umesh, Martin Karafiat, Ján Švec, Fernando López, Jordi Luque, Karel Beneš, Mireia Diez, Igor Szoke, Karel Veselý, Lukáš Burget, Jan Černocký
Intelligent Voice Speaker Recognition and Diarization System for IberSpeech 2022 Albayzin Evaluations Speaker Diarization and Identity Assignment Challenge
Roman Shrestha, Cornelius Glackin, Julie Wall, Nigel Cannings
ViVoLAB System Description for the S2TC IberSPEECH-RTVE 2022 challenge
Antonio Miguel, Alfonso Ortega, Eduardo Lleida
GTTS Systems for the Albayzin 2022 Speech and Text Alignment Challenge
Germán Bordel, Luis Javier Rodriguez-Fuentes, Mikel Peñagarikano, Amparo Varona