Odyssey 2016

21-24 Jun 2016, Bilbao, Spain

Luis Javier Rodriguez-Fuentes and Eduardo Lleida

ISSN: 2312-2846  DOI: 10.21437/Odyssey.2016


Text-Dependent Speaker Verification


A Low-Power Text-Dependent Speaker Verification System with Narrow-Band Feature Pre-Selection and Weighted Dynamic Time Warping
Qing He, Gregory Wornell, Wei Ma

Deep Neural Network based Text-Dependent Speaker Verification : Preliminary Results
Gautam Bhattacharya, Patrick Kenny, Jahangir Alam, Themos Stafylakis

Uncertainty Modeling Without Subspace Methods For Text-Dependent Speaker Recognition
Patrick Kenny, Themos Stafylakis, Jahangir Alam, Vishwa Gupta, Marcel Kockmann

Deep Neural Networks and Hidden Markov Models in i-vector-based Text-Dependent Speaker Verification
Hossein Zeinali, Lukas Burget, Hossein Sameti, Ondrej Glembek, Oldrich Plchot


Speaker Recognition: i-vector approaches


Fast Scoring for PLDA with Uncertainty Propagation
Weiwei Lin, Man-Wai Mak

I–vector transformation and scaling for PLDA based speaker recognition
Sandro Cumani, Pietro Laface

Rapid Computation of I-vector
Longting Xu, Kong Aik Lee, Haizhou Li, Zhen Yang

Constrained discriminative speaker verification specific to normalized i-vectors
Pierre-Michel Bousquet, Jean-Francois Bonastre

Iterative Bayesian and MMSE-based noise compensation techniques for speaker recognition in the i-vector space
Waad Ben Kheder, Driss Matrouf, Moez Ajili, Jean-Francois Bonastre


Language Recognition


Between-Class Covariance Correction For Linear Discriminant Analysis in Language Recognition
Abhinav Misra, Qian Zhang, Finnian Kelly, John H.L. Hansen

Incorporating uncertainty as a Quality Measure in I-Vector Based Language Recognition
Amir Hossein Poorjam, Rahim Saeidi, Tomi Kinnunen, Ville Hautamaki

Discriminating Languages in a Probabilistic Latent Subspace
Aleksandr Sizov, Kong Aik Lee, Tomi Kinnunen

Investigation of Senone-based Long-Short Term Memory RNNs for Spoken Language Recognition
Yao Tian, Liang He, Yi Liu, Jia Liu

Automatic Accent Recognition Systems and the Effects of Data on Performance
Georgina Brown

The “Sprekend Nederland” project and its application to accent location
David van Leeuwen, Rosemary Orr

Deep Language: a comprehensive deep learning approach to end-to-end language recognition
Trung Ngo Trong, Ville Hautamäki, Kong Aik Lee

On the use of phone-gram units in recurrent neural networks for language identification
Christian Salamea, Luis Fernando D'Haro, Ricardo Cordoba, Rubén San-Segundo

Language Recognition for Dialects and Closely Related Languages
Gregory Gelly, Jean-Luc Gauvain, Lori Lamel, Antoine Laurent, Viet Bac Le, Abdel Messaoudi

Identification of British English regional accents using fusion of i-vector and multi-accent phonotactic systems
Maryam Najafian, Saeid Safavi, Phil Weber, Martin Russell

Improvements on Deep Bottleneck Network based I-Vector Representation for Spoken Language Identification
Yan Song, Ruilian Cui, Mcloughlin Ian, Lirong Dai


Special Session 1: Speaker Recognition in Multimedia Content


Deep complementary features for speaker identification in TV broadcast data
Mateusz Budnik, Ali Khodabakhsh, Laurent Besacier, Cenk Demiroglu

First investigations on self trained speaker diarization
Gaël Le Lan, Sylvain Meignier, Delphine Charlet, Anthony Larcher

Soft VAD in Factor Analysis Based Speaker Segmentation of Broadcast News
Brecht Desplanques, Kris Demuynck, Jean-Pierre Martens



Speaker and Language Recognition Systems


BAT System Description for NIST LRE 2015
Oldrich Plchot, Pavel Matejka, Ondrej Glembek, Radek Fer, Ondrej Novotny, Jan Pesan, Lukas Burget, Niko Brummer, Sandro Cumani

The IBM 2016 Speaker Recognition System
Seyed Omid Sadjadi, Sriram Ganapathy, Jason Pelecanos

The Sheffield language recognition system in NIST LRE 2015
Raymond W. M. Ng, Mauro Nicolao, Oscar Saz, Madina Hasan, Bhusan Chettri, Mortaza Doulaty, Tan Lee, Thomas Hain

Analyzing the Effect of Channel Mismatch on the SRI Language Recognition Evaluation 2015 System
Mitchell Mclaren, Diego Castán, Luciana Ferrer

The MITLL NIST LRE 2015 Language Recognition System
Pedro Torres-Carrasquillo, Najim Dehak, Elizabeth Godoy, Douglas Reynolds, Fred Richardson, Stephen Shum, Elliot Singer, Douglas Sturim


Speaker and Language Recognition: deep learning approaches


Augmented Data Training of Joint Acoustic/Phonotactic DNN i-vectors for NIST LRE15
Alan Mccree, Greg Sell, Daniel Garcia-Romero

LID-senone Extraction via Deep Neural Networks for End-to-End Language Identification
Ma Jin, Yan Song, Ian Mcloughlin, Lirong Dai, Zhongfu Ye

On autoencoders in the i-vector space for speaker recognition
Timur Pekhovsky, Sergey Novoselov, Aleksei Sholohov, Oleg Kudashev

Channel Compensation for Speaker Recognition using MAP Adapted PLDA and Denoising DNNs
Fred Richardson, Brian Nemsick, Douglas Reynolds

Evaluation of an LSTM-RNN System in Different NIST Language Recognition Frameworks
Ruben Zazo, Alicia Lozano-Diez, Joaquin Gonzalez-Rodriguez


Speaker Recognition I


Feature-based likelihood ratios for speaker recognition from linguistically-constrained formant-based i-vectors
Javier Franco-Pedroso, Joaquin Gonzalez-Rodriguez

Improving Robustness of Speaker Verification Against Mimicked Speech
Kuruvachan K George, Santhosh Kumar C, Ramachandran K I, Ashish Panda

Multi-channel i-vector combination for robust speaker verification in multi-room domestic environments
Alessio Brutti, Alberto Abad

Voice Liveness Detection for Speaker Verification based on a Tandem Single/Double-channel Pop Noise Detector
Sayaka Shiota, Fernando Villavicencio, Junichi Yamagishi, Nobutaka Ono, Isao Echizen, Tomoko Matsui

A PLDA Approach for Language and Text Independent Speaker Recognition
Abbas Khosravani, Mohammad Mehdi Homayounpour, Dijana Petrovska-Delacrétaz, Gérard Chollet

Spoofing Detection on the ASVspoof2015 Challenge Corpus Employing Deep Neural Networks
Md Jahangir Alam, Patrick Kenny, Vishwa Gupta, Themos Stafylakis

Age-Related Voice Disguise and its Impact on Speaker Verification Accuracy
Rosa González Hautamäki, Md Sahidullah, Tomi Kinnunen, Ville Hautamäki

A New Feature for Automatic Speaker Verification Anti-Spoofing: Constant Q Cepstral Coefficients
Massimiliano Todisco, Héctor Delgado, Nicholas Evans

Multi-Bit Allocation: Preparing Voice Biometrics for Template Protection
Marco Paulini, Christian Rathgeb, Andreas Nautsch, Hermine Reichau, Herbert Reininger, Christoph Busch


Special Session 2: NIST 2015 Language Recognition i-Vector Machine Learning Challenge


Summary of the 2015 NIST Language Recognition i-Vector Machine Learning Challenge
Audrey Tong, Craig Greenberg, Alvin Martin, Desire Banse, John Howard, Hui Zhao, George Doddington, Daniel Garcia-Romero, Alan McCree, Douglas Reynolds, Elliot Singer, Jaime Hernandez-Cordero, Lisa Mason

Out-of-Set i-Vector Selection for Open-set Language Identification
Hamid Behravan, Tomi Kinnunen, Ville Hautamäki

I2R Submission to the 2015 NIST Language Recognition I-vector Challenge
Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Kong Aik Lee, Bin Ma, Haizhou Li

A Semisupervised Approach for Language Identification based on Ladder Networks
Ehud Ben-Reuven, Jacob Goldberger



Speaker Recognition II


Cantonese forensic voice comparison with higher-level features: likelihood ratio-based validation using F-pattern and tonal F0 trajectories over a disyllabic hexaphone
Phil Rose, Bruce Xiao Wang

I-Vectors for speech activity detection
Elie Khoury, Matt Garland

Compensation for phonetic nuisance variability in speaker recognition using DNNs
Themos Stafylakis, Patrick Kenny, Vishwa Gupta, Jahangir Alam, Marcel Kockmann

Local binary patterns as features for speaker recognition
Waad Ben Kheder, Driss Matrouf, Moez Ajili, Jean-Francois Bonastre

Analysis and Optimization of Bottleneck Features for Speaker Recognition
Alicia Lozano-Diez, Anna Silnova, Pavel Matejka, Ondrej Glembek, Oldrich Plchot, Jan Pesan, Lukas Burget, Joaquin Gonzalez-Rodriguez

Robustness of Quality-based Score Calibration of Speaker Recognition Systems with respect to low-SNR and short-duration conditions
Andreas Nautsch, Rahim Saeidi, Christian Rathgeb, Christoph Busch

From Features to Speaker Vectors by means of Restricted Boltzmann Machine Adaptation
Pooyan Safari, Omid Ghahabi, Javier Hernando

Reducing Noise Bias in the i-Vector Space for Speaker Recognition
Yosef Solewicz, Hagai Aronowitz, Timo Becker


Speaker Clustering and Diarization


Semi-supervised On-line Speaker Diarization for Meeting Data with Incremental Maximum A-posteriori Adaptation
Giovanni Soldi, Massimiliano Todisco, Héctor Delgado, Christophe Beaugeant, Nicholas Evans

Influence of transition cost in the segmentation stage of speaker diarization
Beatriz Martínez-González, José M. Pardo, Rubén San-Segundo, J.M. Montero

Analysis of the Impact of the Audio Database Characteristics in the Accuracy of a Speaker Clustering System
Jesús Jorrín Prieto, Carlos Vaquero, Paola García

Short- and Long-Term Speech Features for Hybrid HMM-i-Vector based Speaker Diarization System
Abraham Woubie Zewoudie, Jordi Luque, Javier Hernando

On the Use of PLDA i-vector Scoring for Clustering Short Segments
Itay Salmun, Irit Opher, Itshak Lapidot