Odyssey 2012 - The Speaker and Language Recognition Workshop

Singapore
June 25-28, 2012


Bibliographic Reference

[Odyssey-2012] Odyssey 2012 - The Speaker and Language Recognition Workshop, Singapore, June 25-28, 2012; ed. by Haizhou Li, Bin Ma, and Kong Aik Lee; ISBN 978-981-07-3093-2: ISCA Archive, http://www.isca-speech.org/archive/odyssey_2012

Introduction to the Workshop


Author Index and Quick Access to Abstracts

Names written in boldface refer to first authors, in CAPITAL letters to keynote and invited papers. Full papers can be accessed from the abstracts. Please note that each abstract opens in a separate window.

Agrawal   Alam   Alku   Ambikairajah   Aronowitz (122)   Aronowitz (312)   Becker   BenZeghiba   Bonastre (138)   Bonastre (157)   Bordel   Borgström (180)   Borgström (187)   Bousquet   Brümmer (216)   Brümmer (330)   BRÜMMER   Burget   Campbell   Carré   Černocký (216)   Černocký (330)   Chen   Cieri   Cumani (7)   Cumani (216)   Dean (28)   Dean (34)   Dehak, Najim (117)   Dehak, Najim (209)   Dehak, Réda   DENG   Diez   Doddington   Dumouchel (109)   Dumouchel (117)   Dumouchel (324)   Dunn   Enzinger   Ertaş   Ferrer (298)   Ferrer (317)   Galibert   Ganapathy (98)   Ganapathy (229)   Gannu   Garimella   Gauvain   Gfroerer   Giraudel   Glembek (216)   Glembek (330)   Graciarena   Graff   Greenberg   Hanilçi   Hansen (224)   Hansen (243)   Haris   Hasan   Hautamäki   He   Hermansky (92)   Hermansky (98)   Hermansky (229)   Hernando   Hurmalainen   Jardine   Joly   Jones   Kahn   Kajarekar   Kanagasundaram (28)   Kanagasundaram (34)   Karafiát (216)   Karafiát (330)   Katsouros   Kenny (1)   Kenny (109)   Kenny (117)   Kenny (256)   Kenny (324)   Khare   Kinnunen (236)   Kinnunen (268)   Kinnunen (304)   Laface   Lamel   Lapidot   Larcher (157)   Larcher (268)   Lee (268)   Lee (338)   van Leeuwen (55)   van Leeuwen (248)   Leppänen   Li, Haizhou (268)   Li, Haizhou (338)   Li, Zhi-Yi   Liu, Gang   Liu, Jia   Lleida   Luque   Ma (268)   Ma (338)   Madikeri   Mak   Mallidi   Mandasari   Martin (275)   MARTIN   Matějka (216)   Matějka (330)   Matrouf   McCree (180)   McCree (187)   McCree (209)   McLaren   Meignier   Morrison (62)   Morrison (78)   O'Shaughnessy   Ochoa   Panchapagesan   Paulik   Pelecanos   Penagarikano   Pešán   Plchot (157)   Plchot (216)   Plchot (317)   Plchot (330)   Pohjalainen   Pratt   Przybocki   Quatieri   Quintard   Rao   Reynolds (180)   Reynolds (209)   Richardson   Rodríguez-Fuentes   Rouvier   Saarinen   Saeidi (236)   Saeidi (248)   Saeidi (304)   Sankar   Sarikaya   Scheffer   Senoussaoui (109)   Senoussaoui (117)   Singer   Sinha   Solewicz (86)   Solewicz (122)   Soufifar   Sridharan (28)   Sridharan (34)   Stafylakis (109)   Stafylakis (324)   Stolcke   Strassel (202)   Strassel (291)   Sturim (180)   Sturim (209)   Thiruvaran   Thomas (98)   Thomas (229)   Toledo-Ronen   Torres-Carrasquillo   Vaquero   Varona   Vasilakakis   Villalba   de Villiers (216)   de Villiers (330)   Virtanen   Vogt (28)   Vogt (34)   Walker (202)   Walker (291)   Xu   Yaman   You   Zhang, Chi   Zhang, Cuiling   Zhang, Wei-Qiang  


Table of Contents and Access to Abstracts

Plenary Session

Brümmer, Niko: "The role of proper scoring rules in training and evaluating probabilistic speaker and language recognizers" (abstract).

Deng, Li: "Being deep and being dynamic - new-generation models and methodology for advancing speech technology" (abstract).

Martin, Alvin: "The NIST speaker recognition evaluations" (abstract).

Speaker Recognition – Compact Representation

Kenny, Patrick: "A small footprint i-vector extractor", 1-6.

Cumani, Sandro / Laface, Pietro / Vasilakakis, Vasileios: "Memory and computation effective approaches for i–vector extraction", 7-13.

Madikeri, Srikanth: "A hybrid factor analysis and probabilistic PCA-based system for dictionary learning and encoding for robust speaker recognition", 14-20.

Haris, B. C. / Sinha, R.: "On exploring the similarity and fusion of i-vector and sparse representation based speaker verification systems", 21-27.

Speaker Recognition – Generative Modeling

Kanagasundaram, Ahilan / Vogt, Robbie / Dean, David / Sridharan, Sridha: "PLDA based speaker recognition on short utterances", 28-33.

Kanagasundaram, Ahilan / Dean, David / Sridharan, Sridha / Vogt, Robbie: "PLDA based speaker verification with weighted LDA techniques", 34-38.

Vaquero, Carlos: "Dataset shift in PLDA based speaker verification", 39-46.

Villalba, Jesús / Lleida, Eduardo: "Bayesian adaptation of PLDA based speaker recognition to domains with scarce development data", 47-54.

McLaren, Mitchell / Mandasari, Miranti Indar / Leeuwen, David A. van: "Source normalization for language-independent speaker recognition using i-vectors", 55-61.

Forensic Speaker Recognition

Morrison, Geoffrey Stewart / Ochoa, Felipe / Thiruvaran, Tharmarajah: "Database selection for forensic voice comparison", 62-77.

Enzinger, Ewald / Zhang, Cuiling / Morrison, Geoffrey Stewart: "Voice source features for forensic voice comparison - an evaluation of the GLOTTEX software package", 78-85.

Solewicz, Yosef A. / Becker, Timo / Jardine, Gaëlle / Gfroerer, Stefan: "Comparison of speaker recognition systems on a real forensic benchmark", 86-91.

Neural Network for Speaker Recognition

Garimella, Sri / Hermansky, Hynek: "Factor analysis of mixture of auto-associative neural networks for speaker verification", 92-97.

Thomas, Samuel / Mallidi, Sri Harish / Ganapathy, Sriram / Hermansky, Hynek: "Adaptation transforms of auto-associative neural networks as features for speaker verification", 98-104.

Yaman, Sibel / Pelecanos, Jason / Sarikaya, Ruhi: "Bottleneck features for speaker recognition", 105-108.

Stafylakis, Themos / Kenny, Patrick / Senoussaoui, Mohammed / Dumouchel, Pierre: "Preliminary investigation of Boltzmann machine classifiers for speaker recognition", 109-116.

Senoussaoui, Mohammed / Dehak, Najim / Kenny, Patrick / Dehak, Réda / Dumouchel, Pierre: "First attempt of boltzmann machines for speaker verification", 117-121.

Speaker Diarization

Aronowitz, Hagai / Solewicz, Yosef A. / Toledo-Ronen, Orith: "Online two speaker diarization", 122-129.

Luque, Jordi / Hernando, Javier: "On the use of agglomerative and spectral clustering in speaker diarization of meetings", 130-137.

Lapidot, Itshak / Bonastre, Jean-François: "Generalized Viterbi-based models for time-series segmentation applied to speaker diarization", 138-145.

Rouvier, Mickael / Meignier, Sylvain: "A global optimization framework for speaker diarization", 146-150.

Kajarekar, Sashin / Khare, Aparna / Paulik, Matthias / Agrawal, Neha / Panchapagesan, Panchi / Sankar, Ananth / Gannu, Satish: "Cisco's speaker segmentation and recognition system", 151-156.

Speaker Recognition – Channel Robustness

Bousquet, Pierre-Michel / Larcher, Anthony / Matrouf, Driss / Bonastre, Jean-François / Plchot, Oldřich: "Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis", 157-164.

Rao, Wei / Mak, Man-Wai: "Utterance partitioning with acoustic vector resampling for i-vector based speaker verification", 165-171.

Chen, Sheng / Xu, Mingxing / Pratt, Emlyn: "Study on the effects of intrinsic variation using i-vectors in text-independent speaker verification", 172-179.

Campbell, William M. / Sturim, Doug / Borgström, Bengt Jonas / Dunn, Robert / McCree, Alan / Quatieri, Thomas F. / Reynolds, Douglas A.: "Exploring the impact of advanced front-end processing on NIST speaker recognition microphone tasks", 180-186.

Borgström, Bengt Jonas / McCree, Alan: "Linear prediction modulation filtering for speaker recognition of reverberant speech", 187-193.

Language Recognition Evaluation

Rodríguez-Fuentes, Luis Javier / Varona, Amparo / Diez, Mireia / Penagarikano, Mikel / Bordel, Germán: "Evaluation of spoken language recognition technology using broadcast speech: performance and challenges", 194-201.

Strassel, Stephanie / Walker, Kevin / Jones, Karen / Graff, Dave / Cieri, Christopher: "New resources for recognition of confusable linguistic varieties: the LRE11 corpus", 202-208.

Singer, Elliot / Torres-Carrasquillo, Pedro / Reynolds, Douglas A. / McCree, Alan / Richardson, Fred / Dehak, Najim / Sturim, Doug: "The MITLL NIST LRE 2011 language recognition system", 209-215.

Brümmer, Niko / Cumani, Sandro / Glembek, Ondřej / Karafiát, Martin / Matějka, Pavel / Pešán, Jan / Plchot, Oldřich / Soufifar, Mehdi / Villiers, Edward de / Černocký, Jan "Honza": "Description and analysis of the Brno276 system for LRE2011", 216-223.

Liu, Gang / Zhang, Chi / Hansen, John H. L.: "A linguistic data acquisition front-end for language recognition evaluation", 224-228.

Features for Speaker Recognition

Ganapathy, Sriram / Thomas, Samuel / Hermansky, Hynek: "Feature extraction using 2-d autoregressive models for speaker recognition", 229-235.

Hanilçi, Cemal / Kinnunen, Tomi / Saeidi, Rahim / Pohjalainen, Jouni / Alku, Paavo / Ertaş, Figen: "Regularization of all-pole models for speaker verification under additive noise", 236-242.

Hasan, Taufiq / Hansen, John H. L.: "Factor analysis of acoustic features using a mixture of probabilistic principal component analyzers for robust speaker verification", 243-247.

Saeidi, Rahim / Hurmalainen, Antti / Virtanen, Tuomas / Leeuwen, David A. van: "Exemplar-based sparse representation and sparse discrimination for noise robust speaker identification", 248-255.

Alam, Md Jahangir / Kenny, Patrick / O'Shaughnessy, Douglas: "On the use of asymmetric-shaped tapers for speaker verification using i-vectors", 256-262.

Speaker Recognition Evaluation

Doddington, George: "The effect of target/non-target age difference on speaker recognition performance", 263-267.

Hautamäki, Ville / Lee, Kong Aik / Larcher, Anthony / Kinnunen, Tomi / Ma, Bin / Li, Haizhou: "Variational Bayes logistic regression as regularized fusion for NIST SRE 2010", 268-274.

Greenberg, Craig / Martin, Alvin / Przybocki, Mark: "The 2011 BEST speaker recognition interim assessment", 275-282.

Kahn, Juliette / Galibert, Olivier / Carré, Matthieu / Giraudel, Aude / Joly, Philippe / Quintard, Ludovic: "The REPERE challenge: finding people in a multimodal context", 283-290.

Walker, Kevin / Strassel, Stephanie: "The RATS radio traffic collection system", 291-297.

Speaker Recognition – Application

Stolcke, Andreas / Graciarena, Martin / Ferrer, Luciana: "Effects of audio and ASR quality on cepstral and high-level speaker verification systems", 298-303.

Kinnunen, Tomi / Saeidi, Rahim / Leppänen, Jussi / Saarinen, Jukka P.: "Audio context recognition in variable mobile environments from short segments using speaker and language recognizers", 304-311.

Aronowitz, Hagai: "Text dependent speaker verification using a small development set", 312-316.

Ferrer, Luciana / Burget, Lukas / Plchot, Oldřich / Scheffer, Nicolas: "A unified approach for audio characterization and its application to speaker recognition", 317-323.

Stafylakis, Themos / Katsouros, Vassilis / Kenny, Patrick / Dumouchel, Pierre: "Mean shift algorithm for exponential families with applications to speaker clustering", 324-329.

Language Recognition – Feature, Classifier and Fusion

Plchot, Oldřich / Karafiát, Martin / Brümmer, Niko / Glembek, Ondřej / Matějka, Pavel / Villiers, Edward de / Černocký, Jan "Honza": "Speaker vectors from subspace Gaussian mixture model as complementary features for language identification", 330-333.

Li, Zhi-Yi / Zhang, Wei-Qiang / He, Liang / Liu, Jia: "Complementary combination in i-vector level for language recognition", 334-337.

You, Chang Huai / Li, Haizhou / Ambikairajah, Eliathamby / Lee, Kong Aik / Ma, Bin: "Bhattacharyya-based GMM-SVM system with adaptive relevance factor for pair language recognition", 338-345.

BenZeghiba, Mohamed Faouzi / Gauvain, Jean-Luc / Lamel, Lori: "Fusing language information from diverse data sources for phonotactic language recognition", 346-352.