SAPA-SCALE Conference 2012

Portland, OR, USA
September 7-8, 2012



Bibliographic Reference

[SAPA_SCALE-2012] SAPA-SCALE Conference 2012, Portland, OR, USA, September 7-8, 2012; ISCA Archive, http://www.isca-speech.org/archive/sapa_2012


Introduction to the Workshop



Author Index and Quick Access to Abstracts

Asaei   Barras   Basha Shaik   ten Bosch   Bourlard (52)   Bourlard (74)   Boves   Cabral   Carson-Berndsen   Cevher   Cranen   Dighe   Do   Drepper   Driesen   Ellis   Faubel (68)   Faubel (86)   Gemmeke   Ghoshal   Goto   Hahn   Heckmann   Huang   Kawahara   King   Klakow (68)   Klakow (86)   Krishnan   Lu   Magimai-Doss (52)   Magimai-Doss (68)   McDermott   Mirbagheri   Moore   Nakano   Ney   Ngouoko M   Nicolao   Ogbureke   Oualil   Raj (74)   Raj (110)   Renals   Rybach   Sahni   Sakaue   Schlüter (34)   Schlüter (46)   Seelamantula   Shamma   Singh   Soldo   Toroghi   Tüske   Valente   Valentini-Botinhao   Van hamme   Vijayasenan   VIRTANEN   Wrede   Xu   Yamagishi   Yoshioka  

Names written in boldface refer to first authors, in CAPITAL letters to keynote papers. Full papers can be accessed from the abstracts. Please note that each abstract opens in a separate window.



Table of Contents and Access to Abstracts

Keynote Paper

Virtanen, Tuomas: "Human sound perception - what can we learn from it when developing audio analysis algorithms?"

Contributed Papers

Mirbagheri, Majid / Xu, Yanbo / Shamma, Shihab: "Pitch estimation using mutual information", 1-4.

Nicolao, Mauro / Moore, Roger K.: "Establishing some principles of human speech production through two-dimensional computational models", 5-10.

Nakano, Tomoyasu / Goto, Masataka: "A spectral envelope estimation method based on F0-adaptive multi-frame integration analysis", 11-16.

Do, Cong-Thanh / Barras, Claude: "Cochlear implant-like processing of speech signal for speaker verification", 17-21.

Valentini-Botinhao, Cassia / Yamagishi, Junichi / King, Simon: "Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise", 22-27.

Krishnan, Sunder Ram / Seelamantula, Chandra Sekhar: "A generalized Stein’s estimation approach for speech enhancement based on perceptual criteria", 28-33.

Tüske, Zoltán / Drepper, Friedhelm R. / Schlüter, Ralf: "Non-stationary signal processing and its application in speech recognition", 34-39.

Lu, Liang / Ghoshal, Arnab / Renals, Steve: "Joint uncertainty decoding with unscented transform for noise robust subspace Gaussian mixture models", 40-45.

Basha Shaik, M. Ali / Rybach, David / Hahn, Stefan / Schlüter, Ralf / Ney, Hermann: "Hierarchical hybrid language models for open vocabulary continuous speech recognition using WFST", 46-51.

Soldo, Serena / Magimai-Doss, Mathew / Bourlard, Hervé: "Template-based ASR using posterior features and synthetic references: comparing different TTS systems", 52-57.

Ogbureke, Kalu U. / Cabral, João P. / Carson-Berndsen, Julie: "Explicit duration modelling in HMM-based speech synthesis using a hybrid hidden Markov model-multilayer perceptron", 58-63.

Vijayasenan, Deepu / Valente, Fabio: "Dimensionality reduction of large TDOA vectors for speaker diarization", 64-67.

Oualil, Youssef / Magimai-Doss, Mathew / Faubel, Friedrich / Klakow, Dietrich: "Joint detection and localization of multiple speakers using a probabilistic interpretation of the steered response power", 68-73.

Asaei, Afsaneh / Raj, Bhiksha / Bourlard, Hervé / Cevher, Volkan: "Structured sparse coding for microphone array location calibration", 74-79.

Yoshioka, Takuya / Sakaue, Daichi: "Log-normal matrix factorization with application to speech-music separation", 80-85.

Toroghi, Rahil Mahdian / Faubel, Friedrich / Klakow, Dietrich: "Multi-channel speech separation with soft time-frequency masking", 86-91.

Huang, Heyun / Bosch, Louis ten / Cranen, Bert / Boves, Lou: "Smoothing speech trajectories by regularization", 92-97.

Driesen, Joris / Gemmeke, Jort F. / Van hamme, Hugo: "Data-driven speech representations for NMF-based word learning", 98-103.

Ngouoko M, Samuel K. / Heckmann, Martin / Wrede, Britta: "Spectro-temporal features with distribution equalization", 104-109.

Sahni, Kamal / Dighe, Pranay / Singh, Rita / Raj, Bhiksha: "Language identification using spectro-temporal patch features", 110-113.

McDermott, Josh H. / Ellis, Daniel P. W. / Kawahara, Hideki: "Inharmonic speech: a tool for the study of speech perception and separation", 114-117.