Introduction to the Conference
Author Index Table of Contents
[INTERSPEECH-2012] INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, OR, USA, September 9-13, 2012; ISSN 1990-9770; ISCA Archive, http://www.isca-speech.org/archive/interspeech_2012
Introduction (7.1 MB) Keynotes Tutorials
Names written in boldface refer to first authors, in CAPITAL letters to keynote and invited papers. Full papers can be accessed from the abstracts (ISCA members only). Please note that each abstract opens in a separate window.
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Adaptation & Robust Modeling Adaptation for ASR
ASR: Bayesian Modeling ASR: Deep Neural Networks I, II ASR: Discriminative Training
ASR: Noise Robustness ASR: Robust Features I, II ASR: Robust Modeling
Audio Analysis, Estimation and Classification
Communication Disorders and Assistive Technologies
Computer Assisted Language Learning I, II Conversation and Interaction I, II
Degraded Speech and Enhancement Development of Speech Production and Perception
Dialog Systems Dynamic Decoding
Enhancement Enhancement and Coding
Hearing HMM Synthesis I, II Language and Accent Recognition
Language Learning and Cross-Language Production and Perception
Language Modeling Language Modeling: New Models and Features
Language Recognition Multi-Channel Speech Enhancement
Paralinguistics I-III Perception and Production
Perceptual Learning and Perceptual Cues to Segments and Tones
Phonetics and Phonology I, II Pitch and Harmonic Analysis Prosody I, II
Rich Transcription I, II Robust Speech Recognition I, II Search and Decoding
Single Channel Speech Enhancement Source Separation and Computational Auditory Scene Analysis
Sparse, Template-Based Representations Speaker Diarization Speaker Diarization and Age Recognition
Speaker Recognition I-III Speaker Verification
Speech Analysis Speech Analysis and Modeling Speech and Age Differences
Speech and Speaker Segmentation Speech Intelligibility in Quiet and in Noise
Speech Production: Imaging and Models Speech Synthesis
Speech Synthesis: Adaptation Speech Synthesis: Intelligibility Speech Synthesis: Prosody
Speech Synthesis: Selected Topics
Spoken Language Applications Spoken Language Understanding
Spoken Language Understanding and Dialog I, II Spoken Term and Unseen Word Detection
Voice Activity Detection Voice Conversion Voice Search and Spoken Document Retrieval I, II
Analysis of Spoken Disorders in Health Applications I, II
Glottal Source Processing: from Analysis to Applications
New Trends in Vowel Nasalization: The Articulation of Nasal Vowels
Prosodic Prominence: Annotation, Prediction, Applications
Speaker Trait Challenge I, II Speech and Language Technologies for STEM
Speech Tools and Systems Demo
Lee, Chin-Hui: "An information-extraction approach to speech analysis and processing", 1-5.
Dannenberg, Roger B.: "Music understanding and the future of music performance", 550.
Riley, Michael: "Weighted transducers in speech and language processing", 1347.
Lahvis, Garet: "Finding meaning in rodent ultrasonic vocalizations", 2129.
Yu, Dong / Deng, Li / Seide, Frank: "Large vocabulary speech recognition using deep tensor neural networks", 6-9.
Kingsbury, Brian / Sainath, Tara N. / Soltau, Hagen: "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed hessian-free optimization", 10-13.
Saon, George / Kingsbury, Brian: "Discriminative feature-space transforms using deep neural networks", 14-17.
Tüske, Zoltán / Sundermeyer, Martin / Schlüter, Ralf / Ney, Hermann: "Context-dependent MLPs for LVCSR: TANDEM, hybrid or both?", 18-21.
Maas, Andrew L. / Le, Quoc V. / O'Neil, Tyler M. / Vinyals, Oriol / Nguyen, Patrick / Ng, Andrew Y.: "Recurrent neural networks for noise reduction in robust ASR", 22-25.
Chen, Xie / Eversole, Adam / Li, Gang / Yu, Dong / Seide, Frank: "Pipelined back-propagation for context-dependent deep neural networks", 26-29.
Vinyals, Oriol / Deng, Li: "Are sparse representations rich enough for acoustic modeling?", 2570-2573.
Xiao, Yeming / Zhang, Zhen / Cai, Shang / Pan, Jielin / Yan, Yonghong: "A initial attempt on task-specific adaptation for deep neural network-based large vocabulary continuous speech recognition", 2574-2577.
Jaitly, Navdeep / Nguyen, Patrick / Senior, Andrew / Vanhoucke, Vincent: "Application of pretrained deep neural networks to large vocabulary speech recognition", 2578-2581.
Qian, Yanmin / Liu, Jia: "Cross-lingual and ensemble MLPs strategies for low-resource speech recognition", 2582-2585.
Vu, Ngoc Thang / Breiter, Wojtek / Metze, Florian / Schultz, Tanja: "Initialization schemes for multilayer perceptron training and their impact on ASR performance using multilingual data", 2586-2589.
Siniscalchi, Sabato Marco / Li, Jinyu / Lee, Chin-Hui: "Hermitian based hidden activation functions for adaptation of hybrid HMM/ANN models", 2590-2593.
Kubo, Yotaro / Hori, Takaaki / Nakamura, Atsushi: "Integrating deep neural networks into structural classification approach based on weighted finite-state transducers", 2594-2597.
Deng, Li / Hutchinson, Brian / Yu, Dong: "Parallel training for deep stacking networks", 2598-2601.
Qian, Yanmin / Liu, Jia: "Articulatory feature based multilingual MLPs for low-resource speech recognition", 2602-2605.
Fernandez Astudillo, Ramón / Abad, Alberto / Neto, João Paulo da Silva: "Uncertainty-driven compensation of multi-stream MLP acoustic models for robust ASR ramon", 2606-2609.
Bořil, Hynek / Sangwan, Abhijeet / Hansen, John H. L.: "Arabic dialect identification - "is the secret in the silence?" and other observations", 30-33.
Greenberg, Craig S. / Martin, Alvin F. / Przybocki, Mark A.: "The 2011 NIST language recognition evaluation", 34-37.
Rodríguez-Fuentes, Luis Javier / Penagarikano, Mikel / Varona, Amparo / Diez, Mireia / Bordel, Germán / Abad, Alberto / Martínez, David / Villalba, Jesus / Ortega, Alfonso / Lleida, Eduardo: "The BLZ submission to the NIST 2011 LRE: data collection, system development and performance", 38-41.
D'Haro, Luis Fernando / Glembek, Ondřej / Plchot, Oldřich / Matějka, Pavel / Soufifar, Mehdi / Cordoba, Ricardo / Černocký, Jan: "Phonotactic language recognition using ivvectors and phoneme posteriogram counts", 42-45.
McCree, Alan / Borgström, Bengt: "Supervector LDA: a new approach to reduced-complexity i-vector language recognition", 46-49.
Matějka, Pavel / Plchot, Oldřich / Soufifar, Mehdi / Glembek, Ondřej / D'Haro, Luis Fernando / Veselý, Karel / Grézl, František / Ma, Jeff / Matsoukas, Spyros / Dehak, Najim: "Patrol team language identification system for DARPA RATS P1 evaluation", 50-53.
Hu, Fang / Wu, Yungang / Xu, Wen / Han, Demin: "Articulatory strategies in obstruent production in Mandarin esophageal speech", 54-57.
Béchet, Marion / Hirsch, Fabrice / Fauth, Camille / Sock, Rudolph: "Consonantal space area in children with a cleft palate: an acoustic study", 58-61.
Paja, Milton Sarria / Falk, Tiago H.: "Automated dysarthria severity classification for improved objective intelligibility assessment of spastic dysarthric speech", 62-65.
Kacha, Abdellah / Grenez, Francis / Schoentgen, Jean: "Assessment of disordered voices using empirical mode decomposition in the log-spectral domain", 66-69.
Fuchs, Anna Katharina / Hagmüller, Martin: "Learning an artificial F0-contour for ALT speech", 70-73.
Richmond, Korin / Renals, Steve: "Ultrax: an animated midsagittal vocal tract display for speech therapy", 74-77.
Hwang, Hsin-Te / Tsao, Yu / Wang, Hsin-Min / Wang, Yih-Ru / Chen, Sin-Horng: "A study of mutual information for GMM-based spectral conversion", 78-81.
Li, Na / Qiao, Yu: "Bayesian mixture of probabilistic linear regressions for voice conversion", 82-85.
Erro, Daniel / Navas, Eva / Hernáez, Inma: "Iterative MMSE estimation of vocal tract length normalization factors for voice transformation", 86-89.
Percybrooks, Winston / Moore, Elliot: "An HMM approach to residual estimation for high resolution voice conversion", 90-93.
Toda, Tomoki / Muramatsu, Takashi / Banno, Hideki: "Implementation of computationally efficient real-time voice conversion", 94-97.
Saito, Daisuke / Minematsu, Nobuaki / Hirose, Keikichi: "Effects of speaker adaptive training on tensor-based arbitrary speaker conversion", 98-101.
Weninger, Felix / Schuller, Björn: "Discrimination of linguistic and non-linguistic vocalizations in spontaneous speech: intra- and inter-corpus perspectives", 102-105.
Avanzi, Mathieu / Dubosson, Pauline / Schwab, Sandra / Obin, Nicolas: "Accentual transfer from Swiss-German to French. a study of "francais federal"", 106-109.
Jannedy, Stefanie / Weirich, Melanie: "Phonology & the interpretation of fine phonetic detail in Berlin German", 110-113.
Ishi, Carlos T. / Liu, Chaoran / Ishiguro, Hiroshi / Hagita, Norihiro: "Evaluation of a formant-based speech-driven lip motion generation", 114-117.
Kallay, Jeffrey / Holliday, Jeffrey: "Using spectral measures to differentiate Mandarin and Korean sibilant fricatives", 118-121.
Jian, Hua-Li / Konopka, Richard: "EFL conversational triads: foreigner-directed speech and hyperarticulation", 122-125.
Ouyang, Iris Chuoying / Iskarous, Khalil: "Syllable perception depends on tone perception", 126-129.
DiCanio, Christian T. / Nam, Hosung / Whalen, Douglas H. / Bunnell, H. Timothy / Amith, Jonathan D. / Castillo Garcia, Rey: "Assessing agreement level between forced alignment models with data from endangered language documentation corpora", 130-133.
Fujimoto, Masako / Funatsu, Seiya / Fujimoto, Ichiro: "How consonants, dialect and speech rate affect vowel devoicing?", 134-137.
Nadeu, Marianna: "Effects of stress and speech rate on vowel quality in Catalan and Spanish", 1396-1399.
McAuliffe, Michael / Babel, Molly: "Predictability affects vowel dispersion and dynamics in the Buckeye corpus", 1400-1403.
Fox, Robert Allen / Jacewicz, Ewa: "Dialectal and generational variations in vowels in spontaneous speech", 1404-1407.
Scarborough, Rebecca / Zellou, Georgia: "Perceiving listener-directed speech: effects of authenticity and lexical neighborhood density", 1408-1411.
Chen, Ying / Kapatsinski, Vsevolod / Guion-Anderson, Susan: "Acoustic cues of vowel quality to coda nasal perception in southern Min", 1412-1415.
Simonet, Miquel / Hualde, José I. / Nadeu, Marianna: "Lenition of /d/ in spontaneous Spanish and Catalan", 1416-1419.
Fehér, Thomas / Richter, Dietmar / Jokisch, Oliver / Hoffmann, Rüdiger: "Distance-dependent noise reduction for two-channel microphones", 138-141.
Xue, Wei / Liu, Wenju: "Direction of arrival estimation based on subband weighting for noisy conditions", 142-145.
Marin-Hurtado, Jorge I. / Anderson, David V.: "Binaural noise reduction using frequency-warped FIR filters", 146-149.
Yu, Meng / Xin, Jack: "Exploring off time nature for speech enhancement", 150-153.
Bao, Xulei / Zhu, Jie: "Model-based single-channel dereverberation in noisy acoustical environments", 154-157.
Mirbagheri, Majid / Akram, Sahar / Shamma, Shihab: "An auditory inspired multimodal framework for speech enhancement", 158-161.
Hazrati, Oldooz / Lee, Jaewook / Loizou, Philipos C.: "Binary mask estimation for improved speech intelligibility in reverberant environments", 162-165.
Petkov, Petko N. / Kleijn, W. Bastiaan / Henter, Gustav Eje: "Enhancing subjective speech intelligibility using a statistical model of speech", 166-169.
Mousa, Amr El-Desoky / Basha Shaik, M. Ali / Schlüter, Ralf / Ney, Hermann: "Morpheme level feature-based language models for German LVCSR", 170-173.
Yamamoto, Hitoshi / Dixon, Paul R. / Matsuda, Shigeki / Hori, Chiori / Kashioka, Hideki: "Tied-state mixture language model for WFST-based speech recognition", 174-177.
Alumäe, Tanel / Kaljurand, Kaarel: "Maximum entropy language model adaptation for mobile speech input", 178-181.
Lecorvé, Gwénolé / Dines, John / Hain, Thomas / Motlicek, Petr: "Supervised and unsupervised web-based language model domain adaptation", 182-185.
Tam, Yik-Cheung / Vozila, Paul: "A hierarchical Bayesian approach for semi-supervised discriminative language modeling", 186-189.
Wu, Youzheng / Abe, Kazuhiko / Dixon, Paul R. / Hori, Chiori / Kashioka, Hideki: "Leveraging social annotation for topic language model adaptation", 190-193.
Sundermeyer, Martin / Schlüter, Ralf / Ney, Hermann: "LSTM neural networks for language modeling", 194-197.
Xu, Puyang / Roark, Brian / Khudanpur, Sanjeev: "Phrasal cohort based unsupervised discriminative language modeling", 198-201.
Karakos, Damianos / Roark, Brian / Shafran, Izhak / Sagae, Kenji / Lehr, Maider / Prud'hommeaux, Emily / Xu, Puyang / Glenn, Nathan / Khudanpur, Sanjeev / Saraclar, Murat / Bikel, Dan / Dredze, Mark / Callison-Burch, Chris / Cao, Yuan / Hall, Keith / Hasler, Eva / Koehn, Philip / Lopez, Adam / Post, Matt / Riley, Darcey: "Deriving conversation-based features from unlabeled speech for discriminative language modeling", 202-205.
Dikici, Erinç / Çelebi, Arda / Saraçlar, Murat: "Performance comparison of training algorithms for semi-supervised discriminative language modeling", 206-209.
Thadani, Kapil / Biadsy, Fadi / Bikel, Dan: "On-the-fly topic adaptation for YouTube video transcription", 210-213.
Jabaian, Bassam / Lefèvre, Fabrice / Besacier, Laurent: "Portability of semantic annotations for fast development of dialogue corpora", 214-217.
Griol, David / Callejas, Zoraida / López-Cózar, Ramón: "Optimization of dialog strategies using automatic dialog simulation and statistical dialog management techniques", 218-221.
Sugiyama, Hiroaki / Meguro, Toyomi / Minami, Yasuhiro: "Preference-learning based inverse reinforcement learning for dialog control", 222-225.
Meena, Raveesh / Skantze, Gabriel / Gustafson, Joakim: "A data-driven approach to understanding spoken route directions in human-robot dialogue", 226-229.
Komatani, Kazunori / Hirano, Akira / Nakano, Mikio: "Detecting system-directed utterances using dialogue-level features", 230-233.
Planells, Joaquin / Hurtado, Lluís-F. / Sanchis, Emilio / Segarra, Encarna: "An online generated transducer to increase dialog manager coverage", 234-237.
Kazemzadeh, Abe / Gibson, James / Li, Juanchen / Lee, Sungbok / Georgiou, Panayiotis G. / Narayanan, Shrikanth: "A sequential Bayesian dialog agent for computational ethnography", 238-241.
Seide, Frank / McDirmid, Sean: "Clippyscript: a programming language for multi-domain dialogue systems", 242-245.
Engelbrecht, Klaus-Peter / Möller, Sebastian: "Correlation between model-based approximations of grounding-related cognition and user judgments", 246-249.
Callejas, Zoraida / Griol, David / Engelbrecht, Klaus-Peter: "Assessment of user simulators for spoken dialogue systems by means of subspace multidimensional clustering", 250-253.
Kretzschmar, Florian / Möller, Sebastian: "help me, i need more user tests! user simulations as supportive tool in the development process of spoken dialogue systems", 322-325.
Witt, Silke M.: "Caller response timing patterns in spoken dialog systems", 326-329.
Hakkani-Tür, Dilek / Tur, Gokhan / Heck, Larry / Fidler, Ashley / Celikyilmaz, Asli: "A discriminative classification-based approach to information state updates for a multi-domain dialog system", 330-333.
Shriberg, Elizabeth / Stolcke, Andreas / Hakkani-Tür, Dilek / Heck, Larry: "Learning when to listen: detecting system-addressed speech in human-human-computer dialog", 334-337.
Tur, Gokhan / Jeong, Minwoo / Wang, Ye-Yi / Hakkani-Tür, Dilek / Heck, Larry: "Exploiting the semantic web for unsupervised natural language semantic parsing", 338-341.
Fandrianto, Andrew / Eskenazi, Maxine: "Prosodic entrainment in an information-driven dialog system", 342-345.
Schuller, Björn / Steidl, Stefan / Batliner, Anton / Nöth, Elmar / Vinciarelli, Alessandro / Burkhardt, Felix / Son, Rob van / Weninger, Felix / Eyben, Florian / Bocklet, Tobias / Mohammadi, Gelareh / Weiss, Benjamin: "The INTERSPEECH 2012 speaker trait challenge", 254-257.
Polzehl, Tim / Schoenenberg, Katrin / Möller, Sebastian / Metze, Florian / Mohammadi, Gelareh / Vinciarelli, Alessandro: "On speaker-independent personality perception and prediction from speech", 258-261.
Audhkhasi, Kartik / Metallinou, Angeliki / Li, Ming / Narayanan, Shrikanth S.: "Speaker personality classification using systems based on acoustic-lexical cues and an optimal tree-structured Bayesian network", 262-265.
Chastagnol, Clément / Devillers, Laurence: "Personality traits detection using a parallelized modified SFFS algorithm", 266-269.
Pohjalainen, Jouni / Kadioglu, Serdar / Räsänen, Okko: "Feature selection for speaker traits", 270-273.
Wagner, Johannes / Lingenfelser, Florian / André, Elisabeth: "A frame pruning approach for paralinguistic recognition tasks", 274-277.
Ivanov, Alexei / Chen, Xin: "Modulation spectrum analysis for speaker personality trait recognition", 278-281.
Cummins, Nicholas / Epps, Julien / Kua, Jia Min Karen: "A comparison of classification paradigms for speaker likeability determination", 282-285.
Lu, Dingchao / Sha, Fei: "Predicting likability of speakers with Gaussian processes", 286-289.
Brueckner, Raymond / Schuller, Björn: "Likability classification - a not so deep neural network approach", 290-293.
Wu, Dongrui: "Genetic algorithm based feature selection for speaker trait classification", 294-297.
Weiss, Benjamin / Burkhardt, Felix: "Is 'not bad' good enough? aspects of unknown voices' likability", 510-513.
Sanchez, Michelle Hewlett / Lawson, Aaron / Vergyri, Dimitra / Bratt, Harry: "Multi-system fusion of extended context prosodic and cepstral features for paralinguistic speaker trait classification", 514-517.
Buisman, Harm / Postma, Eric: "The log-Gabor method: speech classification using spectrogram image analysis", 518-521.
Attabi, Yazid / Dumouchel, Pierre: "Anchor models and WCCN normalization for speaker trait classification", 522-525.
Montacié, Claude / Caraty, Marie-José: "Pitch and intonation contribution to speakers' traits classification", 526-529.
Anumanchipalli, Gopala Krishna / Meinedo, Hugo / Bugalho, Miguel / Trancoso, Isabel / Oliveira, Luís C. / Black, Alan W.: "Text-dependent pathological voice detection", 530-533.
Kim, Jangwon / Kumar, Naveen / Tsiartas, Andreas / Li, Ming / Narayanan, Shrikanth: "Intelligibility classification of pathological speech using fusion of multiple high level descriptors", 534-537.
Stark, Anthony / Bayestehtashk, Alireza / Asgari, Meysam / Shafran, Izhak: "Interspeech pathology challenge: investigations into speaker and sentence specific effects", 538-541.
Zhou, Xinhui / Garcia-Romero, Daniel / Mesgarani, Nima / Stone, Maureen / Espy-Wilson, Carol / Shamma, Shihab: "Automatic intelligibility assessment of pathologic speech in head and neck cancer based on auditory-inspired spectro-temporal modulations", 542-545.
Huang, Dong-Yan / Zhu, Yongwei / Wu, Dajun / Yu, Rongshan: "Detecting intelligibility by linear dimensionality reduction and normalized voice quality hierarchical features", 546-549.
Kumatani, Kenichi / Raj, Bhiksha / Singh, Rita / McDonough, John: "Microphone array post-filter based on spatially-correlated noise measurements for distant speech recognition", 298-301.
Weninger, Felix / Wöllmer, Martin / Schuller, Björn: "Combining bottleneck-BLSTM and semi-supervised sparse NMF for recognition of conversational speech in highly instationary noise", 302-305.
Lu, Liang / Chin, K. K. / Ghoshal, Arnab / Renals, Steve: "Noise compensation for subspace Gaussian mixture models", 306-309.
Sun, Yang / Doss, Mathew M. / Gemmeke, Jort F. / Cranen, Bert / Bosch, Louis ten / Boves, Lou: "Combination of sparse classification and multilayer perceptron for noise-robust ASR", 310-313.
Li, Weifeng / Bourlard, Hervé: "Sub-band based log-energy and its dynamic range stretching for robust in-car speech recognition", 314-317.
Bouallegue, Mohamed / Rouvier, Mickael / Matrouf, Driss / Linarès, Georges: "Noise compensation for speech recognition using subspace Gaussian mixture models", 318-321.
Ringeval, Fabien / Chetouani, Mohamed / Schuller, Björn: "Novel metrics of speech rhythm for the assessment of emotion", 346-349.
Wöllmer, Martin / Eyben, Florian / Schuller, Björn / Rigoll, Gerhard: "Temporal and situational context modeling for improved dominance recognition in meetings", 350-353.
Swerts, Marc / Leuverink, Kitty / Munnik, Madelene / Nijveld, Vera: "Audiovisual correlates of basic emotions in blind and sighted people", 354-357.
Cao, Houwei / Verma, Ragini / Nenkova, Ani: "Combining ranking and classification to improve emotion recognition in spontaneous speech", 358-361.
Zhang, Zixing / Schuller, Björn: "Active learning by sparse instance tracking and classifier confidence in acoustic emotion recognition", 362-365.
Rozgić, Viktor / Ananthakrishnan, Sankaranarayanan / Saleem, Shirin / Kumar, Rohit / Vembu, Aravind Namandi / Prasad, Rohit: "Emotion recognition using acoustic and lexical features", 366-369.
Weninger, Felix / Marchi, Erik / Schuller, Björn: "Improving recognition of speaker states and traits by cumulative evidence: intoxication, sleepiness, age and gender", 1159-1162.
Ding, Ni / Epps, Julien: "Speaker clustering in emotion recognition", 1163-1166.
Kim, Samuel / Yella, Sree Harsha / Valente, Fabio: "Automatic detection of conflict escalation in spoken conversations", 1167-1170.
Reichel, Uwe D. / Kisler, Thomas: "The entropy of intoxicated speech.lexical creativity and heavy tongues", 1171-1174.
Bone, Daniel / Lee, Chi-Chun / Narayanan, Shrikanth S.: "A robust unsupervised arousal rating framework using prosody with cross-corpora evaluation", 1175-1178.
Busso, Carlos / Rahman, Tauhidur: "Unveiling the acoustic properties that describe the valence dimension", 1179-1182.
Valente, Fabio / Kim, Samuel / Motlicek, Petr: "Annotation and recognition of personality traits in spoken conversations from the AMI meetings corpus", 1183-1186.
Lyu, Shao-ren: "The effects of lexical tones and nasal coda /-n/ to sadness in Taiwan Hakka", 1187-1190.
Deng, Jun / Schuller, Björn: "Confidence measures in speech emotion recognition based on semi-supervised learning", 2226-2229.
Xia, Rui / Liu, Yang: "Using i-vector space model for emotion recognition", 2230-2233.
Obin, Nicolas: "Cries and whispers.classification of vocal effort in expressive speech", 2234-2237.
Fewzee, Pouria / Karray, Fakhri: "Emotional speech: a spectral analysis", 2238-2241.
Rosenberg, Andrew: "Classifying skewed data: importance weighting to optimize average recall", 2242-2245.
Oertel, Catharine / Włodarczak, Marcin / Edlund, Jens / Wagner, Petra / Gustafson, Joakim: "Gaze patterns in turn-taking", 2246-2249.
Fecher, Natalie: "The "audio-visual face cover corpus": investigations into audio-visual speech and speaker recognition when the speaker's face is occluded by facewear", 2250-2253.
Can, Doğan / Georgiou, Panayiotis G. / Atkins, David C. / Narayanan, Shrikanth S.: "A case study: detecting counselor reflections in psychotherapy for addictions using linguistic features", 2254-2257.
Leon, Phillip L. De / Stewart, Bryan / Yamagishi, Junichi: "Synthetic speech discrimination using pitch pattern statistics derived from image analysis", 370-373.
Wen, Zhengqi / Kawahara, Hideki / Tao, Jianhua: "Pitch-scaled analysis based residual reconstruction for speech analysis and synthesis", 374-377.
Huang, Feng / Lee, Tan: "Robust pitch estimation using l1-regularized maximum likelihood estimation", 378-381.
Degottex, Gilles / Stylianou, Yannis: "A full-band adaptive harmonic representation of speech", 382-385.
Kawahara, Hideki / Morise, Masanori / Nisimura, Ryuichi / Irino, Toshio: "Deviation measure of waveform symmetry and its application to high-speed and temporally-fine F0 extraction for vocal sound texture manipulation", 386-389.
Yoshizato, Kota / Kameoka, Hirokazu / Saito, Daisuke / Sagayama, Shigeki: "Hidden Markov convolutive mixture model for pitch contour analysis of speech", 390-393.
Sjerps, Matthias / McQueen, James M. / Mitterer, Holger: "Extrinsic normalization for vocal tracts depends on the signal, not on attention", 394-397.
Scharenborg, Odette / Janse, Esther / Weber, Andrea: "Perceptual learning of /f/-/s/ by older listeners", 398-401.
Hatano, Hiroaki / Kitamura, Tatsuya / Takemoto, Hironori / Mokhtari, Parham / Honda, Kiyoshi / Masaki, Shinobu: "Correlation between vocal tract length, body height, formant frequencies, and pitch frequency for the five Japanese vowels uttered by fifteen male speakers", 402-405.
Jagbandhu, J. / Nataraj, K. S. / Pandey, Prem C.: "Detection of transition segments in VCV utterances for estimation of the place of closure of oral stops for speech training", 406-409.
Dubois, Cyril / Sock, Rudolph: "Audiovisual discrimination of CV syllables: a simultaneous fMRI-EEG study", 410-413.
Kertkeidkachorn, Natthawut / Vorapatratorn, Surapol / Tangruamsub, Sirinart / Punyabukkana, Proadpran / Suchato, Atiwong: "Contribution of spectral shapes to tone perception", 414-417.
Tantibundhit, Charturong / Onsuwan, Chutamanee / Phienphanich, P. / Wutiwiwatchai, Chai: "Methodological issues in assessing perceptual representation of consonant sounds in Thai", 418-421.
Meyer, Julien: "Pitch and phonological perception of tone in the Suruí language of Rondônia (Brazil): identification task of LHL and LHH tonal patterns", 422-425.
Cao, Rui / Wayland, Ratree / Kaan, Edith: "The role of creaky voice in Mandarin tone 2 and tone 3 perception", 426-429.
Tyler, Michael D. / Faris, Mona M.: "Can litheners retune native categories acroth a thoneme boundary?", 430-433.
Morley, Eric / Klabbers, Esther / Santen, Jan P. H. van / Kain, Alexander / Mohammadi, Seyed Hamidreza: "Synthetic F0 can effectively convey speaker ID in delexicalized speech", 434-437.
Baumann, Timo / Schlangen, David: "Evaluating prosodic processing for incremental speech synthesis", 438-441.
Iwata, Kazuhiko / Kobayashi, Tetsunori: "Expressing speaker's intentions through sentence-final intonations for Japanese conversational speech synthesis", 442-445.
Parlikar, Alok / Black, Alan W.: "Modeling pause-duration for style-specific speech synthesis", 446-449.
Gruber, Martin: "Enumerating differences between various communicative functions for purposes of Czech expressive speech synthesis in limited domain", 450-453.
Norrenbrock, Christoph R. / Hinterleitner, Florian / Heute, Ulrich / Möller, Sebastian: "Quality analysis of macroprosodic F0 dynamics in text-to-speech signals", 454-457.
Hashimoto, Hiroya / Hirose, Keikichi / Minematsu, Nobuaki: "Improved automatic extraction of generation process model commands and its use for generating fundamental frequency contours for training HMM-based speech synthesis", 458-461.
Koriyama, Tomoki / Nose, Takashi / Kobayashi, Takao: "Discontinuous observation HMM for prosodic-event-based F0 generation", 462-465.
Meng, Fanbo / Wu, Zhiyong / Meng, Helen / Jia, Jia / Cai, Lianhong: "Hierarchical English emphatic speech synthesis based on HMM with limited training data", 466-469.
Hoffmann, Sarah / Pfister, Beat: "Employing sentence structure: syntax trees as prosody generators", 470-473.
Ohishi, Yasunori / Kameoka, Hirokazu / Mochihashi, Daichi / Kashino, Kunio: "A stochastic model of singing voice F0 contours for characterizing expressive dynamic components", 474-477.
Silovsky, Jan / Cerva, Petr / Zdansky, Jindrich / Nouza, Jan: "Study on integration of speaker diarization with speaker adaptive speech recognition for broadcast transcription", 478-481.
Shum, Stephen / Dehak, Najim / Glass, James: "On the use of spectral and iterative methods for speaker diarization", 482-485.
Knox, Mary Tai / Mirghafori, Nikki / Friedland, Gerald: "Where did i go wrong?: identifying troublesome segments for speaker diarization systems", 486-489.
Yella, Sree Harsha / Valente, Fabio: "Speaker diarization of overlapping speech based on silence distribution in meeting recordings", 490-493.
Bozonnet, Simon / Vipperla, Ravichander / Evans, Nicholas: "Phone adaptive training for speaker diarization", 494-497.
Kelly, Finnian / Drygajlo, Andrzej / Harte, Naomi: "Compensating for ageing and quality variation in speaker verification", 498-501.
Leeuwen, David van / Bahari, Mohamad Hasan: "Calibration of probabilistic age recognition", 502-505.
Bahari, Mohamad Hasan / McLaren, Mitchell / Van hamme, Hugo / Leeuwen, David van: "Age estimation from telephone speech using i-vectors", 506-509.
Rath, Shakti P. / Karafiát, Martin / Glembek, Ondřej / Černocký, Jan: "A factorized representation of FMLLR transform based on QR-decomposition", 551-554.
Tomar, Vikrant Singh / Rose, Richard C.: "A correlational discriminant approach to feature extraction for robust speech recognition", 555-558.
Weng, Chao / Juang, Biing-Hwang (Fred) / Povey, Daniel: "Discriminative training using non-uniform criteria for keyword spotting on spontaneous speech", 559-562.
Suzuki, Masayuki / Kurata, Gakuto / Nishimura, Masafumi / Minematsu, Nobuaki: "Discriminative reranking for LVCSR leveraging invariant structure", 563-566.
Hu, Ting-yao / Tsao, Yu / Lee, Lin-shan: "Discriminative fuzzy clustering maximum a posterior linear regression for speaker adaptation", 567-570.
Tahir, Muhammad Ali / Nussbaum-Thom, Markus / Schlüter, Ralf / Ney, Hermann: "Simultaneous discriminative training and mixture splitting of HMMs for speech recognition", 571-574.
Boucheron, Laura / Leon, Phillip L. De: "Low-SNR, speaker-dependent speech enhancement using GMMs and MFCCs", 575-578.
Koutsogiannaki, Maria / Pettinato, Michelle / Mayo, Cassie / Kandia, Varvara / Stylianou, Yannis: "Can modified casual speech reach the intelligibility of clear speech?", 579-582.
Carlin, Michael A. / Malyska, Nicolas / Quatieri, Thomas F.: "Speech enhancement using sparse convolutive non-negative matrix factorization with basis adaptation", 583-586.
Kolossa, Dorothea / Nickel, Robert / Zeiler, Steffen / Martin, Rainer: "Inventory-based audio-visual speech enhancement", 587-590.
Jokinen, Emma / Alku, Paavo / Vainio, Martti: "Utilization of the lombard effect in post-.ltering for intelligibility enhancement of telephone speech", 591-594.
Duan, Zhiyao / Mysore, Gautham J. / Smaragdis, Paris: "Speech enhancement by online non-negative spectrogram decomposition in nonstationary noise environments", 595-598.
Varnet, Léo / Meyer, Julien / Hoen, Michel / Meunier, Fanny: "Phoneme resistance during speech-in-speech comprehension", 599-602.
Quené, Hugo / Schuerman, Will: "smile with a smile", 603-606.
Lunsford, Rebecca / Heeman, Peter A. / Santen, Jan P. H. van: "Interactions between turn-taking gaps, disfluencies and social obligation", 607-610.
Garnier, Maëva / Ménard, Lucie / Richard, Gabrielle: "Effect of being seen on the production of visible speech cues. a pilot study on lombard speech", 611-614.
Włodarczak, Marcin / Šimko, Juraj / Wagner, Petra: "Temporal entrainment in overlapped speech: cross-linguistic study", 615-618.
Lee, Chi-Chun / Katsamanis, Athanasios / Georgiou, Panayiotis G. / Narayanan, Shrikanth S.: "Based on isolated saliency or causal integration? toward a better understanding of human annotation process using multiple instance learning and sequential probability ratio test", 619-622.
Levow, Gina-Anne / Duncan, Susan: "Contrasting cues to verbal and non-verbal backchannels in multi-lingual dyadic rapport", 835-838.
Strömbergsson, Sofia / Edlund, Jens / House, David: "Prosodic measurements and question types in the spontal corpus of Swedish dialogues", 839-842.
Truong, Khiet P. / Heylen, Dirk: "Measuring prosodic alignment in cooperative task-based conversations", 843-846.
Laskowski, Kornel / Heldner, Mattias / Edlund, Jens: "On the dynamics of overlap in multi-party conversation", 847-850.
Truong, Khiet P. / Trouvain, Jürgen: "On the acoustics of overlapping laughter in conversational speech", 851-854.
Gravano, Agustín / Hirschberg, Julia: "A corpus-based study of interruptions in spoken dialogue", 855-858.
Syrdal, Ann K. / Bunnell, H. Timothy / Hertz, Susan R. / Mishra, Taniya / Spiegel, Murray / Bickley, Corine / Rekart, Deborah / Makashay, Matthew J.: "Text-to-speech intelligibility across speech rates", 623-626.
Wang, Linfang / Wang, Lijuan / Teng, Yan / Geng, Zhe / Soong, Frank K.: "Objective intelligibility assessment of text-to-speech system using template constrained generalized posterior probability", 627-630.
Valentini-Botinhao, Cassia / Yamagishi, Junichi / King, Simon: "Mel cepstral coefficient modification based on the glimpse proportion measure for improving the intelligibility of HMM-generated synthetic speech in noise", 631-634.
Zorila, Tudor-Catalin / Kandia, Varvara / Stylianou, Yannis: "Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression", 635-638.
Erro, Daniel / Stylianou, Yannis / Navas, Eva / Hernáez, Inma: "Implementation of simple spectral techniques to enhance the intelligibility of speech using a harmonic model", 639-642.
Mohammadi, Seyed Hamidreza / Kain, Alexander / Santen, Jan P. H. van: "Making conversational vowels more clear", 643-646.
Tsurutani, Chiharu / Ishihara, Shunichi: "Naturalness judgement of prosodic variation of Japanese utterances with prosody modified stimuli", 647-650.
Avanzi, Mathieu / Dubosson, Pauline / Schwab, Sandra: "Effects of dialectal origin on articulation rate in French", 651-654.
Hsieh, Chiao-Hua / Chiang, Chen-Yu / Wang, Yih-Ru / Yu, Hsiu-Min / Chen, Sin-Horng: "A new approach of speaking rate modeling for Mandarin speech prosody", 655-658.
Doukhan, David / Rilliard, Albert / Rosset, Sophie / D'Alessandro, Christophe: "Modelling pause duration as a function of contextual length", 659-662.
Wang, Bei / Li, Chenxia / Wu, Qian / Zhang, Xiaxia / Wang, Baofeng / Xu, Yi: "Production and perception of focus in PFC and non-PFC languages: comparing beijing Mandarin and hainan tsat", 663-666.
Zhang, Xiaxia / Wang, Bei / Wu, Qian / Xu, Yi: "Prosodic realization of focus in statement and question in tibetan (lhasa dialect)", 667-670.
Vainio, Martti / Aalto, Daniel / Suni, Antti / Arnhold, Anja / Raitio, Tuomo / Seijo, Henri / Järvikivi, Juhani / Alku, Paavo: "Effect of noise type and level on focus related fundamental frequency changes", 671-674.
Warsi, Anal / Basu, Tulika / Mazumdar, Debasis: "Role of prosody in automatic modality recognition of bangla speech", 675-678.
Braun, Bettina: "Where to associate stressed additive particles? evidence from speech prosody", 679-682.
Benton, Matthew: "From PVI to perception: a return to the roots of rhythm in broadcast news", 683-686.
Meyer, Julien / Dentel, Laure / Seifart, Frank: "A methodology for the study of rhythm in drummed forms of languages: application to Bora Manguare of Amazon", 687-690.
Wayland, Ratree / Laphasradakul, Donruethai / Kaan, Edith / Cao, Rui: "Perception of pitch contours among native tone listeners", 1946-1948.
Igarashi, Yosuke / Koiso, Hanae: "Pitch range control of Japanese boundary pitch movements", 1949-1952.
Kuo, Grace: "Perceived prosodic boundaries in taiwanese and their acoustic correlates", 1953-1956.
Hon, Laying / Jia, Yuan / Li, Aijun: "Phonetic foreignization of Mandarin for dubbing in imported western movies", 1957-1960.
Moniz, Helena / Batista, Fernando / Trancoso, Isabel / Mata, Ana Isabel: "Prosodic contex-based analysis of disfluencies.", 1961-1964.
Lintfert, Britta / Möbius, Bernd: "Describing the development of intonational categories using a target-oriented parametric approach", 1965-1968.
Pohjalainen, Jouni / Raitio, Tuomo / Pulakka, Hannu / Alku, Paavo: "Automatic detection of high vocal effort in telephone speech", 691-694.
Gomathi, D. / Thati, Sathya Adithya / Sridaran, Karthik Venkat / Yegnanarayana, Bayya: "Analysis of mimicry speech", 695-698.
Kasess, Christian H. / Kreuzer, Wolfgang / Enzinger, Ewald / Kerschhofer-Puhalo, Nadja: "Estimation of the vocal tract shape of nasals using a Bayesian scheme", 699-702.
Birkholz, Peter / Dächert, Philippe / Neuschaefer-Rube, Christiane: "Advances in combined electro-optical palatography", 703-706.
Lee, Byung Suk / Ellis, Daniel P. W.: "Noise robust pitch tracking by subband autocorrelation classification", 707-710.
Sepulveda, Alexander / Capobianco-Guido, Rodrigo / Castellanos-Dominguez, German: "Inference of critical articulator position for fricative consonants", 711-714.
Brückl, Markus: "Vocal tremor measurement based on autocorrelation of contours", 715-718.
Hansakunbuntheung, Chatchawarn / Chotimongkol, Ananlada / Thatphithakkul, Sumonmas / Chootrakool, Patcharika: "Model-based duration-difference approach on accent evaluation of L2 learner", 719-722.
Hueber, Thomas / Bailly, Gérard / Denby, Bruce: "Continuous articulatory-to-acoustic mapping using phone-based trajectory HMM for a silent speech interface", 723-726.
Kawahara, Tatsuya / Iwatate, Takuma / Takanashi, Katsuya: "Prediction of turn-taking by combining prosodic and eye-gaze information in poster conversations", 727-730.
Wechsung, Ina / Engelbrecht, Klaus-Peter / Möller, Sebastian: "Using quality ratings to predict modality choice in multimodal systems", 731-734.
Fang, Fuming / Shinozaki, Takahiro / Horiuchi, Yasuo / Kuroiwa, Shingo / Furui, Sadaoki / Musha, Toshimitsu: "HMM based continuous EOG recognition for eye-input speech interface", 735-738.
Lilley, Jason / Stent, Amanda / Zeljkovic, Ilija: "A random, semantically appropriate sentence generator for speaker verification", 739-742.
Macias-Galindo, Daniel / Wong, Wilson / Cavedon, Lawrence / Thangarajah, John: "Coherent topic transition in a conversational agent", 743-746.
Heeman, Peter A. / Fryer, Jordan / Lunsford, Rebecca / Rueckert, Andrew / Selfridge, Ethan: "Using reinforcement learning for dialogue management Policies: towards understanding MDP violations and convergence", 747-750.
López-Cózar, Ramón / Callejas, Zoraida / Griol, David: "Enhancing speech understanding in spoken dialogue systems by means of a new frame-correction technique", 751-754.
Litman, Diane / Friedberg, Heather / Forbes-Riley, Kate: "Prosodic cues to disengagement and uncertainty in physics tutorial dialogues", 755-758.
Ward, Wayne H. / Bolanos, Daniel / Cole, Ronald A.: "Spoken dialogs with a virtual science tutor", 759-762.
Cerva, Petr / Silovsky, Jan / Zdansky, Jindrich / Nouza, Jan / Malek, Jiri: "Real-time lecture transcription using ASR for Czech hearing impaired or deaf students", 763-766.
Chen, Lei / Yoon, Su-Youn: "Application of structural events detected on ASR outputs for automated speaking assessment", 767-770.
Saz, Oscar / Eskenazi, Maxine: "Addressing confusions in spoken language in ESL pronunciation tutors", 771-774.
Qian, Xiaojun / Meng, Helen / Soong, Frank K.: "The use of DBN-HMMs for mispronunciation detection and diagnosis in L2 English to support computer-aided pronunciation training", 775-778.
Cucchiarini, Catia / Doremalen, Joost van / Strik, Helmer: "Practice and feedback in L2 speaking: an evaluation of the DISCO CALL system", 779-782.
Hueber, Thomas / Ben-Youssef, Atef / Bailly, Gérard / Badin, Pierre / Elisei, Frédéric: "Cross-speaker acoustic-to-articulatory inversion using phone-based trajectory HMM for pronunciation training", 783-786.
Kintzley, Keith / Jansen, Aren / Hermansky, Hynek: "MAP estimation of whole-word acoustic models with dictionary priors", 787-790.
Thomas, Samuel / Ganapathy, Sriram / Jansen, Aren / Hermansky, Hynek: "Data-driven posterior features for low resource speech recognition applications", 791-794.
Cui, Xiaodong / Afify, Mohamed / Saon, George / Goel, Vaibhava: "Sparse Bayesian factor analysis for stereo-based stochastic mapping", 795-798.
Vanhainen, Niklas / Salvi, Giampiero: "Word discovery with beta process factor analysis", 799-802.
Hahm, Seong-Jun / Ogawa, Atsunori / Fujimoto, Masakiyo / Hori, Takaaki / Nakamura, Atsushi: "Speaker adaptation using variational Bayesian linear regression in normalized feature space", 803-806.
Krueger, Alexander / Walter, Oliver / Leutnant, Volker / Haeb-Umbach, Reinhold: "Bayesian feature enhancement for ASR of noisy reverberant real-world data", 807-810.
Yilmaz, Emre / Compernolle, Dirk van / Van hamme, Hugo: "Robust tracking for automatic reading tutors", 811-814.
Huang, Hao / Wang, Jianming / Abudureyimu, Halidan: "Maximum F1-score discriminative training for automatic mispronunciation detection in computer-assisted language learning", 815-818.
Wang, Yow-Bang / Lee, Lin-Shan: "Error pattern detection integrating generative and discriminative learning for computer-aided pronunciation training", 819-822.
Hönig, Florian / Bocklet, Tobias / Riedhammer, Korbinian / Batliner, Anton / Nöth, Elmar: "The automatic assessment of non-native prosody: combining classical prosodic analysis with acoustic modelling", 823-826.
Stanley, Theban / Hacioglu, Kadri: "Improving L1-specific phonological error diagnosis in computer assisted pronunciation training", 827-830.
Gemmeke, Jort F. / Loo, Janneke van de / Pauw, Guy de / Driesen, Joris / Van hamme, Hugo / Daelemans, Walter: "A self-learning assistive vocal interface based on vocabulary learning and grammar induction", 831-834.
Iribe, Yurie / Mori, Takurou / Katsurada, Kouichi / Kawai, Goh / Nitta, Tsuneo: "Real-time visualization of English pronunciation on an IPA chart based on articulatory feature extraction", 1271-1274.
Jeon, Je Hun / Yoon, Su-Youn: "Acoustic feature-based non-scorable response detection for an automated speaking proficiency assessment", 1275-1278.
Wuth, Jorge / Yoma, Néstor Becerra / Benavides, Leopoldo / Vivanco, Hiram: "Pronunciation quality evaluation of sentences by combining word based scores", 1279-1282.
Bell, Peter / Dzikovska, Myroslava / Isard, Amy: "Designing a spoken language interface for a tutorial dialogue system", 1283-1286.
Zhang, Long / Li, Haifeng / Ma, Lin: "Automatic pronunciation error detection based on extended pronunciation space using the unsupervised clustering of pronunciation errors", 1287-1290.
Pellegrini, Thomas / Costa, Ângela / Trancoso, Isabel: "Less errors with TTS? a dictation experiment with foreign language learners", 1291-1294.
Chen, Liang-Yu / Jang, Jyh-Shing Roger: "Improvement in automatic pronunciation scoring using additional basic scores and learning to rank", 1295-1298.
Cheng, Jian: "Automatic tone assessment of non-native Mandarin speakers", 1299-1302.
Kafentzis, George P. / Rosec, Olivier / Stylianou, Yannis: "On the modeling of voiceless stop sounds of speech using adaptive quasi-harmonic models", 859-862.
Ng, Raymond W. M. / Hain, Thomas / Hirose, Keikichi: "An alignment matching method to explore pseudosyllable properties across different corpora", 863-866.
Uria, Benigno / Murray, Iain / Renals, Steve / Richmond, Korin: "Deep architectures for articulatory inversion", 867-870.
Henry, Katharine / Sonderegger, Morgan / Keshet, Joseph: "Automatic measurement of positive and negative voice onset time", 871-874.
Khanagha, Vahid / Daoudi, Khalid: "Efficient multipulse approximation of speech excitation using the most singular manifold", 875-878.
Jansen, Aren / Thomas, Samuel / Hermansky, Hynek: "Intrinsic spectral analysis for zero and high resource speech recognition", 879-882.
Scharenborg, Odette / Witteman, Marijt / Weber, Andrea: "Computational modelling of the recognition of foreign-accented speech", 883-886.
Meister, Lya / Meister, Einar: "The production and perception of Estonian quantity degrees by native and non-native speakers", 887-890.
Sadakata, Makiko / Shingai, Mizuki / Brandmeyer, Alex / Sekiyama, Kaoru: "Perception of the moraic obstruent /q/: a cross-linguistic study", 891-894.
Nariai, Tomoko / Tanaka, Kazuyo / Kawahara, Tatsuya: "Comparative analysis of intensity between native speakers and Japanese speakers of English", 895-898.
Koniaris, Christos / Engwall, Olov / Salvi, Giampiero: "Auditory and dynamic modeling paradigms to detect L2 mispronunciations", 899-902.
Li, Sheng / Wang, Lan: "Cross linguistic comparison of Mandarin and English EMA articulatory data", 903-906.
Zeroual, Chakir / Gafos, Diamantis / Hoole, Phil / Esling, John: "Physiological and acoustic study of word initial post-lexical gemination in Moroccan Arabic", 907-910.
Tyler, Michael D. / Fenwick, Sarah: "Perceptual assimilation of Arabic voiceless fricatives by English monolinguals", 911-914.
Räsänen, Okko: "Non-auditory cognitive capabilities in computational modeling of early language acquisition", 915-918.
Räsänen, Okko / Rasilo, Heikki / Laine, Unto K.: "Modeling spoken language acquisition with a generic cognitive architecture for associative learning", 919-922.
Wang, Dongmei / Loizou, Philipos C.: "Pitch estimation based on long frame harmonic model and short frame average correlation coefficient", 923-926.
Möller, Sebastian / Wältermann, Marcel / Côté, Nicolas: "Diagnostic prediction of transmitted speech quality: a new framework for signal-based and parametric models", 927-930.
Bäckström, Tom: "Enumerative algebraic coding for ACELP", 931-934.
Saha, Atanu / Shimamura, Tetsuya: "Speech enhancement with bivariate gamma model", 935-938.
Trawicki, Marek / Johnson, Michael: "Improvements of the beta-order minimum mean-square error (MMSE) spectral amplitude estimator using chi priors", 939-942.
Harding, Philip / Milner, Ben: "Enhancing speech by reconstruction from robust acoustic features", 943-946.
Chetupally, Srikanth Raj / Sreenivas, Thippur V.: "Joint pitch-analysis formant-synthesis framework for CS recovery of speech", 947-950.
Liang, Shan / Jiang, Wei / Liu, Wenju: "A new noise-tracking algorithm for generalizing binary time-frequency (t-f) masking to ratio masking", 951-954.
Tang, Yan / Cooke, Martin: "Optimised spectral weightings for noise-dependent speech intelligibility enhancement", 955-958.
Chen, Langzhou / Gales, Mark J. F. / Wan, Vincent / Latorre, Javier / Akamine, Masami: "Exploring rich expressive information from audiobook data using cluster adaptive training", 959-962.
He, Ji / Qian, Yao / Soong, Frank K. / Zhao, Sheng: "Turning a monolingual speaker into multilingual for a mixed-language TTS", 963-966.
Veaux, Christophe / Yamagishi, Junichi / King, Simon: "Using HMM-based speech synthesis to reconstruct the voice of individuals with degenerative speech disorders", 967-970.
Latorre, Javier / Wan, Vincent / Gales, Mark J. F. / Chen, Langzhou / Chin, K. K. / Knill, Kate / Akamine, Masami: "Speech factorization for HMM-TTS based on cluster adaptive training", 971-974.
Sung, June Sig / Hong, Doo Hwa / Koo, Hyun Woo / Kim, Nam Soo: "Factored MLLR adaptation algorithm for HMM-based expressive TTS", 975-978.
Schabus, Dietmar / Pucher, Michael / Hofer, Gregor: "Speaker-adaptive visual speech synthesis in the HMM-framework", 979-982.
Oliveira, Viviane de Franca / Shiota, Sayaka / Nankaku, Yoshihiko / Tokuda, Keiichi: "Cross-lingual speaker adaptation for HMM-based speech synthesis based on perceptual characteristics and speaker interpolation", 983-986.
Nicolao, Mauro / Latorre, Javier / Moore, Roger K.: "C2h: a computational model of H&h-based phonetic contrast in synthetic speech", 987-990.
Ling, Zhen-Hua / Richmond, Korin / Yamagishi, Junichi: "Vowel creation by articulatory control in HMM-based parametric speech synthesis", 991-994.
Dall, Rasmus / Veaux, Christophe / Yamagishi, Junichi / King, Simon: "Analysis of speaker clustering strategies for HMM-based speech synthesis", 995-998.
Chen, Kuan-Yu / Chang, Hao-Chin / Chen, Berlin / Wang, Hsin-Min: "Word relevance modeling for speech recognition", 999-1002.
Duckhorn, Frank / Hoffmann, Rüdiger: "Using context-free grammars for embedded speech recognition with weighted finite-state transducers", 1003-1006.
Dufour, Richard / Damnati, Géraldine / Charlet, Delphine / Béchet, Frédéric: "Automatic transcription error recovery for person name recognition", 1007-1010.
Kobashikawa, Satoshi / Hori, Takaaki / Yamaguchi, Yoshikazu / Asami, Taichi / Masataki, Hirokazu / Takahashi, Satoshi: "Efficient beam width control to suppress excessive speech recognition computation time based on prior score range normalization", 1011-1014.
Nolden, David / Schlüter, Ralf / Ney, Hermann: "Search space pruning based on anticipated path recombination in LVCSR", 1015-1018.
McGraw, Ian / Gruenstein, Alexander: "Estimating word-stability during incremental speech recognition", 1019-1022.
Ziegler, Stefan / Ludusan, Bogdan / Gravier, Guillaume: "Using broad phonetic classes to guide search in automatic speech recognition", 1023-1026.
Miranda, João / Neto, João Paulo da Silva / Black, Alan W.: "Parallel combination of multilingual speech streams for improved ASR", 1027-1030.
Bougares, Fethi / Rouvier, Mickael / Estève, Yannick / Linarès, Georges: "Low latency combination of parallelized single-pass LVCSR systems", 1031-1034.
Kim, Jungsuk / Chong, Jike / Lane, Ian: "Efficient on-the-fly hypothesis rescoring in a hybrid GPU/CPU-based large vocabulary continuous speech recognition engine", 1035-1038.
Lehr, Maider / Prud'hommeaux, Emily / Shafran, Izhak / Roark, Brian: "Fully automated neuropsychological assessment for detecting mild cognitive impairment", 1039-1042.
Bone, Daniel / Black, Matthew P. / Lee, Chi-Chun / Williams, Marian E. / Levitt, Pat / Lee, Sungbok / Narayanan, Shrikanth: "Spontaneous-speech acoustic-prosodic features of children with autism and the interacting psychologist", 1043-1046.
Kaland, Constantijn / Krahmer, Emiel / Swerts, Marc: "Contrastive intonation in autism: the effect of speaker- and listener-perspective", 1047-1050.
Hagedorn, Christina / Proctor, Michael / Goldstein, Louis / Tempini, Maria Luisa Gorno / Narayanan, Shrikanth S.: "Characterizing covert articulation in apraxic speech using real-time MRI", 1051-1054.
Abad, Alberto / Pompili, Anna / Costa, Angela / Trancoso, Isabel: "Automatic word naming recognition for treatment and assessment of aphasia", 1055-1058.
Quatieri, Thomas F. / Malyska, Nicolas: "Vocal-source biomarkers for depression: a link to psychomotor activity", 1059-1062.
Drugman, Thomas / Urbain, Jerome / Bauwens, Nathalie / Chessini, Ricardo / Aubriot, Anne-Sophie / Lebecque, Patrick / Dutoit, Thierry: "Audio and contact microphones for cough detection", 1303-1306.
Chen, Nancy F. / Shen, Wade / Campbell, Joseph P.: "Analyzing and interpreting automatically learned rules across dialects", 1307-1310.
Raev, Andrey / Matveev, Yuri / Goloshchapova, Tatiana: "The effect of use of drugs on speaker's fundamental frequency and formants", 1311-1314.
Swerts, Marc / Bie, Cees de: "On the assessment of audiovisual cues to speaker confidence by preteens with typical development (TD) and a-typical development (AD)", 1315-1318.
Chaspari, Theodora / Lee, Chi-Chun / Narayanan, Shrikanth: "Interplay between verbal response latency and physiology of children with autism during ECA interactions", 1319-1322.
Kim, Myung Jong / Kim, Hoirin: "Combination of multiple speech dimensions for automatic assessment of dysarthric speech intelligibility", 1323-1326.
Wang, Jun / Samal, Ashok / Green, Jordan R. / Rudzicz, Frank: "Whole-word recognition from articulatory movements for silent speech interfaces", 1327-1330.
Yin, Shou-Chun / Rose, Richard C. / Tang, Yun: "Verifying session level pronunciation accuracy in a speech therapy application", 1331-1334.
Mehta, Daryush D. / Listfield, Rebecca Woodbury / Cheyne II, Harold A. / Heaton, James T. / Feng, Shengran W. / Zañartu, Matías / Hillman, Robert E.: "Duration of ambulatory monitoring needed to accurately estimate voice use", 1335-1338.
Hassanali, Khairun-nisa / Liu, Yang / Solorio, Thamar: "Evaluating NLP features for automatic prediction of language impairment using child speech transcripts", 1339-1342.
Kiss, Géza / Santen, Jan P. H. van / Prud'hommeaux, Emily / Black, Lois M.: "Quantitative analysis of pitch in speech of children with neurodevelopmental disorders", 1343-1346.
Jyothi, Preethi / Fosler-Lussier, Eric / Livescu, Karen: "Discriminatively learning factorized finite state pronunciation models from dynamic Bayesian networks", 1063-1066.
Deoras, Anoop / Sarikaya, Ruhi / Tur, Gokhan / Hakkani-Tür, Dilek: "Joint decoding for speech recognition and semantic tagging", 1067-1070.
Basha Shaik, M. Ali / Mousa, Amr El-Desoky / Schlüter, Ralf / Ney, Hermann: "Investigation of maximum entropy hybrid language models for open vocabulary German and Polish LVCSR", 1071-1074.
Dixon, Paul R. / Hori, Chiori / Kashioka, Hideki: "A specialized WFST approach for class models and dynamic vocabulary", 1075-1078.
Novak, Josef R. / Minematsu, Nobuaki / Hirose, Keikichi: "Dynamic grammars with lookahead composition for WFST-based speech recognition", 1079-1082.
Shore, Todd / Faubel, Friedrich / Helmke, Hartmut / Klakow, Dietrich: "Knowledge-based word lattice rescoring in a dynamic context", 1083-1086.
McClanahan, Richard D. / Leon, Phillip L. De: "Mixture component clustering for efficient speaker verification", 1087-1090.
Hasan, Taufiq / Hansen, John H. L.: "Front-end channel compensation using mixture-dependent feature transformations for i-vector speaker recognition", 1091-1094.
Campbell, William M. / Singer, Elliot: "Query-by-example using speaker content graphs", 1095-1098.
Sun, Hanwu / Ma, Bin: "Unsupervised NAP training data design for speaker recognition", 1099-1102.
Doddington, George: "The role of score calibration in speaker recognition", 1103-1106.
Hattori, Takafumi / Hashimoto, Kei / Nankaku, Yoshihiko / Tokuda, Keiichi: "A Bayesian approach to speaker recognition based on GMMs using multiple model structures", 1107-1110.
Wang, Jianglin / Johnson, Michael: "Residual phase cepstrum coefficients with application to cross-lingual speaker verification", 1556-1559.
Liang, Chunyan / Yang, Jinchao / Yang, Lin / Yan, Yonghong: "Speaker veri.cation using neighborhood preserving embedding", 1560-1563.
Liang, Chunyan / Zhang, Xiang / Yang, Lin / Yan, Yonghong: "Discriminative decision function based scoring method in joint factor analysis for speaker verification", 1564-1567.
Hasan, Taufiq / Hansen, John H. L.: "Integrated feature normalization and enhancement for robust speaker recognition using acoustic factor analysis", 1568-1571.
Machlica, Lukáš / Zajic, Zbyněk: "Factor analysis and nuisance attribute projection revisited", 1572-1575.
Chen, Sheng / Xu, Mingxing: "Compensation of intrinsic variability with factor analysis modeling for robust speaker verification", 1576-1579.
Larcher, Anthony / Lee, Kong Aik / Ma, Bin / Li, Haizhou: "RSR2015: database for text-dependent speaker verification using multiple pass-phrases", 1580-1583.
Dellwo, Volker / Leemann, Adrian / Kolly, Marie-José: "Speaker idiosyncratic rhythmic features in the speech signal", 1584-1587.
Lei, Yun / Burget, Lukáš / Scheffer, Nicolas: "Bilinear factor analysis for i-vector based speaker verification", 1588-1591.
Poignant, Johann / Bredin, Hervé / Le, Viet Bac / Besacier, Laurent / Barras, Claude / Quénot, Georges: "Unsupervised speaker identification using overlaid texts in TV broadcast", 2650-2653.
Zhao, Yali / Xie, Lie / Fu, Zhonghua: "Mask estimation and refinement for MFT-based robust speaker verification", 2654-2657.
Yang, Hai / Liang, Chunyan / Xu, Yunfei / Yang, Lin / Yan, Yonghong: "Sparse probabilistic linear discriminant analysis for speaker verification", 2658-2661.
Sarkar, Achintya Kumar / Matrouf, Driss / Bousquet, Pierre Michel / Bonastre, Jean-François: "Study of the effect of i-vector modeling on short and mismatch utterance duration for speaker verification", 2662-2665.
Huang, Chien-Lin / Hori, Chiori / Kashioka, Hideki / Ma, Bin: "Ensemble classifiers using unsupervised data selection for speaker recognition", 2666-2669.
Hyon, Songgun / Wang, Hongcui / Zhao, Chen / Wei, Jianguo / Dang, Jianwu: "A method of speaker identification based on phoneme mean F-Ratio contribution", 2670-2673.
Remus, Jeremiah J. / Estrada, Jenniffer M. / Schuckers, Stephanie A. C.: "Mitigating effects of recording condition mismatch in speaker recognition using partial least squares", 2674-2677.
Marklund, Ellen / Lacerda, Francisco / Schwarz, Iris-Corinna / Sundberg, Ulla: "Similarities in fundamental frequency in infant speech segmentation models", 1111-1114.
Marklund, Ulrika / Sundberg, Ulla / Schwarz, Iris-Corinna / Lacerda, Francisco: "Phonological complexity and vocabulary size in 30-month-old Swedish children", 1115-1118.
Kim, Jeesun / Davis, Chris / Kitamura, Christine: "Auditory-visual speech to infants and adults: signals and correlations", 1119-1122.
Xu, Dongxin / Gilkerson, Jill / Richards, Jeffery A.: "Objective child vocal development measurement with naturalistic daylong audio recording", 1123-1126.
Nagao, Kyoko / Paullin, Mark / Livinsky, Vilena / Polikoff, James B. / Vallino, Linda D. / Morlet, Thierry G. / Schanen, N. Carolyn / Bunnell, H. Timothy: "Speech production-perception relationships in children with speech delay", 1127-1130.
Strömbergsson, Sofia: "Synthetic correction of deviant speech children's perception of phonologically modified recordings of their own speech", 1131-1134.
Wan, Vincent / Latorre, Javier / Chin, K. K. / Chen, Langzhou / Gales, Mark J. F. / Zen, Heiga / Knill, Kate / Akamine, Masami: "Combining multiple high quality corpora for improving HMM-TTS", 1135-1138.
Takamichi, Shinnosuke / Toda, Tomoki / Shiga, Yoshinori / Kawai, Hisashi / Sakti, Sakriani / Nakamura, Satoshi: "An evaluation of parameter generation methods with rich context models in HMM-based speech synthesis", 1139-1142.
Lu, Heng / King, Simon: "Using Bayesian networks to find relevant context features for HMM-based speech synthesis", 1143-1146.
Yin, Xiang / Ling, Zhen-Hua / Lei, Ming / Dai, Lirong: "Considering global variance of the log power spectrum derived from mel-cepstrum in HMM-based parametric speech synthesis", 1147-1150.
Chunwijitra, Vataya / Nose, Takashi / Kobayashi, Takao: "A speech parameter generation algorithm using local bariance for HMM-based speech synthesis", 1151-1154.
Ohtani, Yamato / Tamura, Masatsune / Morita, Masahiro / Kagoshima, Takehiko / Akamine, Masami: "Histogram-based spectral equalization for HMM-based speech synthesis using mel-LSP", 1155-1158.
Raitio, Tuomo / Suni, Antti / Vainio, Martti / Alku, Paavo: "Wideband parametric speech synthesis using warped linear prediction", 1420-1423.
Drugman, Thomas / Kane, John / Gobl, Christer: "Modeling the creaky excitation for parametric speech synthesis", 1424-1427.
Wen, Zhengqi / Tao, Jianhua: "Amplitude spectrum based excitation model for HMM-based speech synthesis", 1428-1431.
Nishizawa, Nobuyuki / Kato, Tsuneo: "Speech synthesis using a non-maximally decimated filter bank for embedded systems", 1432-1435.
Silén, Hanna / Helander, Elina / Nurminen, Jani / Gabbouj, Moncef: "Ways to implement global variance in statistical speech synthesis", 1436-1439.
Ohtani, Yamato / Tamura, Masatsune / Morita, Masahiro / Kagoshima, Takehiko / Akamine, Masami: "HMM-based speech synthesis using sub-band basis spectrum model", 1440-1443.
Imseng, David / Dines, John / Motlicek, Petr / Garner, Philip N. / Bourlard, Hervé: "Comparing different acoustic modeling techniques for multilingual boosting", 1191-1194.
Wang, Yongqiang / Gales, Mark J. F.: "Model-based approaches to adaptive training in reverberant environments", 1195-1198.
Gales, Mark J. F. / Flego, Federico: "Model-based approaches for degraded channel modelling in robust ASR", 1199-1202.
Hartmann, William / Fosler-Lussier, Eric: "Improved model selection for the ASR-driven binary mask", 1203-1206.
Wiesler, Simon / Schlüter, Ralf / Ney, Hermann: "Accelerated batch learning of convex log-linear models for LVCSR", 1207-1210.
Pylkkönen, Janne / Kurimo, Mikko: "Improving discriminative training for robust acoustic models in large vocabulary continuous speech recognition", 1211-1214.
Novotney, Scott / Bulyko, Ivan / Schwartz, Richard / Khudanpur, Sanjeev / Kimball, Owen: "Semi-supervised methods for improving keyword search of unseen terms", 1215-1218.
Li, Xiangang / Su, Dan / Pang, Zaihu / Wu, Xihong: "Probabilistic speaker-class based acoustic modeling for large vocabulary continuous speech recognition", 1219-1222.
Yao, Xiao / Jitsuhiro, Takatoshi / Miyajima, Chiyomi / Kitaoka, Norihide / Takeda, Kazuya: "Classification of stressed speech using physical parameters derived from two-mass model", 1223-1226.
Du, Jun / Huo, Qiang: "IVN-based joint training of GMM and HMMs using an improved VTS-based feature compensation for noisy speech recognition", 1227-1230.
Moritz, Niko / Anemüller, Jörn / Kollmeier, Birger: "Amplitude modulation filters as feature sets for robust ASR: constant absolute or relative bandwidth?", 1231-1234.
Demir, Cemil / Cemgil, A. Taylan / Saraçlar, Murat: "Effect of speech priors in single-channel speech-music separation for ASR", 1235-1238.
Narayanan, Arun / Wang, DeLiang: "On the role of binary mask pattern in automatic speech recognition", 1239-1242.
Gomez, Randy / Kawahara, Tatsuya: "Dereverberation based on wavelet packet filtering for robust automatic speech recognition", 1243-1246.
Kristjansson, Trausti / Hughes, Thad: "Spectral intersections for non-stationary signal separation", 1247-1250.
Odani, Kyohei / Wang, Longbiao / Kai, Atsuhiko: "Speech recognition by denoising and dereverberation based on spectral subtraction in a real noisy reverberant environment", 1251-1254.
Pardede, Hilman F. / Shinoda, Koichi / Iwano, Koji: "Q-Gaussian based spectral subtraction for robust speech recognition", 1255-1258.
Meyer, Bernd T. / Spille, Constantin / Kollmeier, Birger / Morgan, Nelson: "Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition", 1259-1262.
Li, Qi Peter / Sun, Xie: "Feature extraction based on hearing system signal processing for robust large vocabulary speech recognition", 1263-1266.
Arsikere, Harish / Leung, Gary K. F. / Lulich, Steven M. / Alwan, Abeer: "Automatic estimation of the first two subglottal resonances in children's speech with application to speaker normalization in limited-data conditions", 1267-1270.
Carlin, Michael A. / Patil, Kailash / Nemala, Sridhar Krishna / Elhilali, Mounya: "Robust phoneme recognition based on biomimetic speech contours", 1348-1351.
Yao, Kaisheng / Gong, Yifan / Liu, Chaojun: "A feature space transformation method for personalization using generalized i-vector clustering", 1352-1355.
Tsai, T. J. / Morgan, Nelson: "Longer features: they do a speech detector good", 1356-1359.
Alam, Md Jahangir / Kenny, Patrick / O'Shaughnessy, Douglas: "Robust feature extraction for speech recognition by enhancing auditory spectrum", 1360-1363.
Müller, Florian / Mertins, Alfred: "Enhancing vocal tract length normalization with elastic registration for automatic speech recognition", 1364-1367.
Pessentheiner, Hannes / Petrik, Stefan / Romsdorfer, Harald: "Beamforming using uniform circular arrays for distant speech recognition in reverberant environments and double talk scenarios", 1368-1371.
Pražák, Aleš / Loos, Zdeněk / Trmal, Jan / Psutka, Josef V. / Psutka, Josef: "Novel approach to live captioning through re-speaking: tailoring speech recognition to re-speaker's needs", 1372-1375.
Kolář, Jáchym / Lamel, Lori: "Development and evaluation of automatic punctuation for French and English speech-to-text", 1376-1379.
Ikbal, Shajith / Joshi, Sachindra / Verma, Ashish / Deshmukh, Om D.: "Spoken document clustering using word confusion networks", 1380-1383.
Wang, Xuancong / Ng, Hwee Tou / Sim, Khe Chai: "Dynamic conditional random fields for joint sentence boundary and punctuation prediction", 1384-1387.
Brugnara, Fabio / Falavigna, Daniele / Giuliani, Diego / Gretter, Roberto: "Analysis of the characteristics of talk-show TV programs", 1388-1391.
Rosenberg, Andrew: "Rethinking the corpus: moving towards dynamic linguistic resources", 1392-1395.
Safavi, Saeid / Najafian, Maryam / Hanani, Abualsoud / Russell, Martin / Jančovič, Peter / Carey, Michael: "Speaker recognition for children's speech", 1836-1839.
Bordel, Germán / Penagarikano, Mikel / Rodriguez-Fuentes, Luis Javier / Varona, Amparo: "A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions", 1840-1843.
Takashima, Ryoichi / Takiguchi, Tetsuya / Ariki, Yasuo: "Estimation of talker's head orientation based on discrimination of the shape of cross-power spectrum phase coefficients", 1844-1847.
Lee, Ann / Glass, James: "Sentence detection using multiple annotations", 1848-1851.
Charlet, Delphine / Damnati, Geraldine: "A speaker-role based approach for detecting Politicians in TV broadcast news", 1852-1855.
Mai, Guangting: "Relative importance of temporal envelope and fine structure cues in low- and high- order harmonic regions for Mandarin lexical-tone recognition", 1856-1859.
Tiwari, Nitya / Pandey, Prem C. / Kulkarni, Pandurangarao N.: "Real-time implementation of multi-band frequency compression for listeners with moderate sensorineural impairment", 1860-1863.
Mishra, Taniya / Sridhar, Vivek Rangarajan / Conkie, Alistair: "Word prominence detection using robust yet simple prosodic features", 1864-1867.
Srivastava, Amit / Khanwalkar, Saurabh / Markiewicz, Gretchen / Saikumar, Guruprasad: "Online story segmentation of multilingual streaming broadcast news", 1868-1871.
Räsänen, Okko: "Average spectrotemporal structure of continuous speech matches with the frequency resolution of human hearing", 1444-1447.
Saratxaga, Ibon / Hernáez, Inma / Pucher, Michael / Navas, Eva / Sainz, Iñaki: "Perceptual importance of the phase related information in speech", 1448-1451.
Grigorescu, Andrea / Rudnicki, Marek / Isik, Michael / Hemmert, Werner / Rini, Stefano: "Improving the entropy estimate of neuronal firings of modeled cochlear nucleus neurons", 1452-1455.
Nagao, Kyoko / Paullin, Mark / Polikoff, James B. / Lilley, Jason / Bunnell, H. Timothy: "Perception of synthetic speech in adult users of cochlear implants", 1456-1459.
Scharenborg, Odette / Janse, Esther: "Hearing loss and the use of acoustic cues in phonetic categorisation of fricatives", 1460-1463.
Hodoshima, Nao / Arai, Takayuki / Kurisu, Kiyohiro: "Intelligibility of speech spoken in noise/reverberation for older adults in reverberant environments", 1464-1467.
Hines, Andrew / Harte, Naomi: "Improved speech intelligibility with a chimaera hearing aid algorithm", 1468-1471.
Godoy, Elizabeth / Stylianou, Yannis: "Unsupervised acoustic analyses of normal and lombard speech, with spectral envelope transformation to improve intelligibility", 1472-1475.
Amano-Kusumoto, Akiko / Aronoff, Justin M. / Itoh, Motokuni / Soli, Sigfrid D.: "The effect of dichotic processing on the perception of binaural cues", 1476-1479.
Mesgarani, Nima / Chang, Edward: "Speech and speaker separation in human auditory cortex", 1480-1483.
Edlund, Jens / Heldner, Mattias / Gustafson, Joakim: "On the effect of the acoustic environment on the accuracy of perception of speaker orientation from auditory cues alone", 1484-1487.
Gonzalez, Sira / Brookes, Mike: "Sibilant speech detection in noise", 1488-1491.
Thambiratnam, Kit / Zhu, Weiwu / Seide, Frank: "Voice activity detection using speech recognizer feedback", 1492-1495.
Sharma, Dushyant / Hilkhuysen, Gaston / Naylor, Patrick A. / Gaubitch, Nikolay D. / Huckvale, Mark / Brookes, Mike: "Descriptive vocabulary development for degraded speech", 1496-1499.
Yokoyama, Ryo / Nasu, Yu / Shinoda, Koichi / Iwano, Koji: "Overlapped speech detection in meeting using cross-channel spectral subtraction and spectrum similarity", 1500-1503.
Lu, Xugang / Matsuda, Shigeki / Hori, Chiori / Kashioka, Hideki: "Speech restoration based on deep learning autoencoder with layer-wised pretraining", 1504-1507.
Chakraborty, Rupayan / Nadeu, Climent / Butko, Taras: "Detection and positioning of overlapped sounds in a room environment", 1508-1511.
Deepak, K. T. / Sarma, Biswajit Dev / Prasanna, S. R. Mahadeva: "Foreground speech segmentation using zero frequency filtered signal", 1512-1515.
Reidy, Patrick / Beckman, Mary: "The effect of spectral estimator on common spectral measures for sibilant fricatives", 1516-1519.
Grais, Emad M. / Erdogan, Hakan: "Gaussian mixture gain priors for regularized nonnegative matrix factorization in single- channel source separation", 1520-1523.
Ranjan, Shivesh / Payton, Karen L. / Mowlaee, Pejman: "Speaker independent single channel source separation using sinusoidal features", 1524-1527.
Wang, Yuxuan / Wang, DeLiang: "Boosting classification based speech separation using temporal dynamics", 1528-1531.
Wang, Yuxuan / Han, Kun / Wang, DeLiang: "Acoustic features for classification based speech separation", 1532-1535.
Grais, Emad M. / Erdogan, Hakan: "Hidden Markov models as priors for regularized nonnegative matrix factorization in single-channel source separation", 1536-1539.
Ji, Ming / Srinivasan, Ramji / Crookes, Danny: "Unconstrained speech separation by composition of longest segments", 1540-1543.
Zhang, Yi / Zhao, Yunxin: "Modulation domain blind source separation for noisy speech mixture", 1544-1547.
Mowlaee, Pejman / Saeidi, Rahim / Martin, Rainer: "Phase estimation for signal reconstruction in single-channel source separation", 1548-1551.
Chien, Jen-Tzung / Hsieh, Hsin-Lung: "Bayesian group sparse learning for nonnegative matrix factorization", 1552-1555.
Drugman, Thomas / Kane, John / Gobl, Christer: "Resonator-based creaky voice detection", 1592-1595.
Mittal, V. K. / Dhananjaya, N. / Yegnanarayana, Bayya: "Effect of tongue tip trilling on the glottal excitation source", 1596-1599.
Chen, Gang / Shue, Yen-Liang / Kreiman, Jody / Alwan, Abeer: "Estimating the voice source in noise", 1600-1603.
Pinheiro, Alan / Raitio, Tuomo / Gomes, Danyane / Alku, Paavo: "Voice source analysis using biomechanical modeling and glottal inverse filtering", 1604-1607.
Drioli, Carlo / Calanca, Andrea: "Speech modeling and processing by low-dimensional dynamic glottal models", 1608-1611.
Alku, Paavo / Pohjalainen, Jouni / Vainio, Martti / Laukkanen, Anne-Maria / Story, Brad: "Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction", 1612-1615.
Sasou, Akira: "Automatic topology generation of glottal source HMM", 1616-1619.
Lorenzo-Trueba, Jaime / Barra-Chicote, Roberto / Raitio, Tuomo / Obin, Nicolas / Alku, Paavo / Yamagishi, Junichi / Montero, Juan M.: "Towards glottal source controllability in expressive speech synthesis", 1620-1623.
Alpan, Ali / Schoentgen, Jean / Grenez, Francis: "Combining temporal and cepstral features for the automatic perceptual categorization of disordered connected speech", 1624-1627.
Sun, Rui / Moore II, Elliot: "A preliminary study on cross-databases emotion recognition using the glottal features in speech", 1628-1631.
Maia, Ranniery / Akamine, Masami: "Analysis on the importance of short-term speech parameterizations for emotional statistical parametric speech synthesis", 1632-1635.
Mertens, Christophe / Grenez, Francis / Schoentgen, Jean: "Analysis of vocal tremor and jitter by empirical mode decomposition of glottal cycle length time series", 1636-1639.
Auvinen, Harri / Raitio, Tuomo / Siltanen, Samuli / Alku, Paavo: "Utilizing Markov chain Monte Carlo (MCMC) method for improved glottal inverse filtering", 1640-1643.
Huber, Stefan / Roebel, Axel / Degottex, Gilles: "Glottal source shape parameter estimation using phase minimization variants", 1644-1647.
Godin, Keith W. / Hasan, Taufiq / Hansen, John H. L.: "Glottal waveform analysis of physical task stress speech", 1648-1651.
Torres, Juan Félix / Moore, Elliot: "Speaker discrimination ability of glottal waveform features", 1652-1655.
Liu, Xunying / Gales, Mark J. F. / Woodland, Phillip C.: "Paraphrastic language models", 1656-1659.
Rastrow, Ariya / Dredze, Mark / Khudanpur, Sanjeev: "Efficient structured language modeling for speech recognition", 1660-1663.
Shi, Yangyang / Wiggers, Pascal / Jonker, Catholijn M.: "Towards recurrent neural networks language models with linguistic and contextual features", 1664-1667.
Lecorvé, Gwénolé / Motlicek, Petr: "Conversion of recurrent neural network language models to weighted finite state transducers for automatic speech recognition", 1668-1671.
Kuo, Hong-Kwang / Arısoy, Ebru / Emami, Ahmad / Vozila, Paul: "Large scale hierarchical neural network language models", 1672-1675.
Hutchinson, Brian / Ostendorf, Mari / Fazel, Maryam: "A sparse plus low rank maximum entropy language model", 1676-1679.
Jiang, Ye / Lee, Kong Aik / Tang, Zhenmin / Ma, Bin / Larcher, Anthony / Li, Haizhou: "PLDA modeling in i-vector and supervector space for speaker verification", 1680-1683.
Simonchik, Konstantin / Pekhovsky, Timur / Shulipa, Andrey / Afanasyev, Anton: "Supervized mixture of PLDA models for cross-channel speaker verification", 1684-1687.
Alegre, Federico / Vipperla, Ravichander / Evans, Nicholas: "Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals", 1688-1691.
Stafylakis, Themos / Kenny, Patrick / Senoussaoui, Mohammed / Dumouchel, Pierre: "PLDA using Gaussian restricted boltzmann machines with application to speaker verification", 1692-1695.
Sadjadi, Seyed Omid / Hasan, Taufiq / Hansen, John H. L.: "Mean hilbert envelope coefficients (MHEC) for robust speaker recognition", 1696-1699.
Wu, Zhizheng / Chng, Eng Siong / Li, Haizhou: "Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition", 1700-1703.
Villegas, Julián / Cooke, Martin: "Maximising objective speech intelligibility by local F0 modulation", 1704-1707.
Mayo, Catherine / Aubanel, Vincent / Cooke, Martin: "Effect of prosodic changes on speech intelligibility", 1708-1711.
Kawase, Saya / Wang, Yue: "Effects of visual speech information on native listener judgments of L2 consonant intelligibility", 1712-1715.
Brown, Guy J. / Beeston, Amy V. / Palomäki, Kalle J.: "Perceptual compensation for the effects of reverberation on consonant identification: a comparison of human and machine performance", 1716-1719.
Fitzpatrick, Michael / Kim, Jeesun / Davis, Chris: "The intelligibility of lombard speech: communicative setting matters", 1720-1723.
Santos, João Felipe / Cosentino, Stefano / Hazrati, Oldooz / Loizou, Philipos C. / Falk, Tiago H.: "Performance comparison of intrusive objective speech intelligibility and quality metrics for cochlear implant users", 1724-1727.
Chaudhuri, Sourish / Singh, Rita / Raj, Bhiksha: "Exploiting temporal sequence structure for semantic analysis of multimedia", 1728-1731.
Liu, Hong / Li, Xiaofei: "Time delay estimation for speech signal based on FOC-spectrum", 1732-1735.
Shi, Ziqiang / Zheng, Tieran / Han, Jiqing / Deng, Shiwen: "Low-rank audio signal classification under soft margin and trace norm constraints", 1736-1739.
Segura, Carlos / Hernando, Javier: "GCC-PHAT based head orientation estimation", 1740-1743.
De, Soham / Roy, Indradyumna / Prabhakar, Tarunima / Suneja, Kriti / Chaudhuri, Sourish / Singh, Rita / Raj, Bhiksha: "Plagiarism detection in polyphonic music using monaural signal separation", 1744-1747.
Bouafif, Mariem / Lachiri, Zied: "TDOA estimation for multiple speakers in underdetermined case", 1748-1751.
Nakashika, Toru / Garcia, Christophe / Takiguchi, Tetsuya: "Local-feature-map integration using convolutional neural networks for music genre classification", 1752-1755.
Berry, Jeff / Fasel, Ian / Fadiga, Luciano / Archangeli, Diana: "Training deep nets with imbalanced and unlabeled data", 1756-1759.
Asami, Taichi / Kobashikawa, Satoshi / Masataki, Hirokazu / Yoshioka, Osamu / Takahashi, Satoshi: "Speech data clustering based on phoneme error trend for unsupervised acoustic model adaptation", 1760-1763.
Kim, Wooil / Hansen, John H. L.: "Gaussian map based acoustic model adaptation using untranscribed data for speech recognition in severely adverse environments", 1764-1767.
Jiang, Danning / Kanevsky, Dimitri / Goel, Vaibhava / Qin, Yong: "Investigating performance of the discriminative methods for long-term speaker adaptation", 1768-1771.
Li, Bo / Sim, Khe Chai: "A two-stage speaker adaptation approach for subspace Gaussian mixture model based nonnative speech recognition", 1772-1775.
Christensen, Heidi / Cunningham, Stuart / Fox, Charles / Green, Phil / Hain, Thomas: "A comparative study of adaptive, automatic recognition of disordered speech", 1776-1779.
Uluskan, Seçkin / Hansen, John H. L.: "Phoneme class based adaptation for mismatch acoustic modeling of distant noisy speech", 1780-1783.
Roupakia, Zoi / Ragni, Anton / Gales, Mark J. F.: "Rapid nonlinear speaker adaptation for large-vocabulary continuous speech recognition", 1784-1787.
Chen, I-Fan / Lee, Chin-Hui: "A study on using word-level HMMs to improve ASR performance over state-of-the-art phone-level acoustic modeling for LVCSR", 1788-1791.
Seltzer, Michael / Acero, Alex: "Factored adaptation using a combination of feature-space and model-space transforms", 1792-1795.
Huang, Heyun / Bosch, Louis ten / Cranen, Bert / Boves, Lou: "Exploring discriminative speech trajectory structures", 1796-1799.
Variani, Ehsan / Hermansky, Hynek: "Estimating classifier performance in unknown noise", 1800-1803.
Jalalvand, Azarakhsh / Triefenbach, Fabian / Martens, Jean-Pierre: "Continuous digit recognition in noise: reservoirs can do an excellent job!", 1804-1807.
Pylkkönen, Janne / Kurimo, Mikko: "Optimization-based control for the extended baum-welch algorithm", 1808-1811.
Schädler, Marc René / Kollmeier, Birger: "Normalization of spectro-temporal Gabor filter bank features for improved robust automatic speech recognition systems", 1812-1815.
Li, Feipeng / Mallidi, Sri Harish / Hermansky, Hynek: "Phone recognition in critical bands using sub-band temporal modulations", 1816-1819.
Rasipuram, Ramya / Doss, Mathew M.: "Combining acoustic data driven G2p and letter-to-sound rules for under resource lexicon generation", 1820-1823.
Al-Shareef, Sarah / Hain, Thomas: "CRF-based diacritisation of colloquial Arabic for automatic speech recognition", 1824-1827.
Ganapathy, Sriram / Hermansky, Hynek: "Analysis of temporal resolution in frequency domain linear prediction", 1828-1831.
Zhang, Bing / Schwartz, Richard / Tsakalidis, Stavros / Nguyen, Long / Matsoukas, Spyros: "White listing and score normalization for keyword spotting of noisy speech", 1832-1835.
Diehl, Frank / Woodland, Phillip C.: "Complementary phone error training", 2610-2613.
Nussbaum-Thom, Markus / Tuske, Zoltan / Heigold, Georg / Schlüter, Ralf / Ney, Hermann: "Posterior-scaled MPE: novel discriminative training criteria", 2614-2617.
Ding, Pei / He, Liqiang: "Improve the implementation of pitch features for Mandarin digit string recognition task", 2618-2621.
Hsieh, Hsin-Ju / Hung, Jeih-weih / Chen, Berlin: "Exploring joint equalization of spatial-temporal contextual statistics of speech features for robust speech recognition", 2622-2625.
Matsuda, Shigeki / Ito, Naoya / Tsujino, Kosuke / Kashioka, Hideki / Sagayama, Shigeki: "Speaker-dependent voice activity detection robust to background speech noise", 2626-2629.
González, Jose A. / Peinado, Antonio M. / Gómez, Angel M. / Ma, Ning: "Log-spectral feature reconstruction based on an occlusion model for noise robust speech recognition", 2630-2633.
Abdelaziz, Ahmed Hussen / Kolossa, Dorothea: "Decoding of uncertain features using the posterior distribution of the clean data for robust speech recognition", 2634-2637.
Ma, Ning / Barker, Jon: "Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition", 2638-2641.
Ludusan, Bogdan / Ziegler, Stefan / Gravier, Guillaume: "Integrating stress information in large vocabulary continuous speech recognition", 2642-2645.
Chien, Jen-Tzung / Chiang, Cheng-Chun: "Group sparse hidden Markov models for speech recognition", 2646-2649.
Metze, Florian / Fosler-Lussier, Eric: "The speech recognition virtual kitchen: an initial prototype", 1872-1873.
Reichel, Uwe D.: "Perma and Balloon: tools for string alignment and text processing", 1874-1877.
Ouni, Slim / Mangeonjean, Loïc / Steiner, Ingmar: "Visartico: a visualization tool for articulatory data", 1878-1881.
Lenkiewicz, Przemyslaw / Uytvanck, Dieter van / Wittenburg, Peter / Drude, Sebastian: "Towards automated annotation of audio and video recordings by application of advanced web-services", 1882-1885.
Ashby, Simone / Barbosa, Sílvia / Brandão, Silvia / Ferreira, José Pedro / Janssen, Maarten / Silva, Catarina / Viaro, Mário Eduardo: "A rule based pronunciation generator and regional accent databank for Portuguese", 1886-1887.
Chappel, Roger / Paliwal, Kuldip: "Speech enhancement for android (SEA): a speech processing demonstration tool for android based smart phones and tablets", 1888-1891.
Okamoto, Jacob / Pakhomov, Serguei / Shriberg, Elizabeth / Stolcke, Andreas: "ProTK: an improved prosody toolkit", 1892-1893.
Boyce, Suzanne / Fell, Harriet / MacAuslan, Joel: "Speechmark: landmark detection tool for speech analysis", 1894-1897.
Bell, Peter / Dzikovska, Myroslava / Isard, Amy: "A tutorial dialogue system with unrestricted spoken input", 2113-2114.
Sun, Xie / Li, Qi Peter / Zhu, Manli / Zhou, Qiru: "Integrating adaptive beam-forming and auditory features for robust large vocabulary speech recognition", 2115-2116.
Hofmann, Hansjörg / Ehrlich, Ute / Bader, Klaus / Nothelfer, Ilona / Berton, André: "A natural in-car speech interface to internet services using hybrid ASR", 2117-2118.
Cole, Ronald A. / Bolanos, Daniel / Ward, Wayne H. / Carmer, J. T. / Borts, Eric / Svirsky, Edward: "How marni helps English language learners acquire oral reading fluency", 2119-2120.
Finomore Jr, Victor / Stewart, John / Singh, Rita / Raj, Bhiksha / Dallman, Ron: "Demonstration of advanced multi-modal, network-centric communication management suite", 2121-2122.
Pelemans, Joris / Demuynck, Kris / Wambacq, Patrick: "Dutch automatic speech recognition on the web: towards a general purpose system", 2123-2126.
Tejedor, Javier / López-Colino, Fernando / Porta, Jordi / Colás, José: "An on-line, cloud-based Spanish-Spanish sign language translation system", 2127-2128.
He, Yanzhang / Fosler-Lussier, Eric: "Efficient segmental conditional random fields for one-pass phone recognition", 1898-1901.
Nallasamy, Udhyakumar / Metze, Florian / Schultz, Tanja: "Enhanced polyphone decision tree adaptation for accented speech recognition", 1902-1905.
Li, Jinyu / Seltzer, Michael L. / Gong, Yifan: "Efficient VTS adaptation using jacobian approximation", 1906-1909.
Cerňak, Miloš / Imseng, David / Bourlard, Hervé: "Robust triphone mapping for acoustic modeling", 1910-1913.
Zhang, Weibin / Fung, Pascale: "Sparse banded precision matrices for low resource speech recognition", 1914-1917.
Mohammed, Abdul Waheed / Matassoni, Marco / Maganti, Harikrishna / Omologo, Maurizio: "Semi-blind model adaptation using piece-wise energy decay curve for large reverberant environments", 1918-1921.
Bispo, Bruno C. / Freitas, Diamantino S.: "Developments of a hybrid pre-processor based on frequency shifting for stereophonic acoustic echo cancellation", 1922-1925.
Kinoshita, Keisuke / Delcroix, Marc / Souden, Mehrez / Nakatani, Tomohiro: "Example-based speech enhancement with joint of spatial, spectral & temporal cues of speech and noise", 1926-1929.
Zhao, Shengkui / Jones, Douglas L.: "A fast-converging adaptive frequency-domain MVDR beamformer for speech enhancement", 1930-1933.
Singh, Rita / Kumatani, Kenichi / McDonough, John / Liu, Chen: "A signal-separation-based array postfilter for distant speech recognition", 1934-1937.
Yu, Meng / Soong, Frank K.: "Constrained multichannel speech dereverberation", 1938-1941.
Ritch, Ryan / Yu, Meng / Xin, Jack: "A triple-microphone real-time speech enhancement algorithm based on approximate array analytical solutions", 1942-1945.
Ng, Tim / Zhang, Bing / Nguyen, Long / Matsoukas, Spyros / Zhou, Xinhui / Mesgarani, Nima / Veselý, Karel / Matějka, Pavel: "Developing a speech activity detection system for the DARPA RATS program", 1969-1972.
Omar, Mohamed Kamal: "Speech activity detection for noisy data using adaptation techniques", 1973-1976.
Misra, Ananya: "Speech/nonspeech segmentation in web videos", 1977-1980.
Harding, Philip / Milner, Ben: "On the use of machine learning methods for speech and voicing classification", 1981-1984.
Thomas, Samuel / Mallidi, Sri Harish / Janu, Thomas / Hermansky, Hynek / Mesgarani, Nima / Zhou, Xinhui / Shamma, Shihab / Ng, Tim / Zhang, Bing / Nguyen, Long / Matsoukas, Spyros: "Acoustic and data-driven features for robust speech activity detection", 1985-1988.
Wang, Shuo / Wu, Wenjun: "A two-step NMF based algorithm for single channel speech separation", 1989-1992.
Yip, Michael C. W.: "Meaning inhibition and sentence processing in Chinese: evidence from negative priming", 1993-1996.
Ijima, Yusuke / Isogai, Mitsuaki / Mizuno, Hideyuki: "Similar speaker selection technique based on distance metric learning with perceptual voice quality similarity", 1997-2000.
Babel, Molly / McGuire, Grant: "Gendered sound symbolism and masking effects in speech processing", 2001-2004.
Bosch, Louis ten / Scharenborg, Odette: "Modeling cue trading in human word recognition", 2005-2008.
Li, David Cheng-Huan / Kaiser, Elsi: "Accounting for speech rate in spoken word recognition", 2009-2012.
Hanique, Iris / Ernestus, Mirjam: "The processes underlying two frequent casual speech phenomena in Dutch: a production experiment", 2013-2016.
Birkholz, Peter / Hoole, Phil: "Intrinsic velocity differences of lip and jaw movements: preliminary results", 2017-2020.
Viebahn, Malte C. / Ernestus, Mirjam / McQueen, James M.: "Co-occurrence of reduced word forms in natural speech", 2021-2024.
Yoshinaga, Ikuyo / Kong, Jiangping: "Voice production mechanisms of vibrato in Noh", 2025-2028.
Orozco-Arroyave, Juan Rafael / Arias-Londoño, Julian David / Vargas-Bonilla, Jesús Francisco / Nöth, Elmar: "Automatic detection of hypernasal speech signals using nonlinear and entropy measurements", 2029-2032.
Aubanel, Vincent / Cooke, Martin / Foster, Emma / Garcia Lecumberri, Maria Luisa / Mayo, Catherine: "Effects of the availability of visual information and presence of competing conversations on speech production", 2033-2036.
Huang, Shuai / Coppersmith, Glen A. / Karakos, Damianos: "Constrained maximum mutual information dimensionality reduction for language identification", 2037-2040.
BenZeghiba, Mohamed Faouzi / Gauvain, Jean-Luc / Lamel, Lori: "Phonotactic language recognition using MLP features", 2041-2044.
Penagarikano, Mikel / Varona, Amparo / Rodriguez-Fuentes, Luis Javier / Diez, Mireia / Bordel, German: "The EHU systems for the NIST 2011 language recognition evaluation", 2045-2048.
Penagarikano, Mikel / Varona, Amparo / Diez, Mireia / Rodriguez-Fuentes, Luis Javier / Bordel, German: "Study of different backends in a state-of-the-art language recognition system", 2049-2052.
Yaman, Sibel / Pelecanos, Jason / Omar, Mohamed Kamal: "On the use of non-linear polynomial kernel SVMs in language recognition", 2053-2056.
Jiang, Bing / Song, Yan / Guo, Wu / Dai, Lirong: "Exemplar-based sparse representation for language recognition on i-vectors", 2057-2060.
Shih, Yu-Chin / Lee, Hung-Shin / Wang, Hsin-Min / Jeng, Shyh-Kang: "Subspace-based feature representation and learning for language recognition", 2061-2064.
You, Changhuai / Li, Haizhou / Ma, Bin / Lee, Kong Aik: "Effect of relevance factor of maximum a posteriori adaptation for GMM-SVM in speaker and language recognition", 2065-2068.
Varona, Amparo / Penagarikano, Mikel / Rodriguez-Fuentes, Luis Javier / Bordel, German / Diez, Mireia: "Using time-synchronous phone co-occurrences in a SVM-phonotactic dialect recognition system", 2069-2072.
Mehrabani, Mahnoosh / Tepperman, Joseph / Nava, Emily: "Nativeness classification with suprasegmental features on the accent group level", 2073-2076.
Lee, Huny-yi / Chou, Po-wei / Lee, Lin-shan: "Open-vocabulary retrieval of spoken content with shorter/longer queries considering word/subword-based acoustic feature similarity", 2077-2080.
Byun, Byungki / Kim, Ilseo / Siniscalchi, Sabato Marco / Lee, Chin-Hui: "Consumer-level multimedia event detection through unsupervised audio signal modeling", 2081-2084.
Jin, Qin / Schulam, Peter / Rawat, Shourabh / Burger, Susanne / Ding, Duo / Metze, Florian: "Event-based video retrieval using audio", 2085-2088.
Zhuang, Xiaodan / Tsakalidis, Stavros / Wu, Shuang / Natarajan, Pradeep / Prasad, Rohit / Natarajan, Prem: "Compact audio representation for event detection in consumer media", 2089-2092.
Liu, Chao / Wang, Dong / Tejedor, Javier: "N-gram FST indexing for spoken term detection", 2093-2096.
Majima, Haruka / Torres, Rafael / Fujita, Yoko / Kawanami, Hiromichi / Matsui, Tomoko / Saruwatari, Hiroshi / Shikano, Kiyohiro: "Spoken inquiry discrimination using bag-of-words for speech-oriented guidance system", 2097-2100.
Tsakalidis, Stavros / Zhuang, Xiaodan / Hsiao, Roger / Wu, Shuang / Natarajan, Pradeep / Prasad, Rohit / Natarajan, Prem: "Robust event detection from spoken content in consumer domain videos", 2101-2104.
Pancoast, Stephanie / Akbacak, Murat: "Bag-of-audio-words approach for multimedia event classification", 2105-2108.
Iso, Ken-ichi / Whittaker, Edward / Emori, Tadashi / Miyake, Junpei: "Improvements in Japanese voice search", 2109-2112.
Liu, Jingjing / Cyphers, Scott / Pasupat, Panupong / McGraw, Ian / Glass, James: "A conversational movie search system based on conditional random fields", 2454-2457.
Wen, Tsung-Hsien / Lee, Hung-Yi / Lee, Lin-Shan: "Interactive spoken content retrieval with different types of actions optimized by a Markov decision process", 2458-2461.
Allauzen, Cyril / Benson, Edward / Chelba, Ciprian / Riley, Michael / Schalkwyk, Johan: "Voice query refinement", 2462-2465.
Jansen, Aren / Durme, Benjamin Van: "Indexing raw acoustic features for scalable zero resource search", 2466-2469.
Fayolle, Julien / Saraçlar, Murat / Moreau, Fabienne / Raymond, Christian / Gravier, Guillaume: "Lexical-phonetic automata for spoken utterance indexing and retrieval", 2470-2473.
McGraw, Ian / Cyphers, Scott / Pasupat, Panupong / Liu, Jingjing / Glass, James: "Automating crowd-supervised learning for spoken language systems", 2474-2477.
Sainath, Tara N. / Nahamoo, David / Kanevsky, Dimitri / Ramabhadran, Bhuvana: "Enhancing exemplar-based posteriors for speech recognition tasks", 2130-2133.
Gemmeke, Jort F. / Van hamme, Hugo: "Advances in noise robust digit recognition using hybrid exemplar-based techniques", 2134-2137.
Hurmalainen, Antti / Saeidi, Rahim / Virtanen, Tuomas: "Group sparsity for speaker identity discrimination in factorisation-based speech recognition", 2138-2141.
Sun, Yang / Cranen, Bert / Gemmeke, Jort F. / Boves, Lou / Bosch, Louis ten / Doss, Mathew M.: "Using sparse classification outputs as feature observations for noise-robust ASR", 2142-2145.
Soldo, Serena / Magimai-Doss, Mathew / Bourlard, Hervé: "Synthetic references for template-based ASR using posterior features", 2146-2149.
Wang, Dong / Tejedor, Javier: "Heterogeneous convolutive non-negative sparse coding", 2150-2153.
Geiger, Jürgen T. / Vipperla, Ravichander / Bozonnet, Simon / Evans, Nicholas / Schuller, Björn / Rigoll, Gerhard: "Convolutive non-negative sparse coding and new features for speech overlap handling in speaker diarization", 2154-2157.
Martínez-González, Beatriz / Pardo, José M. / Echeverry-Correa, Julián D. / Vallejo-Pinto, José A. / Barra-Chicote, Roberto: "Selection of TDOA parameters for MDM speaker diarization", 2158-2161.
Toledo-Ronen, Orith / Aronowitz, Hagai: "Confidence for speaker diarization using PCA spectral ratio", 2162-2165.
Tawara, Naohiro / Ogawa, Tetsuji / Watanabe, Shinji / Nakamura, Atsushi / Kobayashi, Tetsunori: "Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model", 2166-2169.
Vijayasenan, Deepu / Valente, Fabio: "Diartk: an open source toolkit for research in multistream speaker diarization and its application to meetings recordings", 2170-2173.
Dupuy, Grégor / Rouvier, Mickael / Meignier, Sylvain / Estève, Yannick: "I-vectors and ILP clustering adapted to cross-show speaker diarization", 2174-2177.
Israel, Assaf / Proctor, Michael / Goldstein, Louis / Iskarous, Khalil / Narayanan, Shrikanth: "Emphatic segments and emphasis spread in Lebanese Arabic: a real-time magnetic resonance imaging study", 2178-2181.
Shosted, Ryan K. / Sutton, Bradley P. / Benmamoun, Abbas: "Using magnetic resonance to image the pharynx during Arabic speech: static and dynamic aspects", 2182-2185.
Vargas, Julián Andrés Valdés / Badin, Pierre / Lamalle, Laurent: "Articulatory speaker normalisation based on MRI-data using three-way linear decomposition methods", 2186-2189.
Arai, Takayuki: "Vowels produced by sliding three-tube model with different lengths", 2190-2193.
Kaburagi, Tokihiko / Takano, Tetsuro / Sakamoto, Yuki: "Estimating the vocal-tract area function from formants using a sensitivity function and least square", 2194-2197.
Lucero, Jorge C. / Koenig, Laura L. / Fuchs, Susanne: "Modeling source-tract interaction in speech production: voicing onset vs. vowel height after a voiceless obstruent", 2198-2201.
Bollepalli, Bajibabu / Black, Alan W. / Prahallad, Kishore: "Modelling a noisy-channel for voice conversion using articulatory features", 2202-2205.
Janska, Anna C. / Schröger, Erich / Jacobsen, Thomas / Clark, Robert A. J.: "Asymmetries in the perception of synthesized speech", 2206-2209.
Greene, Erica / Mishra, Taniya / Haffner, Patrick / Conkie, Alistair: "Predicting character-appropriate voices for a TTS-based storyteller system", 2210-2213.
Sorin, Alexander / Shechtman, Slava / Pollet, Vincent: "Psychoacoustic segment scoring for multi-form speech synthesis", 2214-2217.
Bailly, Gérard / Gouvernayre, Cécilia: "Pauses and respiratory markers of the structure of book reading", 2218-2221.
Potard, Blaise / Aylett, Matthew P. / Pidcock, Christopher J.: "Proper name splicing in computer games with TTS", 2222-2225.
Mehrabani, Mahnoosh / Hansen, John H. L.: "Speaker clustering for a mixture of singing and reading", 2258-2261.
Ghosh, Sayan / Sreenivas, Thippur V.: "Automatic speech segmentation using probabilistic latent component modeling", 2262-2265.
Dennis, Jonathan / Tran, Huy Dat / Chng, Eng Siong: "Overlapping sound event recognition using local spectrogram features with the generalised hough transform", 2266-2269.
Kalinli, Ozlem: "Automatic phoneme segmentation using auditory attention features", 2270-2273.
Kua, Jia Min Karen / Thiruvaran, Tharmarajah / Ambikairajah, Eliathamby: "A non-uniform filterbank for speaker recognition", 2274-2277.
Lorenzo-Trueba, Jaime / Martinez-Gonzalez, Beatriz / LopezLudeña, Veronica / Barra-Chicote, Roberto / Ferreiros, Javier / Yamagishi, Junichi / Montero, Juan M.: "Towards an unsupervised speaking style voice building framework: multi.style speaker diarization", 2278-2281.
Mohammadi, Seyed Hamidreza / Sameti, Hossein / Langarani, Mahsa Sadat Elyasi / Tavanaei, Amirhossein: "KNNDIST: a non-parametric distance measure for speaker segmentation", 2282-2285.
Feng, Wei / Nie, Xuecheng / Wan, Liang / Xie, Lei / Jiang, Jianmin: "Lexical story co-segmentation of Chinese broadcast news", 2286-2289.
Karnjanadecha, Montri / Zahorian, Stephen A.: "Toward an optimum feature set and HMM model parameters for automatic phonetic alignment of spontaneous speech", 2290-2293.
Vertanen, Keith / Kristensson, Per Ola: "Spelling as a complementary strategy for speech recognition", 2294-2297.
Schlippe, Tim / Ochs, Sebastian / Vu, Ngoc Thang / Schultz, Tanja: "Automatic error recovery for pronunciation dictionaries", 2298-2301.
Senay, Grégory / Linarès, Georges: "Confidence measure for speech indexing based on latent dirichlet allocation", 2302-2305.
Cerisara, Christophe / Lorenzo, Alejandra: "Mixed probabilistic and deterministic dependency parsing", 2306-2309.
Yamahata, Shoko / Yamaguchi, Yoshikazu / Ogawa, Atsunori / Masataki, Hirokazu / Yoshioka, Osamu / Takahashi, Satoshi: "Automatic vocabulary adaptation based on semantic similarity and speech recognition confidence measure", 2310-2313.
Ward, Nigel G. / Vega, Alejandro: "Towards empirical dialog-state modeling and its use in language modeling", 2314-2317.
Kubo, Keigo / Kawanami, Hiromichi / Saruwatari, Hiroshi / Shikano, Kiyohiro: "Evaluation of many-to-many alignment algorithm by automatic pronunciation annotation using web text mining", 2318-2321.
Koço, Sokol / Capponi, Cécile / Béchet, Frédéric: "Applying multiview learning algorithms to human-human conversation classification", 2322-2325.
Akita, Yuya / Watanabe, Makoto / Kawahara, Tatsuya: "Automatic transcription of lecture speech using language model based on speaking- style transformation of proceeding texts", 2326-2329.
Li, Chen / Liu, Yang: "Normalization of text messages using character- and phone-based machine translation approaches", 2330-2333.
Azim, Aisha S. / Wang, Xiaoxuan / Chai, Sim Khe: "A weighted combination of speech with text-based models for Arabic diacritization", 2334-2337.
Seigel, Matthew S. / Woodland, Phillip C.: "Using sub-word-level information for confidence estimation with conditional random field models", 2338-2341.
Lee, Hung-yi / Chou, Yu-yu / Wang, Yow-Bang / Lee, Lin-shan: "Supervised spoken document summarization jointly considering utterance importance and redundancy by structured support vector machine", 2342-2345.
Chen, Yun-Nung / Metze, Florian: "Integrating intra-speaker topic modeling and temporal-based inter-speaker topic modeling in random walk for improved multi-party meeting summarization", 2346-2349.
Feng, Junlan / Renger, Bernard: "Language modeling for voice-enabled social TV using tweets", 2350-2353.
Kumar, Rohit / Prasad, Rohit / Ananthakrishnan, Sankaranarayanan / Vembu, Aravind Namandi / Stallard, Dave / Tsakalidis, Stavros / Natarajan, Prem: "Detecting OOV named-entities in conversational speech", 2354-2357.
Maskey, Sameer / Zhou, Bowen: "Unsupervised deep belief features for speech translation", 2358-2361.
Pérez, Alicia / Alcaide, José M. / Torres, María-Inés: "Euskoparl: a speech and text Spanish-basque parallel corpus", 2362-2365.
Ryu, Hyuksu / Kim, Sunhee / Chung, Minhwa: "Comparing transcription agreement on non-native English speech corpus between native and non-native annotators", 2366-2369.
Ogata, Jun / Goto, Masataka: "Podcastle: collaborative training of language models on the basis of wisdom of crowds", 2370-2373.
Xie, Lei / Xu, Yinqing / Zheng, Lilei / Huang, Qiang / Li, Bingfeng: "Speech pattern discovery using audio-visual fusion and canonical correlation analysis", 2374-2377.
Maskey, Sameer / Rosenberg, Andrew: "Power mean pyramid scores for summarization evaluation", 2378-2381.
Escudero-Mancebo, David / Estebas-Vilaplana, Eva: "Visualizing tool for evaluating inter-label similarity in prosodic labeling experiments", 2382-2385.
Wagner, Petra / Tamburini, Fabio / Windmann, Andreas: "Objective, subjective and linguistic roads to perceptual prominence.how are they compared and why?", 2386-2389.
Heckmann, Martin: "Audio-visual evaluation and detection of word prominence in a human-machine interaction scenario", 2390-2393.
Arnold, Denis / Wagner, Petra / Möbius, Bernd: "Obtaining prominence judgments from naive listeners.influence of rating scales linguistic levels and normalisation", 2394-2397.
Badino, Leonardo / Clark, Robert A. J. / Wester, Mirjam: "Towards hierarchical prosodic prominence generation in TTS synthesis", 2398-2401.
Cutugno, Francesco / Leone, Enrico / Ludusan, Bogdan / Origlia, Antonio: "Investigating syllabic prominence with conditional random fields and latent-dynamic conditional random fields", 2402-2405.
Samlowski, Barbara / Wagner, Petra / Möbius, Bernd: "Disentangling lexical, morphological, syntactic and semantic influences on German prominence - evidence from a production study", 2406-2409.
Rosenberg, Andrew: "Using prominence and phrasing predictions to improve weighted dictionary pronunciation models", 2410-2413.
Goldman, Jean-Philippe / Avanzi, Mathieu / Auchlin, Antoine / Simon, Anne Catherine: "A continuous prominence score based on acoustic features", 2414-2417.
Sappok, Christopher / Arnold, Denis: "More on the normalization of syllable prominence ratings", 2418-2421.
Mahrt, Tim / Cole, Jennifer / Fleck, Margaret / Hasegawa-Johnson, Mark: "F0 and the perception of prominence", 2422-2425.
Andreeva, Bistra / Barry, William / Wolska, Magdalena: "Language differences in the perceptual weight of prominence-lending properties", 2426-2429.
Li, Haiyang / Han, Jiqing / Zheng, Tieran / Zheng, Guibin: "A novel confidence measure based on context consistency for spoken term detection", 2430-2433.
Karanasou, Panagiota / Burget, Lukas / Vergyri, Dimitra / Akbacak, Murat / Mandal, Arindam: "Discriminatively trained phoneme confusion model for keyword spotting", 2434-2437.
Kintzley, Keith / Jansen, Aren / Church, Kenneth / Hermansky, Hynek: "Inverting the point process model for fast phonetic keyword search", 2438-2441.
Norouzian, Atta / Jansen, Aren / Rose, Richard C. / Thomas, Samuel: "Exploiting discriminative point process models for spoken term detection", 2442-2445.
Bulyko, Ivan / Herrero, José / Mihelich, Chris / Kimball, Owen: "Subword speech recognition for detection of unseen words", 2446-2449.
Qin, Long / Rudnicky, Alexander: "OOV word detection using hybrid models with mixed types of fragments", 2450-2453.
Vosoughi, Soroush / Roy, Deb: "An automatic child-directed speech detector for the study of child language development", 2478-2481.
Plummer, Andrew R.: "Aligning manifolds to model the earliest phonological abstraction in infant-caretaker vocal imitation", 2482-2485.
Saikachi, Yoko / Kitahara, Mafuyu / Nishikawa, Ken'ya / Kanato, Ai / Mazuka, Reiko: "The F0 fall delay of lexical pitch accent in Japanese infant-directed speech", 2486-2489.
Shport, Irina A.: "Childrenfs productions of multi-syllabic lexical stress patterns in different prosodic positions", 2490-2493.
Redford, Melissa A. / Dilley, Laura C. / Gamache, Jessica L. / Wieland, Elizabeth A.: "Prosodic marking of continuation versus completion in childrenfs narratives", 2494-2497.
Fogerty, Daniel / Kewley-Port, Diane / Humes, Larry E.: "Judging temporal onset differences for concurrent vowels: results for young, middleaged, and older adults", 2498-2501.
Hu, Pengfei / Liu, Wenju / Jiang, Wei: "Combining frame and segment based models for environmental sound classification", 2502-2505.
Leng, Yi Ren / Tran, Huy Dat: "Using blob detection in missing feature linear-frequency cepstral coefficients for robust sound event recognition", 2506-2509.
Patil, Kailash / Elhilali, Mounya: "Goal-oriented auditory scene recognition", 2510-2513.
Ziaei, Ali / Sangwan, Abhijeet / Hansen, John H. L.: "Prof-life-log: audio environment detection for naturalistic audio streams", 2514-2517.
Huang, Po-Sen / Yang, Jianchao / Hasegawa-Johnson, Mark / Liang, Feng / Huang, Thomas S.: "Pooling robust shift-invariant sparse representations of acoustic signals", 2518-2521.
Tan, Lee Ngee / Kaewtip, Kantapon / Cody, Martin L. / Taylor, Charles E. / Alwan, Abeer: "Evaluation of a sparse representation-based classifier for bird phrase classification under limited data conditions", 2522-2525.
Novak, Josef R. / Dixon, Paul R. / Minematsu, Nobuaki / Hirose, Keikichi / Hori, Chiori / Kashioka, Hideki: "Improving WFST-based G2p conversion with alignment constraints and RNNLM n-best rescoring", 2526-2529.
Luan, Jian / He, Bolei / Xia, Hairong / Wang, Linfang / Braga, Daniela / Zhao, Sheng: "Expand CRF to model long distance dependencies in prosodic break prediction", 2530-2533.
Veilleux, Nanette / Barnes, Jonathan / Brugos, Alejna / Shattuck-Hufnagel, Stefanie: "Perceptual foundations for naturalistic variability in the prosody of synthetic speech", 2534-2537.
Hahn, Stefan / Vozila, Paul / Bisani, Maximilian: "Comparison of grapheme-to-phoneme methods on large pronunciation dictionaries and LVCSR tasks", 2538-2541.
Berthommier, Frédéric / Girin, Laurent / Boë, Louis-Jean: "A simple hybrid acoustic/morphologically-constrained technique for the synthesis of stop consonants in various vocalic contexts", 2542-2545.
Prahallad, Kishore / Kumar, E. Naresh / Keri, Venkatesh / Rajendran, S. / Black, Alan W.: "The IIIT-h indic speech databases", 2546-2549.
San-Segundo, Rubén / Montero, Juan M. / López-Ludeña, Verónica / King, Simon: "Detecting acronyms from capital letter sequences in Spanish", 2550-2553.
Lehnen, Patrick / Hahn, Stefan / Guta, Vlad-Andrei / Ney, Hermann: "Hidden conditional random fields with M-to-N alignments for grapheme-to-phoneme conversion", 2554-2557.
Rosenberg, Andrew / Fernandez, Raul / Ramabhadran, Bhuvana: "Phrase boundary assignment from text in multiple domains", 2558-2561.
Minematsu, Nobuaki / Kobayashi, Shumpei / Shimizu, Shinya / Hirose, Keikichi: "Improved prediction of Japanese word accent sandhi using CRF", 2562-2565.
Toutios, Asterios / Maeda, Shinji: "Articulatory VCV synthesis from EMA data", 2566-2569.
Zellou, Georgia: "Nasality from Moroccan Arabic nasal and pharyngeal consonants: patterns of airflow and nasalance", 2678-2681.
Delvaux, Véronique / Huet, Kathy / Piccaluga, Myriam / Harmegnies, Bernard: "Inter-gestural timing in French nasal vowels: a comparative study of (liege, tournai) northern French vs. (marseille, toulouse) southern French", 2682-2685.
Zellou, Georgia / Scarborough, Rebecca: "Nasal coarticulation and contrastive stress", 2686-2689.
Oliveira, Catarina / Martins, Paula / Silva, Samuel / Teixeira, António: "An MRI study of the oral articulation of European Portuguese nasal vowels", 2690-2693.
Scarborough, Rebecca / Zellou, Georgia: "Acoustic and perceptual similarity in coarticulatorily nasalized vowels", 2694-2697.
Rong, Panying / Shosted, Ryan K. / Kuehn, David: "Articulatory differences between oral and nasal vowels based on the simulation of a speaker-adaptive articulatory model", 2698-2701.