Table of Contents and Access to Abstracts
Keynotes
Hirschberg, Julia:
"Speaking more like you: entrainment in conversational speech",
4001.
Mitchell, Tom M.:
"Neural representations of word meanings",
4002.
Pentland, Alex:
"Signals and speech",
1-4.
Speaker Recognition - Modeling
Matza, Avi / Bistritz, Yuval:
"Skew Gaussian mixture models for speaker recognition",
5-8.
Toledo-Ronen, Orith / Aronowitz, Hagai / Hoory, Ron / Pelecanos, Jason / Nahamoo, David:
"Towards goat detection in text-dependent speaker verification",
9-12.
Bonastre, Jean-François / Anguera, Xavier / Sierra, Gabriel H. / Bousquet, Pierre-Michel:
"Speaker modeling using local binary decisions",
13-16.
Aronowitz, Hagai / Hoory, Ron / Pelecanos, Jason / Nahamoo, David:
"New developments in voice biometrics for user authentication",
17-20.
Mandasari, Miranti Indar / McLaren, Mitchell / Leeuwen, David A. van:
"Evaluation of i-vector speaker recognition systems for forensic application",
21-24.
Senoussaoui, Mohammed / Kenny, Patrick / Brümmer, Niko / Villiers, Edward de / Dumouchel, Pierre:
"Mixture of PLDA models in i-vector space for gender-independent speaker recognition",
25-28.
Speech Perception - Speech Intelligibility
Iyer, Nandini / Brungart, Douglas S. / Simpson, Brian D.:
"Segregation of whispered speech interleaved with noise or speech maskers",
29-32.
Kliper, Roi / Kayser, Hendrik / Weinshall, Daphna / Nelken, Israel / Anemüller, Jörn:
"Monaural azimuth localization using spectral dynamics of speech",
33-36.
Rennies, Jan / Brand, Thomas / Kollmeier, Birger:
"Prediction of binaural intelligibility level differences in reverberation",
37-40.
Gautreau, Aurore / Hoen, Michel / Meunier, Fanny:
"Let's all speak together! exploring the impact of various languages on the comprehension of speech in multi-linguistic babble",
41-44.
Shafiro, Valeriy / Sheft, Stanley / Risley, Robert:
"Cross-rate variation in the intelligibility of dual-rate gated speech in older listeners",
45-48.
Lee, Chia-ying / Glass, James / Ghitza, Oded:
"An efferent-inspired auditory model front-end for speech recognition",
49-52.
Speech Representation and Modelling
Ali, Faten Ben / Girin, Laurent / Larbi, Sonia Djaziri:
"A long-term harmonic plus noise model for speech signals",
53-56.
Cinnéide, Alan Ó / Dorran, David / Gainza, Mikel / Coyle, Eugene:
"A frequency domain approach to ARX-LF voiced speech parameterization and synthesis",
57-60.
Ramanarayanan, Vikram / Katsamanis, Athanasios / Narayanan, Shrikanth:
"Automatic data-driven learning of articulatory primitives from real-time MRI data using convolutive NMF with sparseness constraints",
61-64.
Wang, Dong / Vipperla, Ravichander / Evans, Nicholas:
"Online pattern learning for non-negative convolutive sparse coding",
65-68.
Malyska, Nicolas / Quatieri, Thomas F. / Dunn, Robert:
"Sinewave representations of nonmodality",
69-72.
Raj, Ch. Srikanth / Sreenivas, T. V.:
"Time-varying signal adaptive transform and IHT recovery of compressive sensed speech",
73-76.
Emotion, Speaking Style, and Social Behavior
Wöllmer, Martin / Weninger, Felix / Eyben, Florian / Schuller, Björn:
"Acoustic-linguistic recognition of interest in speech with bottleneck-BLSTM nets",
77-80.
Erden, Mustafa / Arslan, Levent M.:
"Automatic detection of anger in human-human call center dialogs",
81-84.
Chang, Keng-hao / Lei, Howard / Canny, John:
"Improved classification of speaking styles for mental health monitoring using phoneme dynamics",
85-88.
Black, Matthew P. / Georgiou, Panayiotis G. / Katsamanis, Athanasios / Baucom, Brian R. / Narayanan, Shrikanth:
"“you made me do it”: classification of blame in married couples' interactions by fusing automatically derived speech and language information",
89-92.
Goudbeek, Martijn / Nilsenová, Marie:
"Context and priming effects in the recognition of emotion of old and young listeners",
93-96.
Gravano, Agustín / Levitan, Rivka / Willson, Laura / Beňuš, Štefan / Hirschberg, Julia / Nenkova, Ani:
"Acoustic and prosodic correlates of social behavior",
97-100.
HMM-Based Speech Synthesis I, II
Oh, Kyung Hwan / Sung, June Sig / Hong, Doo Hwa / Kim, Nam Soo:
"Decision tree-based clustering with outlier detection for HMM-based speech synthesis",
101-104.
Silén, Hanna / Helander, Elina / Gabbouj, Moncef:
"Prediction of voice aperiodicity based on spectral representations in HMM speech synthesis",
105-108.
Nose, Takashi / Kobayashi, Takao:
"A perceptual expressivity modeling technique for speech synthesis based on multiple-regression HSMM",
109-112.
Hashimoto, Kei / Nankaku, Yoshihiko / Tokuda, Keiichi:
"Multi-speaker modeling with shared prior distributions and model structures for Bayesian speech synthesis",
113-116.
Ling, Zhen-Hua / Richmond, Korin / Yamagishi, Junichi:
"Feature-space transform tying in unified acoustic-articulatory modelling for articulatory control of HMM-based speech synthesis",
117-120.
Shannon, Matt / Zen, Heiga / Byrne, William:
"The effect of using normalized models in statistical speech synthesis",
121-124.
Picart, Benjamin / Drugman, Thomas / Dutoit, Thierry:
"Continuous control of the degree of articulation in HMM-based speech synthesis",
1797-1800.
Chen, Ling-Hui / Nankaku, Yoshihiko / Zen, Heiga / Tokuda, Keiichi / Ling, Zhen-Hua / Dai, Li-Rong:
"Estimation of window coefficients for dynamic feature extraction for HMM-based speech synthesis",
1801-1804.
Wen, Zhengqi / Tao, Jianhua:
"Inverse filtering based harmonic plus noise excitation model for HMM-based speech synthesis",
1805-1808.
Erro, Daniel / Sainz, Iñaki / Navas, Eva / Hernáez, Inma:
"Improved HNM-based vocoder for statistical synthesizers",
1809-1812.
Anumanchipalli, Gopala Krishna / Oliveira, Luís C. / Black, Alan W.:
"A statistical phrase/accent model for intonation modeling",
1813-1816.
Henter, Gustav Eje / Kleijn, W. Bastiaan:
"Intermediate-state HMMs to capture continuously-changing signal features",
1817-1820.
Braunschweiler, Norbert / Buchholz, Sabine:
"Automatic sentence selection from speech corpora including diverse speech for improved HMM-TTS synthesis quality",
1821-1824.
Liang, Hui / Dines, John:
"Phonological knowledge guided HMM state mapping for cross-lingual speaker adaptation",
1825-1828.
Obin, Nicolas / Lanchantin, Pierre / Lacheret, Anne / Rodet, Xavier:
"Reformulating prosodic break model into segmental HMMs and information fusion",
1829-1832.
Maia, Ranniery / Zen, Heiga / Knill, Kate / Gales, M. J. F. / Buchholz, Sabine:
"Multipulse sequences for residual signal modeling",
1833-1836.
Valentini-Botinhao, Cassia / Yamagishi, Junichi / King, Simon:
"Can objective measures predict the intelligibility of modified HMM-based synthetic speech in noise?",
1837-1840.
Nitta, Tsuneo / Onoda, Takayuki / Kimura, Masashi / Iribe, Yurie / Katsurada, Kouichi:
"Speech synthesis based on articulatory-movement HMMs with voice-source codebooks",
1841-1844.
Kato, Tsuneo / Yamada, Makoto / Nishizawa, Nobuyuki / Oura, Keiichiro / Tokuda, Keiichi:
"Large-scale subjective evaluations of speech rate control methods for HMM-based speech synthesizers",
1845-1848.
Maeno, Yu / Nose, Takashi / Kobayashi, Takao / Ijima, Yusuke / Nakajima, Hideharu / Mizuno, Hideyuki / Yoshioka, Osamu:
"HMM-based emphatic speech synthesis using unsupervised context labeling",
1849-1852.
Speaker Recognition - Modeling, Automatic Procedures, Analysis I-III
Zhang, Ce / Zheng, Rong / Xu, Bo:
"Restoring the residual speaker information in total variability modeling for speaker verification",
125-128.
Aronowitz, Hagai / Barkan, Oren:
"New developments in joint factor analysis for speaker verification",
129-132.
Gonzalez-Rodriguez, Joaquin:
"Speaker recognition using temporal contours in linguistic units: the case of formant and formant-bandwidth trajectories",
133-136.
Glembek, Ondřej / Burget, Lukáš / Brümmer, Niko / Plchot, Oldřich / Matějka, Pavel:
"Discriminatively trained i-vector extractor for speaker verification",
137-140.
Sanchez, Michelle Hewlett / Ferrer, Luciana / Shriberg, Elizabeth / Stolcke, Andreas:
"Constrained cepstral speaker recognition using matched UBM and JFA training",
141-144.
McCree, Alan / Sturim, Douglas / Reynolds, Douglas:
"A new perspective on GMM subspace compensation based on PPCA and wiener filtering",
145-148.
Zhang, Ce / Zheng, Rong / Xu, Bo:
"Data-driven Gaussian component selection for fast GMM-based speaker verification",
245-248.
Garcia-Romero, Daniel / Espy-Wilson, Carol Y.:
"Analysis of i-vector length normalization in speaker recognition systems",
249-252.
Jiang, Weiwu / Li, Zhifeng / Meng, Helen:
"An analysis framework based on random subspace sampling for speaker verification",
253-256.
Scheffer, Nicolas / Lei, Yun / Ferrer, Luciana:
"Factor analysis back ends for MLLR transforms in speaker recognition",
257-260.
Greenberg, Craig S. / Martin, Alvin F. / Barr, Bradford N. / Doddington, George R.:
"Report on performance results in the NIST 2010 speaker recognition evaluation",
261-264.
Kockmann, Marcel / Ferrer, Luciana / Burget, Lukáš / Černocký, Jan:
"ivector fusion of prosodic and cepstral features for speaker verification",
265-268.
Kanagasundaram, Ahilan / Vogt, Robbie / Dean, David / Sridharan, Sridha / Mason, Michael:
"i-vector based speaker recognition on short utterances",
2341-2344.
Sun, Hanwu / Ma, Bin:
"Study of overlapped speech detection for NIST SRE summed channel speaker recognition",
2345-2348.
Ma, Zhanyu / Leijon, Arne:
"Super-dirichlet mixture models using differential line spectral frequencies for text-independent speaker identification",
2349-2352.
Yu, Hon-Bill / Mak, Man-Wai:
"Comparison of voice activity detectors for interview speech in NIST speaker recognition evaluation",
2353-2356.
Sarkar, A. K. / Umesh, S.:
"Eigen-voice based anchor modeling system for speaker identification using MLLR super-vector",
2357-2360.
Wang, Wen / Kathol, Andreas / Bratt, Harry:
"Automatic detection of speaker attributes based on utterance text",
2361-2364.
Cumani, Sandro / Batzu, Pier Domenico / Colibro, Daniele / Vair, Claudio / Laface, Pietro / Vasilakakis, Vasileios:
"Comparison of speaker recognition approaches for real applications",
2365-2368.
Polzehl, Tim / Möller, Sebastian / Metze, Florian:
"Modeling speaker personality using voice",
2369-2372.
Ferràs, Marc / Shinoda, Koichi / Furui, Sadaoki:
"Structural joint factor analysis for speaker recognition",
2373-2376.
Biswas, Sangeeta / Ferràs, Marc / Shinoda, Koichi / Furui, Sadaoki:
"Acoustic forest for SMAP-based speaker verification",
2377-2380.
Sivaram, G. S. V. S. / Thomas, Samuel / Hermansky, Hynek:
"Mixture of auto-associative neural networks for speaker verification",
2381-2384.
Speech Perception - Perceptual Learning and Cross-Language Perception
Scharenborg, Odette / Mitterer, Holger / McQueen, James M.:
"Perceptual learning of liquids",
149-152.
Tuinman, Annelie / Mitterer, Holger / Cutler, Anne:
"The efficiency of cross-dialectal word recognition",
153-156.
Tsuzaki, Minoru / Tokuda, Keiichi / Kawai, Hisashi / Ni, Jinfu:
"Estimation of perceptual spaces for speaker identities based on the cross-lingual discrimination task",
157-160.
Peperkamp, Sharon / Bouchon, Camillia:
"The relation between perception and production in L2 phonological processing",
161-164.
Bissiri, Maria Paola / Garcia Lecumberri, Maria Luisa / Cooke, Martin / Volín, Jan:
"The role of word-initial glottal stops in recognizing English words",
165-168.
Zhang, Caicai / Peng, Gang / Wang, William S.-Y.:
"Effect of language experience on the categorical perception of Cantonese vowel duration",
169-172.
Speech Analysis
Pedersen, C. F. / Andersen, Ove / Dalsgaard, Paul:
"Adaptive estimation of zeros of time-varying z-transforms",
173-176.
Kane, John / Gobl, Christer:
"Identifying regions of non-modal phonation using features of the wavelet transform",
177-180.
Fan, Xing / Godin, Keith W. / Hansen, John H. L.:
"Acoustic analysis of whispered speech for phoneme and speaker dependency",
181-184.
Asaei, Afsaneh / Taghizadeh, Mohammad J. / Bourlard, Hervé / Cevher, Volkan:
"Multi-party speech recovery exploiting structured sparsity models",
185-188.
Mallidi, Sri Harish / Ganapathy, Sriram / Hermansky, Hynek:
"Modulation spectrum analysis for recognition of reverberant speech",
189-192.
Petkov, Petko N. / Kleijn, W. Bastiaan / Vries, Bert de:
"Discrete choice models for non-intrusive quality assessment",
193-196.
Speech Enhancement and Dereverberation
Kinoshita, Keisuke / Souden, Mehrez / Delcroix, Marc / Nakatani, Tomohiro:
"Single channel dereverberation using example-based speech enhancement with uncertainty decoding technique",
197-200.
Erkelens, Jan S. / Heusdens, Richard:
"A statistical room impulse response model with frequency dependent reverberation time for single-microphone late reverberation suppression",
201-204.
Zheng, Chenxi / Falk, Tiago H. / Chan, Wai-Yip:
"An assessment of the improvement potential of time-frequency masking for speech dereverberation",
205-208.
Prego, Thiago de M. / Lima, Amaro A. de / Netto, Sergio L.:
"Perceptual improvement of a two-stage algorithm for speech dereverberation",
209-212.
Hadir, Najib / Faubel, Friedrich / Klakow, Dietrich:
"A model-based spectral envelope wiener filter for perceptually motivated speech enhancement",
213-216.
Marin-Hurtado, Jorge I. / Parikh, Devangi N. / Anderson, David V.:
"Binaural noise-reduction method based on blind source separation and perceptual post processing",
217-220.
ASR - Feature Extraction I, II
Ng, Tim / Zhang, Bing / Matsoukas, Spyros / Nguyen, Long:
"Region dependent transform on MLP features for speech recognition",
221-224.
Heckmann, Martin / Gläser, Claudius:
"Discriminant sub-space projection of spectro-temporal speech features based on maximizing mutual information",
225-228.
Fukuda, Takashi / Ichikawa, Osamu / Nishimura, Masafumi:
"Combining feature space discriminative training with long-term spectro-temporal features for noise-robust speech recognition",
229-232.
Chopra, Sumit / Haffner, Patrick / Dimitriadis, Dimitrios:
"Combining frame and segment level processing via temporal pooling for phonetic classification",
233-236.
Yu, Dong / Seltzer, Michael L.:
"Improved bottleneck features using pretrained deep neural networks",
237-240.
Liao, Yuan-Fu / Lin, Chia-Hsing / Fang, We-Der:
"Minimum classification error based spectro-temporal feature extraction for robust audio classification",
241-244.
Grézl, František / Karafiát, Martin:
"Integrating recent MLP feature extraction techniques into TRAP architecture",
1229-1232.
Wöllmer, Martin / Schuller, Björn / Rigoll, Gerhard:
"Feature frame stacking in RNN-based tandem ASR systems - learned vs. predefined context",
1233-1236.
Plahl, Christian / Schlüter, Ralf / Ney, Hermann:
"Improved acoustic feature combination for LVCSR by neural networks",
1237-1240.
Pinto, Joel / Magimai-Doss, Mathew / Bourlard, Hervé:
"Hierarchical tandem features for ASR in Mandarin",
1241-1244.
Valente, Fabio / Magimai-Doss, Mathew / Wang, Wen:
"Analysis and comparison of recent MLP features for LVCSR systems",
1245-1248.
Lee, Jaehyung / Lee, Soo-Young:
"Deep learning of speech features for improved phonetic recognition",
1249-1252.
Huang, Heyun / Liu, Yang / Gemmeke, Jort F. / Bosch, Louis ten / Cranen, Bert / Boves, Lou:
"Globality-locality consistent discriminant analysis for phone classification",
1253-1256.
Bořil, Hynek / Grézl, František / Hansen, John H. L.:
"Front-end compensation methods for LVCSR under lombard effect",
1257-1260.
Lee, Jung-Won / Choi, Jeung-Yoon / Kang, Hong-Goo:
"Classification of fricatives using feature extrapolation of acoustic-phonetic features in telephone speech",
1261-1264.
Keronen, Sami / Pohjalainen, Jouni / Alku, Paavo / Kurimo, Mikko:
"Noise robust feature extraction based on extended weighted linear prediction in LVCSR",
1265-1268.
Meyer, Bernd T. / Ravuri, Suman V. / Schädler, Marc René / Morgan, Nelson:
"Comparing different flavors of spectro-temporal features for ASR",
1269-1272.
Variani, Ehsan / Schaaf, Thomas:
"VTLN in the MFCC domain: band-limited versus local interpolation",
1273-1276.
Nemala, Sridhar Krishna / Patil, Kailash / Elhilali, Mounya:
"Multistream bandpass modulation features for robust speech recognition",
1277-1280.
Marino, Davide / Hain, Thomas:
"An analysis of automatic speech recognition with multiple microphones",
1281-1284.
Speech Production - Articulatory Measurements
Kim, Yoon-Chul / Proctor, Michael / Narayanan, Shrikanth / Nayak, Krishna S.:
"Visualization of vocal tract shape using interleaved real-time MRI of multiple scan planes",
269-272.
Winkler, Ralf / Fuchs, Susanne / Perrier, Pascal / Tiede, Mark:
"Biomechanical tongue models: an approach to studying inter-speaker variability",
273-276.
Wang, Jun / Green, Jordan R. / Samal, Ashok / Marx, David B.:
"Quantifying articulatory distinctiveness of vowels",
277-280.
Proctor, Michael / Lammert, Adam / Katsamanis, Athanasios / Goldstein, Louis / Hagedorn, Christina / Narayanan, Shrikanth:
"Direct estimation of articulatory kinematics from real-time magnetic resonance image sequences",
281-284.
Birkholz, Peter / Neuschaefer-Rube, Christiane:
"Combined optical distance sensing and electropalatography to measure articulation",
285-288.
Prom-on, Santitham / Xu, Yi / Liu, Fang:
"Simulating post-l F0 bouncing by modeling articulatory dynamics",
289-292.
Acoustic Event Detection
Geiger, Jürgen T. / Lakhal, Mohamed Anouar / Schuller, Björn / Rigoll, Gerhard:
"Learning new acoustic events in an HMM-based system using MAP adaptation",
293-296.
Leng, Yi Ren / Tran, Huy Dat / Kitaoka, Norihide / Li, Haizhou:
"Alternative frequency scale cepstral coefficient for robust sound event recognition",
297-300.
Ito, Akinori / Aiba, Akihito / Ito, Masashi / Makino, Shozo:
"Evaluation of abnormal sound detection using multi-stage GMM in various environments",
301-304.
Schmalenstroeer, Joerg / Bartek, Markus / Haeb-Umbach, Reinhold:
"Unsupervised learning of acoustic events using dynamic time warping and hierarchical k-means++ clustering",
305-308.
Mejía-Navarrete, David / Gallardo-Antolín, Ascensión / Peláez-Moreno, Carmen / Valverde-Albacete, Francisco J.:
"Feature extraction assessment for an acoustic-event classification task using the entropy triangle",
309-312.
Natarajan, Pradeep / Tsakalidis, Stavros / Manohar, Vasant / Prasad, Rohit / Natarajan, Premkumar:
"Unsupervised audio analysis for categorizing heterogeneous consumer domain videos",
313-316.
Speech Synthesis - Unit Selection and Hybrid approaches
Sridhar, Vivek Kumar Rangarajan / Syrdal, Ann / Conkie, Alistair D. / Bangalore, Srinivas:
"Enriching text-to-speech synthesis using automatic dialog act tags",
317-320.
Latacz, Lukas / Mattheyses, Wesley / Verhelst, Werner:
"Joint target and join cost weight training for unit selection synthesis",
321-324.
Windmann, Andreas / Jauk, Igor / Tamburini, Fabio / Wagner, Petra:
"Prominence-based prosody prediction for unit selection speech synthesis",
325-328.
Pammi, Sathish / Schröder, Marc:
"Evaluating the meaning of synthesized listener vocalizations",
329-332.
Sainz, Iñaki / Erro, Daniel / Navas, Eva / Hernáez, Inma:
"A hybrid TTS approach for prosody and acoustic modules",
333-336.
Sorin, Alexander / Shechtman, Slava / Pollet, Vincent:
"Uniform speech parameterization for multi-form segment synthesis",
337-340.
Speech Enhancement Analysis and Evaluation
Miyazaki, Ryoichi / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Theoretical analysis of musical noise and speech distortion in structure-generalized parametric blind spatial subtraction array",
341-344.
Tang, Yan / Cooke, Martin:
"Subjective and objective evaluation of speech intelligibility enhancement under constant energy and duration constraints",
345-348.
Muraka, Nagarjuna Reddy / Seelamantula, Chandra Sekhar:
"A risk-estimation-based comparison of mean square error and itakura-saito distortion measures for speech enhancement",
349-352.
Triki, Mahdi:
"On noise tracking for noise floor estimation",
353-356.
Milner, Ben:
"Maximum a posteriori estimation of noise from non-acoustic reference signals in very low signal-to-noise ratio environments",
357-360.
Wakisaka, Ryo / Saruwatari, Hiroshi / Shikano, Kiyohiro / Takatani, Tomoya:
"Blind speech prior estimation for generalized minimum mean-square error short-time spectral amplitude estimator",
361-364.
Speaker Recognition - Analysis and Statistics I-III
Laskowski, Kornel / Jin, Qin:
"Harmonic structure transform for speaker recognition",
365-368.
Patil, Hemant A. / Madhavi, Maulik C. / Parhi, Keshab K.:
"Combining evidence from spectral and source-like features for person recognition from humming",
369-372.
Long, Yanhua / Yan, Zhi-Jie / Soong, Frank K. / Dai, Li-Rong / Guo, Wu:
"Improvements in speaker characterization using spectral subband energy based on harmonic plus noise model",
373-376.
Solewicz, Yosef A. / Aronowitz, Hagai:
"Implicit segmentation in two-wire speaker recognition",
377-380.
Yaman, Sibel / Pelecanos, Jason / Omar, Mohamed Kamal:
"Boosting speaker recognition performance with compact representations",
381-384.
Vaquero, Carlos / Ortega, Alfonso / Lleida, Eduardo:
"Partitioning of two-speaker conversation datasets",
385-388.
Bousquet, Pierre-Michel / Matrouf, Driss / Bonastre, Jean-François:
"Intersession compensation and scoring methods in the i-vectors space for speaker recognition",
485-488.
Drgas, Szymon / Dabrowski, Adam:
"Kernel alignment maximization for speaker recognition based on high-level features",
489-492.
Srinivasan, Balaji Vasan / Garcia-Romero, Daniel / Zotkin, Dmitry N. / Duraiswami, Ramani:
"Kernel partial least squares for speaker recognition",
493-496.
Omar, Mohamed Kamal / Pelecanos, Jason:
"Conversational-side-specific inter-session variability compensation",
497-500.
Leeuwen, David A. van / Brümmer, Niko:
"A speaker line-up for the likelihood ratio",
501-504.
Villalba, Jesús / Brümmer, Niko:
"Towards fully Bayesian speaker recognition: integrating out the between-speaker covariance",
505-508.
Pekhovsky, Timur / Lokhanova, Alexandra:
"Variational Bayesian model selection for GMM-speaker verification using universal background model",
2705-2708.
McLaren, Mitchell / Leeuwen, David A. van:
"To weight or not to weight: source-normalised LDA for speaker recognition using i-vectors",
2709-2712.
Huang, Chien-Lin / Ma, Bin:
"Maximum entropy based data selection for speaker recognition",
2713-2716.
Rao, Wei / Mak, Man-Wai:
"Addressing the data-imbalance problem in kernel-based speaker verification via utterance partitioning and speaker comparison",
2717-2720.
Takashima, Ryoichi / Takiguchi, Tetsuya / Ariki, Yasuo:
"Single-channel head orientation estimation based on discrimination of acoustic transfer function",
2721-2724.
Lei, Zhenchun / Yang, Yingchun:
"Maximum likelihood i-vector space using PCA for speaker verification",
2725-2728.
Li, Ming / Zhang, Xiang / Yan, Yonghong / Narayanan, Shrikanth:
"Speaker verification using sparse representations on total variability i-vectors",
2729-2732.
Hasan, Taufiq / Hansen, John H. L.:
"Robust speaker recognition in non-stationary room environments based on empirical mode decomposition",
2733-2736.
Even, Jani / Heracleous, Panikos / Ishi, Carlos T. / Hagita, Norihiro:
"Range based multi microphone array fusion for speaker activity detection in small meetings",
2737-2740.
Ogawa, Tetsuji / Hino, Hideitsu / Murata, Noboru / Kobayashi, Tetsunori:
"Speaker verification robust to talking style variation using multiple kernel learning based on conditional entropy minimization",
2741-2744.
Hautamäki, Ville / Lee, Kong Aik / Kinnunen, Tomi / Ma, Bin / Li, Haizhou:
"Regularized logistic regression fusion for speaker verification",
2745-2748.
Jafari, Ayeh / Srinivasan, Ramji / Crookes, Danny / Ming, Ji:
"A longest matching segment approach with Bayesian adaptation - application to noise-robust speaker recognition",
2749-2752.
Lei, Howard / Mirghafori, Nikki:
"Data selection with kurtosis and nasality features for speaker recognition",
2753-2756.
Hernáez, Inma / Saratxaga, Ibon / Sanchez, Jon / Navas, Eva / Luengo, Iker:
"Use of the harmonic phase in speaker recognition",
2757-2760.
Speech Production - Coarticulation and Speech Timing
Beňuš, Štefan / Pouplier, Marianne:
"Jaw movement in vowels and liquids forming the syllable nucleus",
389-392.
Fivela, Barbara Gili / Stella, Antonio / D'Apolito, Sonia / Sigona, Francesco:
"Coarticulation across prosodic domains in Italian: an ultrasound investigation",
393-396.
Šimko, Juraj / Cummins, Fred / Beňuš, Štefan:
"Investigating the stability of intergestural timing relations",
397-400.
Zmarich, Claudio / Fivela, Barbara Gili / Perrier, Pascal / Savariaux, Christophe / Tisato, Graziano:
"Speech timing organization for the phonological length contrast in Italian consonants",
401-404.
Celata, Chiara / Calamai, Silvia:
"Timing in Italian VNC sequences at different speech rates",
405-408.
Hagedorn, Christina / Proctor, Michael / Goldstein, Louis:
"Automatic analysis of singleton and geminate consonant articulation using real-time magnetic resonance imaging",
409-412.
Speech Segmentation
Wang, Yih-Ru:
"A two-stage sample-based phone boundary detector using segmental similarity features",
413-416.
Huang, Qiang / Cox, Stephen J.:
"Iterative improvement of speaker segmentation in a noisy environment using high-level knowledge",
417-420.
Castán, Diego / Vaquero, Carlos / Ortega, Alfonso / Martínez, David / Villalba, Jesús / Lleida, Eduardo:
"Hierarchical audio segmentation with HMM and factor analysis in broadcast news domain",
421-424.
Kalinli, Ozlem:
"Syllable segmentation of continuous speech using auditory attention cues",
425-428.
Peddinti, Vijayaditya / Prahallad, Kishore:
"Exploiting phone-class specific landmarks for refinement of segment boundaries in TTS databases",
429-432.
Pedone, Agnès / Burred, Juan José / Maller, Simon / Leveau, Pierre:
"Phoneme-level text to audio synchronization on speech signals with background music",
433-436.
ASR - Acoustic Models I-III
Seide, Frank / Li, Gang / Yu, Dong:
"Conversational speech transcription using context-dependent deep neural networks",
437-440.
Wang, Guangsen / Sim, Khe Chai:
"Sequential classification criteria for NNs in automatic speech recognition",
441-444.
Magimai-Doss, Mathew / Rasipuram, Ramya / Aradilla, Guillermo / Bourlard, Hervé:
"Grapheme-based automatic speech recognition using KL-HMM",
445-448.
Keshet, Joseph / Cheng, Chih-Chieh / Stoehr, Mark / McAllester, David:
"Direct error rate minimization of hidden Markov models",
449-452.
Sun, Xie / Chen, Xin / Zhao, Yunxin:
"On the effectiveness of statistical modeling based template matching approach for continuous speech recognition",
453-456.
Wang, Guangsen / Sim, Khe Chai:
"Comparison of smoothing techniques for robust context dependent acoustic modelling in hybrid NN/HMM systems",
457-460.
Hsiao, Roger / Schultz, Tanja:
"Generalized Baum-welch algorithm and its implication to a new extended Baum-welch algorithm",
773-776.
Diehl, F. / Gales, M. J. F. / Liu, X. / Tomalin, M. / Woodland, P. C.:
"Word boundary modelling and full covariance Gaussians for Arabic speech-to-text systems",
777-780.
Ko, Tom / Mak, Brian:
"A fully automated derivation of state-based eigentriphones for triphone modeling with no tied states using regularization",
781-784.
Sainath, Tara N. / Ramabhadran, Bhuvana / Nahamoo, David / Kanevsky, Dimitri:
"Reducing computational complexities of exemplar-based sparse representations with applications to large vocabulary speech recognition",
785-788.
Zhang, Yu / Xu, Jian / Yan, Zhi-Jie / Huo, Qiang:
"An i-vector based approach to training data clustering for improved speech recognition",
789-792.
Buthpitiya, Senaka / Lane, Ian / Chong, Jike:
"Rapid training of acoustic models using graphics processing unit",
793-796.
Alessandrini, Michele / Biagetti, Giorgio / Curzi, Alessandro / Turchetti, Claudio:
"Semi-automatic acoustic model generation from large unsynchronized audio and text chunks",
1681-1684.
Strope, Brian / Beeferman, Doug / Gruenstein, Alexander / Lei, Xin:
"Unsupervised testing strategies for ASR",
1685-1688.
Kurata, Gakuto / Itoh, Nobuyasu / Nishimura, Masafumi:
"Acoustic model training with detecting transcription errors in the training data",
1689-1692.
Jansen, Aren / Church, Kenneth:
"Towards unsupervised training of speaker independent acoustic models",
1693-1696.
Cui, Xiaodong / Chen, Xin / Xue, Jian / Olsen, Peder A. / Hershey, John R. / Zhou, Bowen:
"Acoustic modeling with bootstrap and restructuring based on full covariance",
1697-1700.
Xu, Jian / Zhang, Yu / Yan, Zhi-Jie / Huo, Qiang:
"An i-vector based approach to acoustic sniffing for irrelevant variability normalization based acoustic model training and speech recognition",
1701-1704.
Tahir, Muhammad Ali / Schlüter, Ralf / Ney, Hermann:
"Log-linear optimization of second-order polynomial features with subsequent dimension reduction for speech recognition",
1705-1708.
Zhang, Qingqing / Lamel, Lori / Gauvain, Jean-Luc:
"Genre categorization and modeling for broadcast speech transcription",
1709-1712.
Shin, Sunghwan / Jung, Ho-Young / Juang, Biing-Hwang:
"Individual error minimization learning framework and its applications to speech recognition and utterance verification",
1713-1716.
Darjaa, Sakhia / Cerňak, Miloš / Trnka, Marián / Rusko, Milan / Sabo, Róbert:
"Effective triphone mapping for acoustic modeling in speech recognition",
1717-1720.
Nallasamy, Udhyakumar / Garbus, Michael / Metze, Florian / Jin, Qin / Schaaf, Thomas / Schultz, Tanja:
"Analysis of dialectal influence in pan-Arabic ASR",
1721-1724.
Jalalvand, Azarakhsh / Triefenbach, Fabian / Verstraeten, David / Martens, Jean-Pierre:
"Connected digit recognition by means of reservoir computing",
1725-1728.
Ratnagiri, Madhavi V. / Juang, Biing-Hwang / Rabiner, Lawrence:
"Large margin - minimum classification error using sum of shifted sigmoids as the loss function",
1729-1732.
Olaso, Javier M. / Torres, M. Inés / Justo, Raquel:
"Representing phonological features through a two-level finite state model",
1733-1736.
Vaněk, Jan / Trmal, Jan / Psutka, Josef V. / Psutka, Josef:
"Optimization of the Gaussian mixture model evaluation on GPU",
1737-1740.
Robust Speech Recognition I-III
Astudillo, Ramón Fernandez / Neto, João Paulo da Silva:
"Propagation of uncertainty through multilayer perceptrons for robust automatic speech recognition",
461-464.
Mahkonen, Katariina / Hurmalainen, Antti / Virtanen, Tuomas / Gemmeke, Jort F.:
"Mapping sparse representation to state likelihoods in noise-robust automatic speech recognition",
465-468.
Kallasjoki, Heikki / Remes, Ulpu / Gemmeke, Jort F. / Virtanen, Tuomas / Palomäki, Kalle J.:
"Uncertainty measures for improving exemplar-based source separation",
469-472.
Liao, Hsien-Cheng / Liao, Yuan-Fu / Lee, Chin-Hui:
"Maximum confidence measure based interaural phase difference estimation for noise masking in dual-microphone robust speech recognition",
473-476.
Badiezadegan, Shirin / Rose, Richard:
"A performance monitoring approach to fusing enhanced spectrogram channels in robust speech recognition",
477-480.
Cheng, Ning / Liu, X. / Wang, Lan:
"Generalized variable parameter HMMs for noise robust speech recognition",
481-484.
Mowlaee, P. / Saeidi, R. / Tan, Zheng-Hua / Christensen, M. G. / Kinnunen, Tomi / Fränti, P. / Jensen, S. H.:
"Sinusoidal approach for the single-channel speech separation and recognition challenge",
677-680.
Demir, Cemil / Cemgil, A. Taylan / Saraçlar, Murat:
"Semi-supervised single-channel speech-music separation for automatic speech recognition",
681-684.
Maganti, HariKrishna / Matassoni, Marco:
"A level-dependent auditory filter-bank for speech recognition in reverberant environments",
685-688.
Souden, Mehrez / Kinoshita, Keisuke / Delcroix, Marc / Nakatani, Tomohiro:
"A multichannel feature-based processing for robust speech recognition",
689-692.
Xiao, Xiong / Li, Jinyu / Chng, Eng Siong / Li, Haizhou:
"Feature normalization using structured full transforms for robust speech recognition",
693-696.
Fujimoto, Masakiyo / Watanabe, Shinji / Nakatani, Tomohiro:
"A robust estimation method of noise mixture model for noise suppression",
697-700.
Leutnant, Volker / Krueger, Alexander / Haeb-Umbach, Reinhold:
"A versatile Gaussian splitting approach to non-linear state estimation and its application to noise-robust ASR",
1641-1644.
Pardede, Hilman F. / Shinoda, Koichi:
"Generalized-log spectral mean normalization for speech recognition",
1645-1648.
Kim, Young-Ik / Cho, Hoon-Young / Kim, Sang-Hun:
"Zero-crossing-based channel attentive weighting of cepstral features for robust speech recognition: the ETRI 2011 CHiME challenge system",
1649-1652.
Kim, Wooil / Hansen, John H. L.:
"Feature compensation for speech recognition in severely adverse environments due to background noise and channel distortion",
1653-1656.
Ma, Ning / Barker, Jon / Christensen, Heidi / Green, Phil D.:
"Binaural cues for fragment-based speech recognition in reverberant multisource environments",
1657-1660.
Joshi, Vikas / Bilgi, Raghavendra / Umesh, S. / Garcia, L. / Benitez, C.:
"Sub-band level histogram equalization for robust speech recognition",
1661-1664.
Remes, Ulpu / Nankaku, Yoshihiko / Tokuda, Keiichi:
"GMM-based missing-feature reconstruction on multi-frame windows",
1665-1668.
Sun, Yang / Gemmeke, Jort F. / Cranen, Bert / Bosch, Louis ten / Boves, Lou:
"Improvements of a dual-input DBN for noise robust ASR",
1669-1672.
Gomez, Randy / Kawahara, Tatsuya:
"Denoising using optimized wavelet filtering for automatic speech recognition",
1673-1676.
Müller, Florian / Mertins, Alfred:
"Noise robust speaker-independent speech recognition with invariant-integration features using power-bias subtraction",
1677-1680.
Physiology and Pathology of Spoken Language
Patil, Hemant A. / Baljekar, Pallavi N.:
"Novel VTEO based mel cepstral features for classification of normal and pathological voices",
509-512.
Shimura, Eiji / Kakehi, Kazuhiko:
"Temporal performance of dysarthric patients in speech and tapping tasks",
513-516.
Zhou, Xinhui / Stone, Maureen / Espy-Wilson, Carol Y.:
"A comparative acoustic study on speech of glossectomy patients and normal subjects",
517-520.
Alpan, Ali / Grenez, Francis / Schoentgen, Jean:
"Dysperiodicity analysis of perceptually assessed synthetic speech stimuli",
521-524.
Ghio, Alain / Weisz, Frédérique / Baracca, Giovanna / Cantarella, Giovanna / Robert, Danièle / Woisard, Virginie / Fussi, Franco / Giovanni, Antoine:
"Is the perception of voice quality language-dependent? a comparison of French and Italian listeners and dysphonic speakers",
525-528.
Orozco-Arroyave, J. R. / Murillo-Rendón, S. / Álvarez-Meza, A. M. / Arias-Londoño, J. D. / Delgado-Trejos, E. / Vargas-Bonilla, J. F. / Castellanos-Domínguez, C. G.:
"Automatic selection of acoustic and non-linear dynamic features in voice signals for hypernasality detection",
529-532.
ASR - Lexical, Prosodic and Multi-Lingual Models
Reddy, Sravana / Gouvêa, Evandro:
"Learning from mistakes: expanding pronunciation lexicons using word recognition errors",
533-536.
Imseng, David / Bourlard, Hervé / Dines, John / Garner, Philip N. / Magimai-Doss, Mathew:
"Improving non-native ASR through stochastic multilingual phoneme space transformations",
537-540.
Novotney, Scott / Schwartz, Rich / Khudanpur, Sanjeev:
"Unsupervised Arabic dialect adaptation with self-training",
541-544.
Seppi, Dino / Demuynck, Kris / Compernolle, Dirk Van:
"Template-based automatic speech recognition meets prosody",
545-548.
Badr, Ibrahim / McGraw, Ian / Glass, James:
"Pronunciation learning from continuous speech",
549-552.
Qian, Yanmin / Povey, Daniel / Liu, Jia:
"State-level data borrowing for low-resource speech recognition based on subspace GMMs",
553-556.
Source Separation
Benabderrahmane, Y. / Selouani, Sid-Ahmed / O'Shaughnessy, Douglas:
"Blind speech separation in multiple environments using a frequency oriented PCA method for convolutive mixtures",
557-560.
Koldovský, Zbyněk / Málek, Jiří / Tichavský, Petr:
"Blind speech separation in time-domain using block-toeplitz structure of reconstructed signal matrices",
561-564.
Sarmiento, Auxiliadora / Durán, Iván / Cruces, Sergio / Aguilera, Pablo:
"Generalized method for solving the permutation problem in frequency-domain blind source separation of convolved speech signals",
565-568.
Grais, Emad M. / Erdogan, Hakan:
"Adaptation of speaker-specific bases in non-negative matrix factorization for single channel speech-music separation",
569-572.
Zhang, Shuhua / Girin, Laurent:
"An informed source separation system for speech signals",
573-576.
Tran, Ngoc Thuy / Cowley, William / Pollok, André:
"Adaptive blocking beamformer for speech separation",
577-580.
Multimodal Signal Processing
Kristensson, Per Ola / Vertanen, Keith:
"Asynchronous multimodal text entry using speech and gesture keyboards",
581-584.
McLaughlin, Niall / Ming, Ji / Crookes, Danny:
"Robust bimodal person identification using face and speech with limited training data and corruption of both modalities",
585-588.
Youssef, Atef Ben / Hueber, Thomas / Badin, Pierre / Bailly, Gérard:
"Toward a multi-speaker visual articulatory feedback system",
589-592.
Hueber, Thomas / Benaroya, Elie-Laurent / Denby, Bruce / Chollet, Gérard:
"Statistical mapping between articulatory and acoustic data for an ultrasound-based silent speech interface",
593-596.
Schmalenstroeer, Joerg / Jacob, Florian / Haeb-Umbach, Reinhold / Hennecke, Marius H. / Fink, Gernot A.:
"Unsupervised geometry calibration of acoustic sensor networks using source correspondences",
597-600.
Wand, Michael / Janke, Matthias / Schultz, Tanja:
"Investigations on speaking mode discrepancies in EMG-based speech recognition",
601-604.
ASR - Language Models I, II
Mikolov, Tomáš / Deoras, Anoop / Kombrink, Stefan / Burget, Lukáš / Černocký, Jan:
"Empirical evaluation and combination of advanced language modeling techniques",
605-608.
Zweig, Geoffrey / Chang, Shuangyu:
"Personalizing model M for voice-search",
609-612.
Shinozaki, Takahiro / Kubota, Yu / Furui, Sadaoki / Utsunomiya, Eiji / Shindoh, Yasutaka:
"Sentence selection by direct likelihood maximization for language model adaptation",
613-616.
Arısoy, Ebru / Ramabhadran, Bhuvana / Kuo, Hong-Kwang Jeff:
"Feature combination approaches for discriminative language models",
617-620.
Ananthakrishnan, Sankaranarayanan / Tsakalidis, Stavros / Prasad, Rohit / Natarajan, Premkumar:
"On-line language model biasing for multi-pass automatic speech recognition",
621-624.
Kang, Moonyoung / Ng, Tim / Nguyen, Long:
"Mandarin word-character hybrid-input neural network language model",
625-628.
Sorensen, Jeffrey / Allauzen, Cyril:
"Unary data structures for language models",
1425-1428.
Allauzen, Cyril / Riley, Michael:
"Bayesian language model interpolation for mobile speech input",
1429-1432.
Sundermeyer, Martin / Schlüter, Ralf / Ney, Hermann:
"On the estimation of discount parameters for language model smoothing",
1433-1436.
Lehnen, Patrick / Hahn, Stefan / Ney, Hermann:
"N-grams for conditional random fields or a failure-transition(ϕ) posterior for acyclic FSTs",
1437-1440.
Shaik, M. Ali Basha / Mousa, Amr El-Desoky / Schlüter, Ralf / Ney, Hermann:
"Hybrid language models using mixed types of sub-lexical units for open vocabulary German LVCSR",
1441-1444.
Mousa, Amr El-Desoky / Shaik, M. Ali Basha / Schlüter, Ralf / Ney, Hermann:
"Morpheme based factored language models for German LVCSR",
1445-1448.
Nußbaum-Thom, Markus / Mousa, Amr El-Desoky / Schlüter, Ralf / Ney, Hermann:
"Compound word recombination for German LVCSR",
1449-1452.
Kobayashi, Akio / Oku, Takahiro / Homma, Shinichi / Imai, Toru / Nakagawa, Seiichi:
"Lattice-based risk minimization training for unsupervised language model adaptation",
1453-1456.
Gillot, Christian / Cerisara, Christophe:
"Similarity language model",
1457-1460.
Dikici, Erinç / Semerci, Murat / Saraçlar, Murat / Alpaydın, Ethem:
"Data sampling and dimensionality reduction approaches for reranking ASR outputs using discriminative language models",
1461-1464.
Masumura, Ryo / Hahm, Seongjun / Ito, Akinori:
"Training a language model using webdata for large vocabulary Japanese spontaneous speech recognition",
1465-1468.
Le, Hai-Son / Oparin, Ilya / Messaoudi, Abdel / Allauzen, Alexandre / Gauvain, Jean-Luc / Yvon, François:
"Large vocabulary SOUL neural network language models",
1469-1472.
Mamou, Jonathan / Sethy, Abhinav / Ramabhadran, Bhuvana / Hoory, Ron / Vozila, Paul:
"Improved spoken query transcription using co-occurrence information",
1473-1476.
Tam, Yik-Cheung / Vozila, Paul:
"Unsupervised latent speaker language modeling",
1477-1480.
Phonology and Phonetics
Sadeghi, Vahid:
"Laryngealization and breathiness in persian",
629-632.
Müller, Viola / Harrington, Jonathan / Kleber, Felicitas / Reubold, Ulrich:
"Age-dependent differences in the neutralization of the intervocalic voicing contrast: evidence from an apparent-time study on east franconian",
633-636.
Samlowski, Barbara / Möbius, Bernd / Wagner, Petra:
"Comparing syllable frequencies in corpora of written and spoken language",
637-640.
Iacoponi, Luca / Savy, Renata:
"Sylli: automatic phonological syllabification for Italian",
641-644.
Xavier, André N. / Barbosa, Plínio A.:
"A preliminary study on the production of signs in brazilian sign language when one of the manual articulators is unavailable",
645-648.
Pan, Ho-hsien / Chen, Mao-Hsu / Lyu, Shao-Ren:
"Electroglottograph and acoustic cues for phonation contrasts in taiwan min falling tones",
649-652.
Voice Conversion
Saito, Daisuke / Yamamoto, Keisuke / Minematsu, Nobuaki / Hirose, Keikichi:
"One-to-many voice conversion based on tensor representation of speaker space",
653-656.
Qiao, Yu / Tong, Tong / Minematsu, Nobuaki:
"A study on bag of Gaussian model with application to voice conversion",
657-660.
Li, Lei / Nankaku, Yoshihiko / Tokuda, Keiichi:
"A Bayesian approach to voice conversion based on GMMs using multiple model structures",
661-664.
Eslami, Mahdi / Sheikhzadeh, Hamid / Sayadiyan, Abolghasem:
"Quality improvement of voice conversion systems based on trellis structured vector quantization",
665-668.
Benisty, Hadas / Malah, David:
"Voice conversion using GMM with enhanced global variance",
669-672.
Godoy, Elizabeth / Rosec, Olivier / Chonavel, Thierry:
"Spectral envelope transformation using DFW and amplitude scaling for voice conversion with parallel or nonparallel corpora",
673-676.
Spoken Language Understanding
Li, Xiao / Wang, Ye-Yi / Tur, Gokhan:
"Multi-task learning for spoken language understanding with shared slots",
701-704.
Hillard, Dustin / Celikyilmaz, Asli / Hakkani-Tür, Dilek / Tur, Gokhan:
"Learning weighted entity lists from web click logs for spoken language understanding",
705-708.
Hakkani-Tür, Dilek / Tur, Gokhan / Heck, Larry / Shriberg, Elizabeth:
"Bootstrapping domain detection using query click logs for new domains",
709-712.
Celikyilmaz, Asli / Hakkani-Tür, Dilek / Tur, Gokhan:
"Approximate inference for domain detection in spoken language understanding",
713-716.
Huang, Chien-Lin / Ma, Bin / Li, Haizhou / Wu, Chung-Hsien:
"Speech indexing using semantic context inference",
717-720.
Ju, Yun-Cheng / Droppo, Jasha:
"Automatically optimizing utterance classification performance without human in the loop",
721-724.
Dialect and Accent Identification
Boula de Mareüil, Philippe / Rouas, Jean-Luc / Yapomo, Manuela:
"In search of cues discriminating West-african accents in French",
725-728.
Hanani, Abualsoud / Russell, Martin / Carey, Michael J.:
"Computer and human recognition of regional accents of british English",
729-732.
Tong, Rong / Ma, Bin / Li, Haizhou / Chng, Eng Siong:
"Target-aware lattice rescoring for dialect recognition",
733-736.
Akbacak, Murat / Vergyri, Dimitra / Stolcke, Andreas / Scheffer, Nicolas / Mandal, Arindam:
"Effective Arabic dialect classification using diverse phonotactic models",
737-740.
Chen, Nancy F. / Shen, Wade / Campbell, Joseph P.:
"Characterizing deletion transformations across dialects using a sophisticated tying mechanism",
741-744.
Biadsy, Fadi / Hirschberg, Julia / Ellis, Daniel P. W.:
"Dialect and accent recognition using phonetic-segmentation supervectors",
745-748.
First Language Acquisition
Miyazawa, Kouki / Miura, Hideaki / Kikuchi, Hideaki / Mazuka, Reiko:
"The multi timescale phoneme acquisition model of the self-organizing based on the dynamic features",
749-752.
Brown, Helen / Gaskell, M. Gareth:
"The time-course of talker-specificity effects for newly-learned pseudowords: evidence for a hybrid model of lexical representation",
753-756.
Lintfert, Britta / Schweitzer, Antje / Möbius, Bernd:
"A parametric approach to intonation acquisition research: validation on child-directed speech data",
757-760.
Versteegh, Maarten / Bosch, Louis ten / Boves, Lou:
"Modelling novelty preference in word learning",
761-764.
Ananthakrishnan, G. / Salvi, Giampiero:
"Using imitation to learn infant-adult acoustic mappings",
765-768.
Bergmann, Christina / Bosch, Louis ten / Boves, Lou:
"Thresholding word activations for response scoring - modelling psycholinguistic data",
769-772.
Spoken Dialogue Systems I, II
Misu, Teruhisa / Ohtake, Kiyonori / Hori, Chiori / Kawai, Hisashi / Nakamura, Satoshi:
"User study of spoken decision support system",
797-800.
Raux, Antoine / Ma, Yi:
"Efficient probabilistic tracking of user goal and dialog history for spoken dialog systems",
801-804.
Schmitt, Alexander / Zgorzelski, Alexander / Minker, Wolfgang:
"Tackling a shilly-shally classifier for predicting task success in spoken dialogue interaction",
805-808.
Meguro, Toyomi / Minami, Yasuhiro / Higashinaka, Ryuichiro / Dohsaka, Kohji:
"Evaluation of listening-oriented dialogue control rules based on the analysis of HMMs",
809-812.
Suendermann, D. / Liscombe, J. / Bloom, J. / Li, G. / Pieraccini, Roberto:
"Large-scale experiments on data-driven design of commercial spoken dialog systems",
813-816.
Kronlid, Fredrik / Villing, Jessica / Berman, Alexander / Larsson, Staffan:
"Comparing system-driven and free dialogue in in-vehicle interaction",
817-820.
Cuayáhuitl, Heriberto / Dethlefs, Nina:
"Optimizing situated dialogue management in unknown environments",
1009-1012.
Deshmukh, Om D. / Ikbal, Shajith / Verma, Ashish / Marcheret, Etienne:
"Acoustic-similarity based technique to improve concept recognition",
1013-1016.
Peters, Doug / Stubley, Peter:
"Dialog methods for improved alphanumeric string capture",
1017-1020.
DeVault, David / Sagae, Kenji / Traum, David:
"Detecting the status of a predictive incremental speech understanding model for real-time decision-making in a spoken dialogue system",
1021-1024.
Chandramohan, Senthilkumar / Geist, Matthieu / Lefèvre, Fabrice / Pietquin, Olivier:
"User simulation in dialogue systems using inverse reinforcement learning",
1025-1028.
Crook, Paul A. / Lemon, Oliver:
"Lossless value directed compression of complex user goal states for statistical spoken dialogue systems",
1029-1032.
Spoken Language Resources, Evaluation and Standardization I, II
Carlin, Michael A. / Thomas, Samuel / Jansen, Aren / Hermansky, Hynek:
"Rapid evaluation of speech representations for spoken term discovery",
821-824.
Hixon, Ben / Schneider, Eric / Epstein, Susan L.:
"Phonemic similarity metrics to compare pronunciation methods",
825-828.
Skowronek, Janto / Raake, Alexander:
"Investigating the effect of number of interlocutors on the quality of experience for multi-party audio conferencing",
829-832.
Kolář, Jáchym / Lamel, Lori:
"On development of consistently punctuated speech corpora",
833-836.
Narayanan, Shrikanth / Bresch, Erik / Ghosh, Prasanta Kumar / Goldstein, Louis / Katsamanis, Athanasios / Kim, Yoon / Lammert, Adam / Proctor, Michael / Ramanarayanan, Vikram / Zhu, Yinghua:
"A multimodal real-time MRI articulatory corpus for speech research",
837-840.
Burnham, Denis / Estival, Dominique / Fazio, Steven / Viethen, Jette / Cox, Felicity / Dale, Robert / Cassidy, Steve / Epps, Julien / Togneri, Roberto / Wagner, Michael / Kinoshita, Yuko / Göcke, Roland / Arciuli, Joanne / Onslow, Marc / Lewis, Trent / Butcher, Andrew / Hajek, John:
"Building an audio-visual corpus of Australian English: large corpus collection with an economical portable and replicable black box",
841-844.
Spoken Language Resources, Evaluation and Standardization I
Minematsu, Nobuaki / Okabe, Koji / Ogaki, Keisuke / Hirose, Keikichi:
"Measurement of objective intelligibility of Japanese accented English using ERJ (English read by Japanese) database",
1481-1484.
Möller, Sebastian / Bang, Chihuy / Tamme, Teele / Vaalgamaa, Markus / Weiss, Benjamin:
"From single-call to multi-call quality: a study on long-term quality integration in audio-visual speech communication",
1485-1488.
Lin, Hui / Bilmes, Jeff:
"Optimal selection of limited vocabulary speech corpora",
1489-1492.
Zahorian, Stephen A. / Wu, Jiang / Karnjanadecha, Montri / SekharVootkuri, Chandra / Wong, Brian / Hwang, Andrew / Tokhtamyshev, Eldar:
"Open source multi-language audio database for spoken language processing applications",
1493-1496.
Black, Matthew P. / Bone, Daniel / Williams, Marian E. / Gorrindo, Phillip / Levitt, Pat / Narayanan, Shrikanth:
"The USC CARE corpus: child-psychologist interactions of children with autism spectrum disorders",
1497-1500.
Barbot, Nelly / Barreaud, Vincent / Boëffard, Olivier / Charonnat, Laure / Delhay, Arnaud / Maguer, Sébastien Le / Lolive, Damien:
"Towards a versatile multi-layered description of speech corpora using algebraic relations",
1501-1504.
Richmond, Korin / Hoole, Phil / King, Simon:
"Announcing the electromagnetic articulography (day 1) subset of the mngu0 articulatory corpus",
1505-1508.
Pirker, Gregor / Wohlmayr, Michael / Petrik, Stefan / Pernkopf, Franz:
"A pitch tracking corpus with evaluation on multipitch tracking scenario",
1509-1512.
Butko, Taras / Nadeu, Climent:
"On building and evaluating a broadcast-news audio segmentation system",
1513-1516.
Dobrišek, Simon / Mihelič, France:
"Time- and acoustic-mediated alignment algorithms for speech recognition evaluation",
1517-1520.
Niemann, Julia / Schulz, Kati / Wechsung, Ina:
"Effects of shortening speech prompts of in-car voice user interfaces on users mental models",
1521-1524.
Werff, Laurens van der / Kraaij, Wessel / Jong, Franciska de:
"Speech transcript evaluation for information retrieval",
1525-1528.
Rodriguez-Fuentes, Luis Javier / Penagarikano, Mikel / Varona, Amparo / Diez, Mireia / Bordel, Germán:
"The Albayzin 2010 language recognition evaluation",
1529-1532.
Moore, Roger K.:
"Progress and prospects for speech technology: results from three sexennial surveys",
1533-1536.
Novak, Josef R. / Minematsu, Nobuaki / Hirose, Keikichi:
"Painless WFST cascade construction for LVCSR - transducersaurus",
1537-1540.
Language Identification
Zheng, Rong / Zhang, Ce / Xu, Bo:
"Data-driven UBM generation via tied Gaussians for GMM-supervector based accent identification",
845-848.
Martínez, David / Villalba, Jesús / Miguel, Antonio / Ortega, Alfonso / Lleida, Eduardo:
"I3a language recognition system for albayzin 2010 LRE",
849-852.
Penagarikano, Mikel / Varona, Amparo / Rodriguez-Fuentes, Luis Javier / Bordel, Germán:
"Dimensionality reduction for using high-order n-grams in SVM-based phonotactic language recognition",
853-856.
Dehak, Najim / Torres-Carrasquillo, Pedro A. / Reynolds, Douglas / Dehak, Reda:
"Language recognition via i-vectors and dimensionality reduction",
857-860.
Martínez, David / Plchot, Oldřich / Burget, Lukáš / Glembek, Ondřej / Matějka, Pavel:
"Language recognition in ivectors space",
861-864.
Second Language Acquisition, Development and Learning I, II
Qian, Xiaojun / Meng, Helen / Soong, Frank K.:
"On mispronunciation lexicon generation using joint-sequence multigrams in computer-aided pronunciation training (CAPT)",
865-868.
Sisinni, Bianca / Grimaldi, Mirko:
"Validating a second language perception model for classroom context - a longitudinal study within the perceptual assimilation model",
869-872.
Sadakata, Makiko / McQueen, James M.:
"The role of variability in non-native perceptual learning of a Japanese geminate-singleton fricative contrast",
873-876.
Bernstein, Jared / Cheng, Jian / Suzuki, Masanori:
"Fluency changes with general progress in L2 proficiency",
877-880.
Ouni, Slim:
"Tongue gestures awareness and pronunciation training",
881-884.
Dommelen, Wim A. van / Hazan, Valerie:
"Impact of speaker variability on speech perception in non-native listeners",
885-888.
Ordin, Mikhail / Polyanskaya, Leona / Ulbrich, Christiane:
"Acquisition of timing patterns in second language",
1129-1132.
Li, Hongyan / Huang, Shen / Wang, Shijin / Xu, Bo:
"Context-dependent duration modeling with backoff strategy and look-up tables for pronunciation assessment and mispronunciation detection",
1133-1136.
Sonu, Mee / Tajima, Keiichi / Kato, Hiroaki / Sagisaka, Yoshinori:
"Perceptual training of vowel length contrast of Japanese by L2 listeners: effects of an isolated word versus a word embedded in sentences",
1137-1140.
Wu, E-Chin:
"Similar vowels in L1/L2 production: confused or discerned in early L2 English learners with different amount of exposure",
1141-1144.
Meister, Lya / Meister, Einar:
"Production and perception of estonian vowels by native and non-native speakers",
1145-1148.
Kibishi, Hiroshi / Nakagawa, Seiichi:
"New feature parameters for pronunciation evaluation in English presentations at international conferences",
1149-1152.
Bailly, Gérard / Barbour, Will:
"Synchronous reading: learning French orthography by audiovisual training",
1153-1156.
Koniaris, Christos / Engwall, Olov:
"Phoneme level non-native pronunciation analysis by an auditory model-based native assessment scheme",
1157-1160.
Šturm, Pavel / Skarnitzl, Radek:
"The open front vowel /æ/ in the production and perception of Czech students of English",
1161-1164.
Cucchiarini, Catia / Heuvel, Henk van den / Sanders, Eric / Strik, Helmer:
"Error selection for ASR-based English pronunciation training in `my pronunciation coach'",
1165-1168.
Nariai, Tomoko / Tanaka, Kazuyo:
"An experimental analysis of pitch patterns in Japanese speakers of English with verification by speech re-synthesis",
1169-1172.
Nariai, Tomoko / Tanaka, Kazuyo / Ito, Yoshiaki:
"An analysis of word duration in native speakers and Japanese speakers of English",
1173-1176.
ASR - Search, Keyword Spotting and Confidence Measures I, II
Kurniawati, Evelyn / Ng, Samsudin / Muralidhar, Karthik / George, Sapna:
"A template based voice trigger system using bhattacharyya edit distance",
889-892.
Nolden, D. / Schlüter, Ralf / Ney, Hermann:
"Acoustic look-ahead for more efficient decoding in LVCSR",
893-896.
Duckhorn, Frank / Wolff, Matthias / Hoffmann, Rüdiger:
"A new epsilon filter for efficient composition of weighted finite-state transducers",
897-900.
Siniscalchi, Sabato Marco / Svendsen, Torbjørn / Lee, Chin-Hui:
"A bottom-up stepwise knowledge-integration approach to large vocabulary continuous speech recognition using weighted finite state machines",
901-904.
Seigel, M. S. / Woodland, P. C.:
"Combining information sources for confidence estimation with CRF models",
905-908.
Katsurada, Kouichi / Sawada, Shinta / Teshima, Shigeki / Iribe, Yurie / Nitta, Tsuneo:
"Evaluation of fast spoken term detection using a suffix array",
909-912.
Kintzley, Keith / Jansen, Aren / Hermansky, Hynek:
"Event selection from phone posteriorgrams using matched filters",
1905-1908.
Zhang, Yaodong / Glass, James:
"A piecewise aggregate approximation lower-bound estimate for posteriorgram-based dynamic time warping",
1909-1912.
Qin, Long / Sun, Ming / Rudnicky, Alexander:
"OOV detection and recovery using hybrid models with different fragments",
1913-1916.
Li, Haiyang / Han, Jiqing / Zheng, Tieran:
"AUC optimization based confidence measure for keyword spotting",
1917-1920.
Ma, Zejun / Wang, Xiaorui / Xu, Bo:
"An empirical study of multilingual spoken term detection",
1921-1924.
Ma, Zejun / Wang, Xiaorui / Xu, Bo:
"Fusing multiple confidence measures for Chinese spoken term detection",
1925-1928.
Yang, Zhanlei / Chao, Hao / Liu, Wenju:
"Response probability based decoding algorithm for large vocabulary continuous speech recognition",
1929-1932.
Shan, Yuxiang / Deng, Yan / Liu, Jia:
"Combining lattice-based language dependent and independent approaches for out-of-language detection in LVCSR",
1933-1936.
Ito, Naoaki / Nankaku, Yoshihiko / Lee, Akinobu / Tokuda, Keiichi:
"Evaluation of tree-trellis based decoding in over-million LVCSR",
1937-1940.
Huang, Hao / Li, Bing Hu:
"Lattice based discriminative model combination using automatically induced phonetic contexts",
1941-1944.
Mishra, Taniya / Ljolje, Andrej / Gilbert, Mazin:
"Predicting human perceived accuracy of ASR systems",
1945-1948.
Vasilescu, I. / Yahia, D. / Snoeren, N. / Adda-Decker, Martine / Lamel, Lori:
"Cross-lingual study of ASR errors: on the role of the context in human perception of near-homophones",
1949-1952.
Saito, Tatsuhiko / Nose, Takashi / Kobayashi, Takao / Okato, Yohei / Horii, Akio:
"Performance prediction of speech recognition using average-voice-based speech synthesis",
1953-1956.
Haznedaroglu, Ali / Arslan, Levent M.:
"Confidence measures for turkish call center conversations",
1957-1960.
Asami, Taichi / Nomoto, Narichika / Kobashikawa, Satoshi / Yamaguchi, Yoshikazu / Masataki, Hirokazu / Takahashi, Satoshi:
"Spoken document confidence estimation using contextual coherence",
1961-1964.
SLP for Information Extraction and Retrieval I, II
Hazen, Timothy J.:
"Latent topic modeling for audio corpus summarization",
913-916.
Dufour, Richard / Estève, Yannick / Deléglise, Paul:
"Investigation of spontaneous speech characterization applied to speaker role recognition",
917-920.
Muscariello, Armando / Gravier, Guillaume / Bimbot, Frédéric:
"Zero-resource audio-only spoken term detection based on a combination of template matching techniques",
921-924.
Kim, Yeon-Jun / Gibbon, David C.:
"Automatic learning in content indexing service using phonetic alignment",
925-928.
Chen, Pei-Ning / Chen, Kuan-Yu / Chen, Berlin:
"Leveraging relevance cues for improved spoken document retrieval",
929-932.
Chen, Yun-Nung / Huang, Yu / Yeh, Ching-Feng / Lee, Lin-shan:
"Spoken lecture summarization by random walk over a graph constructed with automatically extracted key terms",
933-936.
Claveau, Vincent / Lefèvre, Sébastien:
"Topic segmentation of TV-streams by mathematical morphology and vectorization",
1105-1108.
Lu, Mimi / Leung, Cheung-Chi / Xie, Lei / Ma, Bin / Li, Haizhou:
"Probabilistic latent semantic analysis for broadcast news story segmentation",
1109-1112.
Gouvêa, Evandro:
"Hybrid speech recognition for voice search: a comparative study",
1113-1116.
Peng, Bo / Qian, Yao / Soong, Frank K. / Zhang, Bo:
"A new phonetic candidate generator for improving search query efficiency",
1117-1120.
Suzuki, Yukiko / Aikawa, Kiyoaki:
"Towards voice-input symbolic pattern retrieval using parameter-based search",
1121-1124.
Gupta, Vikram / Ajmera, Jitendra / Kumar, Arun / Verma, Ashish:
"A language independent approach to audio search",
1125-1128.
Speaker Diarization I, II
Aronowitz, Hagai:
"Speaker diarization using a priori acoustic information",
937-940.
Boakye, Kofi / Vinyals, Oriol / Friedland, Gerald:
"Improved overlapped speech handling for speaker diarization",
941-944.
Shum, Stephen / Dehak, Najim / Chuangsuwanich, Ekapol / Reynolds, Douglas / Glass, James:
"Exploiting intra-conversation variability for speaker diarization",
945-948.
Nishida, Masafumi / Yamamoto, Seiichi:
"Speaker clustering based on non-negative matrix factorization",
949-952.
Yella, Sree Harsha / Valente, Fabio:
"Information bottleneck features for HMM/GMM speaker diarization of meetings recordings",
953-956.
Wang, D. / Vogt, Robbie / Sridharan, Sridha / Dean, David:
"Cross likelihood ratio based speaker clustering using eigenvoice models",
957-960.
Žibert, Janez / Mihelič, France:
"Prosodic and phonetic features for speaker clustering in speaker diarization systems",
1033-1036.
Huijbregts, Marijn / Leeuwen, David A. van:
"Diarization-based speaker retrieval for broadcast television archives",
1037-1040.
Zelenák, Martin / Hernando, Javier:
"The detection of overlapping speech with prosodic features for speaker diarization",
1041-1044.
Parthasarathi, Sree Hari Krishnan / Bourlard, Hervé / Gatica-Perez, Daniel:
"LP residual features for robust, privacy-sensitive speaker diarization",
1045-1048.
Ghaemmaghami, Houman / Dean, David / Vogt, Robbie / Sridharan, Sridha:
"Extending the task of diarization to speaker attribution",
1049-1052.
Tran, Viet-Anh / Le, Viet Bac / Barras, Claude / Lamel, Lori:
"Comparing multi-stage approaches for cross-show speaker diarization",
1053-1056.
Prosody I, II
Turco, Giuseppina / Gubian, Michele / Schertz, Jessamyn:
"A quantitative investigation of the prosody of verum focus in Italian",
961-964.
Dorn, Amelie / Chasaide, Ailbhe Ní:
"Effects of focus on f_0 and duration in irish (gaelic) declaratives",
965-968.
Cole, Jennifer / Shattuck-Hufnagel, Stefanie:
"The phonology and phonetics of perceived prosody: what do listeners imitate?",
969-972.
Michelas, Amandine / Nguyen, Noël:
"Uncovering the effect of imitation on tonal patterns of French accentual phrases",
973-976.
Prieto, Pilar / Pugliesi, Cecilia / Borràs-Comes, Joan / Arroyo, Ernesto / Blat, Josep:
"Crossmodal prosodic and gestural contribution to the perception of contrastive focus",
977-980.
Cvejic, Erin / Kim, Jeesun / Davis, Chris:
"Temporal relationship between auditory and visual prosodic cues",
981-984.
Szaszák, György / Nagy, Katalin / Beke, András:
"Analysing the correspondence between automatic prosodic segmentation and syntactic structure",
1057-1060.
Tepperman, Joseph / Nava, Emily:
"Long-distance rhythmic dependencies and their application to automatic language identification",
1061-1064.
Rosenberg, Andrew:
"Symbolic and direct sequential modeling of prosody for classification of speaking-style and nativeness",
1065-1068.
Gu, Wentao / Zhang, Ting / Fujisaki, Hiroya:
"Prosodic analysis and perception of Mandarin utterances conveying attitudes",
1069-1072.
Cheng, Chierh / Gubian, Michele:
"Predicting taiwan Mandarin tone shapes from their duration",
1073-1076.
Wollermann, Charlotte / Schade, Ulrich / Schröder, Bernhard:
"Variation of accent type and of context - influences on pragmatic focus interpretation",
1077-1080.
ASR - New Paradigms
Sun, Xie / Zhao, Yunxin:
"New methods for template selection and compression in continuous speech recognition",
985-988.
Zhang, Shi-Xiong / Gales, M. J. F.:
"Structured support vector machines for noise robust continuous speech recognition",
989-992.
Suzuki, Masayuki / Kurata, Gakuto / Nishimura, Masafumi / Minematsu, Nobuaki:
"Continuous digits recognition leveraging invariant structure",
993-996.
Kanevsky, Dimitri / Nahamoo, David / Sainath, Tara N. / Ramabhadran, Bhuvana:
"Convergence of line search a-function methods",
997-1000.
Fujii, Yasuhisa / Yamamoto, Kazumasa / Nakagawa, Seiichi:
"Hidden boosted MMI and hierarchical state posterior feature for automatic speech recognition based on hidden conditional neural fields",
1001-1004.
Cai, Jun / Denby, Bruce / Roussel, Pierre / Dreyfus, Gérard / Crevier-Buchman, Lise:
"Recognition and real time performances of a lightweight ultrasound based silent speech interface employing a language model",
1005-1008.
Adaptation for ASR
Watanabe, Shinji / Nakamura, Atsushi / Juang, Biing-Hwang:
"Model adaptation for automatic speech recognition based on multiple time scale evolution",
1081-1084.
Breslin, C. / Chin, K. K. / Gales, M. J. F. / Knill, Kate:
"Integrated online speaker clustering and adaptation",
1085-1088.
Tüske, Zoltán / Plahl, Christian / Schlüter, Ralf:
"A study on speaker normalized MLP features in LVCSR",
1089-1092.
Jeong, Yongwon / Kim, Young Kuk:
"Matrix-variate distribution of training models for robust speaker adaptation",
1093-1096.
Seltzer, Michael L. / Acero, Alex:
"Separating speaker and environmental variability using factored transforms",
1097-1100.
Gilbert, Mazin / Arizmendi, Iker / Bocchieri, Enrico / Caseiro, Diamantino / Goffin, Vincent / Ljolje, Andrej / Phillips, Mike / Wang, Chao / Wilpon, Jay:
"Your mobile virtual assistant just got smarter!",
1101-1104.
Speech Enhancement
Laaksonen, Laura / Myllylä, Ville / Niemistö, Riitta:
"Evaluating artificial bandwidth extension by conversational tests in car using mobile devices with integrated hands-free functionality",
1177-1180.
Pulakka, Hannu / Remes, Ulpu / Yrttiaho, Santeri / Palomäki, Kalle J. / Kurimo, Mikko / Alku, Paavo:
"Low-frequency bandwidth extension of telephone speech using sinusoidal synthesis and Gaussian mixture model",
1181-1184.
Nour-Eldin, Amr H. / Kabal, Peter:
"Memory-based approximation of the Gaussian mixture model framework for bandwidth extension of narrowband speech",
1185-1188.
Harding, Philip / Milner, Ben:
"Speech enhancement by reconstruction from cleaned acoustic features",
1189-1192.
Choi, Jae-Hun / Kim, Sang-Kyun / Chang, Joon-Hyuk:
"A soft decision-based speech enhancement using acoustic noise classification",
1193-1196.
Li, Chao / Liu, Wenju:
"A noise estimation method based on speech presence probability and spectral sparseness",
1197-1200.
Li, Chao / Liu, Wenju:
"Improved a posteriori speech presence probability estimation based on cepstro-temporal smoothing and time-frequency correlation",
1201-1204.
Chowdhury, Md Foezur Rahman / Selouani, Sid-Ahmed / O'Shaughnessy, Douglas:
"A rapid adaptation algorithm for tracking highly non-stationary noises based on Bayesian inference for on-line spectral change point detection",
1205-1208.
Paliwal, Kuldip / Schwerin, Belinda / Wójcicki, Kamil:
"Single channel speech enhancement using MMSE estimation of short-time modulation magnitude spectrum",
1209-1212.
Saha, Atanu / Shimamura, Tetsuya:
"Speech enhancement using masking properties in adverse environments",
1213-1216.
Raj, Bhiksha / Singh, Rita / Virtanen, Tuomas:
"Phoneme-dependent NMF for speech enhancement in monaural mixtures",
1217-1220.
Leitner, Christina / Pernkopf, Franz / Kubin, Gernot:
"Kernel PCA for speech enhancement",
1221-1224.
Gomez, Angel M. / Schwerin, Belinda / Paliwal, Kuldip:
"Objective intelligibility prediction of speech by combining correlation and distortion based techniques",
1225-1228.
Spoken Dialogue & Spoken Language Understanding Systems
Damnati, Géraldine / Charlet, Delphine:
"Multi-view approach for speaker turn role labeling in TV broadcast news shows",
1285-1288.
Gandhe, Sudeep / Rushforth, Michael / Aggarwal, Priti / Traum, David:
"Evaluation of an integrated authoring tool for building advanced question-answering characters",
1289-1292.
Tur, Gokhan / Hakkani-Tür, Dilek / Hillard, Dustin / Celikyilmaz, Asli:
"Towards unsupervised spoken language understanding: exploiting query click logs for slot filling",
1293-1296.
Lee, Donghyeon / Lee, Cheongjae / Jeong, Minwoo / Kim, Kyungduk / Kim, Seokhwan / Choi, Junhwi / Lee, Gary Geunbae:
"Web-enhanced content retrieval for information access dialogue system",
1297-1300.
Daubigney, Lucie / Gašić, Milica / Chandramohan, Senthilkumar / Geist, Matthieu / Pietquin, Olivier / Young, Steve:
"Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system",
1301-1304.
Hara, Sunao / Kitaoka, Norihide / Takeda, Kazuya:
"Detection of task-incomplete dialogs based on utterance-and-behavior tag n-gram for spoken dialog systems",
1305-1308.
Sarikaya, Ruhi / Chen, Stanley F. / Ramabhadran, Bhuvana:
"Shrinkage-based features for natural language call routing",
1309-1312.
Rachevsky, Leonid / Kanevsky, Dimitri / Sarikaya, Ruhi / Ramabhadran, Bhuvana:
"Clustering with modified cosine distance learned from constraints",
1313-1316.
Fandrianto, Andrew / Langner, Brian / Black, Alan W.:
"Using speaker ID to discover repeat callers of a spoken dialog system",
1317-1320.
Pinault, Florian / Lefèvre, Fabrice:
"Semantic graph clustering for POMDP-based spoken dialog systems",
1321-1324.
Taguchi, Ryo / Yamada, Yuji / Hattori, Koosuke / Umezaki, Taizo / Hoguro, Masahiro / Iwahashi, Naoto / Funakoshi, Kotaro / Nakano, Mikio:
"Learning place-names from spoken utterances and localization results by mobile robot",
1325-1328.
Gambäck, Björn / Olsson, Fredrik / Täckström, Oscar:
"Active learning for dialogue act classification",
1329-1332.
Bazillon, Thierry / Maza, Benjamin / Rouvier, Michael / Bechet, Frederic / Nasr, Alexis:
"Speaker role recognition using question detection and characterization",
1333-1336.
Huang, Qiang / Cox, Stephen J.:
"Learning score structure from spoken language for a tennis game",
1337-1340.
Witt, Silke M.:
"Semi-automated classifier adaptation for natural language call routing",
1341-1344.
Liang, Wei-Bin / Wu, Chung-Hsien / Wang, Chih-Hung / Wang, Jhing-Fa:
"Interactional style detection for versatile dialogue response using prosodic and semantic features",
1345-1348.
Kühnel, Christine / Weiss, Benjamin / Schulz, Matthias / Möller, Sebastian:
"Quality aspects of multimodal dialog systems: identity, stimulation and success",
1349-1352.
Prosodic Structure
Tepperman, Joseph / Nava, Emily:
"Where should pitch accents and phrase breaks go? a syntax tree transducer solution",
1353-1356.
Bocci, Giuliano / Avesani, Cinzia:
"Phrasal prominences do not need pitch movements: postfocal phrasal heads in Italian",
1357-1360.
Gac, David Le / Yoo, Hiyon:
"Intonation of left dislocated topics in modern greek",
1361-1364.
Thompson, Laura / Watson, Catherine I. / Harlow, Ray / King, Jeanette / Maclagan, Margaret / Charters, Helen / Keegan, Peter:
"Phrases, pitch and perceived prominence in māori",
1365-1368.
Duběda, Tomáš:
"Perceptual sensitivity to prenuclear and nuclear intonational patterns",
1369-1372.
Kalaldeh, Raya:
"Tonal alignment defined: the case of southern irish English",
1373-1376.
Rosenberg, Andrew:
"Using mutual information to identify regions of analysis for prosodic analysis",
1377-1380.
Tseng, Chiu-yu / Su, Chao-yu / Huang, Chi-Feng:
"Prosodic highlights in Mandarin continuous speech - cross-genre attributes and implications",
1381-1384.
Sulpizio, Simone / McQueen, James M.:
"When two newly-acquired words are one: new words differing in stress alone are not automatically represented differently",
1385-1388.
Bu, Shehui / Zhuo, Zhenjie / Yang, Lingling / Itahashi, Shuichi:
"Automatic determination of the standard Chinese prosodic phrase boundaries by f_0 generation model",
1389-1392.
Looze, Céline De / Rauzy, Stéphane:
"Measuring speakers' similarity in speech by means of prosodic cues: methods and potential",
1393-1396.
Yang, Li-chiung:
"Tonal variations in Mandarin: new evidence from spontaneous and read speech",
1397-1400.
Language Processing
Guinaudeau, Camille / Hirschberg, Julia:
"Accounting for prosodic information to improve ASR-based topic tracking for TV broadcast news",
1401-1404.
Imamura, Kenji / Izumi, Tomoko / Sadamitsu, Kugatsu / Saito, Kuniko / Kobashikawa, Satoshi / Masataki, Hirokazu:
"Morpheme conversion for connecting speech recognizer and language analyzers in unsegmented languages",
1405-1408.
Fang, Ren-Ying / Chen, Bo-Wei / Wang, Jhing-Fa / Wu, Chung-Hsien:
"Emotion detection based on concept inference and spoken sentence analysis for customer service",
1409-1412.
Cerisara, Christophe / Král, Pavel / Gardent, Claire:
"Commas recovery with syntactic features in French and in Czech",
1413-1416.
Falavigna, Daniele:
"Redundancy reduction in ASR of spontaneous speech through statistical machine translation",
1417-1420.
Chiang, Chin-Chih:
"From interview to news text: a study of taiwan TV Political interviews in newspaper reports",
1421-1424.
Paralinguistic Information - Classification and Detection
Oertel, Catharine / Scherer, Stefan / Campbell, Nick:
"On the use of multimodal cues for the prediction of degrees of involvement in spontaneous conversation",
1541-1544.
Nomoto, Narichika / Tamoto, Masafumi / Masataki, Hirokazu / Yoshioka, Osamu / Takahashi, Satoshi:
"Anger recognition in spoken dialog using linguistic and para-linguistic information",
1545-1548.
Ivanov, A. V. / Riccardi, G. / Sporka, A. J. / Franc, J.:
"Recognition of personality traits from human spoken conversations",
1549-1552.
Schuller, Björn / Zhang, Zixing / Weninger, Felix / Rigoll, Gerhard:
"Using multiple databases for training in emotion recognition: to unite or to vote?",
1553-1556.
Burkhardt, Felix / Schuller, Björn / Weiss, Benjamin / Weninger, Felix:
"“would you buy a car from me?” - on the likability of telephone voices",
1557-1560.
Gibson, James / Katsamanis, Athanasios / Black, Matthew P. / Narayanan, Shrikanth:
"Automatic identification of salient acoustic instances in couples' behavioral interactions using diverse density support vector machines",
1561-1564.
Neiberg, Daniel / Gustafson, Joakim:
"Predicting speaker changes and listener responses with and without eye-contact",
1565-1568.
Amarakeerthi, Senaka / Nwe, Tin Lay / Silva, Liyanage C. De / Cohen, Michael:
"Emotion classification using inter- and intra-subband energy variation",
1569-1572.
Kitahara, K. / Michiwiki, S. / Sato, M. / Matsunaga, S. / Yamashita, M. / Shinohara, K.:
"Emotion classification of infants' cries using duration ratios of acoustic segments",
1573-1576.
Vlasenko, Bogdan / Prylipko, Dmytro / Philippou-Hübner, David / Wendemuth, Andreas:
"Vowels formants analysis allows straightforward detection of high arousal acted and spontaneous emotions",
1577-1580.
Neiberg, Daniel / Laukka, Petri / Elfenbein, Hillary Anger:
"Intra-, inter-, and cross-cultural classification of vocal affect",
1581-1584.
Applications for Learning, Education, Aged and Handicapped Persons
Shirali-Shahreza, Sajad / Ganjali, Yashar / Balakrishnan, Ravin:
"Verifying human users in speech-based interactions",
1585-1588.
Cheng, Jian:
"Automatic assessment of prosody in high-stakes English tests",
1589-1592.
Luo, Dean / Yang, Xuesong / Wang, Lan:
"Improvement of segmental mispronunciation detection with prior knowledge extracted from large L2 speech corpus",
1593-1596.
Cheng, Jian / Shen, Jianqiang:
"Off-topic detection in automated speech assessment applications",
1597-1600.
Stüker, Sebastian / Fay, Johanna / Berkling, Kay:
"Towards context-dependent phonetic spelling error correction in children's freely composed text for diagnostic and pedagogical purposes",
1601-1604.
López-Ludeña, V. / San-Segundo, R. / Córdoba, R. / Ferreiros, J. / Montero, J. M. / Pardo, J. M.:
"Factored translation models for improving a speech into sign language translation system",
1605-1608.
Abari, Kálmán / Rácz, Zsuzsanna Zsófia / Olaszy, Gábor:
"Formant maps in Hungarian vowels - online data inventory for research, and education",
1609-1612.
Bordel, Germán / Nieto, Silvia / Penagarikano, Mikel / Rodriguez-Fuentes, Luis Javier / Varona, Amparo:
"Automatic subtitling of the basque parliament plenary sessions videos",
1613-1616.
Iribe, Yurie / Manosavanh, Silasak / Katsurada, Kouichi / Hayashi, Ryoko / Zhu, Chunyue / Nitta, Tsuneo:
"Generating animated pronunciation from speech through articulatory feature extraction",
1617-1620.
Chen, Wei / Mostow, Jack:
"A tale of two tasks: detecting children's off-task speech in a reading tutor",
1621-1624.
Isei-Jaakkola, Toshiko / Naka, Takatoshi / Hirose, Keikichi:
"Problems encountered by Japanese EL2 with English short vowels as illustrated on a 3d vowel chart",
1625-1628.
Pellegrini, Thomas / Correia, Rui / Trancoso, Isabel / Baptista, Jorge / Mamede, Nuno:
"Automatic generation of listening comprehension learning material in european portuguese",
1629-1632.
Liu, Chao-Hong / Wu, Chung-Hsien / Sarwono, David / Wang, Jhing-Fa:
"Candidate generation for ASR output error correction using a context-dependent syllable cluster-based confusion matrix",
1633-1636.
Huynh, Thai Hoa / Tran, Vu An / Tran, Huy Dat:
"Semi-supervised tree support vector machine for online cough recognition",
1637-1640.
Source Separation and Speech Enhancement
Zhang, Xueliang / Liu, Wenju:
"Monaural voiced speech segregation based on pitch and comb filter",
1741-1744.
Hirasawa, Yasuharu / Yasuraoka, Naoki / Takahashi, Toru / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Fast and simple iterative algorithm of lp-norm minimization for under-determined speech separation",
1745-1748.
Rabiee, Azam / Setayeshi, Saeed / Lee, Soo-Young:
"Monaural speech separation based on a 2d processing and harmonic analysis",
1749-1752.
Jafari, Ingrid / Haque, Serajul / Togneri, Roberto / Nordholm, Sven:
"Underdetermined blind source separation with fuzzy clustering for arbitrarily arranged sensors",
1753-1756.
Vu, Dang Hai Tran / Haeb-Umbach, Reinhold:
"On initial seed selection for frequency domain blind speech separation",
1757-1760.
Tanaka, Nobuaki / Ogawa, Tetsuji / Kobayashi, Tetsunori:
"Spatial filter calibration based on minimization of modified LSD",
1761-1764.
Nakashika, Toru / Takiguchi, Tetsuya / Ariki, Yasuo:
"Probabilistic spectrum envelope: categorized audio-features representation for NMF-based sound decomposition",
1765-1768.
Choi, Jinho / Yoo, Chang D.:
"A high resolution multiple source localization based on generalized cumulant structure (GCS) matrix",
1769-1772.
Grais, Emad M. / Erdogan, Hakan:
"Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks",
1773-1776.
Marin-Hurtado, Jorge I. / Anderson, David V.:
"Perceptually-inspired processing for multichannel Wiener filter",
1777-1780.
Nakano, Shoichi / Yamamoto, Kazumasa / Nakagawa, Seiichi:
"Speech recognition in mixed sound of speech and music based on vector quantization and non-negative matrix factorization",
1781-1784.
Nakatani, Tomohiro / Araki, Shoko / Delcroix, Marc / Yoshioka, Takuya / Fujimoto, Masakiyo:
"Reduction of highly nonstationary ambient noise by integrating spectral and locational characteristics of speech and noise for robust ASR",
1785-1788.
Drioli, Carlo / Calanca, Andrea:
"Voice processing by dynamic glottal models with applications to speech enhancement",
1789-1792.
Sang, Jinqiu / Li, Guoping / Hu, Hongmei / Lutman, Mark E. / Bleeck, Stefan:
"Supervised sparse coding strategy in cochlear implants",
1793-1796.
Phonetics and Phonology, Stress, Accent, Rhythm
Bertini, Chiara / Bertinetto, Pier Marco / Zhi, Na:
"Chinese and Italian speech rhythm: normalization and the CCI algorithm",
1853-1856.
Mairano, Paolo / Romano, Antonio:
"Rhythm metrics on syllables and feet do not work as expected",
1857-1860.
Chen, Lei / Zechner, Klaus:
"Applying rhythm features to automatically assess non-native speech",
1861-1864.
Vaughan, Brian:
"Prosodic synchrony in co-operative task-based dialogues: a measure of agreement and disagreement",
1865-1868.
Niebuhr, Oliver / Wolf, Astrid:
"Low and high, short and long by crook or by hook?",
1869-1872.
Heinrich, Christian / Schiel, Florian:
"Estimating speaking rate by means of rhythmicity parameters",
1873-1876.
Arnold, Denis / Möbius, Bernd / Wagner, Petra:
"Comparing word and syllable prominence rated by naïve listeners",
1877-1880.
Tokuma, Shinichi / Xu, Yi:
"L1/L2 perception of lexical stress with F0 peak-delay: effect of an extra syllable added",
1881-1884.
Seng, Kheang / Iribe, Yurie / Nitta, Tsuneo:
"Letter-to-phoneme conversion based on two-stage neural network focusing on letter and phoneme contexts",
1885-1888.
Orr, Rosemary / Quené, Hugo / Beek, Roeland van / Diefenbach, Thari / Leeuwen, David A. van / Huijbregts, Marijn:
"An international English speech corpus for longitudinal study of accent development",
1889-1892.
Kim, Sunhee / Lee, Kyuwhan / Chung, Minhwa:
"A corpus-based study of English pronunciation variations",
1893-1896.
Stoakes, Hywel / Butcher, Andrew / Fletcher, Janet / Tabain, Marija:
"Long term average speech spectra in yolngu matha and pitjantjatjara speaking females and males",
1897-1900.
Gráczi, Tekla Etelka / Lulich, Steven M. / Csapó, Tamás Gábor / Beke, András:
"Context and speaker dependency in the relation of vowel formants and subglottal resonances - evidence from Hungarian",
1901-1904.
Pitch Processing - Singing Voice Analysis
Pawi, Alipah / Vaseghi, Saeed / Milner, Ben / Ghorshi, Seyed:
"Fundamental frequency estimation using modified higher order moments and multiple windows",
1965-1968.
Wohlmayr, Michael / Pernkopf, Franz:
"EM-based gain adaptation for probabilistic multipitch tracking",
1969-1972.
Drugman, Thomas / Alwan, Abeer:
"Joint robust voicing detection and pitch estimation based on residual harmonics",
1973-1976.
Govind, D. / Prasanna, S. R. M. / Pati, Debadatta:
"Epoch extraction in high pass filtered speech using hilbert envelope",
1977-1980.
Pavlovets, Alexander / Petrovsky, Alexander:
"Robust HNR-based closed-loop pitch and harmonic parameters estimation",
1981-1984.
Prakash, Chetana / , Dhananjaya N. / Gangashetty, Suryakanth V.:
"Exploring bessel features for detection of glottal closure instants",
1985-1988.
Cabral, João P. / Kane, John / Gobl, Christer / Carson-Berndsen, Julie:
"Evaluation of glottal epoch detection algorithms on different voice types",
1989-1992.
Origlia, Antonio / Abete, Giovanni / Cutugno, Francesco / Alfano, Iolanda / Savy, Renata / Ludusan, Bogdan:
"A divide et impera algorithm for optimal pitch stylization",
1993-1996.
Sousa, Ricardo / Ferreira, Aníbal:
"Singing voice analysis using relative harmonic delays",
1997-2000.
Lee, S. W. / Dong, Minghui:
"Singing voice synthesis: singer-dependent vibrato modeling and coherent processing of spectral envelope",
2001-2004.
Beux, Sylvain Le / Feugère, Lionel / d'Alessandro, Christophe:
"Chorus digitalis: experiments in chironomic choir singing",
2005-2008.
Prosodic Modeling
Li, Kun / Zhang, Shuang / Li, Mingxing / Lo, Wai-Kit / Meng, Helen:
"Prominence model for prosodic features in automatic lexical stress and pitch accent detection",
2009-2012.
Li, Ya / Tao, Jianhua / Xu, Xiaoying:
"Hierarchical stress modeling in Mandarin text-to-speech",
2013-2016.
Ni, Chong-Jia / Liu, Wenju / Xu, Bo:
"Automatic prosodic events detection by using syllable-based acoustic, lexical and syntactic features",
2017-2020.
Rilliard, Albert / Allauzen, Alexandre / Boula de Mareüil, Philippe:
"Using dynamic time warping to compute prosodic similarity measures",
2021-2024.
Barbosa, Plínio A. / Mixdorff, Hansjörg / Madureira, Sandra:
"Applying the quantitative target approximation model (qTA) to German and brazilian portuguese",
2025-2028.
Obin, Nicolas / Lacheret, Anne / Rodet, Xavier:
"Stylization and trajectory modelling of short and long term speech prosody variations",
2029-2032.
Avanzi, Mathieu / Obin, Nicolas / Lacheret-Dujour, Anne / Victorri, Bernard:
"Toward a continuous modeling of French prosodic structure: using acoustic features to predict prominence location and prominence degree",
2033-2036.
Mahrt, Tim / Huang, Jui-Ting / Mo, Yoonsook / Fleck, Margaret / Hasegawa-Johnson, Mark / Cole, Jennifer:
"Optimal models of prosodic prominence using the Bayesian information criterion",
2037-2040.
Hussein, Hussein / Mixdorff, Hansjörg / Do, Hue San / Hoffmann, Rüdiger:
"Quantitative analysis of tone coarticulation in Mandarin",
2041-2044.
Neiberg, Daniel / Ananthakrishnan, G. / Gustafson, Joakim:
"Tracking pitch contours using minimum jerk trajectories",
2045-2048.
Discourse and Dialogue
Maza, Benjamin / El-Beze, Marc / Linares, Georges / Mori, Renato De:
"On the use of linguistic features in an automatic system for speech analytics of telephone conversations",
2049-2052.
Kazemzadeh, Abe / Lee, Sungbok / Georgiou, Panayiotis G. / Narayanan, Shrikanth:
"Determining what questions to ask, with the help of spectral graph theory",
2053-2056.
Buschmeier, Hendrik / Malisz, Zofia / Włodarczak, Marcin / Kopp, Stefan / Wagner, Petra:
"`are you sure you're paying attention?' - `uh-huh' communicating understanding as a marker of attentiveness",
2057-2060.
Ishimoto, Yuichi / Enomoto, Mika / Iida, Hitoshi:
"Projectability of transition-relevance places using prosodic features in Japanese spontaneous conversation",
2061-2064.
Hjalmarsson, Anna / Laskowski, Kornel:
"Measuring final lengthening for speaker-change prediction",
2065-2068.
Laskowski, Kornel / Edlund, Jens / Heldner, Mattias:
"Incremental learning and forgetting in stochastic turn-taking models",
2069-2072.
Georgila, Kallirroi / Traum, David:
"Reinforcement learning of argumentation dialogue Policies in negotiation",
2073-2076.
Heinroth, Tobias / Koleva, Savina / Minker, Wolfgang:
"Topic switching strategies for spoken dialogue systems",
2077-2080.
Higashinaka, Ryuichiro / Kawamae, Noriaki / Sadamitsu, Kugatsu / Minami, Yasuhiro / Meguro, Toyomi / Dohsaka, Kohji / Inagaki, Hirohito:
"Unsupervised clustering of utterances using non-parametric Bayesian methods",
2081-2084.
SLP for Speech Translation, Information Extraction and Retrieval
Parada, Carolina / Dredze, Mark / Jelinek, Frederick:
"OOV sensitive named-entity recognition in speech",
2085-2088.
Saers, Markus / Wu, Dekai / Lo, Chi-kiu / Addanki, Karteek:
"Speech translation with grammar driven probabilistic phrasal bilexica extraction",
2089-2092.
Tillmann, Christoph / Hewavitharana, Sanjika:
"An efficient unified extraction algorithm for bilingual data",
2093-2096.
Huang, Songfang / Zhou, Bowen:
"Using features from topic models to alleviate over-generation in hierarchical phrase-based translation",
2097-2100.
Huang, Songfang / Zhou, Bowen:
"An empirical study on improving hierarchical phrase-based translation using alignment features",
2101-2104.
He, Xiaodong / Deng, Li:
"Robust speech translation by domain adaptation",
2105-2108.
Ettelaie, Emil / Georgiou, Panayiotis G. / Narayanan, Shrikanth:
"Enhancements to the training process of classifier-based speech translator via topic modeling",
2109-2112.
Sridhar, Vivek Kumar Rangarajan / Barbosa, Luciano / Bangalore, Srinivas:
"A scalable approach to building a parallel corpus from the web",
2113-2116.
Itoh, Yoshiaki / Iwata, Kohei / Ishigame, Masaaki / Tanaka, Kazuyo / Lee, Shi-wook:
"Spoken term detection results using plural subword models by estimating detection performance for each query",
2117-2120.
Barbosa, Luciano / Caseiro, Diamantino / Fabbrizio, Giuseppe Di / Stent, Amanda:
"Speechforms: from web to speech and back",
2121-2124.
Noritake, Kazuyuki / Nanjo, Hiroaki / Yoshimi, Takehiko:
"Image processing filters for line detection-based spoken term detection",
2125-2128.
Polifroni, Joe / Mairesse, François:
"Using latent topic features for named entity extraction in search queries",
2129-2132.
Masumura, Ryo / Hahm, Seongjun / Ito, Akinori:
"Language model expansion using webdata for spoken document retrieval",
2133-2136.
Akiba, Tomoyosi / Honda, Koichiro:
"Effects of query expansion for spoken document passage retrieval",
2137-2140.
Chan, Chun-an / Lee, Lin-shan:
"Unsupervised hidden Markov modeling of spoken queries for spoken term detection without speech recognition",
2141-2144.
Gemello, Roberto / Mana, Franco / Batzu, Pier Domenico:
"Topic identification from audio recordings using rich recognition results and neural network based classifiers",
2145-2148.
Speech Synthesis - Selected Topics
Parlikar, Alok / Black, Alan W.:
"A grammar based approach to style specific phrase prediction",
2149-2152.
Watts, Oliver / Zhou, Bowen:
"Unsupervised features from text for speech synthesis in a speech-to-speech translation system",
2153-2156.
Watts, Oliver / Yamagishi, Junichi / King, Simon:
"Unsupervised continuous-valued word features for phrase-break prediction without a part-of-speech tagger",
2157-2160.
Campillo, Francisco / Méndez, Francisco / Arza, Montserrat / Docío, Laura / Bonafonte, Antonio / Navas, Eva / Sainz, Iñaki:
"Albayzín 2010: a Spanish text to speech evaluation",
2161-2164.
Shen, Binbin / Wu, Zhiyong / Wang, Yongxin / Cai, Lianhong:
"Combining active and semi-supervised learning for homograph disambiguation in Mandarin text-to-speech synthesis",
2165-2168.
Ewender, Thomas / Pfister, Beat:
"Automatically creating a diphone set from a speech database",
2169-2172.
Mattheyses, Wesley / Latacz, Lukas / Verhelst, Werner:
"Automatic viseme clustering for audiovisual speech synthesis",
2173-2176.
Hinterleitner, Florian / Möller, Sebastian / Norrenbrock, Christoph / Heute, Ulrich:
"Perceptual quality dimensions of text-to-speech systems",
2177-2180.
Mori, Shinsuke / Neubig, Graham:
"A pointwise approach to pronunciation estimation for a TTS front-end",
2181-2184.
Abou-Zleikha, Mohamed / Carson-Berndsen, Julie:
"Correlating text with prosody",
2185-2188.
Rosenberg, Andrew / Fernandez, Raul / Ramabhadran, Bhuvana:
"“what is… dengue fever?” - modeling and predicting pronunciation errors in a text-to-speech system",
2189-2192.
Norrenbrock, Christoph / Heute, Ulrich / Hinterleitner, Florian / Möller, Sebastian:
"Aperiodicity analysis for quality estimation of text-to-speech signals",
2193-2196.
Human Speech and Sound Perception I, II
Klintfors, Eeva / Marklund, Ellen / Lacerda, Francisco:
"Parallels in infants' attention to speech articulation and to physical changes in speech-unrelated objects",
2197-2200.
Duran, Daniel / Bruni, Jagoda / Dogil, Grzegorz / Schütze, Hinrich:
"Speech events are recoverable from unlabeled articulatory data: using an unsupervised clustering approach on data obtained from electromagnetic midsaggital articulography (EMA)",
2201-2204.
Strömbergsson, Sofia:
"Children's recognition of their own voice: influence of phonological impairment",
2205-2208.
Kagomiya, Takayuki / Nakagawa, Seiji:
"Evaluation of bone-conducted ultrasonic hearing-aid regarding transmission of speaker discrimination information",
2209-2212.
Herff, Christian / Janke, Matthias / Wand, Michael / Schultz, Tanja:
"Impact of different feedback mechanisms in EMG-based speech recognition",
2213-2216.
Yip, Michael C. W.:
"Phonotactic constraints and the segmentation of Cantonese speech",
2217-2220.
Schneider, Katrin / Dogil, Grzegorz / Möbius, Bernd:
"Reaction time and decision difficulty in the perception of intonation",
2221-2224.
Honbolygó, Ferenc / Csépe, Valéria:
"Processing of stress related acoustic cues as indexed by ERPs",
2225-2228.
Witteman, Marijt J. / Weber, Andrea / McQueen, James M.:
"On the relationship between perceived accentedness, acoustic similarity, and processing difficulty in foreign-accented speech",
2229-2232.
Amano, Shigeaki / Hirata, Yukari:
"The perception boundary between single and geminate stops in 3- and 4-mora Japanese words",
2233-2236.
Ijima, Yusuke / Isogai, Mitsuaki / Mizuno, Hideyuki:
"Correlation analysis of acoustic features with perceptual voice quality similarity for similar speaker selection",
2237-2240.
Jesse, Alexandra / Mitterer, Holger:
"Pointing gestures do not influence the perception of lexical stress",
2445-2448.
Cushing, Ian R. / Li, Francis F. / Worrall, Ken / Jackson, Tim:
"Relationships between phonetic features and speech perception - a statistical investigation from a large anechoic british English corpus",
2449-2452.
Brown, Guy J. / Jürgens, Tim / Meddis, Ray / Robertson, Matthew / Clark, Nicholas R.:
"The representation of speech in a nonlinear auditory model: time-domain analysis of simulated auditory-nerve firing patterns",
2453-2456.
Coelho, Luis / Braga, Daniela / Sales-Dias, Miguel / Garcia-Mateo, Carmen:
"An automatic voice pleasantness classification system based on prosodic and acoustic patterns of voice preference",
2457-2460.
Carré, René / Divenyi, Pierre / Serniclaes, Willy / Ferragne, Emmanuel / Marsico, Egidio / Nguyen, Viet-Son:
"Contributions of F1 and F2 (F2') to the perception of plosive consonants",
2461-2464.
Kim, Jeesun / Davis, Chris:
"Auditory speech processing is affected by visual speech in the periphery",
2465-2468.
Paris, Tim / Kim, Jeesun / Davis, Chris:
"Visual speech speeds up auditory identification responses",
2469-2472.
Takashima, Ryoichi / Nagano, Tohru / Tachibana, Ryuki / Nishimura, Masafumi:
"Agglomerative hierarchical clustering of emotions in speech based on subjective relative similarity",
2473-2476.
Mai, Guangting / Peng, Gang:
"Optimal syllabic rates and processing units in perceiving Mandarin spoken sentences",
2477-2480.
Wester, Mirjam / Liang, Hui:
"Cross-lingual speaker discrimination using natural and synthetic speech",
2481-2484.
Multilingual and Multimodal Approaches to Spoken Language
Navarathna, Rajitha / Kleinschmidt, Tristan / Dean, David / Sridharan, Sridha / Lucey, Patrick:
"Can audio-visual speech recognition outperform acoustically enhanced speech recognition in automotive environment?",
2241-2244.
Alabau, Vicent / Romero, Verónica / Lagarda, Antonio-L. / Martínez-Hinarejos, Carlos-D.:
"A multimodal approach to dictation of handwritten historical documents",
2245-2248.
Toutios, Asterios / Musti, Utpala / Ouni, Slim / Colotte, Vincent:
"Weight optimization for bimodal unit-selection talking head synthesis",
2249-2252.
Schaffer, Stefan / Jöckel, Benjamin / Wechsung, Ina / Schleicher, Robert / Möller, Sebastian:
"Modality selection and perceived mental effort in a mobile application",
2253-2256.
Ajmera, Jitendra / Verma, Ashish:
"A cross-lingual spoken content search system",
2257-2260.
Girardi, C. / Gretter, Roberto / Falavigna, Daniele / Brugnara, Fabio / Giuliani, Diego / Federico, M.:
"Nemo: a platform for multilingual news monitoring",
2261-2264.
Chaudhuri, Sourish / Harvilla, Mark / Raj, Bhiksha:
"Unsupervised learning of acoustic unit descriptors for audio content representation and classification",
2265-2268.
Glodek, Michael / Scherer, Stefan / Schwenker, Friedhelm:
"Conditioned hidden Markov model fusion for multimodal classification",
2269-2272.
Lecouteux, Benjamin / Vacher, Michel / Portet, François:
"Distant speech recognition in a smart home: comparison of several multisource ASRs in realistic conditions",
2273-2276.
Chen, Jiansong / Zhu, Lei / Feng, Bailan / Ding, Peng / Xu, Bo:
"A robust approach to mining repeated sequence in audio stream",
2277-2280.
ASR - New Paradigms and Other Topics
Yu, Dong / Deng, Li:
"Accelerated parallelizable neural network learning algorithm for speech recognition",
2281-2284.
Deng, Li / Yu, Dong:
"Deep convex net: a scalable architecture for speech pattern classification",
2285-2288.
Wang, Siwei / Levow, Gina-Anne:
"Modeling broad context for tone recognition with conditional random fields",
2289-2292.
Li, Shang-wen / Wang, Yow-bang / Sun, Liang-che / Lee, Lin-shan:
"Improved tonal language speech recognition by integrating spectro-temporal evidence and pitch information with properly chosen tonal acoustic units",
2293-2296.
Gouvêa, Evandro / Davel, Marelie H.:
"Kullback-leibler divergence-based ASR training data selection",
2297-2300.
Næss, Arild Brandrud / Livescu, Karen / Prabhavalkar, Rohit:
"Articulatory feature classification using nearest neighbors",
2301-2304.
Demange, Sébastien / Ouni, Slim:
"Continuous episodic memory based speech recognition using articulatory dynamics",
2305-2308.
Li, T. / Woodland, P. C. / Diehl, F. / Gales, M. J. F.:
"Graphone model interpolation and Arabic pronunciation generation",
2309-2312.
Illina, Irina / Fohr, Dominique / Jouvet, Denis:
"Grapheme-to-phoneme conversion using conditional random fields",
2313-2316.
Yeh, Ching-Feng / Huang, Chao-Yu / Lee, Lin-shan:
"Bilingual acoustic model adaptation by unit merging on different levels and cross-level integration",
2317-2320.
Schraagen, Marijn / Bloothooft, Gerrit:
"A qualitative evaluation of phoneme-to-phoneme technology",
2321-2324.
Falavigna, Daniele / Gretter, Roberto:
"Cheap bootstrap of multi-lingual hidden Markov models",
2325-2328.
Mesgarani, Nima / Thomas, Samuel / Hermansky, Hynek:
"Adaptive stream fusion in multistream recognition of speech",
2329-2332.
Siu, Man-hung / Gish, Herbert / Lowe, Steve / Chan, Arthur:
"Unsupervised audio patterns discovery using HMM-based self-organized units",
2333-2336.
Labiak, John / Livescu, Karen:
"Nearest neighbors with learned distances for phonetic frame classification",
2337-2340.
Speech Audio Analysis and Classification
Fagerlund, Seppo / Laine, Unto K.:
"Stop consonant recognition by temporal fine structure of burst",
2385-2388.
Kirchhoff, Katrin / Alexandrescu, Andrei:
"Phonetic classification using controlled random walks",
2389-2392.
Marujo, Luís / Viveiros, Márcio / Neto, João Paulo da Silva:
"Keyphrase cloud generation of broadcast news",
2393-2396.
Canterla, Alfonso M. / Johnsen, Magne H.:
"Optimized feature extraction and HMMs in subword detectors",
2397-2400.
Shi, Ziqiang / Han, Jiqing / Zheng, Tieran:
"Real-world speech/non-speech audio classification based on sparse representation features and GPCs",
2401-2404.
Pathak, Manas A. / Raj, Bhiksha:
"Privacy preserving speaker verification using adapted GMMs",
2405-2408.
Székely, Éva / Cabral, João P. / Cahill, Peter / Carson-Berndsen, Julie:
"Clustering expressive speech styles in audiobooks using glottal source parameters",
2409-2412.
Ludusan, Bogdan / Origlia, Antonio / Cutugno, Francesco:
"On the use of the rhythmogram for automatic syllabic prominence detection",
2413-2416.
Sam, Sethserey / Xiao, Xiong / Besacier, Laurent / Castelli, Eric / Li, Haizhou / Chng, Eng Siong:
"Speech modulation features for robust nonnative speech accent detection",
2417-2420.
Zhang, Chi / Hansen, John H. L.:
"Frame-level vocal effort likelihood space modeling for improved whisper-island detection",
2421-2424.
Fan, Xing / Hansen, John H. L.:
"Speaker identification for whispered speech using a training feature transformation from neutral to whisper",
2425-2428.
DeMarco, Andrea / Cox, Stephen J.:
"An accurate and robust gender identification algorithm",
2429-2432.
Yang, Xiaohong / Chen, Qingcai / Zhou, Shusen / Wang, Xiaolong:
"Deep belief networks for automatic music genre classification",
2433-2436.
Dennis, Jonathan / Tran, Huy Dat / Li, Haizhou:
"Image representation of the subband power distribution for robust sound classification",
2437-2440.
Xiao, Bo / Rozgić, Viktor / Katsamanis, Athanasios / Baucom, Brian R. / Georgiou, Panayiotis G. / Narayanan, Shrikanth:
"Acoustic and visual cues of turn-taking dynamics in dyadic interactions",
2441-2444.
Speech Audio Analysis
Shi, Yong-zhe / Zhang, Wei-Qiang / Liu, Jia:
"Robust audio fingerprinting based on local spectral luminance maxima scheme",
2485-2488.
Laine, Unto K.:
"Entropy-rate driven inference of stochastic grammars",
2489-2492.
Lee, Sheng-Chieh / Bharanitharan, K. / Chen, Bo-Wei / Wang, Jhing-Fa / Wu, Chung-Hsien / Liao, Min-Jian:
"An efficient pre-processing scheme to improve the sound source localization system in noisy environment",
2493-2496.
Le-Jan, Guylaine / Benezeth, Yannick / Gravier, Guillaume / Bimbot, Frédéric:
"A study on auditory feature spaces for speech-driven lip animation",
2497-2500.
Loweimi, Erfan / Ahadi, Seyed Mohammad / Sheikhzadeh, Hamid:
"Phase-only speech reconstruction using very short frames",
2501-2504.
Skogstad, Trond / Svendsen, Torbjørn:
"Frequency-warped and stabilized time-varying cepstral coefficients",
2505-2508.
William, Freddy / Sangwan, Abhijeet / Hansen, John H. L.:
"Using human perception for automatic accent assessment",
2509-2512.
Molina, Carlos / Lee, Sungbok / Narayanan, Shrikanth / Yoma, Néstor Becerra:
"A study of the effectiveness of articulatory strokes for phonemic recognition",
2513-2516.
Okamoto, Erika / Irino, Toshio / Nisimura, Ryuichi / Kawahara, Hideki:
"Auditory filterbank improves voice morphing",
2517-2520.
Fuchs, Anna Katharina / Feldbauer, Christian / Stark, Michael:
"Monaural sound localization",
2521-2524.
Speech Coding
Fukui, Masahiro / Sasaki, Shigeaki / Hiwasaki, Yusuke / Sachiko, Kurihara / Haneda, Yoichi:
"Dual-mode AVQ coding based on spectral masking and sparseness detection for ITU-t g.711.1/g.722 super-wideband extensions",
2525-2528.
Taufique, Azar / Vijayasankar, Kumaran / Kim, Wooil / Hansen, John H. L. / Tacca, Marco / Fumagalli, Andrea:
"Phone impact based speech transmission technique for reliable speech recognition in poor wireless network conditions",
2529-2532.
Zhou, Jingting / Garcia-Romero, Daniel / Espy-Wilson, Carol Y.:
"Automatic speech codec identification with applications to tampering detection of speech recordings",
2533-2536.
Lee, Chang-Heon / Rosec, Olivier / Stylianou, Yannis:
"A hybrid quasi-harmonic/CELP wideband speech coding scheme for unit selection TTS synthesis",
2537-2540.
Rämö, Anssi / Toukomaa, Henri:
"Voice quality characterization of IETF opus codec",
2541-2544.
Pedersen, C. F.:
"Leja ordering LSFs for accurate estimation of predictor coefficients",
2545-2548.
Gong, Qipeng / Kabal, Peter:
"Improved quality for conversational voIP using path diversity",
2549-2552.
Khan, Abdul Hannan / Kabal, Peter:
"Tree encoding for the ITU-t g.711.1 speech coder",
2553-2556.
Wang, Dong / Vipperla, Ravichander / Evans, Nicholas:
"Parallel and hierarchical decision making for sparse coding in speech recognition",
2557-2560.
Chiang, Chen-Yu / Yang, Jyh-Her / Liu, Ming-Chieh / Wang, Yih-Ru / Liao, Yuan-Fu / Chen, Sin-Horn:
"A new model-based Mandarin-speech coding system",
2561-2564.
Robustness and Adaptation for ASR
Cerva, Petr / Palecek, Karel / Silovsky, Jan / Nouza, Jan:
"Using unsupervised feature-based speaker adaptation for improved transcription of spoken archives",
2565-2568.
Fischer, Volker / Kunzmann, Siegfried:
"Online speaker adaptation with pre-computed FMLLR transformations",
2569-2572.
Giuliani, Diego / Brugnara, Fabio:
"Instantaneous speaker adaptation through selection and combination of fMLLR transformation matrices",
2573-2576.
Song, Hwa Jeon / Lee, Yunkeun / Kim, Hyung Soon:
"Joint bilinear transformation space based maximum a posteriori linear regression adaptation using prior with variance function",
2577-2580.
Sanand, D. R. / Kurimo, Mikko:
"A study on combining VTLN and SAT to improve the performance of automatic speech recognition",
2581-2584.
Tsao, Yu / Dixon, Paul R. / Hori, Chiori / Kawai, Hisashi:
"Incorporating regional information to enhance MAP-based stochastic feature compensation for robust speech recognition",
2585-2588.
Ghai, Shweta / Sinha, Rohit:
"A study on the effect of pitch on LPCC and PLPC features for children's ASR in comparison to MFCC",
2589-2592.
Jouvet, Denis / Fohr, Dominique / Illina, Irina:
"About handling boundary uncertainty in a speaking rate dependent modeling approach",
2593-2596.
Wu, Ji / He, Zhiyang / Lv, Ping:
"An active learning approach to task adaptation",
2597-2600.
Joshi, Vikas / Bilgi, Raghavendra / Umesh, S. / Benitez, C. / Garcia, L.:
"Efficient speaker and noise normalization for robust speech recognition",
2601-2604.
Winkler, Thomas:
"How realistic is artificially added noise?",
2605-2608.
Voice Activity Detection
Unoki, Masashi / Lu, Xugang / Petrick, Rico / Morita, Shota / Akagi, Masato / Hoffmann, Rüdiger:
"Voice activity detection in MTF-based power envelope restoration",
2609-2612.
Espi, Miquel / Miyabe, Shigeki / Nishimoto, Takuya / Ono, Nobutaka / Sagayama, Shigeki:
"Using spectral fluctuation of speech in multi-feature HMM-based voice activity detection",
2613-2616.
Mehta, Kannu / Pham, Chau Khoa / Chng, Eng Siong:
"Linear dynamic models for voice activity detection",
2617-2620.
Pohjalainen, Jouni / Raitio, Tuomo / Alku, Paavo:
"Detection of shouted speech in the presence of ambient noise",
2621-2624.
Fukuda, Takashi / Ichikawa, Osamu / Nishimura, Masafumi:
"Breath-detection-based telephony speech phrasing",
2625-2628.
Kim, Gibak:
"Multi-channel voice activity detection based on conic constraints",
2629-2632.
Petsatodis, Theodoros / Talantzis, Fotios / Boukis, Christos / Tan, Zheng-Hua / Prasad, Ramjee:
"Multi-sensor voice activity detection based on multiple observation hypothesis testing",
2633-2636.
Gao, Chao / Saikumar, Guruprasad / Khanwalkar, Saurabh / Herscovici, Avi / Kumar, Anoop / Srivastava, Amit / Natarajan, Premkumar:
"Online speech activity detection in broadcast news",
2637-2640.
Reich, Daniel / Putze, Felix / Heger, Dominic / Ijsselmuiden, Joris / Stiefelhagen, Rainer / Schultz, Tanja:
"A real-time speech command detector for a smart control room",
2641-2644.
Chuangsuwanich, Ekapol / Glass, James:
"Robust voice activity detector for real world applications using harmonicity and modulation frequency",
2645-2648.
Dekens, Tomas / Verhelst, Werner:
"On noise robust voice activity detection",
2649-2652.
Lu, Xugang / Unoki, Masashi / Isotani, Ryosuke / Kawai, Hisashi / Nakamura, Satoshi:
"Adaptive regularization framework for robust voice activity detection",
2653-2656.
Human Speech Production I
Koriyama, Tomoki / Nose, Takashi / Kobayashi, Takao:
"On the use of extended context for HMM-based spontaneous conversational speech synthesis",
2657-2660.
Toutios, Asterios / Ouni, Slim:
"Predicting tongue positions from acoustics and facial features",
2661-2664.
Bosch, Louis ten / Hämäläinen, Annika / Ernestus, Mirjam:
"Assessing acoustic reduction: exploiting local structure in speech",
2665-2668.
Andreeva, Bistra / Wolska, Magdalena:
"The “fortis-lenis” distinction in Bulgarian and German",
2669-2672.
Chen, Gang / Kreiman, Jody / Shue, Yen-Liang / Alwan, Abeer:
"Acoustic correlates of glottal gaps",
2673-2676.
Bush, Brian O. / Hosom, John-Paul / Kain, Alexander / Amano-Kusumoto, Akiko:
"Using a genetic algorithm to estimate parameters of a coarticulation model",
2677-2680.
Birkholz, Peter / Kröger, Bernd J. / Neuschaefer-Rube, Christiane:
"Synthesis of breathy, normal, and pressed phonation using a two-mass model with a triangular glottis",
2681-2684.
Ghosh, Prasanta Kumar / Narayanan, Shrikanth:
"Analysis of inter-articulator correlation in acoustic-to-articulatory inversion using generalized smoothness criterion",
2685-2688.
Kaburagi, Tokihiko:
"Frequency-domain representation of source-filter coupling and its effect in the production of voice",
2689-2692.
Rasilo, Heikki / Laine, Unto K. / Räsänen, Okko / Altosaar, Toomas:
"Method for speech inversion with large scale statistical evaluation",
2693-2696.
Braun, Bettina / Geiselmann, Sabine:
"Italian in the no-man's land between stress-timing and syllable-timing? speakers are more stress-timed than listeners",
2697-2700.
Folk, Laura / Schiel, Florian:
"The lombard effect in spontaneous dialog speech",
2701-2704.
Voice Conversion and Speech Synthesis
Pilkington, Nicholas C. V. / Zen, Heiga / Gales, M. J. F.:
"Gaussian process experts for voice conversion",
2761-2764.
Veaux, Christophe / Rodet, Xavier:
"Intonation conversion from neutral to expressive speech",
2765-2768.
Hattori, Nobuhiko / Toda, Tomoki / Kawai, Hisashi / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Speaker-adaptive speech synthesis based on eigenvoice conversion and language-dependent prosodic conversion in speech-to-speech translation",
2769-2772.
Pérez, Javier / Bonafonte, Antonio:
"Adding glottal source information to intra-lingual voice conversion",
2773-2776.
Lei, Ming / Yamagishi, Junichi / Richmond, Korin / Ling, Zhen-Hua / King, Simon / Dai, Li-Rong:
"Formant-controlled HMM-based speech synthesis",
2777-2780.
Raitio, Tuomo / Suni, Antti / Vainio, Martti / Alku, Paavo:
"Analysis of HMM-based lombard speech synthesis",
2781-2784.
Obin, Nicolas / Lanchantin, Pierre / Lacheret, Anne / Rodet, Xavier:
"Discrete/continuous modelling of speaking style in HMM-based speech synthesis: design and evaluation",
2785-2788.
Sung, June Sig / Hong, Doo Hwa / Kang, Shin Jae / Kim, Nam Soo:
"Factored MLLR adaptation for singing voice generation",
2789-2792.
Hirose, Keikichi / Ochi, Keiko / Mihara, Ryusuke / Hashimoto, Hiroya / Saito, Daisuke / Minematsu, Nobuaki:
"Adaptation of prosody in speech synthesis by changing command values of the generation process model of fundamental frequency",
2793-2796.
Wen, Miaomiao / Wang, Miaomiao / Hirose, Keikichi / Minematsu, Nobuaki:
"Prosody conversion for emotional Mandarin speech synthesis using the tone nucleus model",
2797-2800.
Karhila, Reima / Wester, Mirjam:
"Rapid adaptation of foreign-accented HMM-based speech synthesis",
2801-2804.
Tóth, Bálint / Fegyó, Tibor / Németh, Géza:
"The effects of phoneme errors in speaker adaptation for HMM speech synthesis",
2805-2808.
Human Speech Production II
Berry, Jeffrey / Ji, Sunjing / Fasel, Ian / Archangeli, Diana:
"Articulatory reduction in Mandarin Chinese words",
2809-2812.
Lammert, Adam / Proctor, Michael / Katsamanis, Athanasios / Narayanan, Shrikanth:
"Morphological variation in the adult vocal tract: a modeling study of its potential acoustic impact",
2813-2816.
Lulich, Steven M. / Arsikere, Harish / Morton, John R. / Leung, Gary K. F. / Alwan, Abeer / Sommers, Mitchell S.:
"Analysis and automatic estimation of children's subglottal resonances",
2817-2820.
Wokurek, Wolfgang / Madsack, Andreas:
"Acceleration sensor based estimates of subglottal resonances: short vs. long vowels",
2821-2824.
Audibert, Nicolas / Amelot, Angélique:
"Comparison of nasalance measurements from accelerometers and microphones and preliminary development of novel features",
2825-2828.
Fitzpatrick, Michael / Kim, Jeesun / Davis, Chris:
"The effect of seeing the interlocutor on speech production in different noise types",
2829-2832.
Aubanel, Vincent / Cooke, Martin / Villegas, Julián / Lecumberri, Maria Luisa Garcia:
"Conversing in the presence of a competing conversation: effects on speech production",
2833-2836.
Heldner, Mattias / Edlund, Jens / Hjalmarsson, Anna / Laskowski, Kornel:
"Very short utterances and timing in turn-taking",
2837-2840.
Katsamanis, Athanasios / Bresch, Erik / Ramanarayanan, Vikram / Narayanan, Shrikanth:
"Validating rt-MRI based articulatory representations via articulatory recognition",
2841-2844.
Li, Yinghao / Kong, Jiangping:
"An electropalatographic and acoustic study on anticipatory coarticulation in V1#C2V2 sequences in standard Chinese",
2845-2848.
Hanique, Iris / Ernestus, Mirjam:
"Final /t/ reduction in dutch past-participles: the role of word predictability and morphological decomposability",
2849-2852.
Raeesy, Zeynab / Baghai-Ravary, Ladan / Coleman, John:
"Parametrising degree of articulator movement from dynamic MRI data",
2853-2856.
Systems for LVCSR and Rich Transcription
Liu, X. / Gales, M. J. F. / Woodland, P. C.:
"Improving LVCSR system combination using neural network language model cross adaptation",
2857-2860.
Xue, Jian / Cui, Xiaodong / Daggett, Gregg / Marcheret, Etienne / Zhou, Bowen:
"Towards high performance LVCSR in speech-to-speech translation system on smart phones",
2861-2864.
Sung, Yun-Hsuan / Jansche, Martin / Moreno, Pedro J.:
"Deploying google search by voice in Cantonese",
2865-2868.
Al-Shareef, Sarah / Hain, Thomas:
"An investigation in speech recognition for colloquial Arabic",
2869-2872.
Brugnara, Fabio:
"A multithreaded implementation of Viterbi decoding on recursive transition networks",
2873-2876.
Kombrink, Stefan / Mikolov, Tomáš / Karafiát, Martin / Burget, Lukáš:
"Recurrent neural network based language modeling in meeting recognition",
2877-2880.
Cossalter, Michele / Sundararajan, Priya / Lane, Ian:
"Ad-hoc meeting transcription on clusters of mobile devices",
2881-2884.
Abida, Kacem / Karray, Fakhri:
"ROVER enhancement with automatic error detection",
2885-2888.
Akita, Yuya / Kawahara, Tatsuya:
"Automatic comma insertion of lecture transcripts based on multiple annotations",
2889-2892.
Language, Dialect Identification and Speaker Diarization
You, Chang Huai / Li, Haizhou / Lee, Kong Aik:
"Study on the relevance factor of maximum a posteriori with GMM for language recognition",
2893-2896.
Habib, Tania / Romsdorfer, Harald:
"Improving multiband position-pitch algorithm for localization and tracking of multiple concurrent speakers by using a frequency selective criterion",
2897-2900.
Varona, Amparo / Penagarikano, Mikel / Rodriguez-Fuentes, Luis Javier / Bordel, Germán:
"On the use of lattices of time-synchronous cross-decoder phone co-occurrences in a SVM-phonotactic language recognition system",
2901-2904.
Tawara, Naohiro / Watanabe, Shinji / Ogawa, Tetsuji / Kobayashi, Tetsunori:
"Speaker clustering based on utterance-oriented dirichlet process mixture model",
2905-2908.
Silovsky, Jan / Prazak, Jan / Cerva, Petr / Zdansky, Jindrich / Nouza, Jan:
"PLDA-based clustering for speaker diarization of broadcast streams",
2909-2912.
Soufifar, Mehdi / Kockmann, Marcel / Burget, Lukáš / Plchot, Oldřich / Glembek, Ondřej / Svendsen, Torbjørn:
"ivector approach to phonotactic language recognition",
2913-2916.
Alberti, Chris / Bacchiani, Michiel:
"Discriminative features for language identification",
2917-2920.
Fox, Robert Allen / Jacewicz, Ewa:
"Perceptual sensitivity to dialectal and generational variations in vowels",
2921-2924.
Yang, Qian / Jin, Qin / Schultz, Tanja:
"Investigation of cross-show speaker diarization",
2925-2928.
Siivola, Vesa / Pellom, Bryan / Sills, Meagan:
"Language identification for text chats",
2929-2932.
Lee, Kong Aik / You, Chang Huai / Hautamäki, Ville / Larcher, Anthony / Li, Haizhou:
"Spoken language recognition in the latent topic simplex",
2933-2936.
Paralinguistic Information - Analysis and Tools
Gottsmann, Frederike / Harwardt, Corinna:
"Investigating robustness of spectral moments on normal- and high-effort speech",
2937-2940.
Harwardt, Corinna:
"Comparing the impact of raised vocal effort on various spectral parameters",
2941-2944.
Godin, Keith W. / Hansen, John H. L.:
"Vowel context and speaker interactions influencing glottal open quotient and formant frequency shifts in physical task stress",
2945-2948.
Pakhomov, Serguei / Kotlyar, Michael:
"Prosodic correlates of individual physiological response to stress",
2949-2952.
Charfuelan, Marcela / Schröder, Marc:
"The vocal effort of dominance in scenario meetings",
2953-2956.
Patel, Sona / Shrivastav, Rahul:
"A preliminary model of emotional prosody using multidimensional scaling",
2957-2960.
Kim, Jangwon / Lee, Sungbok / Narayanan, Shrikanth:
"An exploratory study of the relations between perceived emotion strength and articulatory kinematics",
2961-2964.
Ishi, Carlos T. / Ishiguro, Hiroshi / Hagita, Norihiro:
"Improved acoustic characterization of breathy and whispery voices",
2965-2968.
Govind, D. / Prasanna, S. R. M. / Yegnanarayana, B.:
"Neutral to target emotion conversion using source and suprasegmental information",
2969-2972.
Truong, Khiet P. / Poppe, Ronald / Kok, Iwan de / Heylen, Dirk:
"A multimodal analysis of vocal and visual backchannels in spontaneous dialogs",
2973-2976.
Malandrakis, Nikos / Potamianos, Alexandros / Iosif, Elias / Narayanan, Shrikanth:
"Kernel models for affective lexicon creation",
2977-2980.
Speech and Language Processing-Based Assistive Technologies and Health Applications (Special Session)
Sturim, Douglas / Torres-Carrasquillo, Pedro A. / Quatieri, Thomas F. / Malyska, Nicolas / McCree, Alan:
"Automatic detection of depression in speech using Gaussian mixture modeling with factor analysis",
2981-2984.
Bunnell, H. Timothy / Lilley, Jason / Soli, Sigfrid D. / Pal, Ivan:
"Utterance verification for automating the hearing in noise test (HINT)",
2985-2988.
Mower, Emily / Lee, Chi-Chun / Gibson, James / Chaspari, Theodora / Williams, Marian E. / Narayanan, Shrikanth:
"Analyzing the nature of ECA interactions in children with autism",
2989-2992.
Athanaselis, Theologos / Bakamidis, Stelios / Dologlou, Ioannis / Argyriou, Evmorfia N. / Symvonis, Antonis:
"Incorporating speech recognition engine into an intelligent assistive reading system for dyslexic students",
2993-2996.
Cummins, Nicholas / Epps, Julien / Breakspear, Michael / Goecke, Roland:
"An investigation of depressed speech detection: features and normalization",
2997-3000.
Sanchez, Michelle Hewlett / Vergyri, Dimitra / Ferrer, Luciana / Richey, Colleen / Garcia, Pablo / Knoth, Bruce / Jarrold, William:
"Using prosodic and spectral features in detecting depression in elderly males",
3001-3004.
Middag, Catherine / Bocklet, Tobias / Martens, Jean-Pierre / Nöth, Elmar:
"Combining phonological and acoustic ASR-free features for pathological speech intelligibility assessment",
3005-3008.
Hofe, Robin / Ell, Stephen R. / Fagan, Michael J. / Gilbert, James M. / Green, Phil D. / Moore, Roger K. / Rybchenko, Sergey I.:
"Speech synthesis parameter generation for the assistive silent speech interface MVOCA",
3009-3012.
Heeman, Peter A. / McMillin, Andy / Yaruss, J. Scott:
"Computer-assisted disfluency counts for stuttered speech",
3013-3016.
Hummel, Richard / Chan, Wai-Yip / Falk, Tiago H.:
"Spectral features for automatic blind intelligibility estimation of spastic dysarthric speech",
3017-3020.
Prud'hommeaux, Emily T. / Roark, Brian:
"Extraction of narrative recall patterns for neuropsychological assessment",
3021-3024.
Kunikoshi, Aki / Qiao, Yu / Saito, Daisuke / Minematsu, Nobuaki / Hirose, Keikichi:
"Gesture design of hand-to-speech converter derived from speech-to-hand converter based on probabilistic integration model",
3025-3028.
Sasou, Akira:
"Powered wheelchair control using acoustic-based recognition of head gesture accompanying speech",
3029-3032.
Blanco, José Luis / Fernández, Rubén / Torre, Doroteo / Caminero, F. Javier / López, Eduardo:
"Analyzing training dependencies and posterior fusion in discriminant classification of apnea patients based on sustained and connected speech",
3033-3036.
Crowdsourcing for Speech Processing (Special Session)
Parent, Gabriel / Eskenazi, Maxine:
"Speaking to the crowd: looking at past achievements in using crowdsourcing for speech and predicting future challenges",
3037-3040.
Lee, Chia-ying / Glass, James:
"A transcription task for crowdsourcing with automatic quality control",
3041-3044.
Audhkhasi, Kartik / Georgiou, Panayiotis G. / Narayanan, Shrikanth:
"Reliability-weighted acoustic model adaptation using crowd sourced transcriptions",
3045-3048.
Cooke, Martin / Barker, Jon / Lecumberri, Maria Luisa Garcia / Wasilewski, Krzysztof:
"Crowdsourcing for word recognition in noise",
3049-3052.
Buchholz, Sabine / Latorre, Javier:
"Crowdsourcing preference tests, and how to detect cheating",
3053-3056.
McGraw, Ian / Glass, James / Seneff, Stephanie:
"Growing a spoken language interface on Amazon Mechanical Turk",
3057-3060.
Jurčíček, F. / Keizer, S. / Gašić, Milica / Mairesse, François / Thomson, B. / Yu, K. / Young, Steve:
"Real user evaluation of spoken dialogue systems using Amazon Mechanical Turk",
3061-3064.
Gelas, Hadrien / Abate, Solomon Teferra / Besacier, Laurent / Pellegrino, François:
"Quality assessment of crowdsourcing transcriptions for african languages",
3065-3068.
Evanini, Keelan / Zechner, Klaus:
"Using crowdsourcing to provide prosodic annotations for non-native speech",
3069-3072.
Goto, Masataka / Ogata, Jun:
"Podcastle: recent advances of a spoken document retrieval service improved by anonymous user contributions",
3073-3076.
Spoken Language Processing of Human-Human Conversations (Special Session)
Valente, Fabio / Vinciarelli, Alessandro:
"Language-independent socio-emotional role recognition in the AMI meetings corpus",
3077-3080.
Levitan, Rivka / Hirschberg, Julia:
"Measuring acoustic-prosodic entrainment with respect to multiple levels and dimensions",
3081-3084.
Park, Youngja:
"Automatic call quality monitoring using cost-sensitive classification",
3085-3088.
Iwata, Tomoharu / Watanabe, Shinji:
"Learning influences from word use in polylogue",
3089-3092.
Wang, Wen / Precoda, Kristin / Richey, Colleen / Raymond, Geoffrey:
"Identifying agreement/disagreement in conversational speech: a cross-lingual study",
3093-3096.
Neiberg, Daniel / Gustafson, Joakim:
"A dual channel coupled decoder for fillers and feedback",
3097-3100.
Lee, Chi-Chun / Katsamanis, Athanasios / Black, Matthew P. / Baucom, Brian R. / Georgiou, Panayiotis G. / Narayanan, Shrikanth:
"An analysis of PCA-based vocal entrainment measures in married couples' affective spoken interactions",
3101-3104.
Speech and Audio Processing for Human-Robot Interaction (Special Session)
Schillingmann, Lars / Wagner, Petra / Munier, Christian / Wrede, Britta / Rohlfing, Katharina:
"Using prominence detection to generate acoustic feedback in tutoring scenarios",
3105-3108.
Otsuka, Takuma / Nakadai, Kazuhiro / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Bayesian extension of MUSIC for sound source localization and tracking",
3109-3112.
Wöllmer, Martin / Weninger, Felix / Steidl, Stefan / Batliner, Anton / Schuller, Björn:
"Speech-based non-prototypical affect recognition for child-robot interaction in reverberated environments",
3113-3116.
Maazaoui, Mounira / Grenier, Yves / Abed-Meraim, Karim:
"Blind source separation for robot audition using fixed beamforming with HRTFs",
3117-3120.
Tahon, Marie / Delaborde, Agnes / Devillers, Laurence:
"Real-life emotion detection from speech in human-robot interaction: experiments across diverse corpora with child and adult voices",
3121-3124.
Attabi, Yazid / Dumouchel, Pierre:
"Weighted ordered classes - nearest neighbors: a new framework for automatic emotion recognition from speech",
3125-3128.
Doukhan, David / Rilliard, Albert / Rosset, Sophie / Adda-Decker, Martine / d'Alessandro, Christophe:
"Prosodic analysis of a corpus of tales",
3129-3132.
Ishi, Carlos T. / Ishiguro, Hiroshi / Hagita, Norihiro:
"Analysis of acoustic-prosodic features related to paralinguistic information carried by interjections in dialogue speech",
3133-3136.
Heckmann, Martin / Nakadai, Kazuhiro / Nakajima, Hirofumi:
"Robust intonation pattern classification in human robot interaction",
3137-3140.
Sumiyoshi, Takashi / Togami, Masahito / Obuchi, Yasunari:
"ASR for human-symbiotic robot “EMIEW2” with mechanical noise and floor-level noise reduction",
3141-3144.
Speech Technology for Under-Resourced Languages (Special Session)
Vu, Ngoc Thang / Kraus, Franziska / Schultz, Tanja:
"Rapid building of an ASR system for under-resourced languages based on multilingual unsupervised training",
3145-3148.
Mandal, Shyamal Kr. Das / Chandra, Somnath / Lata, Swaran / Datta, A. K.:
"Places and manner of articulation of Bangla consonants: an EPG based study",
3149-3152.
Davel, Marelie H. / Heerden, Charl van / Kleynhans, Neil / Barnard, Etienne:
"Efficient harvesting of internet audio for resource-scarce ASR",
3153-3156.
Sečujski, Milan / Pekar, Darko / Jakovljević, Nikša:
"Automatic prosody generation for serbo-croatian speech synthesis based on regression trees",
3157-3160.
Karpov, Alexey / Kipyatkova, Irina / Ronzhin, Andrey:
"Very large vocabulary ASR for spoken Russian with syntactic and morphemic analysis",
3161-3164.
Kempton, Timothy / Moore, Roger K. / Hain, Thomas:
"Cross-language phone recognition when the target language phoneme inventory is not known",
3165-3168.
Chaudhuri, Sourish / Raj, Bhiksha / Ezzat, Tony:
"A paradigm for limited vocabulary speech recognition based on redundant spectro-temporal feature sets",
3169-3172.
Barroso, N. / López de Ipiña, K. / Ezeiza, A. / Hernández, C. / Ezeiza, N. / Barroso, O. / Susperregi, U. / Barroso, S.:
"Gorup: an ontology-driven audio information retrieval system that suits the requirements of under-resourced languages",
3173-3176.
Vries, Nic J. de / Badenhorst, Jaco / Davel, Marelie H. / Barnard, Etienne / Waal, Alta de:
"Woefzela - an open-source platform for ASR data collection in the developing world",
3177-3180.
Mixdorff, Hansjörg / Mohasi, Lehlohonolo / Machobane, 'Malillo / Niesler, Thomas:
"A study on the perception of tone and intonation in Sesotho",
3181-3184.
Wet, Febe de / Waal, Alta de / Huyssteen, Gerhard B. van:
"Developing a broadband automatic speech recognition system for Afrikaans",
3185-3188.
Kamper, Herman / Niesler, Thomas:
"Multi-accent speech recognition of Afrikaans, black and white varieties of south african English",
3189-3192.
Tantibundhit, C. / Onsuwan, C. / Saimai, T. / Saimai, N. / Thatphithakkul, S. / Chootrakool, P. / Kosawat, K. / Thatphithakkul, N.:
"Perceptual representation of consonant sounds in Thai",
3193-3196.
Mustafa, Mumtaz B. / Ainon, Raja N. / Zainuddin, Roziati / Don, Zuraidah M. / Knowles, Gerry:
"A cross-lingual approach to the development of an HMM-based speech synthesis system for malay",
3197-3200.
Speaker State Challenge - Intoxication and Sleepiness I, II (Special Session)
Schuller, Björn / Steidl, Stefan / Batliner, Anton / Schiel, Florian / Krajewski, Jarek:
"The INTERSPEECH 2011 speaker state challenge",
3201-3204.
Montacié, Claude / Caraty, Marie-José:
"Combining multiple phoneme-based classifiers with audio feature-based classifier for the detection of alcohol intoxication",
3205-3208.
Biadsy, Fadi / Wang, William Yang / Rosenberg, Andrew / Hirschberg, Julia:
"Intoxication detection using phonetic, phonotactic and prosodic cues",
3209-3212.
Bocklet, Tobias / Riedhammer, Korbinian / Nöth, Elmar:
"Drink and speak: on the automatic classification of alcohol intoxication by acoustic, prosodic and text-based features",
3213-3216.
Bone, Daniel / Black, Matthew P. / Li, Ming / Metallinou, Angeliki / Lee, Sungbok / Narayanan, Shrikanth:
"Intoxicated speech detection by fusion of speaker normalized hierarchical features and GMM supervectors",
3217-3220.
Ultes, Stefan / Schmitt, Alexander / Minker, Wolfgang:
"Attention, sobriety checkpoint! can humans determine by means of voice, if someone is drunk… and can automatic classifiers compete?",
3221-3224.
Hönig, Florian / Batliner, Anton / Nöth, Elmar:
"Does it groove or does it stumble - automatic classification of alcoholic intoxication using prosodic features",
3225-3228.
Schiel, Florian:
"Perception of alcoholic intoxication in speech",
3281-3284.
Rahman, Tauhidur / Mariooryad, Soroosh / Keshavamurthy, Shalini / Liu, Gang / Hansen, John H. L. / Busso, Carlos:
"Detecting sleepiness by fusing classifiers trained with novel acoustic features",
3285-3288.
Rodríguez, Albino Nogueiras:
"An HMM-based approach to the INTERSPEECH 2011 speaker state challenge",
3289-3292.
Bozkurt, Elif / Erzin, Engin / Erdem, Çiğdem Eroğlu / Erdem, A. Tanju:
"RANSAC-based training data selection for speaker state recognition",
3293-3296.
Gajšek, Rok / Dobrišek, Simon / Mihelič, France:
"University of Ljubljana system for interspeech 2011 speaker state challenge",
3297-3300.
Huang, Dong-Yan / Ge, Shuzhi Sam / Zhang, Zhengchen:
"Speaker state classification based on fusion of asymmetric SIMPLS and support vector machines",
3301-3304.
Speech Processing Tools (Special Session)
Draxler, Christoph / Altosaar, Toomas / Furui, Sadaoki / Liberman, Mark / Wittenburg, Peter:
"Speech processing tools - an introduction to interoperability",
3229-3232.
Goldman, Jean-Philippe:
"Easyalign: an automatic phonetic alignment tool under praat",
3233-3236.
Villegas, Julián / Cooke, Martin / Aubanel, Vincent / Piccolino-Boniforti, Marco A.:
"Mtrans: a multi-channel, multi-tier speech annotation tool",
3237-3240.
Cerisara, Christophe / Gardent, Claire:
"The JSafran platform for semi-automatic speech processing",
3241-3244.
Wagner, Johannes / Lingenfelser, Florian / André, Elisabeth:
"The social signal interpretation framework (SSI) for real time signal processing and recognition",
3245-3248.
Sloetjes, Han / Wittenburg, Peter / Somasundaram, Aarthy:
"ELAN - aspects of interoperability and functionality",
3249-3252.
Schröder, Marc / Charfuelan, Marcela / Pammi, Sathish / Steiner, Ingmar:
"Open source voice creation toolkit for the MARY TTS platform",
3253-3256.
Steidl, Stefan / Riedhammer, Korbinian / Bocklet, Tobias / Hönig, Florian / Nöth, Elmar:
"Java visual speech components for rapid application development of GUI based speech processing applications",
3257-3260.
Johnston, Michael / Fabbrizio, Giuseppe Di / Urbanek, Simon:
"mtalk - a multimodal browser for mobile services",
3261-3264.
Wrigley, Stuart N. / Hain, Thomas:
"Web-based automatic speech recognition service - webASR",
3265-3268.
Klehr, Markus / Ratzka, Andreas / Roß, Thomas:
"A web based speech transcription workplace",
3269-3272.
Martin, Philippe:
"Winpitch: a multimodal tool for speech analysis of endangered languages",
3273-3276.
Huckvale, Mark:
"Recording caregiver interactions for machine acquisition of spoken language using the KLAIR virtual infant",
3277-3280.
Show & Tell Demonstration - Speech Systems and Applications (Special Session)
Burkhardt, Felix:
"An affective spoken storyteller",
3305-3306.
Wang, Lijuan / Han, Wei / Soong, Frank K. / Huo, Qiang:
"Text driven 3d photo-realistic talking head",
3307-3308.
Arai, Takayuki:
"Physical models producing vowels with pitch variation",
3309-3310.
Mieskes, Margot:
"An engine-independent text-to-speech workplace",
3311-3312.
Carcone, Simone / Giovannella, Carlo:
"An application to test the emotion conveyed by vocal and musical signals",
3313-3314.
Ziółko, Mariusz / Gałka, Jakub / Ziółko, Bartosz / Jadczyk, Tomasz / Skurzok, Dawid / Masior, Mariusz:
"Automatic speech recognition system dedicated for Polish",
3315-3316.
Lee, Kong Aik / Larcher, Anthony / Thai, Helen / Ma, Bin / Li, Haizhou:
"Joint application of speech and speaker recognition for automation and security in smart home",
3317-3318.
Larsson, Staffan / Berman, Alexander / Villing, Jessica:
"Adding a speech cursor to a multimodal dialogue system",
3319-3320.
Christie, S. Thomas / Pakhomov, Serguei:
"Prosody toolkit: integrating HTK, praat and WEKA",
3321-3322.
Francesconi, F. / Ghosh, A. / Riccardi, G. / Ronchetti, M. / Vagin, A.:
"Collecting life logs for experience-based corpora",
3323-3324.
Show & Tell Demonstration - Mobility and Web-Services (Special Session)
Wrigley, Stuart N. / Hain, Thomas:
"Making an automatic speech recognition service freely available on the web",
3325-3326.
Kim, Yeon-Jun / Okken, Thomas / Conkie, Alistair D. / Fabbrizio, Giuseppe Di:
"AT&t voicebuilder: a cloud-based text-to-speech voice builder tool",
3327-3328.
Tucker, Roger / Fry, Dan / Wan, Vincent / Wrigley, Stuart N. / Hain, Thomas:
"Extending audio notetaker to browse webASR transcriptions",
3329-3330.
Ainsley, Samantha / Ha, Linne / Jansche, Martin / Kim, Ara / Nanzawa, Masayuki:
"A web-based tool for developing multilingual pronunciation lexicons",
3331-3332.
Johnston, Michael / Ehlen, Patrick:
"Speak4it and the multimodal semantic interpretation system",
3333-3334.
Alumäe, Tanel / Kitsik, Ahti:
"TSAB - web interface for transcribed speech collections",
3335-3336.
Ljolje, Andrej / Goffin, Vincent / Caseiro, Diamantino / Mishra, Taniya / Gilbert, Mazin:
"Visual voice mail to text on the iphone/ipad",
3337-3338.
Draxler, Christoph:
"Percy - an HTML5 framework for media rich web experiments on mobile devices",
3339-3340.
Huckvale, Mark:
"The KLAIR toolkit for recording interactive dialogues with a virtual infant",
3341-3342.
Nesta, Francesco / Matassoni, Marco / Maganti, HariKrishna:
"Real-time prototype for integration of blind source extraction and robust automatic speech recognition",
3343-3344.