Table of Contents and Access to Abstracts
Keynote Sessions
Fujisaki, Hiroya:
"In search of models in speech communication research",
1-10.
Alwan, Abeer:
"Dealing with limited and noisy data in ASR: a hybrid knowledge-based and statistical approach",
11-15.
Gonzalez-Rodriguez, Joaquin:
"Forensic automatic speaker recognition: fiction or science?",
16-17.
Cassell, Justine:
"Modelling rapport in embodied conversational agents",
18-19.
Segmentation and Classification
Han, Kyu J. / Narayanan, Shrikanth S.:
"Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modeling",
20-23.
Ben-Harush, Oshry / Lapidot, Itshak / Guterman, Hugo:
"Weighted segmental k-means initialization for SOM-based speaker clustering",
24-27.
Ikbal, Shajith / Visweswariah, Karthik:
"Learning essential speaker sub-space using hetero-associative neural networks for speaker clustering",
28-31.
Boakye, Kofi / Vinyals, Oriol / Friedland, Gerald:
"Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech",
32-35.
Nguyen, Trung Hieu / Chng, Eng Siong / Li, Haizhou:
"T-test distance and clustering criterion for speaker diarization",
36-39.
Vijayasenan, Deepu / Valente, Fabio / Bourlard, Hervé:
"Integration of TDOA features in information bottleneck framework for fast speaker diarization",
40-43.
Speech Coding
Ramasubramanian, V. / Harish, D.:
"Low complexity near-optimal unit-selection algorithm for ultra low bit-rate speech coding based on n-best lattice and Viterbi search",
44.
Eksler, Vaclav / Salami, Redwan / Jelinek, Milan:
"A new fast algebraic fixed codebook search algorithm in CELP speech coding",
45-48.
Xu, Hao / Bao, Changchun:
"A novel transcoding algorithm between 3GPP AMR-NB (7.95kbit/s) and ITU-t g.729a (8kbit/s)",
49-52.
Nour-Eldin, Amr H. / Kabal, Peter:
"Mel-frequency cepstral coefficient-based bandwidth extension of narrowband speech",
53-56.
Garcia, Jean-Luc / Marro, Claude / Kövesi, Balazs:
"A PCM coding noise reduction for ITU-t g.711.1",
57-60.
Wältermann, Marcel / Scholz, Kirstin / Möller, Sebastian / Huo, Lu / Raake, Alexander / Heute, Ulrich:
"An instrumental measure for end-to-end speech transmission quality based on perceptual dimensions: framework and realization",
61-64.
Human Conversation and Communication
Peters, Benno / Pfitzinger, Hartmut R.:
"Duration and F0 interval of utterance-final intonation contours in the perception of German sentence modality",
65-68.
Braun, Bettina / Tagliapietra, Lara / Cutler, Anne:
"Contrastive utterances make alternatives salient - cross-modal priming evidence",
69.
Ishizaki, Masato / Den, Yasuharu / Fukashiro, Senshi:
"Exploring a mechanism of speech sychronization using auditory delayed experiments",
70-73.
Pon-Barry, Heather:
"Prosodic manifestations of confidence and uncertainty in spoken language",
74-77.
Fernandez, Raquel / Frampton, Matthew / Dowding, John / Adukuzhiyil, Anish / Ehlen, Patrick / Peters, Stanley:
"Identifying relevant phrases to summarize decisions in spoken meetings",
78-81.
Laskowski, Kornel / Schultz, Tanja:
"Recovering participant identities in meetings from a probabilistic description of vocal interaction",
82-85.
OzPhon08 - Phonetics and Phonology of Australian Aboriginal Languages (Special Session)
Fletcher, Janet / Loakes, Deborah / Butcher, Andrew:
"Coarticulation in nasal and lateral clusters in Warlpiri",
86-89.
Loakes, Deborah / Butcher, Andrew / Fletcher, Janet / Stoakes, Hywel:
"Phonetically prestopped laterals in Australian languages: a preliminary investigation of Warlpiri",
90-93.
Ingram, John / Laughren, Mary / Chapman, Jeff:
"Connected speech processes in Warlpiri",
94.
Pentland, Christina:
"Consonant enhancement in Lamalama, an initial-dropping language of Cape York Peninsula, North Queensland",
95.
Turpin, Myfany:
"Text, rhythm and metrical form in an Aboriginal song series",
96-98.
Acoustic Activity Detection, Pitch Tracking and Analysis
Ishizuka, Kentaro / Araki, Shoko / Kawahara, Tatsuya:
"Statistical speech activity detection based on spatial power distribution for analyses of poster presentations",
99-102.
Kang, Sang-Ick / Song, Ji-Hyun / Lee, Kye-Hwan / Park, Yun-Sik / Chang, Joon-Hyuk:
"A statistical model-based voice activity detection employing minimum classification error technique",
103-106.
Ding, Hongfei / Yamamoto, Koichi / Akamine, Masami:
"Comparative evaluation of different methods for voice activity detection",
107-110.
Shafiee, Soheil / Almasganj, Farshad / Jafari, Ayyoob:
"Speech/non-speech segments detection based on chaotic and prosodic features",
111-114.
Zieger, Christian / Omologo, Maurizio:
"Acoustic event classification using a distributed microphone network with a GMM/SVM combined algorithm",
115-118.
Obuchi, Yasunari / Togami, Masahito / Sumiyoshi, Takashi:
"Intentional voice command detection for completely hands-free speech interface in home environments",
119-122.
Butko, Taras / Temko, Andrey / Nadeu, Climent / Canton, Cristian:
"Fusion of audio and video modalities for detection of acoustic events",
123-126.
Weiss, Ron J. / Kristjansson, Trausti:
"DySANA: dynamic speech and noise adaptation for voice activity detection",
127-130.
Petrick, Rico / Unoki, Masashi / Mittal, Anish / Segura, Carlos / Hoffmann, Rüdiger:
"A comprehensive study on the effects of room reverberation on fundamental frequency estimation",
131-134.
Hussein, H. / Wolff, M. / Jokisch, Oliver / Duckhorn, F. / Strecha, G. / Hoffmann, Rüdiger:
"A hybrid speech signal based algorithm for pitch marking using finite state machines",
135-138.
Ohishi, Yasunori / Kameoka, Hirokazu / Kashino, Kunio / Takeda, Kazuya:
"Parameter estimation method of F0 control model for singing voices",
139-142.
Vishnubhotla, Srikanth / Espy-Wilson, Carol Y.:
"An algorithm for multi-pitch tracking in co-channel speech",
143-146.
Wohlmayr, Michael / Pernkopf, Franz:
"Multipitch tracking using a factorial hidden Markov model",
147-150.
Li, Ming / Cao, Chuan / Wang, Di / Lu, Ping / Fu, Qiang / Yan, Yonghong:
"Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping",
151-154.
Martin, Philippe:
"Crosscorrelation of adjacent spectra enhances fundamental frequency tracking",
155-158.
Single- and Multichannel Speech Enhancement I, II
Malek, Jiri / Koldovsky, Zbynek / Zdansky, Jindrich / Nouza, Jan:
"Enhancement of noisy speech recordings via blind source separation",
159-162.
Ishibashi, Takaaki / Nakashima, Hidetoshi / Gotanda, Hiromu:
"Studies on estimation of the number of sources in blind source separation",
163-166.
Ramasubramanian, V. / Vijaywargi, Deepak:
"Speech enhancement based on hypothesized Wiener filtering",
167-170.
Li, Junfeng / Jiang, Hui / Akagi, Masato:
"Psychoacoustically-motivated adaptive β-order generalized spectral subtraction based on data-driven optimization",
171-174.
Nand K, Krishna / Sreenivas, T. V.:
"Two stage iterative Wiener filtering for speech enhancement",
175-178.
Ding, Pei / Hao, Jie:
"Assessment of correlation between objective measures and speech recognition performance in the evaluation of speech enhancement",
179-182.
Lyons, James G. / Paliwal, Kuldip K.:
"Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement",
387-390.
So, Stephen / Paliwal, Kuldip K.:
"A long state vector kalman filter for speech enhancement",
391-394.
Kundu, Achintya / Chatterjee, Saikat / Sreenivas, T. V.:
"Subspace based speech enhancement using Gaussian mixture model",
395-398.
Das, Amit / Hansen, John H. L.:
"Generalized parametric spectral subtraction using weighted Euclidean distortion",
399-402.
Miyake, Nobuyuki / Takiguchi, Tetsuya / Ariki, Yasuo:
"Sudden noise reduction based on GMM with noise power estimation",
403-406.
Alam, Md. Jahangir / Selouani, Sid-Ahmed / O'Shaughnessy, Douglas / Jebara, Sofia Ben:
"Speech enhancement using a wiener denoising technique and musical noise reduction",
407-410.
Wilson, Kevin W. / Raj, Bhiksha / Smaragdis, Paris:
"Regularized non-negative matrix factorization with temporal dependencies for speech denoising",
411-414.
Zou, Xin / Jančovič, Peter / Kokuer, Munevver / Russell, Martin J.:
"ICA-based MAP speech enhancement with multiple variable speech distribution models",
415-418.
Weiss, Ron J. / Mandel, Michael I. / Ellis, Daniel P. W.:
"Source separation based on binaural cues and source model constraints",
419-422.
Kumatani, Kenichi / McDonough, John / Rauch, Barbara / Garner, Philip N. / Li, Weifeng / Dines, John:
"Maximum kurtosis beamforming with the generalized sidelobe canceller",
423-426.
Furuya, Ken'ichi / Kataoka, Akitoshi / Haneda, Yoichi:
"Noise robust speech dereverberation using constrained inverse filter",
427-430.
Rahmani, Mohsen / Akbari, Ahmad / Ayad, Beghdad:
"A dual microphone coherence based method for speech enhancement in headsets",
431-434.
Tashev, Ivan / Mihov, Slavy / Gleghorn, Tyler / Acero, Alex:
"Sound capture system and spatial filter for small devices",
435-438.
Cheng, Ning / Liu, Wen-ju / Li, Peng / Xu, Bo:
"An effective microphone array post-filter in arbitrary environments",
439-442.
Cho, Kook / Okumura, Hajime / Nishiura, Takanobu / Yamashita, Yoichi:
"Localization of multiple sound sources based on inter-channel correlation using a distributed microphone system",
443-446.
Zhang, Heng / Fu, Qiang / Yan, Yonghong:
"A frequency domain approach for speech enhancement with directionality using compact microphone array",
447-450.
Spoken Language Systems I, II
Komatani, Kazunori / Kawahara, Tatsuya / Okuno, Hiroshi G.:
"Predicting ASR errors by exploiting barge-in rate of individual users for spoken dialogue systems",
183-186.
Katsumaru, Masaki / Komatani, Kazunori / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Expanding vocabulary for recognizing user's abbreviations of proper nouns without increasing ASR error rates in spoken dialogue systems",
187-190.
Williams, Jason D.:
"Exploiting the ASR n-best by tracking multiple dialog state hypotheses",
191-194.
Makalic, Enes / Zukerman, Ingrid / Niemann, Michael:
"A spoken language interpretation component for a robot dialogue system",
195-198.
Cesari, Federico / Franco, Horacio / Myers, Gregory K. / Bratt, Harry:
"MUESLI: multiple utterance error correction for a spoken language interface",
199-202.
Conrod, Sarah / Basson, Sara / Kanevsky, Dimitri:
"Methods to optimize transcription of on-line media",
203-206.
Ito, Akinori / Meguro, Toyomi / Makino, Shozo / Suzuki, Motoyuki:
"Discrimination of task-related words for vocabulary design of spoken dialog systems",
207-210.
Hori, Chiori / Ohtake, Kiyonori / Misu, Teruhisa / Kashioka, Hideki / Nakamura, Satoshi:
"Dialog management using weighted finite-state transducers",
211-214.
Yoshimi, Yoshitaka / Kakitsuba, Ryota / Nankaku, Yoshihiko / Lee, Akinobu / Tokuda, Keiichi:
"Probabilistic answer selection based on conditional random fields for spoken dialog system",
215-218.
Eskenazi, Maxine / Black, Alan W. / Raux, Antoine / Langner, Brian:
"Let's go lab: a platform for evaluation of spoken dialog systems with real world users",
219.
Batista, Fernando / Mamede, Nuno / Trancoso, Isabel:
"The impact of language dynamics on the capitalization of broadcast news",
220-223.
Paulik, Matthias / Waibel, Alex:
"Lightly supervised acoustic model training on EPPS recordings",
224-227.
Servan, Christophe / Bechet, Frédéric:
"Fast call-classification system development without in-domain training data",
228-231.
Hoffmeister, Björn / Schlüter, Ralf / Ney, Hermann:
"iCNC and iROVER: the limits of improving system combination with classification?",
232-235.
Hahn, Stefan / Lehnen, Patrick / Ney, Hermann:
"System combination for spoken language understanding",
236-239.
Takeuchi, Shota / Cincarek, Tobias / Kawanami, Hiromichi / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Question and answer database optimization using speech recognition results",
451-454.
Saruwatari, Hiroshi / Takahashi, Yu / Sakai, Hiroyuki / Takeuchi, Shota / Cincarek, Tobias / Kawanami, Hiromichi / Shikano, Kiyohiro:
"Development and evaluation of hands-free spoken dialogue system for railway station guidance",
455-458.
Stent, Amanda J. / Bangalore, Srinivas:
"Statistical shared plan-based dialog management",
459-462.
Herm, Ota / Schmitt, Alexander / Liscombe, Jackson:
"When calls go wrong: how to detect problematic calls based on log-files and emotions?",
463-466.
Gillick, Dan / Hakkani-Tür, Dilek / Levit, Michael:
"Unsupervised learning of edit parameters for matching name variants",
467-470.
Cevik, Mert / Weng, Fuliang / Lee, Chin-Hui:
"Detection of repetitions in spontaneous speech in dialogue sessions",
471-474.
Camelin, Nathalie / Damnati, Geraldine / Bechet, Frédéric / Mori, Renato De:
"Automatic customer feedback processing: alarm detection in open question spoken messages",
475-478.
Balakrishna, Mithun / Tatu, Marta / Moldovan, Dan:
"Minimal training based semantic categorization in a voice activated question answering (VAQA) system",
479-482.
Thomson, B. / Gašić, M. / Keizer, S. / Mairesse, F. / Schatzmann, J. / Yu, K. / Young, Steve:
"User study of the Bayesian update of dialogue state approach to dialogue management",
483-486.
Ikeda, Satoshi / Komatani, Kazunori / Ogata, Tetsuya / , Hiroshi G. Okuno / Okuno, Hiroshi G.:
"Extensibility verification of robust domain selection against out-of-grammar utterances in multi-domain spoken dialogue system",
487-490.
Jan, Ea-Ee / Stewart, Osamuyimen / Co, Raymond / Lubensky, David:
"Improving large scale alphanumeric string recognition using redundant information",
491-494.
Demuynck, Kris / Roelens, Jan / Compernolle, Dirk Van / Wambacq, Patrick:
"SPRAAK: an open source "SPeech recognition and automatic annotation kit"",
495.
Vacher, Michel / Fleury, Anthony / Serignat, Jean-François / Noury, Norbert / Glasson, Hubert:
"Preliminary evaluation of speech/sound recognition for telemedicine application in a real environment",
496-499.
Turunen, Markku / Melto, Aleksi / Kainulainen, Anssi / Hakulinen, Jaakko:
"Mobidic - a mobile dictation and notetaking application",
500-503.
Hain, Thomas / Hannani, Asmaa El / Wrigley, Stuart N. / Wan, Vincent:
"Automatic speech recognition for scientific purposes - webASR",
504-507.
Meinedo, Hugo / Viveiros, Marcio / Neto, Joao:
"Evaluation of a live broadcast news subtitling system for portuguese",
508-511.
Emotion and Expression I, II
Suzuki, Tomoko / Ikemoto, Machiko / Sano, Tomoko / Kinoshita, Toshihiko:
"Multidimensional features of emotional speech",
240.
Boufaden, Narjes / Dumouchel, Pierre:
"Leveraging emotion detection using emotions from yes-no answers",
241-244.
Millhouse, Thomas J. / Kenny, Dianna T.:
"Vowel placement during operatic singing: 'come si parla' or 'aggiustamento'?",
245-248.
Kato, Yumiko O. / Hirose, Yoshifumi / Kamai, Takahiro:
"Study on strained rough voice as a conveyer of rage",
249-252.
Begum, Mumtaz / Ainon, Raja N. / Zainuddin, Roziati / Don, Zuraidah M. / Knowles, Gerry:
"Integrating rule and template-based approaches for emotional Malay speech synthesis",
253-256.
Busso, Carlos / Narayanan, Shrikanth S.:
"The expression and perception of emotions: comparing assessments of self versus others",
257-260.
Krahmer, Emiel / Swerts, Marc:
"On the role of acting skills for the collection of simulated emotional speech",
261-264.
Schuller, Björn / Wimmer, Matthias / Arsic, Dejan / Moosmayr, Tobias / Rigoll, Gerhard:
"Detection of security related affect and behaviour in passenger transport",
265-268.
Goudbeek, Martijn / Goldman, Jean Philippe / Scherer, Klaus R.:
"Emotions and articulatory precision",
317.
Truong, Khiet P. / Neerincx, Mark A. / Leeuwen, David A. van:
"Assessing agreement of observer- and self-annotations in spontaneous multimodal emotion data",
318-321.
Arimoto, Yoshiko / Kawatsu, Hiromi / Ohno, Sumio / Iida, Hitoshi:
"Emotion recognition in spontaneous emotional speech for anonymity-protected voice chat systems",
322-325.
Shaikh, Mostafa Al Masum / Molla, Md. Khademul Islam / Hirose, Keikichi:
"Assigning suitable phrasal tones and pitch accents by sensing affective information from text to synthesize human-like speech",
326-329.
Yanushevskaya, Irena / Ní Chasaide, Ailbhe / Gobl, Christer:
"Cross-language study of vocal correlates of affective states",
330-333.
Swerts, Marc / Krahmer, Emiel:
"Gender-related differences in the production and perception of emotion",
334-337.
Automatic Speech Recognition: Acoustic Models I-III
Li, Jinyu / Yan, Zhi-Jie / Lee, Chin-Hui / Wang, Ren-Hua:
"Soft margin estimation with various separation levels for LVCSR",
269-272.
Heigold, Georg / Lehnen, Patrick / Schlüter, Ralf / Ney, Hermann:
"On the equivalence of Gaussian and log-linear HMMs",
273-276.
Kanevsky, Dimitri / Sainath, Tara N. / Ramabhadran, Bhuvana / Nahamoo, David:
"Generalization of extended baum-welch parameter estimation for discriminative training and decoding",
277-280.
Liu, Peng / Soong, Frank K.:
"An ellipsoid constrained quadratic programming perspective to discriminative training of HMMs",
281-284.
Yu, Dong / Deng, Li / Gong, Yifan / Acero, Alex:
"Discriminative training of variable-parameter HMMs for noise robust speech recognition",
285-288.
Droppo, Jasha / Seltzer, Michael L. / Acero, Alex / Chiu, Yu-Hsiang Bosco:
"Towards a non-parametric acoustic model: an acoustic decision tree for observation probability calculation",
289-292.
Bell, Peter / King, Simon:
"A shrinkage estimator for speech recognition with full covariance HMMs",
910-913.
Bell, Peter / King, Simon:
"Covariance updates for discriminative training by constrained line search",
914.
Mak, Brian / Ko, Tom:
"Min-max discriminative training of decoding parameters using iterative linear programming",
915-918.
Willett, Daniel / He, Chuang:
"Discriminative training for complementariness in system combination",
919.
Saon, George / Povey, Daniel:
"Penalty function maximization for large margin HMM training",
920-923.
Bolaños, Daniel / Ward, Wayne:
"Implicit state-tying for support vector machines based speech recognition",
924-927.
Aradilla, Guillermo / Bourlard, Hervé / Doss, Mathew Magimai:
"Using KL-based acoustic models in a large vocabulary recognition task",
928-931.
Shiota, Sayaka / Hashimoto, Kei / Zen, Heiga / Nankaku, Yoshihiko / Lee, Akinobu / Tokuda, Keiichi:
"Acoustic modeling based on model structure annealing for speech recognition",
932-935.
Hashimoto, Kei / Zen, Heiga / Nankaku, Yoshihiko / Lee, Akinobu / Tokuda, Keiichi:
"Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition",
936-939.
Ajmera, Jitendra / Akamine, Masami:
"Speech recognition using soft decision trees",
940-943.
Shi, Yu / Seide, Frank / Soong, Frank K.:
"GPU-accelerated Gaussian clustering for fMPE discriminative training",
944-947.
Hifny, Yasser / Gao, Yuqing:
"Discriminative training using the trusted expectation maximization",
948-951.
Huang, Jui-Ting / Hasegawa-Johnson, Mark:
"Maximum mutual information estimation with unlabeled data for phonetic classification",
952-955.
Tyagi, Vivek:
"Maximum accept and reject (MARS) training of HMM-GMM speech recognition systems",
956-959.
Srinivasan, Sundar / Ma, Tao / May, Daniel / Lazarou, Georgios / Picone, Joseph:
"Nonlinear mixture autoregressive hidden Markov models for speech recognition",
960-963.
Cardinal, Patrick / Dumouchel, Pierre / Boulianne, Gilles / Comeau, Michel:
"GPU accelerated acoustic likelihood computations",
964-967.
Zhang, Qingqing / Li, Ta / Pan, Jielin / Yan, Yonghong:
"Nonnative speech recognition based on state-candidate bilingual model modification",
2366-2369.
Schuller, Björn / Zhang, Xiaohua / Rigoll, Gerhard:
"Prosodic and spectral features within segment-based acoustic modeling",
2370-2373.
Ma, Jeff / Schwartz, Richard:
"Unsupervised versus supervised training of acoustic models",
2374-2377.
Sainath, Tara N. / Zue, Victor:
"A comparison of broad phonetic and acoustic units for noise robust segment-based phonetic recognition",
2378-2381.
Shinozaki, Takahiro / Furui, Sadaoki / Kawahara, Tatsuya:
"Aggregated cross-validation and its efficient application to Gaussian mixture optimization",
2382-2385.
Matton, Mike / Compernolle, Dirk Van / Cools, Ronald:
"A minimum classification error based distance measure for template based speech recognition",
2386-2389.
Siniscalchi, Sabato Marco / Svendsen, Torbjørn / Lee, Chin-Hui:
"A penalized logistic regression approach to detection based phone classification",
2390-2393.
Abad, Alberto / Neto, João:
"Incorporating acoustical modelling of phone transitions in an hybrid ANN/HMM speech recognizer",
2394-2397.
McDermott, Erik / Nakamura, Atsushi:
"Flexible discriminative training based on equal error group scores obtained from an error-indexed forward-backward algorithm",
2398-2401.
Garau, Giulia / Renals, Steve:
"Pitch adaptive features for LVCSR",
2402-2405.
Bartels, Chris D. / Bilmes, Jeff A.:
"Using syllable nuclei locations to improve automatic speech recognition in the presence of burst noise",
2406-2409.
Hong, Hyejin / Kim, Sunhee / Chung, Minhwa:
"Effects of allophones on the performance of Korean speech recognition",
2410-2413.
Pinto, Joel / Hermansky, Hynek:
"Combining evidence from a generative and a discriminative model in phoneme recognition",
2414-2417.
Thambiratnam, K. / Seide, Frank:
"Fragmented context-dependent syllable acoustic models",
2418-2421.
Hu, Hongwei / Russell, Martin J.:
"Speech recognition using non-linear trajectories in a formant-based articulatory layer of a multiple-level segmental HMM",
2422-2425.
Plahl, Ch. / Hoffmeister, Björn / Hwang, M.-Y. / Lu, D. / Heigold, Georg / Loof, Jonas / Schlüter, Ralf / Ney, Hermann:
"Recent improvements of the RWTH GALE Mandarin LVCSR system",
2426-2429.
Vicsi, Klára / Szaszák, György:
"Using prosody for the improvement of ASR - sentence modality recognition",
2877-2880.
Accent and Language Identification
D'Arcy, Shona / Russell, Martin J.:
"Experiments with the ABI (accents of the british isles) speech corpus",
293-296.
Castaldo, Fabio / Dalmasso, Emanuele / Laface, Pietro / Colibro, Daniele / Vair, Claudio:
"Politecnico di Torino system for the 2007 NIST language recognition evaluation",
297-300.
Hubeika, Valiantsina / Burget, Lukáš / Matějka, Pavel / Schwarz, Petr:
"Discriminative training and channel compensation for acoustic language recognition",
301-304.
Wu, Tingyao / Karsmakers, Peter / Van hamme, Hugo / Compernolle, Dirk Van:
"Comparison of variable selection methods and classifiers for native accent identification",
305-308.
Campbell, W. M. / Sturim, Douglas E. / Torres-Carrasquillo, Pedro A. / Reynolds, Douglas A.:
"A comparison of subspace feature-domain methods for language recognition",
309-312.
BenZeghiba, Mohamed Faouzi / Gauvain, Jean-Luc / Lamel, Lori:
"Context-dependent phone models and models adaptation for phonotactic language recognition",
313-316.
Special Session: PANZE 2008 - Phonetics and Phonology of Australian and New Zealand English
Watson, Catherine I. / Maclagan, Margaret / King, Jeanette / Harlow, Ray:
"The English pronunciation of successive groups of Maori speakers",
338-341.
Cox, Felicity / Palethorpe, Sallyanne:
"Reversal of short front vowel raising in Australian English",
342-345.
Price, Jennifer:
"GOOSE on the move: a study of /u/-fronting in Australian news speech",
346.
Butcher, Andrew / Anderson, Victoria:
"The vowels of Australian Aboriginal English",
347-350.
Mannell, Robert H.:
"Perception and production of /i:/, /i@/ and /e:/ in australian English",
351-354.
Speaker Recognition and Diarisation
Zajíc, Zbyněk / Machlica, Lukáš / Padrta, Aleš / Vaněk, Jan / Radová, Vlasta:
"An expert system in speaker verification task",
355-358.
Dean, David / Sridharan, Sridha / Lucey, Patrick:
"Cascading appearance-based features for visual speaker verification",
359-362.
Markov, Konstantin / Nakamura, Satoshi:
"Improved novelty detection for online GMM based speaker diarization",
363-366.
Mezaache, Salah Eddine / Bonastre, Jean-François / Matrouf, Driss:
"Analysis of impostor tests with high scores in NIST-SRE context",
367-370.
Larcher, Anthony / Bonastre, Jean-François / Mason, John S. D.:
"Reinforced temporal structure information for embedded utterance-based speaker recognition",
371-374.
Gerber, Michael / Pfister, Beat:
"Fast search for common segments in speech signals for speaker verification",
375-378.
Chetty, Girija / Wagner, Michael:
"Audio-visual multilevel fusion for speech and speaker recognition",
379-382.
Luque, J. / Segura, Carlos / Hernando, Javier:
"Clustering initialization based on spatial information for speaker diarization of meetings",
383-386.
Perception, Production, Discourse and Dialog
Beskow, Jonas / Bruce, Gösta / Enflo, Laura / Granström, Björn / Schötz, Susanne:
"Recognizing and modelling regional varieties of Swedish",
512-515.
Hajek, John / Stevens, Mary:
"Vowel duration, compression and lengthening in stressed syllables in central and southern varieties of standard Italian",
516-519.
Ma, Joan K.-Y. / Ciocca, Valter / Whitehill, Tara L.:
"Acoustic cues for the perception of intonation in Cantonese",
520-523.
Leemann, Adrian / Siebenhaar, Beat:
"Perception of dialectal prosody",
524-527.
Kroos, Christian / Dreves, Ashlie:
"Does the Mcgurk effect rely on processing time constraints?",
528.
Kuratate, Takaaki / Ayers, Kathryn / Kim, Jeesun / Burnham, Denis:
"Exploring the Uncanny Valley Effect with talking heads",
529.
Kvale, Knut / Halvorsrud, Ragnhild:
"How do the elderly talk to a natural language call routing system?",
530-533.
Nishimura, Ryota / Kitaoka, Norihide / Nakagawa, Seiichi:
"Analysis of relationship between impression of human-to-human conversations and prosodic change and its modeling",
534-537.
Saarni, Tuomo / Hakokari, Jussi / Isoaho, Jouni / Salakoski, Tapio:
"Utterance-level normalization for relative articulation rate analysis",
538-541.
Tietze, Martin / Demberg, Vera / Moore, Johanna D.:
"Syntactic complexity induces explicit grounding in the Maptask corpus",
542.
Winterboer, Andi / Moore, Johanna D. / Ferreira, Fernanda:
"Do discourse cues facilitate recall in information presentation messages?",
543.
Hattori, Noriko:
"Structured heterogeneity of English stress variants",
544.
Sato, Shota / Kimura, Taro / Horiuchi, Yasuo / Nishida, Masafumi / Kuroiwa, Shingo / Ichikawa, Akira:
"A method for automatically estimating F0 model parameters and a speech re-synthesis tool using F0 model and STRAIGHT",
545-548.
Single-Channel Speech Enhancement
Stark, Anthony P. / Wojcicki, Kamil K. / Lyons, James G. / , Kuldip K. Paliwal / Paliwal, Kuldip K.:
"Noise driven short-time phase spectrum compensation procedure for speech enhancement",
549-552.
Faubel, Friedrich / McDonough, John / Klakow, Dietrich:
"A phase-averaged model for the relationship between noisy speech, clean speech and noise in the log-mel domain",
553-556.
Brouckxon, Henk / Verhelst, Werner / Schuymer, Bart De:
"Time and frequency dependent amplification for speech intelligibility enhancement in noisy environments",
557-560.
Mohammadi, Mahdi / Zamani, Behzad / Nasersharif, Babak / Rahmani, Mohsen / Akbari, Ahmad:
"A wavelet based speech enhancement method using noise classification and shaping",
561-564.
Alam, Md. Jahangir / O'Shaughnessy, Douglas / Selouani, Sid-Ahmed:
"Speech enhancement based on novel two-step a priori SNR estimators",
565-568.
Du, Jun / Huo, Qiang:
"A speech enhancement approach using piecewise linear approximation of an explicit model of environmental distortions",
569-572.
Speech Synthesis Methods I, II
Ling, Zhen-Hua / Richmond, Korin / Yamagishi, Junichi / Wang, Ren-Hua:
"Articulatory control of HMM-based parametric speech synthesis driven by phonetic knowledge",
573-576.
Wu, Yi-Jian / Tokuda, Keiichi:
"Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis",
577-580.
Yamagishi, Junichi / Ling, Zhen-Hua / King, Simon:
"Robustness of HMM-based speech synthesis",
581-584.
Conkie, Alistair / Syrdal, Ann / Kim, Yeon-Jun / Beutnagel, Mark:
"Improving preselection in unit selection synthesis",
585-588.
Ding, Feng / Nurminen, Jani / Tian, Jilei:
"Efficient join cost computation for unit selection based TTS systems",
589-592.
Yanagisawa, Kayoko / Huckvale, Mark:
"A phonetic assessment of cross-language voice conversion",
593-596.
Pollet, Vincent / Breen, Andrew:
"Synthesis by generation and concatenation of multiform segments",
1825-1828.
Cabral, João P. / Renals, Steve / Richmond, Korin / Yamagishi, Junichi:
"Glottal spectral separation for parametric speech synthesis",
1829-1832.
Kominek, John / Badaskar, Sameer / Schultz, Tanja / Black, Alan W.:
"Improving speech systems built from very little data",
1833-1836.
Saito, Daisuke / Asakawa, Satoshi / Minematsu, Nobuaki / Hirose, Keikichi:
"Structure to speech conversion - speech generation based on infant-like vocal imitation",
1837-1840.
Tiomkin, Stas / Malah, David:
"Statistical text-to-speech synthesis with improved dynamics",
1841-1844.
Webster, Gabriel / Braunschweiler, Norbert:
"An evaluation of non-standard features for grapheme-to-phoneme conversion",
1845-1848.
Agiomyrgiannakis, Yannis / Rosec, Olivier:
"Towards flexible speech coding for speech synthesis: an LF + modulated noise vocoder",
1849-1852.
Silen, Hanna / Helander, Elina / Nurminen, Jani / Gabbouj, Moncef:
"Evaluation of Finnish unit selection and HMM-based speech synthesis",
1853-1856.
Theobald, Barry-John / Wilkinson, Nicholas:
"A probabilistic trajectory synthesis system for synthesising visual speech",
1857-1860.
Cadic, Didier / Segalen, Lionel:
"Paralinguistic elements in speech synthesis",
1861-1864.
Raghavendra E., Veera / Yegnanarayana, B. / Black, Alan W. / Prahallad, Kishore:
"Building sleek synthesizers for multi-lingual screen reader",
1865-1868.
King, Simon / Tokuda, Keiichi / Zen, Heiga / Yamagishi, Junichi:
"Unsupervised adaptation for HMM-based speech synthesis",
1869-1872.
Strom, Volker / King, Simon:
"Investigating festival's target cost function using perceptual experiments",
1873-1876.
Neubarth, Friedrich / Pucher, Michael / Kranzler, Christian:
"Modeling Austrian dialect varieties for TTS",
1877-1880.
Raitio, Tuomo / Suni, Antti / Pulakka, Hannu / Vainio, Martti / Alku, Paavo:
"HMM-based Finnish text-to-speech system utilizing glottal inverse filtering",
1881-1884.
Sarkar, Tanuja / Joshi, Sachin / Pammi, Sathish Chandra / Prahallad, Kishore:
"LTS using decision forest of regression trees and neural networks",
1885-1888.
Rustullet, Silvia / Braga, Daniela / Nogueira, João / Sales Dias, Miguel:
"Automatic word stress marking and syllabification for Catalan TTS",
1889-1892.
Speaking Style and Emotion Recognition
Wöllmer, Martin / Eyben, Florian / Reiter, Stephan / Schuller, Björn / Cox, Cate / Douglas-Cowie, Ellen / Cowie, Roddy:
"Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies",
597-600.
Seppi, Dino / Batliner, Anton / Schuller, Björn / Steidl, Stefan / Vogt, Thurid / Wagner, Johannes / Devillers, Laurence / Vidrascu, Laurence / Amir, Noam / Aharonson, Vered:
"Patterns, prototypes, performance: classifying emotional user states",
601-604.
He, Ling / Lech, Margaret / Memon, Sheeraz / Allen, Nicholas:
"Recognition of stress in speech using wavelet analysis and Teager energy operator",
605-608.
Shriberg, Elizabeth / Graciarena, Martin / Bratt, Harry / Kathol, Andreas / Kajarekar, Sachin S. / Jameel, Huda / Richey, Colleen / Goodman, Fred:
"Effects of vocal effort and speaking style on text-independent speaker verification",
609-612.
Farrús, Mireia / Wagner, Michael / Anguita, Jan / Hernando, Javier:
"Robustness of prosodic features to voice imitation",
613-616.
Sethu, Vidhyasaharan / Ambikairajah, Eliathamby / Epps, Julien:
"Phonetic and speaker variations in automatic emotion classification",
617-620.
Special Session: Cross-Linguistic and Developmental Issues in the Perception and Production of Lexical Tone
Mattock, Karen:
"Infants' native and nonnative tone perception",
621.
Krishnan, Ananthanarayan / Gandour, Jackson / Swaminathan, Jayaganesh:
"Language experience dependent plasticity for pitch representation in the human brainstem",
622.
Ciocca, Valter / Ip, Vivian W.-K.:
"Development of tone perception and tone production in Cantonese-learning children aged 2 to 5 years",
623.
Xu, Nan / Burnham, Denis:
"Tone hyperarticulation in Cantonese infant-directed speech",
624.
Zerbian, Sabine / Barnard, Etienne:
"Influences on tone in Sepedi, a southern Bantu language",
625.
Ishihara, Shunichi:
"An acoustic-phonetic comparative analysis of Osaka and Kagoshima Japanese tonal phenomena",
626-629.
Special Session: Auditory-Inspired Spectro-Temporal Features I, II
Vinyals, Oriol / Friedland, Gerald:
"Modulation spectrogram features for improved speaker diarization",
630-633.
Falk, Tiago H. / Chan, Wai-Yip:
"Spectro-temporal features for robust far-field speaker identification",
634-637.
Wu, Siqing / Falk, Tiago H. / Chan, Wai-Yip:
"Long-term spectro-temporal information for improved automatic speech emotion classification",
638-641.
Kubo, Yotaro / Okawa, Shigeki / Kurematsu, Akira / Shirai, Katsuhiko:
"A comparative study on AM and FM features",
642-645.
Markaki, Maria / Stylianou, Yannis:
"Dimensionality reduction of modulation frequency features for speech discrimination",
646-649.
Kawahara, Hideki / Morise, Masanori / Banno, Hideki / Takahashi, Toru / Nisimura, Ryuichi / Irino, Toshio:
"Spectral envelope recovery beyond the nyquist limit for high-quality manipulation of speech sounds",
650-653.
Yin, Hui / Xie, Xiang / Kuang, Jingming:
"Adaptive-order fractional Fourier transform features for speech recognition",
654-657.
Petrick, Rico / Lu, Xugang / Unoki, Masashi / Akagi, Masato / Hoffmann, Rüdiger:
"Robust front end processing for speech recognition in reverberant environments: utilization of speech characteristics",
658-661.
Sivaram, G. S. V. S. / Hermansky, Hynek:
"Introducing temporal asymmetries in feature extraction for automatic speech recognition",
890-893.
Heckmann, Martin / Domont, Xavier / Joublin, Frank / Goerick, Christian:
"A closer look on hierarchical spectro-temporal features (HIST)",
894-897.
Zhao, Sherry Y. / Morgan, Nelson:
"Multi-stream spectro-temporal features for robust speech recognition",
898-901.
Wang, Huan / Gelbart, David / Hirsch, Hans-Günter / Hemmert, Werner:
"The value of auditory offset adaptation and appropriate acoustic modeling",
902-905.
Meyer, Bernd T. / Kollmeier, Birger:
"Optimization and evaluation of Gabor feature sets for ASR",
906-909.
Speech Coding, Quality Measurement and Auditory Modelling
Nguyen, Binh Phu / Shibata, Takeshi / Akagi, Masato:
"High-quality analysis/synthesis method based on temporal decomposition for speech modification",
662-665.
Gournay, Philippe:
"Improved frame loss recovery using closed-loop estimation of very low bit rate side information",
666-669.
Happel, Max F. K. / Müller, Simon / Anemüller, Jörn / Ohl, Frank W.:
"Predictability of STRFs in auditory cortex neurons depends on stimulus class",
670.
Mittal, Udar / Ashley, James P. / Gibbs, Jonathan:
"Higher layer coding of non-speech like signals using factorial pulse codebook",
671-674.
Ganapathy, Sriram / Motlicek, Petr / Hermansky, Hynek / Garudadri, Harinath:
"Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain",
675-678.
Flax, Matthew R. / Holmes, W. Harvey:
"Introducing the compression wave cochlear amplifier",
679-682.
Flax, Matthew R. / Holmes, W. Harvey:
"Goldman-hodgkin-katz cochlear hair cell models - a foundation for nonlinear cochlear mechanics",
683-686.
Bao, Changchun / Li, Hai-ting / Liu, Ze-xin / Fan, Rui / Zhu, Heng / Jia, Mao-shen / Li, Rui:
"A 8.32 kb/s embedded wideband speech coding candidate for ITU-t EV-VBR standardization",
687-690.
Kim, Jong Kyu / Park, Seung Seop / Han, Chang Woo / Kim, Nam Soo:
"Decision tree based frame mode selection for AMR-WB+",
695-698.
Liu, W. M. / Jellyman, K. A. / Evans, N. W. D. / Mason, John S. D.:
"Assessment of objective quality measures for speech intelligibility",
699-702.
Scholz, Kirstin / Kühnel, Christine / Waltermann, Marcel / Möller, Sebastian / Heute, Ulrich:
"Assessment of the speech-quality dimension "noisiness" for the instrumental estimation and analysis of telephone-band speech quality",
703-706.
Gomez, Angel M. / Carmona, Jose L. / Peinado, Antonio M. / Sanchez, Victoria / Gonzalez, Jose A.:
"Intelligibility evaluation of Ramsey-derived interleavers for internet voice streaming with the iLBC codec",
707-710.
Accent and Language Recognition
Lyu, Dau-Cheng / Lyu, Ren-Yuan:
"Language identification on code-switching utterances using multiple cues",
711-714.
Tong, Rong / Ma, Bin / Li, Haizhou / Chng, Eng Siong:
"Target-oriented phone selection from universal phone set for spoken language recognition",
715-718.
Torres-Carrasquillo, Pedro A. / Singer, Elliot / Campbell, W. M. / Gleason, Terry / McCree, Alan / Reynolds, Douglas A. / Richardson, Fred / Shen, Wade / Sturim, Douglas E.:
"The MITLL NIST LRE 2007 language recognition system",
719-722.
Torres-Carrasquillo, Pedro A. / Sturim, Douglas E. / Reynolds, Douglas A. / McCree, Alan:
"Eigen-channel compensation and discriminatively trained Gaussian mixture models for dialect and accent recognition",
723-726.
Lopez-Moreno, Ignacio / Ramos, Daniel / Gonzalez-Rodriguez, Joaquin / Toledano, Doroteo T.:
"Anchor-model fusion for language recognition",
727-730.
Yin, Bo / Thiruvaran, Tharmarajah / Ambikairajah, Eliathamby / Chen, Fang:
"Introducing a FM based feature to hierarchical language identification",
731-734.
Lei, Yun / Hansen, John H. L.:
"Dialect classification via discriminative training",
735-738.
Matějka, Pavel / Burget, Lukáš / Glembek, Ondřej / Schwarz, Petr / Hubeika, Valiantsina / Fapšo, Michal / Mikolov, Tomáš / Plchot, Oldřich / Černocký, Jan:
"BUT language recognition system for NIST 2007 evaluations",
739-742.
Glembek, Ondřej / Matějka, Pavel / Burget, Lukáš / Mikolov, Tomáš:
"Advances in phonotactic language recognition",
743-746.
Mehrabani, Mahnoosh / Hansen, John H. L.:
"Dialect separation assessment using log-likelihood score distributions",
747-750.
Alotaibi, Yousef A. / Abdullah-Al-Mamun, Khondaker / Muhammad, Ghulam:
"Study on unique pharyngeal and uvular consonants in foreign accented Arabic",
751-754.
Bi, Fukun / Yang, Jian / Xu, Dan:
"Automatic accent classification using ensemble methods",
755-758.
Piat, Marina / Fohr, Dominique / Illina, Irina:
"Foreign accent identification based on prosodic parameters",
759-762.
Shen, Wade / Chen, Nancy / Reynolds, Douglas A.:
"Dialect recognition using adapted phonetic models",
763-766.
McCree, Alan / Richardson, Fred / Singer, Elliot / Reynolds, Douglas A.:
"Beyond frame independence: parametric modelling of time duration in speaker and language recognition",
767-770.
Prosody: Prosodic Structure, Paralinguistic, Non-linguistic and Other Cues
Dockendorf, Liz / Almubayei, Dalal / Benton, Matthew:
"Testing a large corpus of natural standard Arabic for rhythm class",
771.
Benton, Matthew / Dockendorf, Liz:
"A comparison of two acoustic measurement approaches to the rhythm continuum of natural Chinese and English speech",
772-775.
Nariai, Tomoko / Tanaka, Kazuyo:
"A study of pitch patterns of Japanese English analyzed via comparative linguistic features of English and Japanese",
776-779.
Woehrling, Cécile / Boula de Mareüil, Philippe / Adda-Decker, Martine / Lamel, Lori:
"A corpus-based prosodic study of Alsatian, Belgian and Swiss French",
780-783.
Nakamura, Mitsuhiro:
"Prosodic position effects and function words in English: a pilot study",
784.
Ruiter, Laura E. de:
"How useful are polynomials for analyzing intonation?",
785-788.
Chen, Qingcai / Zhou, Shusen / Wang, Dandan / Yang, Xiaohong:
"Adaptive filter based prosody modification approach",
789-792.
Khine, Swe Zin Kalayar / Nwe, Tin Lay / Li, Haizhou:
"Speech/laughter classification in meeting audio",
793-796.
Knox, Mary Tai / Morgan, Nelson / Mirghafori, Nikki:
"Getting the last laugh: automatic laughter segmentation in meetings",
797-800.
Wrigley, Stuart N. / Tucker, Simon / Brown, Guy J. / Whittaker, Steve:
"The influence of audio presentation style on multitasking during teleconferences",
801-804.
Vlasenko, Bogdan / Schuller, Björn / Mengistu, Kinfe Tadesse / Rigoll, Gerhard / Wendemuth, Andreas:
"Balancing spoken content adaptation and unit length in the recognition of emotion and interest",
805-808.
Krahmer, Emiel / Schaafsma, Juliette / Swerts, Marc / Vingerhoets, Ad:
"Nonverbal responses to social inclusion and exclusion",
809-812.
Kitamura, Tatsuya:
"Acoustic analysis of imitated voice produced by a professional impersonator",
813-816.
Patil, Sanjay A. / Hansen, John H. L.:
"Detection of speech under physical stress: model development, sensor selection, and feature fusion",
817-820.
Automatic Speech Recognition: Language Models I, II
Chen, Langzhou / Nagae, Hisayoshi / Stuttle, Matt:
"Improving Japanese language models using POS information",
821-824.
Arısoy, Ebru / Roark, Brian / Shafran, Izhak / Saraçlar, Murat:
"Discriminative n-gram language modeling for Turkish",
825-828.
Emami, Ahmad / Zitouni, Imed / Mangu, Lidia:
"Rich morphology based n-gram language models for Arabic",
829-832.
Huang, Songfang / Renals, Steve:
"Unsupervised language model adaptation based on topic and role information in multiparty meetings",
833-836.
Liu, X. / Gales, M. J. F. / Woodland, P. C.:
"Context dependent language model adaptation",
837-840.
Hsu, Bo-June / Glass, James:
"Iterative language model estimation: efficient data structure & algorithms",
841-844.
Ohta, Kengo / Tsuchiya, Masatoshi / Nakagawa, Seiichi:
"Evaluating spoken language model based on filler prediction model in speech recognition",
1558-1561.
Duta, Nicolae:
"Transcription-less call routing using unsupervised language model adaptation",
1562-1565.
Pan, Zhen-Yu / Jiang, Hui:
"Large margin multinomial mixture model for text categorization",
1566-1569.
Yeung, Yu Ting / Cao, Houwei / Zheng, N. H. / Lee, Tan / Ching, P. C.:
"Language modeling for speech recognition of spoken Cantonese",
1570-1573.
Kobayashi, Akio / Oku, Takahiro / Homma, Shinichi / Sato, Shoei / Imai, Toru / Takagi, Tohru:
"Discriminative rescoring based on minimization of word errors for transcribing broadcast news",
1574-1577.
Shi, Qin / Chu, Stephen M. / Liu, Wen / Kuo, Hong-Kwang Jeff / Liu, Yi / Qin, Yong:
"Search and classification based language model adaptation",
1578-1581.
Huijbregts, Marijn / Ordelman, Roeland / Jong, Franciska de:
"Fast n-gram language model look-ahead for decoders with static pronunciation prefix trees",
1582-1585.
Saykhum, Kwanchiva / Boonpiam, Vataya / Thatphithakkul, Nattanun / Wutiwiwatchai, Chai / Natthee, Cholwich:
"Thai named-entity recognition using class-based language modeling on multiple-sized subword units",
1586-1589.
Schwarzler, S. / Geiger, J. / Schenk, J. / Al-Hames, M. / Hornler, B. / Ruske, Günther / Rigoll, Gerhard:
"Combining statistical and syntactical systems for spoken language understanding with graphical models",
1590-1593.
Sethy, Abhinav / Ramabhadran, Bhuvana:
"Bag-of-word normalized n-gram models",
1594-1597.
Hahn, Sangyun / Sethy, Abhinav / Kuo, Hong-Kwang Jeff / Ramabhadran, Bhuvana:
"A study of unsupervised clustering techniques for language modeling",
1598-1601.
Martins, Ciro / Teixeira, António / Neto, João:
"Automatic estimation of language model parameters for unseen words using morpho-syntactic contextual information",
1602-1605.
Ward, Nigel G. / Vega, Alejandro:
"Modeling the effects on time-into-utterance on word probabilities",
1606-1609.
Wang, Ye-Yi / Li, Xiao / Acero, Alex:
"Inductive and example-based learning for text classification",
1610-1613.
Wilson, Theresa / Raaijmakers, Stephan:
"Comparing word, character, and phoneme n-grams for subjective utterance recognition",
1614-1617.
Federico, Marcello / Bertoldi, Nicola / Cettolo, Mauro:
"IRSTLM: an open source toolkit for handling large scale language models",
1618-1621.
Speaker Identification and Verification
Kajarekar, Sachin S.:
"Phone-based cepstral polynomial SVM system for speaker recognition",
845-848.
Zhu, Donglai / Ma, Bin / Li, Haizhou:
"Using MAP estimation of feature transformation for speaker recognition",
849-852.
Vogt, Robbie / Baker, Brendan / Sridharan, Sridha:
"Factor analysis subspace estimation for speaker verification with short utterances",
853-856.
McLaren, Mitchell / Matrouf, Driss / Vogt, Robbie / Bonastre, Jean-François:
"Combining continuous progressive model adaptation and factor analysis for speaker verification",
857-860.
Hsieh, Chia-Hsin / Wu, Chung-Hsien / Shen, Han-Ping:
"Adaptive decision tree-based phone cluster models for speaker clustering",
861-864.
Aronowitz, Hagai / Solewicz, Yosef A.:
"Speaker recognition in two-wire test sessions",
865-868.
Prosodic Structure and Processing
Bishop, Jason B.:
"The effect of position on the realization of second occurrence focus",
869-872.
Shue, Yen-Liang / Shattuck-Hufnagel, Stefanie / Iseli, Markus / Jun, Sun-Ah / Veilleux, Nanette / Alwan, Abeer:
"Effects of intonational phrase boundaries on pitch-accented syllables in american English",
873-876.
Walsh, Michael / Schweitzer, Katrin / Möbius, Bernd / Schütze, Hinrich:
"Examining pitch-accent variability from an exemplar-theoretic perspective",
877-880.
Hakokari, Jussi / Saarni, Tuomo / Isoaho, Jouni / Salakoski, Tapio:
"Correlation of utterance length and segmental duration in Finnish is questionable",
881-884.
Yuan, Jiahong / Isard, Stephen / Liberman, Mark:
"Different roles of pitch and duration in distinguishing word stress in English",
885.
O'Reilly, Maria / Chasaide, Ailbhe Ní / Gobl, Christer:
"Cross-dialect Irish prosody: linguistic constraints on Fujisaki modelling",
886-889.
Robust Automatic Speech Recognition I-III
Nakayama, Masato / Nishiura, Takanobu / Denda, Yuki / Kitaoka, Norihide / Yamamoto, Kazumasa / Yamada, Takeshi / Tsuge, Satoru / Miyajima, Chiyomi / Fujimoto, Masakiyo / Takiguchi, Tetsuya / Tamura, Satoshi / Ogawa, Tetsuji / Matsuda, Shigeki / Kuroiwa, Shingo / Takeda, Kazuya / Nakamura, Satoshi:
"CENSREC-4: development of evaluation framework for distant-talking speech recognition under reverberant environments",
968-971.
Tsujikawa, Masanori / Arakawa, Takayuki / Isotani, Ryosuke:
"In-car speech recognition using model-based wiener filter and multi-condition training",
972-975.
Kühne, Marco / Togneri, Roberto / Nordholm, Sven:
"Adaptive beamforming and soft missing data decoding for robust speech recognition in reverberant environments",
976-979.
BabaAli, Bagher / Sameti, Hossein / Safayani, Mehran:
"Spectral subtraction in likelihood-maximizing framework for robust speech recognition",
980-983.
Ganapathy, Sriram / Thomas, Samuel / Hermansky, Hynek:
"Front-end for far-field speech recognition based on frequency domain linear prediction",
984-987.
Park, Ji Hun / Yoon, Jae Sam / Kim, Hong Kook:
"Mask estimation incorporating time-frequency trajectories for a CASA-based ASR front-end",
988-991.
Takahashi, Toru / Yamamoto, Shun'ichi / Nakadai, Kazuhiro / Komatani, Kazunori / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Soft missing-feature mask generation for simultaneous speech recognition system in robots",
992-995.
Wang, Dong / Himawan, Ivan / Frankel, Joe / King, Simon:
"A posterior approach for microphone array based speech recognition",
996-999.
Chiu, Yu-Hsiang Bosco / Stern, Richard M.:
"Analysis of physiologically-motivated signal processing for robust speech recognition",
1000-1003.
Sun, Liang-che / Hsu, Chang-wen / Lee, Lin-shan:
"Evaluation of modulation spectrum equalization techniques for large vocabulary robust speech recognition",
1004-1007.
Chen, Yi / Wan, Chia-yu / Lee, Lin-shan:
"Confusion-based entropy-weighted decoding for robust speech recognition",
1008-1011.
Pettersen, Svein Gunnar / Johnsen, Magne Hallstein:
"Cepstral domain voice activity detection for improved noise modeling in MMSE feature enhancement for ASR",
1012-1015.
Molina, Carlos / Yoma, Nestor Becerra / Huenupan, Fernando / Garreton, Claudio:
"Unsupervised re-scoring of observation probability based on maximum entropy criterion by using confidence measure with telephone speech",
1016-1019.
Liao, Yuan-Fu / Hsu, Chi-Hui / Yang, Chi-Min / Lin, Jeng-Shien / Chang, Sen-Chia:
"Within-class feature normalization for robust speech recognition",
1020-1023.
Tan, Zheng-Hua / Lindberg, Børge:
"A posteriori SNR weighted energy based variable frame rate analysis for speech recognition",
1024-1027.
Wang, Chieh-cheng / Pan, Chi-an / Hung, Jeih-weih:
"Silence feature normalization for robust speech recognition in additive noise environments",
1028-1031.
Wang, L. / Nakagawa, Seiichi / Kitaoka, Norihide:
"Blind dereverberation based on CMN and spectral subtraction by multi-channel LMS algorithm",
1032-1035.
Liao, Yuan-Fu / Fang, Hung-Hsiang / Hsu, Chi-Hui:
"Eigen-MLLR environment/speaker compensation for robust speech recognition",
1249-1252.
Yu, Dong / Deng, Li / Gong, Yifan / Acero, Alex:
"Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition",
1253-1256.
Du, Jun / Huo, Qiang:
"A feature compensation approach using high-order vector taylor series approximation of an explicit distortion model for noisy speech recognition",
1257-1260.
Cui, Xiaodong / Afify, Mohamed / Gao, Yuqing:
"N-best based stochastic mapping on stereo HMM for noise robust speech recognition",
1261-1264.
Tsao, Yu / Lee, Chin-Hui:
"Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process",
1265-1268.
Lu, Jianhua / Ming, Ji / Woods, Roger:
"Combining noise compensation and missing-feature decoding for large vocabulary speech recognition in noise",
1269-1272.
Pettersen, Svein Gunnar:
"Joint Bayesian predictive classification and parallel model combination with prior scaling for robust ASR",
1273-1276.
Kumar, Abhishek / Hansen, John H. L.:
"Environment mismatch compensation using average eigenspace for speech recognition",
1277-1280.
Povey, Daniel / Kingsbury, Brian:
"Monte Carlo model-space noise adaptation for speech recognition",
1281-1284.
Ma, Ning / Green, Phil:
"A 'speechiness' measure to improve speech decoding in the presence of other sound sources",
1285-1288.
Buera, Luis / Miguel, Antonio / Saz, Oscar / Ortega, Alfonso / Lleida, Eduardo:
"Feature vector normalization with combined standard and throat microphones for robust ASR",
1289-1292.
Fukuda, Takashi / Ichikawa, Osamu / Nishimura, Masafumi:
"Phone-duration-dependent long-term dynamic features for a stochastic model-based voice activity detection",
1293-1296.
Ijima, Yusuke / Tachibana, Makoto / Nose, Takashi / Kobayashi, Takao:
"An on-line adaptation technique for emotional speech recognition using style estimation with multiple-regression HMM",
1297-1300.
Berkovitch, Michael / Shallom, Ilan D.:
"HMM adaptation using statistical linear approximation for robust automatic speech recognition",
1301-1304.
Rennie, Steven J. / Dognin, Pierre L.:
"Beyond linear transforms: efficient non-linear dynamic adaptation for noise robust speech recognition",
1305-1308.
Gomez, Randy / Even, Jani / Shikano, Kiyohiro:
"Rapid unsupervised speaker adaptation robust in reverberant environment conditions",
1309-1312.
Li, Jinyu / Lee, Chin-Hui:
"On a generalization of margin-based discriminative training to robust speech recognition",
1992-1995.
Gales, M. J. F. / Longworth, C.:
"Discriminative classifiers with generative kernels for noise robust ASR",
1996-1999.
Dalen, R. C. van / Gales, M. J. F.:
"Covariance modelling for noise-robust speech recognition",
2000-2003.
Chen, Wei-Hau / Lin, Shih-Hsiang / Chen, Berlin:
"Exploiting spatial-temporal feature distribution characteristics for robust speech recognition",
2004-2007.
Fujimoto, Masakiyo / Ishizuka, Kentaro / Nakatani, Tomohiro:
"Study of integration of statistical model-based voice activity detection and noise suppression",
2008-2011.
Li, Weifeng / Dines, John / Doss, Mathew Magimai / Bourlard, Hervé:
"Neural network based regression for robust overlapping speech recognition using microphone arrays",
2012-2015.
Speech Analysis and Processing, Voice Conversion and Modification
Pfitzinger, Hartmut R. / Kaernbach, Christian:
"Amplitude and amplitude variation of emotional speech",
1036-1039.
Krishnamurthy, Nitish / Ikeno, Ayako / Hansen, John H. L.:
"Babble speech: acoustic and perceptual variability",
1040-1043.
Pantazis, Yannis / Rosec, Olivier / Stylianou, Yannis:
"On the properties of a time-varying quasi-harmonic model of speech",
1044-1047.
Lu, Wenliang / Sen, D.:
"Extraction and tracking of formant response jitter in the cochlea for objective prediction of SB/SF DAM attributes",
1048-1051.
Messing, David P. / Delhorne, Lorraine / Bruckert, Ed / Braida, Louis D. / Ghitza, Oded:
"Consonant discrimination of degraded speech using an efferent-inspired closed-loop cochlear model",
1052-1055.
Tomar, Vikrant / Patil, Hemant A.:
"On the development of variable length Teager energy operator (VTEO)",
1056-1059.
Qiao, Yu / Minematsu, Nobuaki:
"Metric learning for unsupervised phoneme segmentation",
1060-1063.
Kalinli, Ozlem / Narayanan, Shrikanth S.:
"Combining task-dependent information with auditory attention cues for prominence detection in speech",
1064-1067.
Zen, Heiga / Nankaku, Yoshihiko / Tokuda, Keiichi:
"Probabilistic feature mapping based on trajectory HMMs",
1068-1071.
Yutani, Kaori / Uto, Yosuke / Nankaku, Yoshihiko / Toda, Tomoki / Tokuda, Keiichi:
"Simultaneous conversion of duration and spectrum based on statistical models including time-sequence matching",
1072-1075.
Muramatsu, Takashi / Ohtani, Yamato / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory",
1076-1079.
Ohtani, Yamato / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"An improved one-to-many eigenvoice conversion system",
1080-1083.
Uchimura, Yoshinori / Banno, Hideki / Itakura, Fumitada / Kawahara, Hideki:
"Study on manipulation method of voice quality based on the vocal tract area function",
1084-1087.
Toth, Arthur / Black, Alan W.:
"Incorporating durational modification in voice transformation",
1088-1091.
Dashiell, Amy / Hutchinson, Brian / Margolis, Anna / Ostendorf, Mari:
"Non-segmental duration feature extraction for prosodic classification",
1092-1095.
Special Session: Tonality in Production and Perception, Language in Australia and New Zealand
Zheng, Hong-Ying / Wang, William S.-Y.:
"An ERP study on categorical perception of lexical tones and nonspeech pitches",
1096.
Otake, Takashi / Higuchi, Marii:
"The role of Japanese pitch accent in spoken-word recognition: evidence from middle-aged accentless dialect listeners",
1097-1100.
Wang, Siwei / Levow, Gina-Anne:
"Mandarin Chinese tone nucleus detection with landmarks",
1101-1104.
Hu, Weixiang / Jian, Jin / Li, Aijun / Wang, Xia:
"A comparative study on dissyllabic stress patterns of Mandarin and Cantonese",
1105-1108.
Ho, Rerrario Shui-Ching / Sagisaka, Yoshinori:
"Three-sectional-staff characterization of Cantonese level tones",
1109-1112.
Zhu, Xiaonong / Zhang, Caicai:
"A seven-tone dialect in southern China with falling-rising-falling contour: a linguistic acoustic analysis",
1113-1115.
Prom-on, Santitham:
"Pitch target analysis of Thai tones using quantitative target approximation model and unsupervised clustering",
1116-1119.
So, Connie K. / Best, Catherine T.:
"Do English speakers assimilate Mandarin tones to English prosodic categories?",
1120.
Bundgaard-Nielsen, Rikke L. / Best, Catherine T. / Tyler, Michael D. / Kroos, Christian:
"Evidence of a near-merger in western sydney australian English vowels",
1121.
Tabain, Marija / Rickard, Kristine / Breen, Gavan / Dobson, Veronica:
"Central vowels in Arrernte: metrical prominence and pitch accent",
1122.
Ross, Bella:
"Pausing and phrase length in two australian languages",
1123.
Stevens, Mary / Hajek, John:
"Positional effects on the characterization of ejectives in Waima'a",
1124-1127.
Starks, Donna / Thompson, Laura / Watson, Catherine I.:
"A Niuean variant of New Zealand English?",
1128.
Automatic Speech Recognition: Tone Languages
Ding, Guo-Hong:
"Phonetic confusion analysis and robust phone set generation for Shanghai-accented Mandarin speech recognition",
1129-1132.
Yeung, Yu Ting / Qian, Yao / Lee, Tan / Soong, Frank K.:
"Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech",
1133-1136.
Cheng, Li-Wei / Lee, Lin-shan:
"Improved large vocabulary Mandarin speech recognition by selectively using tone information with a two-stage prosodic model",
1137-1140.
Ru, Tingting / Xie, Xiang / Yin, Hui / Kuang, Jingming:
"Mandarin connected digits recognition for whispered speech",
1141-1144.
Bao, Changchun / Xu, Weiqun / Yan, Yonghong:
"Recognizing named entities in spoken Chinese dialogues with a character-level maximum entropy tagger",
1145-1148.
Nguyen, Hong Quang / Nocera, Pascal / Castelli, Eric / Trinh, Van Loan:
"A novel approach in continuous speech recognition for Vietnamese, an isolating tonal language",
1149-1152.
Spoken Dialogue Systems
Thomson, B. / Yu, K. / Gašić, M. / Keizer, S. / Mairesse, F. / Schatzmann, J. / Young, Steve:
"Evaluating semantic-level confidence scores with multiple hypotheses",
1153-1156.
Zweig, Geoffrey / Bohus, Dan / Li, Xiao / Nguyen, Patrick:
"Structured models for joint decoding of repeated utterances",
1157-1160.
Meurs, Marie-Jean / Lefevre, Fabrice / Mori, Renato De:
"A Bayesian approach to semantic composition for spoken language interpretation",
1161-1164.
Paek, Tim / Ju, Yun-Cheng:
"Accommodating explicit user expressions of uncertainty in voice search or something like that",
1165-1168.
Kim, Dongho / Sim, Hyeong Seop / Kim, Kee-Eung / Kim, Jin Hyung / Kim, Hyunjeong / Sung, Joo Won:
"Effects of user modeling on POMDP-based dialogue systems",
1169-1172.
Williams, Jason D.:
"The best of both worlds: unifying conventional dialog systems and POMDPs",
1173-1176.
Cross-Language and Language-Specific Phonetics
Bundgaard-Nielsen, Rikke L. / Best, Catherine T. / Tyler, Michael D.:
"The assimilation of L2 australian English vowels to L1 Japanese vowel categories: vocabulary size matters",
1177.
Ali, Azra N. / Lahrouchi, Mohamed / Ingleby, Michael:
"Vowel epenthesis, acoustics and phonology patterns in Moroccan Arabic",
1178-1181.
Wang, Gaowu / Dang, Jianwu / Kong, Jiangping:
"Estimation of vocal tract area function for Mandarin vowel sequences using MRI",
1182-1185.
Tsukada, Kimiko / Nguyen, Thu T. A.:
"The effect of first language (L1) dialects on the identification of Vietnamese word-final stops",
1186-1189.
Antoniou, Mark / Best, Catherine T. / Tyler, Michael D.:
"Perceptual evidence of modern Greek voiced stops as phonological categories",
1190.
Hazan, Valerie / Li, Enid:
"The effect of auditory and visual degradation on audiovisual perception of native and non-native speakers",
1191-1194.
Special Session: Prosody of Spontaneous Speech I, II
Mixdorff, Hansjörg:
"Quantitative prosodic analysis of spontaneous speech",
1195.
Lindström, Anders / Villing, Jessica / Larsson, Staffan / Seward, Alexander / Åberg, Nina / Holtelius, Cecilia:
"The effect of cognitive load on disfluencies during in-vehicle spoken dialogue",
1196-1199.
Tseng, Chiu-yu / Su, Zhao-yu:
"Discourse prosody context - global F0 and tempo modulations",
1200-1203.
Obin, Nicolas / Lacheret-Dujour, Anne / Veaux, Christophe / Rodet, Xavier / Simon, Anne-Catherine:
"A method for automatic and dynamic estimation of discourse genre typology with prosodic features",
1204-1207.
Ishi, Carlos Toshinori / Ishiguro, Hiroshi / Hagita, Norihiro:
"The meanings carried by interjections in spontaneous speech",
1208-1211.
Jones, Christian M. / Deeming, Andrew:
"Speech interaction with an emotional robotic dog",
1212-1215.
Ochi, Keiko / Hirose, Keikichi / Minematsu, Nobuaki:
"Control of prosodic focus in corpus-based generation of fundamental frequency based on the generation process model",
1216.
Godin, Keith W. / Hansen, John H. L.:
"Analysis and perception of speech under physical task stress",
1674-1677.
Lee, Chi-Chun / Lee, Sungbok / Narayanan, Shrikanth S.:
"An analysis of multimodal cues of interruption in dyadic spoken interactions",
1678-1681.
Mori, Hiroki / Kasuya, Hideki:
"Paralinguistic effects on turn-taking behavior in expressive conversation",
1682.
Yin, Zhigang / Li, Aijun / Xiong, Ziyu:
"Study on "ng, a" type of discourse markers in standard Chinese",
1683-1686.
Moniz, Helena / Mata, Ana Isabel / Trancoso, Isabel / Viana, M. Ceu:
"How can you use disfluencies and still sound as a good speaker?",
1687.
Strangert, Eva / Gustafson, Joakim:
"What makes a good speaker? subject ratings, acoustic measurements and perceptual evaluations",
1688-1691.
Kousidis, Spyros / Dorran, David / Wang, Yi / Vaughan, Brian / Cullen, Charlie / Campbell, Dermot / McDonnell, Ciaran / Coyle, Eugene:
"Towards measuring continuous acoustic feature convergence in unconstrained spoken dialogues",
1692-1695.
Kawahara, Tatsuya / Toyokura, Masayoshi / Misu, Teruhisa / Hori, Chiori:
"Detection of feeling through back-channels in spoken dialogue",
1696.
Automatic Speech Recognition: Adaptation I, II
Karafiát, Martin / Burget, Lukáš / Hain, Thomas / Černocký, Jan:
"Discrimininative training of narrow band - wide band adapted systems for meeting recognition",
1217-1220.
Hahm, Seongjun / Ito, Akinori / Makino, Shozo / Suzuki, Motoyuki:
"A fast speaker adaptation method using aspect model",
1221-1224.
Su, Dan / Wu, Xihong / Chi, Huisheng:
"Probabilistic latent speaker training for large vocabulary speech recognition",
1225-1228.
Tanji, Shutaro / Shinoda, Koichi / Furui, Sadaoki / Ortega, Antonio:
"Improvement of eigenvoice-based speaker adaptation by parameter space clustering",
1229-1232.
Sanand, D. R. / Umesh, S.:
"Study of jacobian compensation using linear transformation of conventional MFCC for VTLN",
1233-1236.
Ting, Chuan-Wei / Lee, Kuo-Yuan / Chien, Jen-Tzung:
"Adaptive HMM topology for speech recognition",
1237-1240.
Chen, Liang-Yu / Lee, Chun-Jen / Jang, Jyh-Shing Roger:
"Minimum phone error discriminative training for Mandarin Chinese speaker adaptation",
1241-1244.
Povey, Daniel / Kuo, Hong-Kwang Jeff / Soltau, Hagen:
"Fast speaker adaptive training for speech recognition",
1245-1248.
Raut, C. K. / Yu, K. / Gales, M. J. F.:
"Adaptive training using discriminative mapping transforms",
1697-1700.
Loof, Jonas / Gollan, Christian / Ney, Hermann:
"Speaker adaptive training using shift-MLLR",
1701-1704.
Povey, Daniel / Kuo, Hong-Kwang Jeff:
"XMLLR for improved speaker adaptation in speech recognition",
1705-1708.
Huang, Jing / Epstein, Mark / Matassoni, Marco:
"Effective acoustic adaptation for a distant-talking interactive TV system",
1709-1712.
Akhil, P. T. / Rath, S. P. / Umesh, S. / Sanand, D. R.:
"A computationally efficient approach to warp factor estimation in VTLN using EM algorithm and sufficient statistics",
1713-1716.
Wang, Shizhen / Lulich, Steven M. / Alwan, Abeer:
"A reliable technique for detecting the second subglottal resonance and its use in cross-language speaker adaptation",
1717-1720.
Features for Speech and Speaker Recognition
Fan, Xing / Hansen, John H. L.:
"Speaker identification for whispered speech based on frequency warping and score competition",
1313-1316.
Habib, Tania / Ottowitz, Lukas / Képesi, Marián:
"Experimental evaluation of multi-band position-pitch estimation (m-popi) algorithm for multi-speaker localization",
1317-1320.
Dhananjaya, N. / Rajendran, S. / Yegnanarayana, B.:
"Features for automatic detection of voice bars in continuous speech",
1321-1324.
Segura, Carlos / Abad, Alberto / Hernando, Javier / Nadeu, Climent:
"Speaker orientation estimation based on hybridation of GCC-PHAT and HLBR",
1325-1328.
Hou, Jun / Rabiner, Lawrence R. / Dusan, Sorin:
"Parallel and hierarchical speech feature classification using frame and segment-based methods",
1329-1332.
Varadarajan, Balakrishnan / Khudanpur, Sanjeev:
"Automatically learning speaker-independent acoustic subword units",
1333-1336.
Abdulla, Waleed H. / Zhang, Yushi:
"Human-like ears versus two-microphone array, which works better for speaker identification?",
1337-1340.
Kobayashi, Kenji / Somiya, Mitsuhiro / Nishizaki, Hiromitsu / Sekiguchi, Yoshihiro:
"Is a speech recognizer useful for characteristic analysis of classroom lecture speech?",
1341-1344.
Golipour, Ladan / O'Shaughnessy, Douglas:
"An intuitive class discriminability measure for feature selection in a speech recognition system",
1345-1348.
Qiao, Yu / Minematsu, Nobuaki:
"f-divergence is a generalized invariant measure between distributions",
1349-1352.
Giacobello, Daniele / Christensen, Mads Græsbøll / Dahl, Joachim / Jensen, Søren Holdt / Moonen, Marc:
"Sparse linear predictors for speech processing",
1353-1356.
Zhang, J. X. / Christensen, Mads Græsbøll / Dahl, Joachim / Jensen, Søren Holdt / Moonen, Marc:
"Frequency-domain parameter estimations for binary masked signals",
1357-1360.
Saito, Daisuke / Minematsu, Nobuaki / Hirose, Keikichi:
"Decomposition of rotational distortion caused by VTL difference using eigenvalues of its transformation matrix",
1361-1364.
Maragakis, Michail G. / Potamianos, Alexandros:
"Region-based vocal tract length normalization for ASR",
1365-1368.
Speaker Recognition: Kernel-Based and Session Mismatch
Okamoto, Hideki / Matsui, Tomoko / Kawanami, Hiromichi / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Speaker verification with non-audible murmur segments by combining global alignment kernel and penalized logistic regression machine",
1369-1372.
Lu, Liang / Dong, Yuan / Zhao, Xianyu / Zhao, Jian / , Chengyu Dong (2), Haila Wang (2) / Dong, Chengyu / Wang, Haila:
"Analysis of subspace within-class covariance normalization for SVM-based speaker verification",
1373-1376.
Zhao, Xianyu / Dong, Yuan / Zhao, Jian / Lu, Liang / Liu, Jiqing / Wang, Haila:
"Comparison of input and feature space nonlinear kernel nuisance attribute projections for speaker verification",
1377-1380.
Longworth, C. / Gales, M. J. F.:
"A generalised derivative kernel for speaker verification",
1381-1384.
Ferrer, Luciana:
"Modeling prior belief for speaker verification SVM systems",
1385-1388.
Charlet, Delphine / Zhao, Xianyu / Dong, Yuan:
"Convergence between SVM-based and distance-based paradigms for speaker recognition",
1389-1392.
Zhang, Shi-Xiong / Mak, Man-Wai:
"High-level speaker verification via articulatory-feature based sequence kernels and SVM",
1393-1396.
Lee, Kong-Aik / You, Changhuai / Li, Haizhou / Kinnunen, Tomi / Zhu, Donglai:
"Characterizing speech utterances for speaker verification with sequence kernel SVM",
1397-1400.
Kenny, Patrick / Dehak, Najim / Ouellet, Pierre / Gupta, Vishwa / Dumouchel, Pierre:
"Development of the primary CRIM system for the NIST 2008 speaker recognition evaluation",
1401-1404.
Vogt, Robbie / Sridharan, Sridha / Mason, Michael:
"Making confident speaker verification decisions with minimal speech",
1405-1408.
Luo, Jun / Leung, Cheung-Chi / Ferràs, Marc / Barras, Claude:
"Parallelized factor analysis and feature normalization for automatic speaker verification",
1409-1412.
Garcia-Romero, Daniel / Espy-Wilson, Carol Y.:
"Intersession variability in speaker recognition: a behind the scene analysis",
1413-1416.
Ito, Tatsuya / Hashimoto, Kei / Nankaku, Yoshihiko / Lee, Akinobu / Tokuda, Keiichi:
"Speaker recognition based on variational Bayesian method",
1417-1420.
Matrouf, Driss / Bonastre, Jean-François / Mezaache, Salah Eddine:
"Factor analysis multi-session training constraint in session compensation for speaker verification",
1421-1424.
Liu, Ying / Russell, Martin J. / Carey, Michael J.:
"The role of 'delta' features in speaker verification",
1425-1428.
Broadcast Transcription Systems
Lamel, Lori / Messaoudi, Abdel. / Gauvain, Jean-Luc:
"Investigating morphological decomposition for transcription of Arabic broadcast news and broadcast conversation data",
1429-1432.
Fousek, Petr / Lamel, Lori / Gauvain, Jean-Luc:
"Transcribing broadcast data using MLP features",
1433-1436.
Vergyri, D. / Mandal, A. / Wang, Wen / Stolcke, Andreas / Zheng, Jing / Graciarena, Martin / Rybach, D. / Gollan, Christian / Schlüter, Ralf / Kirchhoff, Katrin / Faria, A. / Morgan, Nelson:
"Development of the SRI/nightingale Arabic ASR system",
1437-1440.
Gollan, Christian / Ney, Hermann:
"Towards automatic learning in LVCSR: rapid development of a Persian broadcast transcription system",
1441-1444.
Hsiao, Roger / Fuhs, Mark / Tam, Yik-Cheung / Jin, Qin / Schultz, Tanja:
"The CMU-interACT 2008 Mandarin transcription system",
1445-1448.
Deoras, Anoop / Fritsch, Jürgen:
"Decoding-time prediction of non-verbalized punctuation",
1449-1452.
Voice Conversion and Modification
Helander, Elina / Schwarz, Jan / Nurminen, Jani / Silen, Hanna / Gabbouj, Moncef:
"On the impact of alignment on voice conversion performance",
1453-1456.
Pozo, Arantza del / Young, Steve:
"The linear transformation of LF glottal waveforms for voice conversion",
1457-1460.
Tani, Daisuke / Toda, Tomoki / Ohtani, Yamato / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Maximum a posteriori adaptation for many-to-one eigenvoice conversion",
1461-1463.
Tran, Viet-Anh / Bailly, Gérard / Loevenbruck, Hélène / Jutten, Christian:
"Improvement to a NAM captured whisper-to-speech system",
1465-1468.
Tran, Huy Dat / Li, Haizhou:
"Speaker identification in noise mismatch conditions based on jump function Kolmogorov analysis in wavelet domain",
1469-1472.
Phonetics: General
Scharenborg, Odette:
"Modelling fine-phonetic detail in a computational model of word recognition",
1473-1476.
Strik, Helmer / Doremalen, Joost van / Cucchiarini, Catia:
"Pronunciation reduction: how it relates to speech style, gender, and age",
1477-1480.
Yegnanarayana, B. / Rajendran, S. / Worku, Hussien Seid / Dhananjaya, N.:
"Analysis of glottal stops in speech signals",
1481-1484.
Neiberg, Daniel / Ananthakrishnan, G. / Engwall, Olov:
"The acoustic to articulation mapping: non-linear or non-unique?",
1485-1488.
Zhuang, Xiaodan / Nam, Hosung / Hasegawa-Johnson, Mark / Goldstein, Louis M. / Saltzman, Elliot:
"The entropy of the articulatory phonological code: recognizing gestures from tract variables",
1489-1492.
Special Session: Forensic Speaker Recognition - Traditional and Automatic Approaches
Ramos, Daniel / Gonzalez-Rodriguez, Joaquin / Gonzalez-Dominguez, Javier / Lucena-Molina, Jose Juan:
"Addressing database mismatch in forensic speaker recognition with Ahumada III: a public real-casework database in Spanish",
1493-1496.
Thiruvaran, Tharmarajah / Ambikairajah, Eliathamby / Epps, Julien:
"FM features for automatic forensic speaker recognition",
1497-1500.
Morrison, Geoffrey Stewart / Kinoshita, Yuko:
"Automatic-type calibration of traditionally derived likelihood ratios: forensic analysis of australian English /o/ formant trajectories",
1501-1504.
Becker, Timo / Jessen, Michael / Grigoras, Catalin:
"Forensic speaker verification using formant features and Gaussian mixture models",
1505-1508.
Shriberg, Elizabeth / Stolcke, Andreas:
"The case for automatic higher-level features in forensic speaker recognition",
1509-1512.
Automatic Speech Recognition: Features I, II
Lee, Kye-Hwan / Kang, Sang-Ick / Song, Ji-Hyun / Chang, Joon-Hyuk:
"Group delay function for improved gender identification",
1513-1516.
Razik, Joseph / Mella, Odile / Fohr, Dominique / Haton, Jean-Paul:
"Frame-synchronous and local confidence measures for on-the-fly automatic speech recognition",
1517-1520.
Thomas, Samuel / Ganapathy, Sriram / Hermansky, Hynek:
"Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech",
1521-1524.
Sangwan, Abhijeet / Ikeno, Ayako / Hansen, John H. L.:
"Evidence of coarticulation in a phonological feature detection system",
1525-1528.
Huda, Mohammad Nurul / Katsurada, Kouichi / Nitta, Tsuneo:
"Phoneme recognition based on hybrid neural networks with inhibition/enhancement of distinctive phonetic feature (DPF) trajectories",
1529-1532.
Hu, Hongbing / Zahorian, Stephen A.:
"A neural network based nonlinear feature transformation for speech recognition",
1533-1536.
Ramya, R. / Hegde, Rajesh M. / Murthy, Hema A.:
"Significance of group delay based acoustic features in the linguistic search space for robust speech recognition",
1537-1540.
Abbasian, Houman / Nasersharif, Babak / Akbari, Ahmad:
"Genetic programming based optimization of class-dependent PCA for extracting robust MFCC",
1541-1544.
Narayana, K. V. S. / Sreenivas, T. V.:
"Comparison of AM-FM based features for robust speech recognition",
1545-1548.
Frankel, Joe / Wang, Dong / King, Simon:
"Growing bottleneck features for tandem ASR",
1549.
Karjigi, Veena / Rao, Preeti:
"Landmark based recognition of stops: acoustic attributes versus smoothed spectra",
1550-1553.
Kogure, Satoru / Nishizaki, Hiromitsu / Tsuchiya, Masatoshi / Yamamoto, Kazumasa / Togashi, Shingo / Nakagawa, Seiichi:
"Speech recognition performance of CJLC: corpus of Japanese lecture contents",
1554-1557.
Valente, Fabio / Hermansky, Hynek:
"On the combination of auditory and modulation frequency channels for ASR applications",
2242-2245.
Tyagi, Vivek:
"Tandem processing of fepstrum features",
2246-2249.
Chang, Shuo-Yiin / Lee, Lin-shan:
"Data-driven clustered hierarchical tandem system for LVCSR",
2250-2253.
Lee, Hung-Shin / Chen, Berlin:
"Linear discriminant feature extraction using weighted classification confusion information",
2254-2257.
Sanand, D. R. / Balaji, V. / Sandhya, Rani R. / Umesh, S.:
"Use of spectral centre of gravity for generating speaker invariant features for automatic speech recognition",
2258-2261.
Fukuda, Takashi / Ichikawa, Osamu / Nishimura, Masafumi:
"Short- and long-term dynamic features for robust speech recognition",
2262-2265.
Speech Resources and Technology Evaluation
Kawahara, Tatsuya / Setoguchi, Hisao / Takanashi, Katsuya / Ishizuka, Kentaro / Araki, Shoko:
"Multi-modal recording, analysis and indexing of poster sessions",
1622-1625.
Matoušek, Jindřich / Romportl, Jan:
"Automatic pitch-synchronous phonetic segmentation",
1626-1629.
Shen, Wade / Olive, Joseph / Jones, Douglas:
"Two protocols comparing human and machine phonetic recognition performance in conversational speech",
1630-1633.
Kato, Tomoyuki / Okamoto, Jun / Shozakai, Makoto:
"Analysis of drivers' speech in a car environment",
1634-1637.
Schuppler, Barbara / Ernestus, Mirjam / Scharenborg, Odette / Boves, Lou:
"Preparing a corpus of dutch spontaneous dialogues for automatic phonetic analysis",
1638-1641.
Kotnik, Bojan / Sendorek, Pierre / Astrov, Sergey / Koc, Turgay / Ciloglu, Tolga / Fernández, Laura Docío / Banga, Eduardo Rodríguez / Höge, Harald / Kačič, Zdravko:
"Evaluation of voice activity and voicing detection",
1642-1645.
Draxler, Christoph / Jänsch, Klaus:
"Wikispeech - a content management system for speech databases",
1646-1649.
Demenko, Grazyna / Bachan, J. / Möbius, Bernd / Klessa, K. / Szymański, M. / Grocholewski, Stefan:
"Development and evaluation of Polish speech corpus for unit selection speech synthesis systems",
1650-1653.
Pitrelli, John F. / Lewis, Burn L. / Epstein, Edward A. / Quinn, Jerome L. / Ramaswamy, Ganesh:
"A data format enabling interoperation of speech recognition, translation and information extraction engines: the GALE type system",
1654-1657.
Li, Wei / Huo, Qiang:
"A rank-predicted pseudo-greedy approach to efficient text selection from large-scale corpus for maximum coverage of target units",
1658-1661.
Engelbrecht, Klaus-Peter / Kruppa, Michael / Möller, Sebastian / Quade, Michael:
"Memo workbench for semi-automated usability testing",
1662-1665.
Yamakawa, Kimiko / Matsui, Tomoko / Itahashi, Shuichi:
"MDS-based visualization method for multiple speech corpora",
1666-1669.
Busso, Carlos / Narayanan, Shrikanth S.:
"Scripted dialogs versus improvisation: lessons learned about emotional elicitation techniques from the IEMOCAP database",
1670-1673.
Applications in Education and Learning I, II
Deshmukh, Om D. / Joshi, Sachindra / Verma, Ashish:
"Automatic pronunciation evaluation and classification",
1721-1724.
Bolanos, Daniel / Ward, Wayne / Wise, Barbara / Vuuren, Sarel van:
"Pronunciation error detection techniques for children's speech",
1725-1728.
Wang, Lan / Feng, Xin / Meng, Helen M.:
"Automatic generation and pruning of phonetic mispronunciations to support computer-aided pronunciation training",
1729-1732.
Li, Xiaolong / Deng, Li / Ju, Yun-Cheng / Acero, Alex:
"Automatic children's reading tutor on hand-held devices",
1733-1736.
Wang, Hongcui / Kawahara, Tatsuya:
"A Japanese CALL system based on dynamic question generation and error prediction for ASR",
1737-1740.
Black, Matthew / Tepperman, Joseph / Lee, Sungbok / Narayanan, Shrikanth S.:
"Estimation of children's reading ability by fusion of automatic pronunciation verification and fluency detection",
2779-2782.
Black, Matthew / Tepperman, Joseph / Kazemzadeh, Abe / Lee, Sungbok / Narayanan, Shrikanth S.:
"Pronunciation verification of English letter-sounds in preliterate children",
2783-2786.
Harrison, Alissa M. / Lau, Wing Yiu / Meng, Helen M. / Wang, Lan:
"Improving mispronunciation detection and diagnosis of learners' speech with context-sensitive phonological rules based on language transfer",
2787-2790.
Cucchiarini, Catia / Doremalen, Joost van / Strik, Helmer:
"DISCO: development and integration of speech technology into courseware for language learning",
2791-2794.
Samir, Abdurrahman / Duchateau, Jacques / Van hamme, Hugo:
"Discriminative model combination and language model selection in a reading tutor for children",
2795-2798.
Pedersen, Jakob Schou / Larsen, Lars Bo / Lindberg, Børge:
"Usability of ASR-based reading training for dyslexics",
2799-2802.
Togashi, Shingo / Nakagawa, Seiichi:
"A browsing system for classroom lecture speech",
2803-2806.
Luo, Dean / Shimomura, Naoya / Minematsu, Nobuaki / Yamauchi, Yutaka / Hirose, Keikichi:
"Automatic pronunciation evaluation of language learners' utterances generated through shadowing",
2807-2810.
Chevalier, Sylvain / Cao, Zhenhai:
"Application and evaluation of speech technologies in language learning: experiments with the Saybot player",
2811-2814.
Ge, Fengpei / Pan, Fuping / Liu, Changliang / Dong, Bin / Yan, Yonghong:
"Forward optimal modeling of acoustic confusions in Mandarin CALL system",
2815-2818.
Ito, Akinori / Tsutsui, Ryohei / Makino, Shozo / Suzuki, Motoyuki:
"Recognition of English utterances with grammatical and lexical mistakes for dialogue-based CALL system",
2819-2822.
Speech Pathologies
Kim, Heejin / Hasegawa-Johnson, Mark / Perlman, Adrienne / Gunderson, Jon / Huang, Thomas S. / Watkin, Kenneth / Frame, Simone:
"Dysarthric speech database for universal access research",
1741-1744.
Middag, Catherine / Nuffelen, Gwen Van / Martens, Jean-Pierre / Bodt, Marc De:
"Objective intelligibility assessment of pathological speakers",
1745-1748.
Ma, Joan K.-Y. / Whitehill, Tara L.:
"Quantitative analysis of intonation patterns produced by Cantonese speakers with Parkinson's disease: a preliminary study",
1749-1752.
Bruijn, Marieke de / Leeuw, Irma Verdonck de / Bosch, Louis ten / Kuik, Joop / Quene, Hugo / Boves, Lou / Langendijk, Hans / Leemans, Rene:
"Phonetic-acoustic and feature analyses by a neural network to assess speech quality in patients treated for head and neck cancer",
1753-1756.
Maier, Andreas / Hönig, Florian / Hacker, Christian / Schuster, Maria / Nöth, Elmar:
"Automatic evaluation of characteristic speech disorders in children with cleft lip and palate",
1757-1760.
Morales, Omar Caballero / Cox, Stephen:
"Application of weighted finite-state transducers to improve recognition accuracy for dysarthric speech",
1761-1764.
Special Session: Consonant Challenge . Human-Machine Comparisons of Consonant Recognition in Noise
Cooke, Martin / Scharenborg, Odette:
"The interspeech 2008 consonant challenge",
1765-1768.
Borgström, Bengt J. / Alwan, Abeer:
"HMM-based estimation of unreliable spectral components for noise robust speech recognition",
1769-1772.
Yoon, Jae Sam / Park, Ji Hun / Kim, Hong Kook:
"Gammatone-domain model combination for consonant recognition in noisy environments",
1773-1776.
Jančovič, Peter / Kokuer, Munevver:
"On the mask modeling and feature representation in the missing-feature ASR: evaluation on the Consonant Challenge",
1777-1780.
Garcia Lecumberri, M. Luisa / Cooke, Martin / Cutugno, Francesco / Giurgiu, Mircea / Meyer, Bernd T. / Scharenborg, Odette / Dommelen, Wim van / Volin, Jan:
"The non-native consonant challenge for european languages",
1781-1784.
Gemmeke, J. F. / Cranen, Bert:
"Noise reduction through compressed sensing",
1785-1788.
Schuller, Björn / Wöllmer, Martin / Moosmayr, Tobias / Rigoll, Gerhard:
"Speech recognition in noisy environments using a switching linear dynamic model for feature enhancement",
1789-1792.
Hodoshima, Nao / Yoshida, Wataru / Arai, Takayuki:
"Improving consonant identification in noise and reverberation by steady-state suppression as a preprocessing approach",
1793-1796.
Lobdell, Bryce E. / Hasegawa-Johnson, Mark / Allen, Jont B.:
"Human speech perception and feature extraction",
1797-1800.
Automatic Speech Recognition: Lexical and Prosodic Models
Tan, Tien-Ping / Besacier, Laurent:
"Improving pronunciation modeling for non-native speech recognition",
1801-1804.
Aronowitz, Hagai:
"Online vocabulary adaptation using contextual information and information retrieval",
1805-1808.
Onishi, Yoshifumi:
"Lexicon expansion using pronunciation variations extracted on the basis of speaker-related deviation in recognition error statistics",
1809-1812.
Tepperman, Joseph / Narayanan, Shrikanth S.:
"Better nonnative intonation scores through prosodic theory",
1813-1816.
Garner, Philip N.:
"Silence models in weighted finite-state transducers",
1817-1820.
Sasada, Tetsuro / Mori, Shinsuke / Kawahara, Tatsuya:
"Extracting word-pronunciation pairs from comparable set of text and speech",
1821-1824.
Speaker Recognition: Adverse Conditions and Forensics
Jin, Qin / Schultz, Tanja:
"Robust far-field speaker identification under mismatched conditions",
1893-1896.
Huang, Chien-Lin / Ma, Bin / Wu, Chung-Hsien / Mak, Brian / Li, Haizhou:
"Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions",
1897-1900.
Kwon, C. H. / Choi, J. K. / Ambikairajah, Eliathamby:
"Performance improvement of text-independent speaker verification systems based on histogram enhancement in noisy environments",
1901-1904.
Suh, Jun-Won / Angkititrakul, Pongtep / Hansen, John H. L.:
"Filling acoustic holes through leveraged uncorellated GMMs for in-set/out-of-set speaker recognition",
1905-1908.
Kim, Wooil / Hansen, John H. L.:
"Missing-feature method for speaker recognition in band-restricted conditions",
1909-1912.
Zhang, Yushi / Abdulla, Waleed H.:
"Robust speaker identification using cross-correlation GTF-ICA feature",
1913-1916.
Amino, Kanae / Arai, Takayuki:
"Perceptual speaker identification using monosyllabic stimuli - effects of the nucleus vowels and speaker characteristics contained in nasals",
1917-1920.
Das, Amitava / Chittaranjan, Gokul:
"Text-dependent speaker recognition by efficient capture of speaker dynamics in compressed time-frequency representations of speech",
1921-1924.
Das, Amitava / Chittaranjan, Gokul / Anumanchipalli, Gopala K.:
"Usefulness of text-conditioning and a new database for text-dependent speaker recognition research",
1925-1928.
Tsuge, Satoru / Osanai, Takashi / Makinae, Hisanori / Kamada, Toshiaki / Fukumi, Minoru / Kuroiwa, Shingo:
"Combination method of bone-conduction speech and air-conduction speech for speaker recognition",
1929-1932.
Toledano, Doroteo T. / Hernandez-Lopez, Daniel / Esteve-Elizalde, Cristina / Gonzalez-Rodriguez, Joaquin / Pozo, Ruben Fernandez / Gomez, Luis Hernandez:
"MAP and sub-word level t-norm for text-dependent speaker recognition",
1933-1936.
Zhang, Cuiling / Morrison, Geoffrey Stewart / Rose, Philip:
"Forensic speaker recognition in Chinese: a multivariate likelihood ratio discrimination on /i/ and /y/",
1937-1940.
Ishihara, Shunichi / Kinoshita, Yuko:
"How many do we need? exploration of the population size effect on the performance of forensic speaker classification",
1941-1944.
Leung, Cheung-Chi / Ferras, Marc / Barras, Claude / Gauvain, Jean-Luc:
"Comparing prosodic models for speaker recognition",
1945-1948.
Zieger, Christian / Omologo, Maurizio:
"Combination of clean and contaminated GMM/SVM for far-field text-independent speaker verification",
1949-1952.
Phonetics: Development, Learning, Cross-Language and Language-Specific
Braun, Bettina / Lemhofer, Kristin / Cutler, Anne:
"English word stress as produced by English and dutch speakers: the role of segmental and suprasegmental differences",
1953.
Reinisch, Eva / Jesse, Alexandra / McQueen, James M.:
"The strength of stress-related lexical competition depends on the presence of first-syllable stress",
1954.
Ishikawa, Keiichi / Nomura, Jun:
"Word stress placement by native speakers and Japanese learners of English",
1955-1958.
Bunnell, H. Timothy / Lilley, Jason:
"Schwa variants in american English",
1959-1962.
Yuan, Jiahong:
"Covariations of English segmental durations across speakers",
1963.
Joto, Akiyo:
"The intelligibility of the English vowel /ʌ/ produced by native speakers of Japanese and its relations to the acoustic characteristics",
1964-1967.
Weiss, Benjamin:
"Rate dependent spectral reduction for voiceless fricatives",
1968.
Ojala, Stina / Aaltonen, Olli / Salakoski, Tapio:
"Investigating perception of places of articulation in sign and speech",
1969.
Tyler, Michael D. / Best, Catherine T. / Goldstein, Louis M. / Antoniou, Mark / Krebs-Lazendic, Lidija:
"Six- and twelve-month-olds' discrimination of native versus non-native between- and within-organ fricative place contrasts",
1970.
Lam, Christa / Kitamura, Christine:
""your baby can't hear you": how mothers talk to infants with simulated hearing loss",
1971.
Klintfors, Eeva / Sundberg, Ulla / Lacerda, Francisco / Marklund, Ellen / Gustavsson, Lisa / Bjursäter, Ulla / Schwarz, Iris-Corinna / Söderlund, Göran:
"Development of communicative skills in 8- to 16-month-old children: a longitudinal study",
1972-1975.
Gustavsson, Lisa / Lacerda, Francisco:
"Vocal imitation in early language acquisition",
1976-1979.
Rasanen, Okko / Laine, Unto K. / Altosaar, Toomas:
"Computational language acquisition by statistical bottom-up processing",
1980-1983.
Katagiri, Noriaki / Kawai, Goh:
"Lexical analyses of native and non-native English language instructor speech based on a six-month co-taught classroom video corpus",
1984-1987.
Masuda, Hinako / Arai, Takayuki:
"Perception and production of consonant clusters in Japanese-English bilingual and Japanese monolingual speakers",
1988-1991.
Multimodal Signal Processing
Moubayed, Samer Al / Smet, Michael De / Van hamme, Hugo:
"Lip synchronization: from phone lattice to PCA eigen-projections using neural networks",
2016-2019.
Takahashi, Ryoei / Ohishi, Yasunori / Kitaoka, Norihide / Takeda, Kazuya:
"Building and combining document and music spaces for music query-by-webpage system",
2020-2023.
Wang, Lei / Huang, Shen / Hu, Sheng / Liang, Jiaen / Xu, Bo:
"Improving searching speed and accuracy of query by humming system based on three methods: feature fusion, candidates set reduction and multiple similarity measurement rescoring",
2024-2027.
Hueber, Thomas / Chollet, Gérard / Denby, Bruce / Dreyfus, Gérard / Stone, Maureen:
"Towards a segmental vocoder driven by ultrasound and optical images of the tongue and lips",
2028-2031.
Hueber, Thomas / Chollet, Gérard / Denby, Bruce / Dreyfus, Gérard / Stone, Maureen:
"Phone recognition from ultrasound and optical video sequences for a silent speech interface",
2032-2035.
Trmal, Jan / Hrúz, Marek / Zelinka, Jan / Campr, Pavel / , Luděk Müller / Müller, Luděk:
"Feature space transforms for Czech sign-language recognition",
2036-2039.
-Speech Perception
Davis, Chris / Kim, Jeesun / Barbaro, Angelo:
"Masked speech priming: no priming in dense neighbourhoods",
2040-2043.
Ali, Azra N.:
"Integration of audiovisual speech and priming effects",
2044-2047.
Zevin, Jason D. / Farmer, Thomas A.:
"Similarity between vowels influences response execution in word identification",
2048-2051.
Lentz, Tom:
"Phonotactically well-formed onset clusters as processing units in word recognition",
2052-2055.
Cutler, Anne / McQueen, James M. / Butterfield, Sally / Norris, Dennis:
"Prelexically-driven perceptual retuning of phoneme boundaries",
2056.
Cvejic, Erin / Kim, Jeesun / Davis, Chris:
"Visual speech modifies the phoneme restoration effect",
2057.
Evaluation and Standardisation of Spoken-Language Technology
Cao, Chuan / Li, Ming / Liu, Jian / Yan, Yonghong:
"An objective singing evaluation approach by relating acoustic measurements to perceptual ratings",
2058-2061.
Gautier-Turbin, Valerie / Gros, Laetitia:
"On the perceived quality of noise reduced signals",
2062-2065.
Murthy, Uma / Pitrelli, John F. / Ramaswamy, Ganesh / , Martin Franz (2), Burn L. Lewis (2) / Franz, Martin / Lewis, Burn L.:
"A methodology and tool suite for evaluation of accuracy of interoperating statistical natural language processing engines",
2066-2069.
Park, Youngja / Patwardhan, Siddharth / Visweswariah, Karthik / Gates, Stephen C.:
"An empirical analysis of word error rate and keyword error rate",
2070-2073.
Durin, Virginie / Gros, Laetitia:
"Measuring speech quality impact on tasks performance",
2074-2077.
Soronen, Hannu / Turunen, Markku / Hakulinen, Jaakko:
"Voice commands in home environment - a consumer survey",
2078-2081.
Automatic Speech Recognition: Search Methods
Bouselmi, Ghazi / Cai, Jun:
"Extended partial distance elimination and dynamic Gaussian selection for fast likelihood computation",
2082-2085.
Driesen, Joris / Van hamme, Hugo:
"Improving the multigram algorithm by using lattices as input",
2086-2089.
Tang, Min / Cristo, Philippe Di:
"Backward Viterbi beam search for utilizing dynamic task complexity information",
2090-2093.
Bertoldi, Nicola / Federico, Marcello / Falavigna, Daniele / Gerosa, Matteo:
"Fast speech decoding through phone confusion networks",
2094-2097.
Gu, Liang / Xue, Jian / Cui, Xiaodong / Gao, Yuqing:
"High-performance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computation",
2098-2101.
Bourke, Patrick J. / Rutenbar, Rob A.:
"A low-power hardware search architecture for speech recognition",
2102-2105.
Mamou, Jonathan / Ramabhadran, Bhuvana:
"Phonetic query expansion for spoken document retrieval",
2106-2109.
Oonishi, Tasuku / Dixon, Paul R. / Iwano, Koji / Furui, Sadaoki:
"Implementation and evaluation of fast on-the-fly WFST composition algorithms",
2110-2113.
Speech Synthesis: Prosody and Emotion I, II
Takeda, Shoichi / Yasuda, Yuuri / Isobe, Risako / Kiryu, Shogo / Tsuru, Makiko:
"Analysis of voice-quality features of speech that expresses "anger", "joy", and "sadness" uttered by radio actors and actresses",
2114-2117.
Badino, Leonardo / Clark, Robert A. J. / Strom, Volker:
"Including pitch accent optionality in unit selection text-to-speech synthesis",
2118-2121.
Inanoglu, Zeynep / Young, Steve:
"Emotion conversion using F0 segment selection",
2122-2125.
Qian, Yao / Liang, Hui / Soong, Frank K.:
"Generating natural F0 trajectory with additive trees",
2126-2129.
Boidin, Cédric / Boeffard, Olivier:
"Generating intonation from a mixed CART-HMM model for speech synthesis",
2130-2133.
Aguero, Pablo Daniel / Bonafonte, Antonio / Yu, Lu / Tulli, Juan Carlos:
"Intonation modeling of Mandarin Chinese using a superpositional approach",
2134-2137.
Tang, Hao / Zhou, Xi / Odisio, Matthias / Hasegawa-Johnson, Mark / Huang, Thomas S.:
"Two-stage prosody prediction for emotional text-to-speech synthesis",
2138-2141.
Hu, Yue-Ning / Chu, Min / Huang, Chao / Zhang, Yan-Ning:
"Prosody boundary detection through context-dependent position models",
2142-2145.
Gao, Boyang / Qian, Yao / Wu, Zhizheng / Soong, Frank K.:
"Duration refinement by jointly optimizing state and longer unit likelihood",
2266-2269.
Thangthai, Ausdang / Thatphithakkul, Nattanun / Wutiwiwatchai, Chai / Rugchatjaroen, Anocha / Saychum, Sittipong:
"T-tilt: a modified tilt model for F0 analysis and synthesis in tonal languages",
2270-2273.
Latorre, Javier / Akamine, Masami:
"Multilevel parametric-base F0 model for speech synthesis",
2274-2277.
Adell, Jordi / Bonafonte, Antonio / Escudero-Mancebo, David:
"On the generation of synthetic disfluent speech: local prosodic modifications caused by the insertion of editing terms",
2278-2281.
Türk, Oytun / Schröder, Marc:
"A comparison of voice conversion methods for transforming voice quality in emotional speech synthesis",
2282-2285.
Tepperman, Joseph / Narayanan, Shrikanth S.:
"Tree grammars as models of prosodic structure",
2286-2289.
Language Information Retrieval Systems
Meng, Sha / Shao, Jian / Yu, Roger Peng / Liu, Jia / Seide, Frank:
"Addressing the out-of-vocabulary problem for large-scale Chinese spoken term detection",
2146-2149.
Shao, Jian / Yu, Roger Peng / Zhao, Qingwei / Yan, Yonghong / Seide, Frank:
"Towards vocabulary-independent speech indexing for large-scale repositories",
2150-2153.
Moreno-Daniel, A. / Wilpon, J. / Juang, B.-H. / Parthasarathy, S.:
"Towards the integration of automatic speech recognition and information retrieval for spoken query processing",
2154-2157.
Turunen, Ville T.:
"Reducing the effect of OOV query words by using morph-based spoken document retrieval",
2158-2161.
Wu, Meng-Sung / Chien, Jen-Tzung:
"Bayesian latent topic clustering model",
2162-2165.
Akiba, Tomoyosi / Yokota, Yusuke:
"Spoken document retrieval by translating recognition candidates into correct transcriptions",
2166-2169.
Drioli, Carlo / Cosi, Piero:
"Audio indexing for an interactive Italian literature management system",
2170.
Terao, Makoto / Koshinaka, Takafumi / Ando, Shinichi / Isotani, Ryosuke / Okumura, Akitoshi:
"Open-vocabulary spoken-document retrieval based on query expansion using related web documents",
2171-2174.
Chaudhari, Upendra / Kuo, Hong-Kwang Jeff / Kingsbury, Brian:
"Discriminative graph training for ultra-fast low-footprint speech indexing",
2175-2178.
Ju, Yun-Cheng / Odell, Julian:
"A language-modeling approach to inverse text normalization and data cleanup for multimodal voice search applications",
2179-2182.
Amaral, Rui / Trancoso, Isabel:
"Topic segmentation and indexation in a media watch system",
2183-2186.
Olsson, J. Scott:
"Vocabulary independent discriminative term frequency estimation",
2187-2190.
Lin, Hui / Stupakov, Alex / Bilmes, Jeff A.:
"Spoken keyword spotting via multi-lattice alignment",
2191-2194.
Iwata, Kenji / Shinoda, Koichi / Furui, Sadaoki:
"Robust spoken term detection using combination of phone-based and word-based recognition",
2195-2198.
Applications for the Aged and Handicapped
D'Haro, Luis Fernando / San-Segundo, Ruben / Cordoba, Ricardo de / Bungeroth, Jan / Stein, Daniel / Ney, Hermann:
"Language model adaptation for a speech to sign language translation system using web frequencies and a MAP framework",
2199-2202.
Beskow, Jonas / Granström, Björn / Nordqvist, Peter / Al Moubayed, Samer / Salvi, Giampiero / Herzke, Tobias / Schulz, Arne:
"Hearing at home - communication support in home environments for hearing impaired persons",
2203-2206.
Taft, Daniel A. / Grayden, David B. / Burkitt, Anthony N.:
"Traveling wave based group delays for cochlear implant speech processing",
2207.
Smith, Damien J. / Burnham, Denis:
"Multimodal perception of Mandarin tone for cochlear implant users",
2208.
Nakamura, Keigo / Toda, Tomoki / Nakajima, Yoshitaka / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments",
2209-2212.
McKechnie, Jacqueline / Ballard, Kirrie J. / Robin, Donald A. / Jacks, Adam / Palethorpe, Sallyanne / Rosen, Kristin M.:
"An acoustic typology of apraxic speech - toward reliable diagnosis",
2213.
Pouchoulin, G. / Fredouille, C. / Bonastre, Jean-François / Ghio, A. / Giovanni, A.:
"Dysphonic voices and the 0-3000 hz frequency band",
2214-2217.
Yin, Shou-Chun / Rose, Richard C. / Saz, Oscar / Lleida, Eduardo:
"Verifying pronunciation accuracy from speakers with neuromuscular disorders",
2218-2221.
Alpan, A. / Maryn, Y. / Grenez, F. / Kacha, A. / Schoentgen, J.:
"Multi-band and multi-cue analyses of disordered connected speech",
2222-2225.
Carmichael, James / Wan, Vincent / Green, Phil:
"Combining neural network and rule-based systems for dysarthria diagnosis",
2226-2229.
D'Arcy, Shona / Rapcan, Viliam / Penard, Nils / Morris, Margaret E. / Robertson, Ian H. / Reilly, Richard B.:
"Speech as a means of monitoring cognitive function of elderly speakers",
2230-2233.
Matsumasa, Hironori / Takiguchi, Tetsuya / Ariki, Yasuo / Li, Ichao / Nakabayashi, Toshitaka:
"Integration of metamodel and acoustic model for speech recognition",
2234-2237.
Fraga, Francisco J. / Prates, Leticia P. Costa S. / , Maria Cecilia M. Iorio (2) / Iorio, Maria Cecilia M.:
"Frequency compression/transposition of fricative consonants for the hearing impaired with high-frequency dead regions",
2238-2241.
Human Speech Production
Lee, Sungbok / Kato, Tsuneo / Narayanan, Shrikanth S.:
"Relation between geometry and kinematics of articulatory trajectory associated with emotional speech production",
2290-2293.
Carne, Michael J.:
"Intrinsic consonantal F0 perturbation in 3-way VOT contrast and its implications for aspiration-conditioned tonal split: evidence from Vietnamese",
2294-2297.
Fang, Qiang / Fujita, Satoru / Lu, Xugang / Dang, Jianwu:
"A model based investigation of activation patterns of the tongue muscles for vowel production",
2298-2301.
Garnier, Maeva / Wolfe, Joe / Henrich, Nathalie / Smith, John:
"Interrelationship between vocal effort and vocal tract acoustics: a pilot study",
2302-2305.
Qin, Chao / Carreira-Perpinan, Miguel A. / Richmond, Korin / Wrench, Alan / Renals, Steve:
"Predicting tongue shapes from a few landmark locations",
2306-2309.
Special Session: LIPS 2008 - Visual Speech Synthesis Challenge
Theobald, Barry-John / Fagel, Sascha / Bailly, Gérard / Elisei, Frédéric:
"LIPS2008: visual speech synthesis challenge",
2310-2313.
Hofer, Gregor / Yamagishi, Junichi / Shimodaira, Hiroshi:
"Speech-driven lip motion generation with a trajectory HMM",
2314-2317.
Bailly, Gérard / Govokhina, Oxana / Breton, Gaspard / Elisei, Frédéric / Savariaux, Christophe:
"A trainable trajectory formation model TD-HMM parameterized for the LIPS 2008 challenge",
2318-2321.
Theobald, Barry-John / Cawley, Gavin / Bangham, Andrew / Matthews, Iain / Wilkinson, Nicholas:
"Comparing text-driven and speech-driven visual speech synthesisers",
2322.
Zoric, Goranka / Cerekovic, Aleksandra / Pandzic, Igor S.:
"Automatic lip synchronization by speech signal analysis",
2323.
Fagel, Sascha:
"MASSY speaks English: adaptation and evaluation of a talking head",
2324.
Fagel, Sascha / Elisei, Frédéric / Bailly, Gérard:
"From 3-d speaker cloning to text-to-audiovisual-speech",
2325.
Krňoul, Zdeněk / Železný, Miloš:
"A development of Czech talking head",
2326-2329.
Liu, Kang / Ostermann, Joern:
"Realistic facial animation system for interactive services",
2330-2333.
Yan, Juan / Xie, Xiang / Hu, Hao:
"Speech-driven 3d facial animation for mobile entertainment",
2334-2337.
Wang, Lijuan / Qian, Xiaojun / Ma, Lei / Qian, Yao / Chen, Yining / Soong, Frank K.:
"A real-time text to audio-visual speech synthesis system",
2338-2341.
Matusov, Evgeny / Hoffmeister, Björn / Ney, Hermann:
"Spoken language translation systems ************ ASR word lattice translation with exhaustive reordering is possible",
2342-2345.
Zheng, Jing / Wang, Wen / Ayan, Necip Fazil:
"Development of SRI's translation systems for broadcast news and broadcast conversations",
2346-2349.
Sarikaya, Ruhi / Deng, Yonggang / Afify, Mohamed / Kingsbury, Brian / Gao, Yuqing:
"Machine translation in continuous space",
2350-2353.
Lavecchia, Caroline / Langlois, David / Smaïli, Kamel:
"Discovering phrases in machine translation by simulated annealing",
2354-2357.
Reddy, Aarthi / Rose, Richard C.:
"Towards domain independence in machine aided human translation",
2358-2361.
Lane, Ian R. / Waibel, Alex:
"Class-based statistical machine translation for field maintainable speech-to-speech translation",
2362-2365.
Spoken Language: Parsing and Summarisation
Maskey, Sameer R. / Rosenberg, Andrew / Hirschberg, Julia:
"Intonational phrases for speech summarization",
2430-2433.
Riedhammer, Korbinian / Gillick, Dan / Favre, Benoit / Hakkani-Tür, Dilek:
"Packing the meeting summarization knapsack",
2434-2437.
Fujii, Yasuhisa / Yamamoto, Kazumasa / Kitaoka, Norihide / Nakagawa, Seiichi:
"Class lecture summarization taking into account consecutiveness of important sentences",
2438-2441.
Zhu, Xiaodan / He, Xuming / Munteanu, Cosmin / Penn, Gerald:
"Using latent Dirichlet allocation to incorporate domain knowledge for topic transition detection",
2443-2445.
Wang, Wen:
"Weakly supervised training for parsing Mandarin broadcast transcripts",
2446-2449.
Plank, Barbara / Sima'an, Khalil:
"Parsing with subdomain instance weighting from raw corpora",
2540.
Ohno, Tomohiro / Matsubara, Shigeki / Kashioka, Hideki / Inagaki, Yasuyoshi:
"Dependency parsing of Japanese spoken monologue based on clause-starts detection",
2454-2457.
Gajjar, Mrugesh R. / Govindarajan, R. / Sreenivas, T. V.:
"Online unsupervised pattern discovery in speech using parallelization",
2458-2461.
Multimodal Interfaces
Melto, Aleksi / Turunen, Markku / Hakulinen, Jaakko / Kainulainen, Anssi / Heimonen, Tomi:
"A comparison of input entry rates in a multimodal mobile application",
2462-2465.
Turunen, Markku / Hakulinen, Jaakko / Smith, Cameron / Charlton, Daniel / Zhang, Li / Cavazza, Marc:
"Physically embodied conversational agents as health and fitness companions",
2466-2469.
Metze, Florian / Englert, Roman / Bub, Udo / Kliche, Ingmar / Scheerbarth, Thomas:
"User perception of multi-modal interfaces for mobile applications",
2470-2473.
Nakano, Teppei / Kumai, Tomoyuki / Kobayashi, Tetsunori / Ishikawa, Yasushi:
"Design and formulation for speech interface based on flexible shortcuts",
2474-2477.
Yin, Bo / Ruiz, Natalie / Chen, Fang / Ambikairajah, Eliathamby:
"Exploring classification techniques in speech based cognitive load monitoring",
2478-2481.
Okamoto, Masayuki / Iketani, Naoki / Nishimura, Keisuke / Kikuchi, Masaaki / Cho, Kenta / Hattori, Masanori / Tsuboi, Sougo:
"Finding two-level interpersonal context: proximity and conversation detection from personal audio feature data",
2482-2485.
Gandhe, Sudeep / DeVault, David / Roque, Antonio / Martinovski, Bilyana / Artstein, Ron / Leuski, Anton / Gerten, Jillian / Traum, David:
"From domain specification to virtual humans: an integrated approach to authoring tactical questioning characters",
2486-2489.
Rozak, Mike:
"Designing a massively multiplayer online role-playing game around text-to-speech",
2490-2493.
Speech, Music, Audio Segmentation and Classification
Gao, Jie / Zhang, Xiang / Zhao, Qingwei / Yan, Yonghong:
"Robust speaker change detection using Kernel-Gaussian model",
2494-2497.
Ntalampiras, Stavros / Fakotakis, Nikos:
"A comparative study in automatic recognition of broadcast audio",
2498-2501.
Tantibundhit, Charturong / Kubin, Gernot:
"Joint time-frequency segmentation for transient decomposition",
2502-2505.
Mitra, Vikramjit / Garcia-Romero, Daniel / Espy-Wilson, Carol Y.:
"Language and genre detection in audio content analysis",
2506-2509.
Zhang, Chi / Hansen, John H. L.:
"An entropy based feature for whisper-island detection within audio streams",
2510-2513.
Grašič, Matej / Kos, Marko / Žgank, Andrej / Kačič, Zdravko:
"Two step speaker segmentation method using Bayesian information criterion and adapted Gaussian mixtures models",
2514-2517.
Germesin, Sebastian / Becker, Tilman / Poller, Peter:
"Domain-specific classification methods for disfluency detection",
2518-2521.
Nwe, Tin Lay / Dong, Minghui / Khine, Swe Zin Kalayar / Li, Haizhou:
"Multi-speaker meeting audio segmentation",
2522-2525.
Maddage, Namunu C. / Li, Haizhou:
"Rhythm based music segmentation and octave scale cepstral features for sung language recognition",
2526-2529.
Molla, Md. Khademul Islam / Hirose, Keikichi / Minematsu, Nobuaki:
"Robust voiced/unvoiced speech classification using empirical mode decomposition and periodic correlation model",
2530-2533.
Wu, Qiong / Yan, Qin / Wang, Jun / Hong, Jun:
"A combination of data mining method with decision trees building for speech/music discrimination",
2534-2537.
Gupta, Vishwa / Boulianne, Gilles / Kenny, Patrick / Dumouchel, Pierre:
"Advertisement detection in French broadcast news using acoustic repetition and Gaussian mixture models",
2538-2541.
Hazen, Timothy J. / Richardson, Fred:
"A hybrid SVM/MCE training approach for vector space topic identification of spoken audio recordings",
2542-2545.
Trancoso, Isabel / Portelo, Jose / Bugalho, Miguel / Neto, João / Serralheiro, Antonio:
"Training audio events detectors with a sound effects corpus",
2546-2549.
Automatic Speech Recognition: New Paradigms
Vipperla, Ravichander / Renals, Steve / Frankel, Joe:
"Longitudinal study of ASR performance on ageing voices",
2550-2553.
Van hamme, Hugo:
"HAC-models: a novel approach to continuous speech recognition",
2554-2557.
Mohapatra, Prateeti / Fosler-Lussier, Eric:
"Investigations into phonological attribute classifier representations for CRF phone recognition",
2558-2561.
Subramanya, Amarnag / Bilmes, Jeff A.:
"Applications of virtual-evidence based speech recognizer training",
2562-2565.
Doremalen, Joost van / Boves, Lou:
"Spoken digit recognition using a hierarchical temporal memory",
2566-2569.
Bosch, Louis ten / Van hamme, Hugo / Boves, Lou:
"A computational model of language acquisition: focus on word discovery",
2570-2573.
Speech and Acoustic Activity Detection
Kaushik, Lakshmish / O'Shaughnessy, Douglas:
"Voice activity detection using modified Wigner-ville distribution",
2574-2577.
Chaitanya, Krishna / Sinha, Rohit:
"Energy and entropy based switching algorithm for speech endpoint detection in varying SNR conditions",
2578-2581.
Anemüller, Jörn / Schmidt, Denny / Bach, Jörg-Hendrik:
"Detection of speech embedded in real acoustic background based on amplitude modulation spectrogram features",
2582-2585.
Pham, Tuan Van / Stadtschnitzer, Michael / Pernkopf, Franz / Kubin, Gernot:
"Voice activity detection algorithms using subband power distance feature for noisy environments",
2586-2589.
Müller, Christian / Biel, Joan-Isaac / Kim, Edward / Rosario, Daniel:
"Speech-overlapped acoustic event detection for automotive applications",
2590-2593.
Temko, Andrey / Nadeu, Climent:
"Detection of acoustic events in interactive seminar data with temporal overlaps",
2594-2597.
Speech Analysis and Processing
Kim, Chanwoo / Stern, Richard M.:
"Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis",
2598-2601.
Stark, Anthony P. / Paliwal, Kuldip K.:
"Speech analysis using instantaneous frequency deviation",
2602-2605.
Gläser, Claudius / Heckmann, Martin / Joublin, Frank / Goerick, Christian:
"Auditory-based formant estimation in noise using a probabilistic framework",
2606-2609.
Rama Murty, K. Sri / Khurana, Saurav / Itankar, Yogendra Umesh / Kesheorey, M. R. / Yegnanarayana, B.:
"Efficient representation of throat microphone speech",
2610-2613.
Deshmukh, Om D. / Verma, Ashish:
"Acoustic-phonetic approach for automatic evaluation of spoken grammar",
2614-2617.
Cox, Stephen:
"On estimation of a speaker's confusion matrix from sparse data",
2618-2621.
Special Session: Talking Heads and Pronunciation Training
Hazan, Valerie:
"Talking heads and pronunciation training: a review",
2622.
Massaro, Dominic W. / Bigler, Stephanie / Chen, Trevor / Perlman, Marcus / Ouni, Slim:
"Pronunciation training: the role of eye and ear",
2623-2626.
Wik, Preben / Engwall, Olov:
"Can visualization of internal articulators support speech perception?",
2627-2630.
Engwall, Olov:
"Can audio-visual instructions help learners improve their articulation? - an ultrasound study of short term changes",
2631-2634.
Badin, Pierre / Tarabalka, Yuliya / Elisei, Frédéric / Bailly, Gérard:
"Can you "read tongue movements"?",
2635-2638.
Kröger, Bernd J. / Graf-Borttscheller, Verena / Lowit, Anja:
"Two- and three-dimensional visual articulatory models for pronunciation training and for treatment of speech disorders",
2639-2642.
Fagel, Sascha / Madany, Katja:
"A 3-d virtual head as a tool for speech therapy for children",
2643-2646.
Hofe, Robin / Moore, Roger K.:
"Anton: an animatronic model of a human tongue and vocal tract",
2647-2650.
Arai, Takayuki:
"Physical models of the human vocal tract with gel-type material",
2651-2654.
Huang, Chao / Zhang, Feng / Soong, Frank K. / Chu, Min:
"Mispronunciation detection for Mandarin Chinese",
2655-2658.
Multimodal Speech Processing
Wang, Lijuan / Hu, Tao / Liu, Peng / Soong, Frank K.:
"Efficient handwriting correction of speech recognition errors with template constrained posterior (TCP)",
2659-2662.
Ejarque, Pascual / Hernando, Javier:
"Bi-Gaussian score equalization in an audio-visual SVM-based person verification system",
2663-2666.
Meltzner, Geoffrey S. / Sroka, Jason / Heaton, James T. / Gilmore, L. Donald / Colby, Glen / Roy, Serge / Chen, Nancy / Luca, Carlo J. De:
"Speech recognition for vocalized and subvocal modes of production using surface EMG signals from the neck and face",
2667-2670.
Lewis, Trent W. / Powers, David M. W.:
"Distinctive feature fusion for recognition of australian English consonants",
2671-2674.
Watanabe, Yasushi / Shinoda, Koichi / Furui, Sadaoki:
"Time-lag adaptation for semi-synchronous speech and pen input",
2675-2678.
Lucey, Patrick / Sridharan, Sridha / Dean, David:
"Continuous pose-invariant lipreading",
2679-2682.
Cross-Lingual and Multilingual Automatic Speech Recognition, Speech Translation
Nouza, Jan / Silovsky, Jan / Zdansky, Jindrich / Cerva, Petr / Kroul, Martin / Chaloupka, Josef:
"Czech-to-slovak adapted broadcast news transcription system",
2683-2686.
Lyu, Dau-Cheng / Siniscalchi, Sabato Marco / Kim, Tae-Yoon / Lee, Chin-Hui:
"Continuous phone recognition without target language training data",
2687-2690.
White, Christopher M. / Khudanpur, Sanjeev / Baker, James K.:
"An investigation of acoustic models for multilingual code-switching",
2691-2694.
Tóth, Lászlá / Frankel, Joe / Gosztolya, Gábor / King, Simon:
"Cross-lingual portability of MLP-based tandem features - a case study for English and Hungarian",
2695-2698.
Zhao, Xufang / O'Shaughnessy, Douglas:
"Seed models combination and state level mappings of cross-lingual transfer for rapid HMM development: from English to Mandarin",
2699-2702.
Bouselmi, Ghazi / Fohr, Dominique / Illina, Irina:
"Multi-accent and accent-independent non-native speech recognition",
2703-2706.
Singla, Adish Kumar / Hakkani-Tür, Dilek:
"Cross-lingual sentence extraction for information distillation",
2707-2710.
Scanzio, Stefano / Laface, Pietro / Fissore, Luciano / Gemello, Roberto / Mana, Franco:
"On the use of a multilingual neural network front-end",
2711-2714.
Sim, Khe Chai / Li, Haizhou:
"Context-sensitive probabilistic phone mapping model for cross-lingual speech recognition",
2715-2718.
Liu, Chen / Melnar, Lynette:
"A non-acoustic approach to crosslingual speech recognition performance prediction",
2719-2722.
Sridhar, Vivek Kumar Rangarajan / Bangalore, Srinivas / Narayanan, Shrikanth S.:
"Factored translation models for enriching spoken language translation with prosody",
2723-2726.
Schwenk, Holger / Esteve, Yannick:
"Data selection and smoothing in an open-source system for the 2008 NIST machine translation evaluation",
2727-2730.
Kathol, Andreas / Zheng, Jing:
"Strategies for building a Farsi-English SMT system from limited resources",
2731-2734.
Kolss, Muntsin / Vogel, Stephan / Waibel, Alex:
"Stream decoding for simultaneous spoken language translation",
2735-2738.
Ettelaie, Emil / Georgiou, Panayiotis G. / Narayanan, Shrikanth S.:
"Towards unsupervised training of the classifier-based speech translator",
2739-2742.
Pitrelli, John F. / Lewis, Burn L. / Epstein, Edward A. / Franz, Martin / Kiecza, Daniel / Quinn, Jerome L. / Ramaswamy, Ganesh / Srivastava, Amit / Virga, Paola:
"Aggregating distributed STT, MT, and information extraction engines: the GALE interoperability-demo system",
2743-2746.
Expression, Emotion and Personality Recognition
Kazemzadeh, Abe / Lee, Sungbok / Narayanan, Shrikanth S.:
"An interval type-2 fuzzy logic system to translate between emotion-related vocabularies",
2747-2750.
Huang, Ting / Yang, Yingchun:
"Applying pitch-dependent difference detection and modification to emotional speaker recognition",
2751-2754.
Neiberg, Daniel / Elenius, Kjell:
"Automatic recognition of anger in spontaneous speech",
2755-2758.
Nose, Takashi / Kato, Yoichi / Tachibana, Makoto / Kobayashi, Takao:
"An estimation technique of style expressiveness for emotional speech using model adaptation based on multiple-regression HSMM",
2759-2762.
Ringeval, Fabien / Chetouani, Mohamed:
"A vowel based approach for acted emotion recognition",
2763-2766.
McIntyre, Gordon / Goecke, Roland:
"A composite framework for affective sensing",
2767-2770.
Shaukat, Arslan / Chen, Ke:
"Towards automatic emotional state categorization from speech signals",
2771-2774.
Park, Jeong-Sik / Kim, Ji-Hwan / Yoon, Sang-Min / Oh, Yung-Hwan:
"Speaker-independent emotion recognition based on feature vector classification",
2775-2778.
Human Speech Production and Speech Perception
Bresch, Erik / Riggs, Daylen / Goldstein, Louis M. / Byrd, Dani / Lee, Sungbok / Narayanan, Shrikanth S.:
"An analysis of vocal tract shaping in English sibilant fricatives using real-time magnetic resonance imaging",
2823-2826.
Arai, Takayuki:
"Science workshop with sliding vocal-tract model",
2827-2830.
Bagou, Odile / Frauenfelder, Ulrich H.:
"Segmentation cues in lexical identification and in lexical acquisition: same or different?",
2831-2834.
Kuijpers, Cecile / Bosch, Louis ten:
"Phonological representations in poor readers",
2835-2838.
Buchaillard, Stephanie / Perrier, Pascal / Payan, Yohan:
"To what extent does tagged-MRI technique allow to infer tongue muscles' activation pattern? a modelling study",
2839-2842.
Aboutabit, Noureddine / Beautemps, Denis / Mathieu, Olivier / Besacier, Laurent:
"Feature adaptation of hearing-impaired lip shapes: the vowel case in the cued speech context",
2843-2846.
Veilleux, Nanette / Shattuck-Hufnagel, Stefanie:
"Automatic detection of the context of acoustic landmark deletion",
2847-2850.
Ouni, Slim:
"Aspects of pharyngealized phonemes in Arabic using articulography",
2851.
Beach, Elizabeth / Kitamura, Christine / Dillon, Harvey / Ching, Teresa / Burnham, Denis:
"The effect of spectral tilt on infants' discrimination of fricatives",
2852.
Knoll, Monja / Scharrer, Lisa:
""look at the shark": evaluation of student produced standardized sentences of infant- and foreigner-directed speech",
2853-2856.
Panchapagesan, Sankaran / Alwan, Abeer:
"Vocal tract inversion by cepstral analysis-by-synthesis using chain matrices",
2857-2860.
Alku, Paavo / Magi, Carlo / Bäckström, Tom:
"DC-constrained linear prediction for glottal inverse filtering",
2861-2864.
Alm, Magnus / Behne, Dawn:
"Voicing influences the saliency of place of articulation in audio-visual speech perception in babble",
2865-2868.
Amano, Shigeaki / Hirata, Yukari:
"Correspondence of perception and production boundaries between single and geminate stops in Japanese",
2869-2872.
Yip, Michael C. W.:
"Inhibitory processes of Chinese spoken word recognition",
2873-2876.