Table of Contents and Access to Abstracts
Keynote Papers
All three keynote speakers provided their slides for the Archive. See
the papers' abstracts for access.
Clark, Graeme M.:
"The multiple-channel cochlear implant: interfacing electronic technology to human consciousness",
1-4.
Pereira, Fernando C. N.:
"Linear models for structure prediction",
717-720.
Shriberg, Elizabeth:
"Spontaneous speech: how people really talk and why engineers should care",
1781-1784.
Speech Recognition - Language Modelling I-III
Tam, Yik-Cheung / Schultz, Tanja:
"Dynamic language model adaptation using variational Bayes inference",
5-8.
Seneviratne, Vidura / Young, Steve:
"The hidden vector state language model",
9-12.
Mori, Shinsuke / Kurata, Gakuto:
"Class-based variable memory length Markov model",
13-16.
Gruenstein, Alexander / Wang, Chao / Seneff, Stephanie:
"Context-sensitive statistical language modeling",
17-20.
Wang, Chao / Seneff, Stephanie / Chung, Grace:
"Language model data filtering via user simulation and dialogue resynthesis",
21-24.
Chien, Jen-Tzung / Wu, Meng-Sung / Wu, Chia-Sheng:
"Bayesian learning for latent semantic analysis",
25-28.
Chueh, Chuang-Hua / Chien, To-Chang / Chien, Jen-Tzung:
"Discriminative maximum entropy language model for speech recognition",
721-724.
Bisani, Maximilian / Ney, Hermann:
"Open vocabulary speech recognition with flat hybrid models",
725-728.
Jeong, Minwoo / Eun, Jihyun / Jung, Sangkeun / Lee, Gary Geunbae:
"An error-corrective language-model adaptation for automatic speech recognition",
729-732.
Lin, Shiuan-Sung / Yvon, François:
"Discriminative training of finite state decoding graphs",
733-736.
Schwenk, Holger / Gauvain, Jean-Luc:
"Building continuous space language models for transcribing european languages",
737-740.
Xu, Peng / Mangu, Lidia:
"Using random forest language models in the IBM RT-04 CTS system",
741-744.
Kuo, Jen-Wei / Chen, Berlin:
"Minimum word error based discriminative training of language models",
1277-1280.
Ghaoui, A. / Yvon, François / Mokbel, C. / Chollet, Gérard:
"On the use of morphological constraints in n-gram statistical language model",
1281-1284.
Sicilia-Garcia, Elvira I. / Ming, Ji / Smith, F. Jack:
"A posteriori multiple word-domain language model",
1285-1288.
Dieguez-Tirado, Javier / Mateo, Carmen García / Cardenal-Lopez, Antonio:
"Effective topic-tree based language model adaptation",
1289-1292.
Sethy, Abhinav / Georgiou, Panayiotis G. / Narayanan, Shrikanth:
"Building topic specific language models from webdata using competitive models",
1293-1296.
Troncoso, Carlos / Kawahara, Tatsuya:
"Trigger-based language model adaptation for automatic meeting transcription",
1297-1300.
Duchateau, Jacques / Uytsel, Dong Hoon Van / Hamme, Hugo Van / Wambacq, Patrick:
"Statistical language models for large vocabulary spontaneous speech recognition in dutch",
1301-1304.
Allauzen, Alexandre / Gauvain, Jean-Luc:
"Diachronic vocabulary adaptation for broadcast news transcription",
1305-1308.
Siivola, Vesa / Pellom, Bryan L.:
"Growing an n-gram language model",
1309-1312.
Hüning, Harald / Kirschner, Manuel / Class, Fritz / Berton, Andre / Haiber, Udo:
"Embedding grammars into statistical language models",
1313-1316.
Broman, Simo / Kurimo, Mikko:
"Methods for combining language models in speech recognition",
1317-1320.
Vaiciunas, Airenas / Raskinis, Gailius:
"Review of statistical modeling of highly inflected lithuanian using very large vocabulary",
1321-1324.
Gorrell, Genevieve / Webb, Brandyn:
"Generalized hebbian algorithm for incremental latent semantic analysis",
1325-1328.
Jensson, Arnar Thor / Whittaker, Edward W. D. / Iwano, Koji / Furui, Sadaoki:
"Language model adaptation for resource deficient languages using translated data",
1329-1332.
Witschel, Petra / Astrov, Sergey / Bakenecker, Gabriele / Bauer, Josef G. / Höge, Harald:
"POS-based language models for large vocabulary speech recognition on embedded systems",
1333-1336.
Prosody in Language Performance I, II
Hirst, Daniel / Bouzon, Caroline:
"The effect of stress and boundaries on segmental duration in a corpus of authentic speech (british English)",
29-32.
Ohsuga, Tomoko / Nishida, Masafumi / Horiuchi, Yasuo / Ichikawa, Akira:
"Investigation of the relationship between turn-taking and prosodic features in spontaneous dialogue",
33-36.
Watanabe, Michiko / Hirose, Keikichi / Den, Yasuharu / Minematsu, Nobuaki:
"Filled pauses as cues to the complexity of following phrases",
37-40.
Schneider, Katrin / Möbius, Bernd:
"Perceptual magnet effect in German boundary tones",
41-44.
Grimm, Angela / Trommer, Jochen:
"Constraints on the acquisition of simplex and complex words in German",
45-48.
Meyer, Julien:
"Whistled speech: a natural phonetic description of languages adapted to human perception and to the acoustical environment",
49-52.
Kim, Heejin / Cole, Jennifer:
"The stress foot as a unit of planned timing: evidence from shortening in the prosodic phrase",
2365-2368.
Welby, Pauline / Loevenbruck, Hélène:
"Segmental "anchorage" and the French late rise",
2369-2372.
Chow, Ivan:
"Prosodic cues for syntactically-motivated junctures",
2373-2376.
Falé, Isabel / Hub Faria, Isabel:
"A glimpse of the time-course of intonation processing in European Portuguese",
2377-2380.
Wagner, Petra:
"Great expectations - introspective vs. perceptual prominence ratings and their acoustic correlates",
2381-2384.
Jensen, Christian / Tøndering, John:
"Choosing a scale for measuring perceived prominence",
2385-2388.
Edlund, Jens / House, David / Skantze, Gabriel:
"The effects of prosodic features on the interpretation of clarification ellipses",
2389-2392.
Jilka, Matthias:
"Exploration of different types of intonational deviations in foreign-accented and synthesized speech",
2393-2396.
Bröggelwirth, Jörg:
"A rhythmic-prosodic model of poetic speech",
2397-2400.
Biersack, Sonja / Kempe, Vera / Knapton, Lorna:
"Fine-tuning speech registers: a comparison of the prosodic features of child-directed and foreigner-directed speech",
2401-2404.
Arbisi-Kelm, Timothy:
"An analysis of the intonational structure of stuttered speech",
2405-2408.
Lintfert, Britta / Wokurek, Wolfgang:
"Voice quality dimensions of pitch accents",
2409-2412.
Dohen, Marion / Loevenbruck, Hélène:
"Audiovisual production and perception of contrastive focus in French: a multispeaker study",
2413-2416.
Barkhuysen, Pashiera / Krahmer, Emiel / Swerts, Marc:
"Predicting end of utterance in multimodal and unimodal conditions",
2417-2420.
Tanaka, Saori / Nishida, Masafumi / Horiuchi, Yasuo / Ichikawa, Akira:
"Production of prominence in Japanese sign language",
2421-2424.
Spoken Language Extraction / Retrieval I, II
Siohan, Olivier / Bacchiani, Michiel:
"Fast vocabulary-independent audio search using path-based graph indexing",
53-56.
Makhoul, John / Baron, Alex / Bulyko, Ivan / Nguyen, Long / Ramshaw, Lance / Stallard, David / Schwartz, Richard / Xiang, Bing:
"The effects of speech recognition and punctuation on information extraction performance",
57-60.
Chelba, Ciprian / Acero, Alex:
"Indexing uncertainty for spoken document search",
61-64.
Akiba, Tomoyosi / Abe, Hiroyuki:
"Exploiting passage retrieval for n-best rescoring of spoken questions",
65-68.
Kolluru, BalaKrishna / Christensen, Heidi / Gotoh, Yoshihiko:
"Multi-stage compaction approach to broadcast news summarisation",
69-72.
Huang, Chien-Lin / Hsieh, Chia-Hsin / Wu, Chung-Hsien:
"Audio-video summarization of TV news using speech recognition and shot change detection",
73-76.
Taniguchi, Toru / Adachi, Akishige / Okawa, Shigeki / Honda, Masaaki / Shirai, Katsuhiko:
"Discrimination of speech, musical instruments and singing voices using the temporal patterns of sinusoidal segments in audio signals",
589-592.
Murray, Gabriel / Renals, Steve / Carletta, Jean:
"Extractive summarization of meeting recordings",
593-596.
Hessen, Arjan van / Hinke, Jaap:
"IR-based classification of customer-agent phone calls",
597-600.
Favre, Benoît / Béchet, Frédéric / Nocéra, Pascal:
"Mining broadcast news data: robust information extraction from word lattices",
601-604.
Kurimo, Mikko / Turunen, Ville:
"To recover from speech recognition errors in spoken document retrieval",
605-608.
Gonzàlez, Edgar / Turmo, Jordi:
"Unsupervised clustering of spontaneous speech documents",
609-612.
Yamaguchi, Masahide / Yamashita, Masaru / Matsunaga, Shoichi:
"Spectral cross-correlation features for audio indexing of broadcast news and meetings",
613-616.
Hori, Chiori / Waibel, Alex:
"Spontaneous speech consolidation for spoken language applications",
617-620.
Maskey, Sameer / Hirschberg, Julia:
"Comparing lexical, acoustic/prosodic, structural and discourse features for speech summarization",
621-624.
Li, Te-Hsuan / Lee, Ming-Han / Chen, Berlin / Lee, Lin-Shan:
"Hierarchical topic organization and visual presentation of spoken documents using probabilistic latent semantic analysis (PLSA) for efficient retrieval/browsing applications",
625-628.
Zibert, Janez / Mihelic, France / Martens, Jean-Pierre / Meinedo, Hugo / Neto, Joao / Docio, Laura / Mateo, Carmen Garcia / David, Petr / Zdansky, Jindrich / Pleva, Matus / Cizmar, Anton / Zgank, Andrej / Kacic, Zdravko / Teleki, Csaba / Vicsi, Klara:
"The COST278 broadcast news segmentation and speaker clustering evaluation - overview, methodology, systems, results",
629-632.
Szoke, Igor / Schwarz, Petr / Matejka, Pavel / Burget, Lukas / Karafiat, Martin / Fapso, Michal / Cernocky, Jan:
"Comparison of keyword spotting approaches for informal continuous speech",
633-636.
Misu, Teruhisa / Kawahara, Tatsuya:
"Dialogue strategy to clarify user's queries for document retrieval system with speech interface",
637-640.
Moreau, Nicolas / Jin, Shan / Sikora, Thomas:
"Comparison of different phone-based spoken document retrieval methods with text and spoken queries",
641-644.
The Blizzard Challenge 2005
Black, Alan W. / Tokuda, Keiichi:
"The blizzard challenge - 2005: evaluating corpus-based speech synthesis on common datasets",
77-80.
Sakai, Shinsuke / Shu, Han:
"A probabilistic approach to unit selection for corpus-based speech synthesis",
81-84.
Kominek, John / Bennett, Christina L. / Langner, Brian / Toth, Arthur R.:
"The blizzard challenge 2005 CMU entry - a method for improving speech synthesis systems",
85-88.
Bunnell, H. Timothy / Pennington, Chris / Yarrington, Debra / Gray, John:
"Automatic personal synthetic voice construction",
89-92.
Zen, Heiga / Toda, Tomoki:
"An overview of nitech HMM-based speech synthesis system for blizzard challenge 2005",
93-96.
Hamza, Wael / Bakis, Raimo / Shuang, Zhi Wei / Zen, Heiga:
"On building a concatenative speech synthesis system from the blizzard challenge speech databases",
97-100.
Clark, Robert A. J. / Richmond, Korin / King, Simon:
"Multisyn voices from ARCTIC data for the blizzard challenge",
101-104.
Bennett, Christina L.:
"Large scale evaluation of corpus-based synthesizers: results and lessons from the blizzard challenge 2005",
105-108.
New Applications
Chen, Berlin / Chen, Yi-Ting / Chang, Chih-Hao / Chen, Hung-Bin:
"Speech retrieval of Mandarin broadcast news via mobile devices",
109-112.
Katoh, Michiaki / Yamamoto, Kiyoshi / Ogata, Jun / Yoshimura, Takashi / Asano, Futoshi / Asoh, Hideki / Kitawaki, Nobuhiko:
"State estimation of meetings by information fusion using Bayesian network",
113-116.
Moore, Roger K.:
"Results from a survey of attendees at ASRU 1997 and 2003",
117-120.
Haeb-Umbach, Reinhold / Kladis, Basilis / Schmalenstroeer, Joerg:
"Speech processing in the networked home environment - a view on the amigo project",
121-124.
Sugiyama, Masahide:
"Fixed distortion segmentation in efficient sound segment searching",
125-128.
Nwe, Tin Lay / Li, Haizhou:
"Identifying singers of popular songs",
129-132.
Ogata, Jun / Goto, Masataka:
"Speech repair: quick error correction just by using selection operation for speech input interfaces",
133-136.
Olszewski, Dirk / Prasetyo, Fransiskus / Linhard, Klaus:
"Steerable highly directional audio beam loudspeaker",
137-140.
Ezzaidi, Hassan / Rouat, Jean:
"Automatic music genre classification using second-order statistical measures for the prescriptive approach",
141-144.
Abad, Alberto / Macho, Dusan / Segura, Carlos / Hernando, Javier / Nadeu, Climent:
"Effect of head orientation on the speaker localization performance in smart-room environment",
145-148.
Fredouille, Corinne / Pouchoulin, G. / Bonastre, Jean-François / Azzarello, M. / Giovanni, A. / Ghio, A.:
"Application of automatic speaker recognition techniques to pathological voice assessment (dysphonia)",
149-152.
Chaudhari, Upendra V. / Ramaswamy, Ganesh N. / Epstein, Eddie / Caskey, Sasha P. / Omar, Mohamed Kamal:
"Adaptive speech analytics: system, infrastructure, and behavior",
153-156.
E-learning and Spoken Language Processing
Forbes-Riley, Katherine / Litman, Diane J.:
"Correlating student acoustic-prosodic profiles with student learning in spoken tutoring dialogues",
157-160.
Litman, Diane J. / Forbes-Riley, Katherine:
"Speech recognition performance and learning in spoken dialogue tutoring",
161-164.
Asakawa, Satoshi / Minematsu, Nobuaki / Isei-Jaakkola, Toshiko / Hirose, Keikichi:
"Structural representation of the non-native pronunciations",
165-168.
Chou, Fu-chiang:
"Ya-ya language box - a portable device for English pronunciation training with speech recognition technologies",
169-172.
Ito, Akinori / Lim, Yen-Ling / Suzuki, Motoyuki / Makino, Shozo:
"Pronunciation error detection method based on error rule clustering using a decision tree",
173-176.
Sethy, Abhinav / Narayanan, Shrikanth / Mote, Nicolaus / Johnson, W. Lewis:
"Modeling and automating detection of errors in Arabic language learner speech",
177-180.
Zhang, Felicia / Wagner, Michael:
"Effects of F0 feedback on the learning of Chinese tones by native speakers of English",
181-184.
E-inclusion and Spoken Language Processing I, II
Brøndsted, Tom / Aaskoven, Erik:
"Voice-controlled internet browsing for motor-handicapped users. design and implementation issues",
185-188.
Williams, Briony / Prys, Delyth / Ní Chasaide, Ailbhe:
"Creating an ongoing research capability in speech technology for two minority languages: experiences from the WISPR project",
189-192.
Vovos, A. / Kladis, Basilis / Fakotakis, Nikolaos:
"Speech operated smart-home control system for users with special needs",
193-196.
Jitsuhiro, Takatoshi / Matsuda, Shigeki / Ashikari, Yutaka / Nakamura, Satoshi / Yairi, Ikuko Eguchi / Igi, Seiji:
"Spoken dialog system and its evaluation of geographic information system for elderly persons' mobility support",
197-200.
Falavigna, Daniele / Giorgino, Toni / Gretter, Roberto:
"A frame based spoken dialog system for home care",
201-204.
Hawley, Mark S. / Green, Phil / Enderby, Pam / Cunningham, Stuart / Moore, Roger K.:
"Speech technology for e-inclusion of people with physical disabilities and disordered speech",
445-448.
Granström, Björn:
"Speech technology for language training and e-inclusion",
449-452.
Tucker, Roger / Shalonova, Ksenia:
"Supporting the creation of TTS for local language voice information systems",
453-456.
Andersen, Ove / Hjulmand, Christian:
"Access for all - a talking internet service",
457-460.
Kvale, Knut / Warakagoda, Narada:
"A speech centric mobile multimodal service useful for dyslectics and aphasics",
461-464.
Acoustic Processing for ASR I-III
Wölfel, Matthias:
"Frame based model order selection of spectral envelopes",
205-208.
Tyagi, Vivek / Wellekens, Christian / Bourlard, Hervé:
"On variable-scale piecewise stationary spectral analysis of speech signals for ASR",
209-212.
Faria, Arlo / Gelbart, David:
"Efficient pitch-based estimation of VTLN warp factors",
213-216.
Zheng, Yanli / Sproat, Richard / Gu, Liang / Shafran, Izhak / Zhou, Haolang / Su, Yi / Jurafsky, Daniel / Starr, Rebecca / Yoon, Su-Youn:
"Accent detection and speech recognition for Shanghai-accented Mandarin",
217-220.
Barrault, Loic / Mori, Renato de / Gemello, Roberto / Mana, Franco / Matrouf, Driss:
"Variability of automatic speech recognition systems using different features",
221-224.
Lihan, Slavomir / Juhar, Jozef / Cizmar, Anton:
"Crosslingual and bilingual speech recognition with Slovak and Czech speechdat-e databases",
225-228.
Pelaez-Moreno, Carmen / Zhu, Qifeng / Chen, Barry Y. / Morgan, Nelson:
"Automatic data selection for MLP-based feature extraction for ASR",
229-232.
Kohler, Thilo W. / Fugen, Christian / Stüker, Sebastian / Waibel, Alex:
"Rapid porting of ASR-systems to mobile devices",
233-236.
Meinedo, Hugo / Neto, Joao:
"A stream-based audio segmentation, classification and clustering pre-processing system for broadcast news using ANN models",
237-240.
Marcheret, Etienne / Visweswariah, Karthik / Potamianos, Gerasimos:
"Speech activity detection fusing acoustic phonetic and energy features",
241-244.
Tuske, Zoltan / Mihajlik, Peter / Tobler, Zoltan / Fegyo, Tibor:
"Robust voice activity detection based on the entropy of noise-suppressed spectrum",
245-248.
Murase, Masamitsu / Yamamoto, Shunichi / Valin, Jean-Marc / Nakadai, Kazuhiro / Yamada, Kentaro / Komatani, Kazunori / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Multiple moving speaker tracking by microphone array on mobile robot",
249-252.
Deng, Li / Yu, Dong / Acero, Alex:
"Learning statistically characterized resonance targets in a hidden trajectory model of speech coarticulation and reduction",
1097-1100.
Kocharov, Daniil / Zolnay, András / Schlüter, Ralf / Ney, Hermann:
"Articulatory motivated acoustic features for speech recognition",
1101-1104.
Watanabe, Shinji / Nakamura, Atsushi:
"Effects of Bayesian predictive classification using variational Bayesian posteriors for sparse training data in speech recognition",
1105-1108.
Tsao, Yu / Li, Jinyu / Lee, Chin-Hui:
"A study on separation between acoustic models and its applications",
1109-1112.
Afify, Mohamed:
"Extended baum-welch reestimation of Gaussian mixture models based on reverse Jensen inequality",
1113-1116.
Gunawardana, Asela / Mahajan, Milind / Acero, Alex / Platt, John C.:
"Hidden conditional random fields for phone classification",
1117-1120.
Jonas, Michael / Schmolze, James G.:
"Hierarchical clustering of mixture tying using a partially observable Markov decision process",
2953-2956.
Ouellet, Pierre / Boulianne, Gilles / Kenny, Patrick:
"Flavors of Gaussian warping",
2957-2960.
Keshet, Joseph / Shalev-Shwartz, Shai / Singer, Yoram / Chazan, Dan:
"Phoneme alignment based on discriminative learning",
2961-2964.
Leppänen, Jussi / Kiss, Imre:
"Comparison of low footprint acoustic modeling techniques for embedded ASR systems",
2965-2968.
Suchato, Atiwong / Punyabukkana, Proadpran:
"Factors in classification of stop consonant place of articulation",
2969-2972.
Toth, Arthur R. / Black, Alan W.:
"Cross-speaker articulatory position data for phonetic feature prediction",
2973-2976.
Povey, Daniel:
"Improvements to fMPE for discriminative training of features",
2977-2980.
Lei, Xin / Hwang, Mei-Yuh / Ostendorf, Mari:
"Incorporating tone-related MLP posteriors in the feature representation for Mandarin ASR",
2981-2984.
Han, Yan / Veth, Johan de / Boves, Louis:
"Speech trajectory clustering for improved speech recognition",
2985-2988.
Temko, Andrey / Macho, Dusan / Nadeu, Climent:
"Selection of features and combination of classifiers using a fuzzy approach for acoustic event classification",
2989-2992.
Stadermann, Jan / Koska, Wolfram / Rigoll, Gerhard:
"Multi-task learning strategies for a recurrent neural net in a hybrid tied-posteriors acoustic model",
2993-2996.
Hönig, Florian / Stemmer, Georg / Hacker, Christian / Brugnara, Fabio:
"Revising Perceptual Linear Prediction (PLP)",
2997-3000.
Pinto, Joel / Sitaram, R. N. V.:
"Confidence measures in speech recognition based on probability distribution of likelihoods",
3001-3004.
Diehl, Frank / Moreno, Asuncion / Monte, Enric:
"Continuous local codebook features for multi- and cross-lingual acoustic phonetic modelling",
3005-3008.
Miguel, Antonio / Lleida, Eduardo / Rose, Richard / Buera, Luis / Ortega, Alfonso:
"Augmented state space acoustic decoding for modeling local variability in speech",
3009-3012.
Dimitriadis, Dimitrios / Maragos, Petros / Potamianos, Alexandros:
"Auditory Teager energy cepstrum coefficients for robust speech recognition",
3013-3016.
Hifny, Yasser / Renals, Steve / Lawrence, Neil D.:
"A hybrid Maxent/HMM based ASR system",
3017-3020.
Erdogan, Hakan:
"Regularizing linear discriminant analysis for speech recognition",
3021-3024.
Wang, Yadong / Greenberg, Steven / Swaminathan, Jayaganesh / Kumaresan, Ramdas / Poeppel, David:
"Comprehensive modulation representation for automatic speech recognition",
3025-3028.
Fu, Qiang / Juang, Biing-Hwang:
"Segment-based phonetic class detection using minimum verification error (MVE) training",
3029-3032.
Liu, Yi / Fung, Pascale:
"Acoustic and phonetic confusions in accented speech recognition",
3033-3036.
Munich, Mario E. / Lin, Qiguang:
"Auditory image model features for automatic speech recognition",
3037-3040.
Heracleous, Panikos / Kaino, Tomomi / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Applications of NAM microphones in speech recognition for privacy in human-machine communication",
3041-3044.
Frankel, Joe / King, Simon:
"A hybrid ANN/DBN approach to articulatory feature recognition",
3045-3048.
Speech Recognition - Adaptation I, II
Zhang, Yaxin / Wu, Bian / Ren, Xiaolin / He, Xin:
"A speaker biased SI recognizer for embedded mobile applications",
253-256.
Bakker, Bart / Meyer, Carsten / Aubert, Xavier:
"Fast unsupervised speaker adaptation through a discriminative eigen-MLLR algorithm",
257-260.
Hu, Rusheng / Xue, Jian / Zhao, Yunxin:
"Incremental largest margin linear regression and MAP adaptation for speech separation in telemedicine applications",
261-264.
Garau, Giulia / Renals, Steve / Hain, Thomas:
"Applying vocal tract length normalization to meeting recordings",
265-268.
Umesh, S. / Zolnay, András / Ney, Hermann:
"Implementing frequency-warping and VTLN through linear transformation of conventional MFCC",
269-272.
Cui, Xiaodong / Alwan, Abeer:
"MLLR-like speaker adaptation based on linearization of VTLN with MFCC features",
273-276.
Raut, Chandra Kant / Nishimoto, Takuya / Sagayama, Shigeki:
"Model adaptation by state splitting of HMM for long reverberation",
277-280.
Liu, Daben / Kiecza, Daniel / Srivastava, Amit / Kubala, Francis:
"Online speaker adaptation and tracking for real-time speech recognition",
281-284.
Nishida, Masafumi / Horiuchi, Yasuo / Ichikawa, Akira:
"Automatic speech recognition based on adaptation and clustering using temporal-difference learning",
285-288.
Ye, Hui / Young, Steve:
"Improving the speech recognition performance of beginners in spoken conversational interaction for language learning",
289-292.
Gomez, Randy / Lee, Akinobu / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments",
293-296.
Choi, Dong-jin / Oh, Yung-Hwan:
"Rapid speaker adaptation for continuous speech recognition using merging eigenvoices",
297-300.
Visweswariah, Karthik / Olsen, Peder:
"Feature adaptation using projection of Gaussian posteriors",
1785-1788.
Li, Xiao / Bilmes, Jeff / Malkin, Jonathan:
"Maximum margin learning and adaptation of MLP classifiers",
1789-1792.
Mandal, Arindam / Ostendorf, Mari / Stolcke, Andreas:
"Leveraging speaker-dependent variation of adaptation",
1793-1796.
Hsiao, Roger / Mak, Brian:
"A comparative study of two kernel eigenspace-based speaker adaptation methods on large vocabulary continuous speech recognition",
1797-1800.
Wang, Xuechuan / O'Shaughnessy, Douglas:
"Environmental compensation using ASR model adaptation by a Bayesian parametric representation method",
1801-1804.
Luo, Jun / Ou, Zhijian / Wang, Zuoying:
"Discriminative speaker adaptation with eigenvoices",
1805-1808.
Signal Analysis, Processing and Feature Estimation I-III
Liu, Jian / Zheng, Thomas Fang / Deng, Jing / Wu, Wenhu:
"Real-time pitch tracking based on combined SMDSF",
301-304.
Bánhalmi, András / Kovács, Kornél / Kocsor, András / Tóth, László:
"Fundamental frequency estimation by least-squares harmonic model fitting",
305-308.
Lee, S. W. / Soong, Frank K. / Ching, P. C.:
"Harmonic filtering for joint estimation of pitch and voiced source with single-microphone input",
309-312.
Képesi, Marián / Weruaga, Luis:
"High-resolution noise-robust spectral-based pitch estimation",
313-316.
Hosom, John-Paul:
"F0 estimation for adult and children's speech",
317-320.
Milner, Ben / Shao, Xu / Darch, Jonathan:
"Fundamental frequency and voicing prediction from MFCCs for speech reconstruction from unconstrained speech",
321-324.
Barbot, N. / Boëffard, Olivier / Lolive, D.:
"F0 stylisation with a free-knot b-spline model and simulated-annealing optimization",
325-328.
Drepper, F. R.:
"Voiced excitation as entrained primary response of a reconstructed glottal master oscillator",
329-332.
Vincent, Damien / Rosec, Olivier / Chonavel, Thierry:
"Estimation of LF glottal source parameters based on an ARX model",
333-336.
Alsteris, Leigh D. / Paliwal, Kuldip K.:
"Some experiments on iterative reconstruction of speech from STFT phase and magnitude spectra",
337-340.
Muralishankar, R. / Sangwan, Abhijeet / O'Shaughnessy, Douglas:
"Statistical properties of the warped discrete cosine transform cepstrum compared with MFCC",
341-344.
Ferreira, Aníbal J. S.:
"New signal features for robust identification of isolated vowels",
345-348.
Pincas, Jonathan / Jackson, Philip J. B.:
"Amplitude modulation of frication noise by voicing saturates",
349-352.
Hecht, Ron M. / Tishby, Naftali:
"Extraction of relevant speech features using the information bottleneck method",
353-356.
Firouzmand, Mohammad / Girin, Laurent / Marchand, Sylvain:
"Comparing several models for perceptual long-term modeling of amplitude and phase trajectories of sinusoidal speech",
357-360.
Hermansky, Hynek / Fousek, Petr:
"Multi-resolution RASTA filtering for TANDEM-based ASR",
361-364.
Jeon, Woojay / Juang, Biing-Hwang:
"A category-dependent feature selection method for speech signals",
365-368.
Kristjansson, Trausti / Deligne, Sabine / Olsen, Peder:
"Voicing features for robust speech detection",
369-372.
Gomez, Pedro / Diaz, Francisco / Alvarez, Agustin / Martinez, Rafael / Rodellar, Victoria / Fernandez-Baillo, Roberto / Nieto, Alberto / Fernandez, Francisco J.:
"PCA of perturbation parameters in voice pathology detection",
645-648.
Sarkar, Anindya / Sreenivas, T. V.:
"Dynamic programming based segmentation approach to LSF matrix reconstruction",
649-652.
Nagarajan, T. / O'Shaughnessy, Douglas:
"Explicit segmentation of speech based on frequency-domain AR modeling",
653-656.
Motlícek, Petr / Burget, Lukás / Cernocký, Jan:
"Non-parametric speaker turn segmentation of meeting data",
657-660.
Korhonen, Petri / Laine, Unto K.:
"Unsupervised segmentation of continuous speech using vector autoregressive time-frequency modeling errors",
661-664.
Vijayalakshmi, P. / RamasubbaReddy, M.:
"The analysis on band-limited hypernasal speech using group delay based formant extraction technique",
665-668.
Zdánský, Jindrich / Nouza, Jan:
"Detection of acoustic change-points in audio records via global BIC maximization and dynamic programming",
669-672.
Molla, Md. Khademul Islam / Hirose, Keikichi / Minematsu, Nobuaki:
"Multi-band approach of audio source discrimination with empirical mode decomposition",
673-676.
Tsuzaki, Minoru / Tanaka, Satomi / Kato, Hiroaki / Sagisaka, Yoshinori:
"Application of auditory image model for speech event detection",
677-680.
Arias, José Anibal:
"Unsupervised identification of speech segments using kernel methods for clustering",
681-684.
Evangelopoulos, Georgios / Maragos, Petros:
"Speech event detection using multiband modulation energy",
685-688.
Kominek, John / Black, Alan W.:
"Measuring unsupervised acoustic clustering through phoneme pair merge-and-split tests",
689-692.
Valente, Fabio / Wellekens, Christian:
"Variational Bayesian speaker change detection",
693-696.
Borys, Sarah / Hasegawa-Johnson, Mark:
"Distinctive feature based SVM discriminant features for improvements to phone recognition on telephone band speech",
697-700.
Vijayalakshmi, P. / RamasubbaReddy, M.:
"Detection of hypernasality using statistical pattern classifiers",
701-704.
Weruaga, Luis / Képesi, Marián:
"Self-organizing chirp-sensitive artificial auditory cortical model",
705-708.
Karabetsos, Sotiris / Tsiakoulis, Pirros / Fotinea, Stavroula-Evita / Dologlou, Ioannis:
"On the use of a decimative spectral estimation method based on eigenanalysis and SVD for formant and bandwidth tracking of speech signals",
709-712.
Ivanov, Alexei V. / Parfieniuk, Marek / Petrovsky, Alexander A.:
"Frequency-domain auditory suppression modelling (FASM) - a WDFT-based anthropomorphic noise-robust feature extraction algorithm for speech recognition",
713-716.
Gianfelici, Francesco / Biagetti, Giorgio / Crippa, Paolo / Turchetti, Claudio:
"Asymptotically exact AM-FM decomposition based on iterated hilbert transform",
1121-1124.
Katsamanis, Athanassios / Maragos, Petros:
"Advances in statistical estimation and tracking of AM-FM speech components",
1125-1128.
Darch, Jonathan / Milner, Ben / Vaseghi, Saeed:
"Formant frequency prediction from MFCC vectors in noisy environments",
1129-1132.
Prasanna, S. R. Mahadeva / Yegnanarayana, B.:
"Detection of vowel onset point events using excitation information",
1133-1136.
Cabral, João P. / Oliveira, Luís C.:
"Pitch-synchronous time-scaling for prosodic and voice quality transformations",
1137-1140.
Ohishi, Yasunori / Goto, Masataka / Itou, Katunobu / Takeda, Kazuya:
"Discrimination between singing and speaking voices",
1141-1144.
Robust Speech Recognition I-IV
Pettersen, Svein G. / Johnsen, Magne H. / Myrvoll, Tor A.:
"Joint Bayesian predictive classification and parallel model combination for robust speech recognition",
373-376.
Yared, Glauco F. G. / Violaro, Fábio / Sousa, Lívio C.:
"Gaussian elimination algorithm for HMM complexity reduction in continuous speech recognition systems",
377-380.
Buera, Luis / Lleida, Eduardo / Miguel, Antonio / Ortega, Alfonso:
"Robust speech recognition in cars using phoneme dependent multi-environment linear normalization",
381-384.
Chen, Yi / Lee, Lin-Shan:
"Energy-based frame selection for reliable feature normalization and transformation in robust speech recognition",
385-388.
Nakajima, Yoshitaka / Kashioka, Hideki / Shikano, Kiyohiro / Campbell, Nick:
"Remodeling of the sensor for non-audible murmur (NAM)",
389-392.
Subramanya, Amarnag / Bilmes, Jeff / Chen, Chia-Ping:
"Focused word segmentation for ASR",
393-396.
Haeb-Umbach, Reinhold / Schmalenstroeer, Joerg:
"A comparison of particle filtering variants for speech feature enhancement",
913-916.
Potamitis, Ilyas / Fakotakis, Nikolaos:
"Enhancement of mel log-power spectrum of speech using particle filtering",
917-920.
Shozakai, Makoto / Nagino, Goshu:
"Improving robustness of speech recognition performance to aggregate of noises by two-dimensional visualization",
921-924.
Lim, Woohyung / Kim, Bong Kyoung / Kim, Nam Soo:
"Feature compensation based on switching linear dynamic model and soft decision",
925-928.
Huang, Shilei / Xie, Xiang / Kuang, Jingming:
"Using output probability distribution for improving speech recognition in adverse environment",
929-932.
Choi, Eric H. C.:
"A generalized framework for compensation of mel-filterbank outputs in feature extraction for robust ASR",
933-936.
Tolba, Hesham / Li, Zili / O'Shaughnessy, Douglas:
"Robust automatic speech recognition using a perceptually-based optimal spectral amplitude estimator speech enhancement algorithm in various low-SNR environments",
937-940.
So, Stephen / Paliwal, Kuldip K.:
"Improved noise-robustness in distributed speech recognition via perceptually-weighted vector quantisation of filterbank energies",
941-944.
Nasersharif, Babak / Akbari, Ahmad:
"Sub-band weighted projection measure for robust sub-band speech recognition",
945-948.
Deng, Jianping / Bouchard, Martin / Yeap, Tet Hin:
"Noise compensation using interacting multiple kalman filters",
949-952.
Stouten, Veronique / Hamme, Hugo Van / Wambacq, Patrick:
"Kalman and unscented kalman filter feature enhancement for noise robust ASR",
953-956.
Wan, Chia-yu / Lee, Lin-Shan:
"Histogram-based quantization (HQ) for robust and scalable distributed speech recognition",
957-960.
Chung, Yong-Joo:
"A data-driven approach for the model parameter compensation in noisy speech recognition",
961-964.
Kobashikawa, Satoshi / Takahashi, Satoshi / Yamaguchi, Yoshikazu / Ogawa, Atsunori:
"Rapid response and robust speech recognition by preliminary model adaptation for additive and convolutional noise",
965-968.
Prasad, Saurabh / Zahorian, Stephen A.:
"Nonlinear and linear transformations of speech features to compensate for channel and noise effects",
969-972.
Suzuki, Motoyuki / Kato, Yusuke / Ito, Akinori / Makino, Shozo:
"Construction method of acoustic models dealing with various background noises based on combination of HMMs",
973-976.
Xu, Haitian / Tan, Zheng-Hua / Dalsgaard, Paul / Lindberg, Børge:
"Robust speech recognition based on noise and SNR classification - a multiple-model framework",
977-980.
Song, Hwa Jeon / Kim, Hyung Soon:
"Eigen-environment based noise compensation method for robust speech recognition",
981-984.
Graciarena, Martin / Franco, Horacio / Myers, Greg / Abrash, Victor:
"Robust feature compensation in nonstationary and multiple noise environments",
985-988.
Droppo, Jasha / Acero, Alex:
"Maximum mutual information SPLICE transform for seen and unseen conditions",
989-992.
Krüger, Sven E. / Schafföner, Martin / Katz, Marcel / Andelic, Edin / Wendemuth, Andreas:
"Speech recognition with support vector machines in a hybrid system",
993-996.
Barreaud, Vincent / O'Shaughnessy, Douglas / Dahan, Jean-Guy:
"Experiments on speaker profile portability",
997-1000.
Colibro, Daniele / Fissore, Luciano / Vair, Claudio / Dalmasso, Emanuele / Laface, Pietro:
"A confidence measure invariant to language and grammar",
1001-1004.
Schutte, Ken / Glass, James:
"Robust detection of sonorant landmarks",
1005-1008.
Ma, Ning / Green, Phil:
"Context-dependent word duration modelling for robust speech recognition",
2609-2612.
Epps, Julien / Choi, Eric H. C.:
"An energy search approach to variable frame rate front-end processing for robust ASR",
2613-2616.
Gemello, Roberto / Mana, Franco / Mori, Renato de:
"Non-linear estimation of voice activity to improve automatic recognition of noisy speech",
2617-2620.
Kida, Yusuke / Kawahara, Tatsuya:
"Voice activity detection based on optimally weighted combination of multiple features",
2621-2624.
Ding, Pei:
"Soft decision strategy and adaptive compensation for robust speech recognition against impulsive noise",
2625-2628.
Morales, Nicolás / Torre Toledano, Doroteo / Hansen, John H. L. / Colás, José / Garrido, Javier:
"Statistical class-based MFCC enhancement of filtered and band-limited speech for robust ASR",
2629-2632.
Misra, Hemant / Bourlard, Hervé:
"Spectral entropy feature in full-combination multi-stream for robust ASR",
2633-2636.
Kim, Wooil / Stern, Richard M. / Ko, Hanseok:
"Environment-independent mask estimation for missing-feature reconstruction",
2637-2640.
Coy, André / Barker, Jon:
"Soft harmonic masks for recognising speech in the presence of a competing speaker",
2641-2644.
Szymanski, Lech / Bouchard, Martin:
"Comb filter decomposition for robust ASR",
2645-2648.
Heracleous, Panikos / Kaino, Tomomi / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Investigating the role of the Lombard reflex in non-audible murmur (NAM) recognition",
2649-2652.
Ruzanski, Evan / Hansen, John H. L. / Finan, Don / Meyerhoff, James / Norris, William / Wollert, Terry:
"Improved "TEO" feature-based automatic stress detection using physiological and acoustic speech sensors",
2653-2656.
Kobayakawa, Takeshi S.:
"Spectral subtraction using elliptic integral for multiplication factor",
2657-2660.
Wang, Longbiao / Kitaoka, Norihide / Nakagawa, Seiichi:
"Robust distant speech recognition based on position dependent CMN using a novel multiple microphone processing technique",
2661-2664.
Tanaka, H. / Fujimura, H. / Miyajima, C. / Nishino, T. / Itou, Katunobu / Takeda, Kazuya:
"Data collection and evaluation of speech recognition for motorbike riders",
2665-2668.
Álvarez, Agustín / Gómez, Pedro / Nieto, V. / Martínez, Rafael / Rodellar, Victoria:
"Application of a first-order differential microphone for efficient voice activity detection in a car platform",
2669-2672.
Setiawan, Panji / Suhadi, Suhadi / Fingscheidt, Tim / Stan, Sorel:
"Robust speech recognition for mobile devices in car noise",
2673-2676.
Mihajlik, Péter / Tobler, Zoltán / Tüske, Zoltán / Gordos, Géza:
"Evaluation and optimization of noise robust front-end technologies for the automatic recognition of Hungarian telephone speech",
2677-2680.
Chen, Gang / O'Shaughnessy, Douglas / Tolba, Hesham:
"A performance investigation of noisy voice recognition over IP telephony networks",
2681-2684.
Ito, Akinori / Kanayama, Takashi / Suzuki, Motoyuki / Makino, Shozo:
"Internal noise suppression for speech recognition by small robots",
2685-2688.
Kraft, Florian / Malkin, Robert / Schaaf, Thomas / Waibel, Alex:
"Temporal ICA for classification of acoustic events i a kitchen environment",
2689-2692.
Krebber, Jan Felix:
""hello - is anybody at home?" - about the minimum word accuracy of a smart home spoken dialogue system",
2693-2696.
Hirsch, H. Gunter / Finster, Harald:
"The simulation of realistic acoustic input scenarios for speech recognition systems",
2697-2700.
Walsh, Michael / O'Hare, Gregory M. P. / Carson-Berndsen, Julie:
"An agent-based framework for speech investigation",
2701-2704.
Liao, H. / Gales, M. J. F.:
"Joint uncertainty decoding for noise robust speech recognition",
3129-3132.
Vanhoucke, Vincent:
"Confidence scoring and rejection using multi-pass speech recognition",
3133-3136.
Lee, Cheng-Lung / Chang, Wen-Whei:
"Memory-enhanced MMSE-based channel error mitigation for distributed speech recognition",
3137-3140.
Fukuda, Takashi / Ghulam, Muhammad / Nitta, Tsuneo:
"Designing multiple distinctive phonetic feature extractors for canonicalization by using clustering technique",
3141-3144.
Kinoshita, Keisuke / Nakatani, Tomohiro / Miyoshi, Masato:
"Efficient blind dereverberation framework for automatic speech recognition",
3145-3148.
Wölfel, Matthias / McDonough, John:
"Combining multi-source far distance speech recognition strategies: beamforming, blind channel and confusion network combination",
3149-3152.
Speech Perception I, II
Alexander, Jennifer A. / Wong, Patrick C. M. / Bradlow, Ann R.:
"Lexical tone perception in musicians and non-musicians",
397-400.
Ma, Joan K.-Y. / Ciocca, Valter / Whitehill, Tara:
"Contextual effect on perception of lexical tones in Cantonese",
401-404.
Mixdorff, Hansjörg / Hu, Yu / Burnham, Denis:
"Visual cues in Mandarin tone perception",
405-408.
Mixdorff, Hansjörg / Hu, Yu:
"Cross-language perception of word stress",
409-412.
Cutler, Anne:
"The lexical statistics of word recognition problems caused by L2 phonetic confusion",
413-416.
Huang, Chun-Fang / Akagi, Masato:
"A multi-layer fuzzy logical model for emotional speech perception",
417-420.
Tran, Do Dat / Castelli, Eric / Serignat, Jean-François / Trinh, Van Loan / Le, Xuan Hung:
"Influence of F0 on Vietnamese syllable perception",
1697-1700.
Schwanhäußer, Barbara / Burnham, Denis:
"Lexical tone and pitch perception in tone and non-tone language speakers",
1701-1704.
Falé, Isabel / Hub Faria, Isabel:
"Intonational contrasts in EP: a categorical perception approach",
1705-1708.
Braun, Bettina / Weber, Andrea / Crocker, Matthew:
"Does narrow focus activate alternative referents?",
1709-1712.
Aikawa, Kiyoaki / Hashimoto, Hayato:
"Audiovisual interaction on the perception of frequency glide of linear sweep tones",
1713-1716.
Omata, Kei / Mogi, Ken:
"Audiovisual integration in dichotic listening",
1717-1720.
Svanfeldt, Gunilla / Olszewski, Dirk:
"Perception experiment combining a parametric loudspeaker and a synthetic talking head",
1721-1724.
Mayo, Catherine / Clark, Robert A. J. / King, Simon:
"Multidimensional scaling of listener responses to synthetic speech",
1725-1728.
Terasawa, Hiroko / Slaney, Malcolm / Berger, Jonathan:
"A timbre space for speech",
1729-1732.
Kacha, A. / Grenez, Francis / Schoentgen, Jean:
"Voice quality assessment by means of comparative judgments of speech tokens",
1733-1736.
Irino, Toshio / Satou, Satoru / Nomura, Shunsuke / Banno, Hideki / Kawahara, Hideki:
"Speech intelligibility derived from time-frequency and source smearing",
1737-1740.
Hayashi, Nahoko / Arai, Takayuki / Hodoshima, Nao / Miyauchi, Yusuke / Kurisu, Kiyohiro:
"Steady-state pre-processing for improving speech intelligibility in reverberant environments: evaluation in a hall with an electrical reverberator",
1741-1744.
Wong, Patrick C.M. / Lee, Kiara M. / Parrish, Todd B.:
"Neural bases of listening to speech in noise",
1745-1748.
Jongmans, P. / Hilgers, F. J. M. / Pols, Louis C. W. / As-Brooks, C. J. van:
"The intelligibility of tracheoesophageal speech: first results",
1749-1752.
Brown, Guy J. / Palomäki, Kalle J.:
"A computational model of the speech reception threshold for laterally separated speech and noise",
1753-1756.
Janse, Esther:
"Lexical inhibition effects in time-compressed speech",
1757-1760.
Jacquier, Caroline / Meunier, Fanny:
"Perception of time-compressed rapid acoustic cues in French CV syllables",
1761-1764.
Grataloup, C. / Hoen, M. / Pellegrino, F. / Veuillet, E. / Collet, L. / Meunier, Fanny:
"Reversed speech comprehension depends on the auditory efferent system functionality",
1765-1768.
Tokuma, Won / Tokuma, Shinichi:
"Perceptual space of English fricatives for Japanese learners",
1769-1772.
Vasilescu, Ioana / Candea, Maria / Adda-Decker, Martine:
"Perceptual salience of language-specific acoustic differences in autonomous fillers across eight languages",
1773-1776.
Pell, Marc D.:
"Effects of cortical and subcortical brain damage on the processing of emotional prosody",
1777-1780.
Spoken Language Understanding I, II
Lane, Ian R. / Kawahara, Tatsuya:
"Utterance verification incorporating in-domain confidence and discourse coherence measures",
421-424.
Boulis, Constantinos / Ostendorf, Mari:
"Using symbolic prominence to help design feature subsets for topic classification and clustering of natural human-human conversations",
425-428.
Sudoh, Katsuhito / Tsukada, Hajime:
"Tightly integrated spoken language understanding using word-to-concept translation",
429-432.
Sarikaya, Ruhi / Kuo, Hong-Kwang Jeff / Goel, Vaibhava / Gao, Yuqing:
"Exploiting unlabeled data using multiple classifiers for improved natural language call-routing",
433-436.
Kuo, Hong-Kwang Jeff / Goel, Vaibhava:
"Active learning with minimum expected error for spoken language understanding",
437-440.
Thomae, Matthias / Fabian, Tibor / Lieb, Robert / Ruske, Günther:
"Lexical out-of-vocabulary models for one-stage speech interpretation",
441-444.
Thomae, Matthias / Fabian, Tibor / Lieb, Robert / Ruske, Günther:
"Hierarchical language models for one-stage speech interpretation",
3425-3428.
Wang, Nick J. C.:
"Spoken language understanding using layered n-gram modeling",
3429-3432.
Surdeanu, Mihai / Turmo, Jordi / Comelles, Eli:
"Named entity recognition from spontaneous open-domain speech",
3433-3436.
Zitouni, Imed / Jiang, Hui / Zhou, Qiru:
"Discriminative training and support vector machine for natural language call routing",
3437-3440.
Eun, Jihyun / Jeong, Minwoo / Lee, Gary Geunbae:
"A multiple classifier-based concept-spotting approach for robust spoken language understanding",
3441-3444.
Lieb, Robert / Thomae, Matthias / Ruske, Günther / Bobbert, Daniel / Althoff, Frank:
"A flexible and integrated interface between speech recognition, speech interpretation and dialog management",
3445-3448.
Ohno, Tomohiro / Matsubara, Shigeki / Kashioka, Hideki / Kato, Naoto / Inagaki, Yasuyoshi:
"Incremental dependency parsing of Japanese spoken monologue based on clause boundaries",
3449-3452.
Sako, Atsushi / Takiguchi, Tetsuya / Ariki, Yasuo:
"Situation based speech recognition for structuring baseball live games",
3453-3456.
Bonneau-Maynard, H. / Rosset, Sophie / Ayache, C. / Kuhn, A. / Mostefa, Djamel:
"Semantic annotation of the French media dialog corpus",
3457-3460.
Engel, Ralf:
"Robust and efficient semantic parsing of free word order languages in spoken dialogue systems",
3461-3464.
Kobus, Catherine / Damnati, Géraldine / Delphin-Poulat, Lionel / Mori, Renato de:
"Conceptual language model design for spoken language understanding",
3465-3468.
Seabra Lopes, Luís / Teixeira, António J. S. / Quinderé, Marcelo / Rodrigues, Mário:
"From robust spoken language understanding to knowledge acquisition and management",
3469-3472.
Wu, Cheng / Li, Xiang / Kuo, Hong-Kwang Jeff / Jan, E. E. / Goel, Vaibhava / Lubensky, David:
"Improving end-to-end performance of call classification through data confusion reduction and model tolerance enhancement",
3473-3476.
Paralinguistic and Nonlinguistic Information in Speech
Campbell, Nick / Kashioka, Hideki / Ohara, Ryo:
"No laughing matter",
465-468.
Blouin, C. / Maffiolo, V.:
"A study on the automatic detection and characterization of emotion in a voice service context",
469-472.
Fernandez, Raul / Picard, Rosalind W.:
"Classical and novel discriminant features for affect recognition from speech",
473-476.
Cichosz, Jaroslaw / Slot, Krzysztof:
"Low-dimensional feature space derivation for emotion recognition",
477-480.
Ishi, Carlos Toshinori / Ishiguro, Hiroshi / Hagita, Norihiro:
"Proposal of acoustic measures for automatic detection of vocal fry",
481-484.
Truong, Khiet P. / Leeuwen, David A. van:
"Automatic detection of laughter",
485-488.
Batliner, Anton / Steidl, Stefan / Hacker, Christian / Nöth, Elmar / Niemann, Heinrich:
"Tales of tuning - prototyping for automatic classification of emotional user states",
489-492.
Luengo, Iker / Navas, Eva / Hernáez, Inmaculada / Sánchez, Jon:
"Automatic emotion recognition using prosodic parameters",
493-496.
Lee, Sungbok / Yildirim, Serdar / Kazemzadeh, Abe / Narayanan, Shrikanth:
"An articulatory study of emotional speech production",
497-500.
Hofer, Gregor O. / Richmond, Korin / Clark, Robert A. J.:
"Informed blending of databases for emotional speech synthesis",
501-504.
Tesser, Fabio / Cosi, Piero / Drioli, Carlo / Tisato, Graziano:
"Emotional FESTIVAL-MBROLA TTS synthesis",
505-508.
Burkhardt, Felix:
"Emofilt: the simulation of emotional speech by prosody-transformation",
509-512.
Rosenberg, Andrew / Hirschberg, Julia:
"Acoustic/prosodic and lexical correlates of charismatic speech",
513-516.
Greenberg, Yoko / Tsuzaki, Minoru / Kato, Hiroaki / Sagisaka, Yoshinori:
"Communicative speech synthesis using constituent word attributes",
517-520.
Braun, Angelika / Katerbow, Matthias:
"Emotions in dubbed speech: an intercultural approach with respect to F0",
521-524.
Audibert, Nicolas / Aubergé, Véronique / Rilliard, Albert:
"The prosodic dimensions of emotion in speech: the relative weights of parameters",
525-528.
Schötz, Susanne:
"Stimulus duration and type in perception of female and male speaker age",
529-532.
Alm, Cecilia Ovesdotter / Sproat, Richard:
"Perceptions of emotions in expressive storytelling",
533-536.
Kawahara, Hideki / Cheveigné, Alain de / Banno, Hideki / Takahashi, Toru / Irino, Toshio:
"Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT",
537-540.
Yonezawa, Tomoko / Suzuki, Noriko / Mase, Kenji / Kogure, Kiyoshi:
"Gradually changing expression of singing voice based on morphing",
541-544.
Issues in Large Vocabulary Decoding
Hetherington, I. Lee:
"A multi-pass, dynamic-vocabulary approach to real-time, large-vocabulary speech recognition",
545-548.
Saon, George / Povey, Daniel / Zweig, Geoffrey:
"Anatomy of an extremely fast LVCSR decoder",
549-552.
Yu, Dong / Deng, Li / Acero, Alex:
"Evaluation of a long-contextual-Span hidden trajectory model and phonetic recognizer using a* lattice search",
553-556.
Hori, Takaaki / Nakamura, Atsushi:
"Generalized fast on-the-fly composition algorithm for WFST-based speech recognition",
557-560.
Nanjo, Hiroaki / Misu, Teruhisa / Kawahara, Tatsuya:
"Minimum Bayes-risk decoding considering word significance for information retrieval system",
561-564.
Chan, Arthur / Ravishankar, Mosur / Rudnicky, Alexander I.:
"On improvements to CI-based GMM selection",
565-568.
Massonie, Dominique / Nocera, Pascal / Linares, Georges:
"Scalable language model look-ahead for LVCSR",
569-572.
Novak, Miroslav:
"Memory efficient approximative lattice generation for grammar based decoding",
573-576.
Ahn, Dong-Hoon / Oh, Su-Byeong / Chung, Minhwa:
"Improved semi-dynamic network decoding using WFSTs",
577-580.
Pylkkönen, Janne:
"New pruning criteria for efficient decoding",
581-584.
Fabian, Tibor / Lieb, Robert / Ruske, Günther / Thomae, Matthias:
"A confidence-guided dynamic pruning approach - utilization of confidence measurement in speech recognition",
585-588.
Spoken Language Acquisition, Development and Learning I, II
Heeren, Willemijn:
"Perceptual development of the duration cue in dutch /a-a:/",
745-748.
You, Hong / Alwan, Abeer / Kazemzadeh, Abe / Narayanan, Shrikanth:
"Pronunciation variations of Spanish-accented English spoken by young children",
749-752.
Heeren, Willemijn:
"L2 development of quantity perception: dutch listeners learning Finnish /t-t:/",
753-756.
Zmarich, Claudio / Bonifacio, Serena:
"Phonetic inventories in Italian children aged 18-27 months: a longitudinal study",
757-760.
Hirano, Hiroko / Kawai, Goh:
"Pitch patterns of intonational phrases and intonational phrase groups in native and non-native speech",
761-764.
Hincks, Rebecca:
"Measuring liveliness in presentation speech",
765-768.
Amano, Shigeaki:
"Developmental change of phoneme duration in a Japanese infant and mother",
2217-2220.
Jia, Haiping / Mori, Hiroki / Kasuya, Hideki:
"Mora timing organization in producing contrastive geminate/single consonants and long/short vowels by native and non-native speakers of Japanese: effects of speaking rate",
2221-2224.
Wang, Hongyan / Heuven, Vincent J. van:
"Mutual intelligibility of american, Chinese and dutch-accented speakers of English",
2225-2228.
Henrichsen, Peter Juel:
"Deriving a bi-lingual dictionary from raw transcription data",
2229-2232.
Ohta, Kei / Nakagawa, Seiichi:
"A statistical method of evaluating pronunciation proficiency for Japanese words",
2233-2236.
Multi-modal / Multi-media Processing I, II
Campbell, Nick:
"Non-verbal speech processing for a communicative agent",
769-772.
Wrigley, Stuart N. / Brown, Guy J.:
"Physiologically motivated audio-visual localisation and tracking",
773-776.
Huang, Jing / Povey, Daniel:
"Discriminatively trained features using fMPE for multi-stream audio-visual speech recognition",
777-780.
Tisato, Graziano / Cosi, Piero / Drioli, Carlo / Tesser, Fabio:
"INTERFACE: a new tool for building emotive/expressive talking heads",
781-784.
Ejarque, P. / Hernando, Javier:
"Variance reduction by using separate genuine- impostor statistics in multimodal biometrics",
785-788.
Schubert, Volker / Hamerich, Stefan W.:
"The dialog application metalanguage GDialogXML",
789-792.
Kumaran, Raghunandan S. / Narayanan, Karthik / Gowdy, John N.:
"Myoelectric signals for multimodal speech recognition",
1189-1192.
Daubias, Philippe:
"Is color information really useful for lip-reading ? (or what is lost when color is not used)",
1193-1196.
Shdaifat, I. / Grigat, R.-R.:
"A system for audio-visual speech recognition",
1197-1200.
Kitaoka, Norihide / Oshikawa, Hironori / Nakagawa, Seiichi:
"Multimodal interface for organization name input based on combination of isolated word recognition and continuous base-word recognition",
1201-1204.
Matsusaka, Yosuke:
"Recognition of (3) party conversation using prosody and gaze",
1205-1208.
Li, Dongdong / Yang, Yingchun / Wu, Zhaohui:
"Combining voiceprint and face biometrics for speaker identification using SDWS",
1209-1212.
Cooke, Neil / Russell, Martin:
"Using the focus of visual attention to improve spontaneous speech recognition",
1213-1216.
Gurbuz, Sabri:
"Real-time outer lip contour tracking for HCI applications",
1217-1220.
Huang, Jing / Visweswariah, Karthik:
"Improving lip-reading with feature space transforms for multi-stream audio-visual speech recognition",
1221-1224.
Mixdorff, Hansjörg / Burnham, Denis / Vignali, Guillaume / Charnvivit, Patavee:
"Are there facial correlates of Thai syllabic tones?",
1225-1228.
Seymour, Rowan / Ming, Ji / Stewart, Darryl:
"A new posterior based audio-visual integration method for robust speech recognition",
1229-1232.
Beskow, Jonas / Nordenberg, Mikael:
"Data-driven synthesis of expressive visual speech using an MPEG-4 talking head",
793-796.
Turk, Oytun / Schröder, Marc / Bozkurt, Baris / Arslan, Levent M.:
"Voice quality interpolation for emotional text-to-speech synthesis",
797-800.
Bulut, Murtaza / Busso, Carlos / Yildirim, Serdar / Kazemzadeh, Abe / Lee, Chul Min / Lee, Sungbok / Narayanan, Shrikanth:
"Investigating the role of phoneme-level modifications in emotional speech resynthesis",
801-804.
Schuller, Björn / Müller, Ronald / Lang, Manfred / Rigoll, Gerhard:
"Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles",
805-808.
Kim, Jonghwa / André, Elisabeth / Rehm, Matthias / Vogt, Thurid / Wagner, Johannes:
"Integrating information from speech and physiological signals to achieve emotional sensitivity",
809-812.
Douglas-Cowie, Ellen / Devillers, Laurence / Martin, Jean-Claude / Cowie, Roddy / Savvidou, Suzie / Abrilian, Sarkis / Cox, Cate:
"Multimodal databases of everyday emotion: facing up to complexity",
813-816.
Spoken / Multi-modal Dialogue Systems I, II
Torres, Francisco / Sanchis, Emilio / Segarra, Encarna:
"Learning of stochastic dialog models through a dialog simulation technique",
817-820.
Black, Lesley-Ann / McTear, Michael / Black, Norman / Harper, Roy / Lemon, Michelle:
"Evaluating the DI@l-log system on a cohort of elderly, diabetic patients: results from a preliminary study",
821-824.
Král, Pavel / Cerisara, Christophe / Klecková, Jana:
"Combination of classifiers for automatic recognition of dialog acts",
825-828.
Wu, Xiaojun / Zheng, Thomas Fang / Brasser, Michael / Song, Zhanjiang:
"Rapidly developing spoken Chinese dialogue systems with the d-ear SDS SDK",
829-832.
Oria, Daniela / Vetek, Akos:
"Robust algorithms and interaction strategies for voice spelling",
833-836.
Toptsis, Ioannis / Haasch, Axel / Hüwel, Sonja / Fritsch, Jannik / Fink, Gernot A.:
"Modality integration and dialog management for a robotic assistant",
837-840.
Reithinger, Norbert / Sonntag, Daniel:
"An integration framework for a mobile multimodal dialogue system accessing the semantic web",
841-844.
Nisimura, Ryuichi / Lee, Akinobu / Yamada, Masashi / Shikano, Kiyohiro:
"Operating a public spoken guidance system in real environment",
845-848.
Salonen, Esa-Pekka / Turunen, Markku / Hakulinen, Jaakko / Helin, Leena / Prusi, Perttu / Kainulainen, Anssi:
"Distributed dialogue management for smart terminal devices",
849-852.
Hakulinen, Jaakko / Turunen, Markku / Salonen, Esa-Pekka:
"Visualization of spoken dialogue systems for demonstration, debugging and tutoring",
853-856.
González-Ferreras, César / Cardeñoso-Payo, Valentín:
"Development and evaluation of a spoken dialog system to access a newspaper web site",
857-860.
Pietquin, Olivier / Beaufort, Richard:
"Comparing ASR modeling methods for spoken dialogue simulation and optimal strategy learning",
861-864.
Chu, Shiu-Wah / O'Neill, Ian / Hanna, Philip / McTear, Michael:
"An approach to multi-strategy dialogue management",
865-868.
Hjalmarsson, Anna:
"Towards user modelling in conversational dialogue systems: a qualitative study of the dynamics of dialogue parameters",
869-872.
Katsurada, Kouichi / Aoki, Kazumine / Yamada, Hirobumi / Nitta, Tsuneo:
"Reducing the description amount in authoring MMI applications",
873-876.
Komatani, Kazunori / Kanda, Naoyuki / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Contextual constraints based on dialogue models in database search task for spoken dialogue systems",
877-880.
Rotaru, Mihai / Litman, Diane J.:
"Using word-level pitch features to better predict student emotions during spoken tutoring dialogues",
881-884.
Raux, Antoine / Langner, Brian / Bohus, Dan / Black, Alan W. / Eskenazi, Maxine:
"Let's go public! taking a spoken dialog system to the real world",
885-888.
Fujie, Shinya / Fukushima, Kenta / Kobayashi, Tetsunori:
"Back-channel feedback generation using linguistic and nonlinguistic information and its application to spoken dialogue system",
889-892.
Georgila, Kallirroi / Henderson, James / Lemon, Oliver:
"Learning user simulations for information state update dialogue systems",
893-896.
Martín-Iglesias, Darío / Pereiro-Estevan, Yago / García-Moral, Ana I. / Gallardo-Antolín, Ascensión / Díaz-de-María, Fernando:
"Design of a voice-enabled interface for real-time access to stock exchange from a PDA through GPRS",
897-900.
Schuler, William / Miller, Tim:
"Integrating denotational meaning into a DBN language model",
901-904.
Bosch, Louis ten:
"Improving out-of-coverage language modelling in a multimodal dialogue system using small training sets",
905-908.
Galibert, Olivier / Illouz, Gabriel / Rosset, Sophie:
"Ritel: an open-domain, human-computer dialog system",
909-912.
Bernsen, Niels Ole / Dybkjaer, Laila:
"User evaluation of conversational agent h. c. Andersen",
2473-2476.
Goronzy, Silke / Beringer, Nicole:
"Integrated development and on-the-fly simulation of multimodal dialogs",
2477-2480.
Rotaru, Mihai / Litman, Diane J. / Forbes-Riley, Katherine:
"Interactions between speech recognition problems and user emotions",
2481-2484.
Feng, Junlan / Reddy, Srihari / Saraçlar, Murat:
"Webtalk: mining websites for interactively answering questions",
2485-2488.
Möller, Sebastian:
"Towards generic quality prediction models for spoken dialogue systems - a case study",
2489-2492.
Parthasarathy, S. / Allauzen, Cyril / Munkong, R.:
"Robust access to large structured data using voice form-filling",
2493-2496.
Speech Production I
Rochet-Capellan, Amélie / Schwartz, Jean-Luc:
"The labial-coronal effect and CVCV stability during reiterant speech production: an acoustic analysis",
1009-1012.
Rochet-Capellan, Amélie / Schwartz, Jean-Luc:
"The labial-coronal effect and CVCV stability during reiterant speech production: an articulatory analysis",
1013-1016.
Nakamura, Mitsuhiro:
"Articulatory constraints and coronal stops: an EPG study",
1017-1020.
Robert, Vincent / Wrobel-Dautcourt, Brigitte / Laprie, Yves / Bonneau, Anne:
"Strategies of labial coarticulation",
1021-1024.
Dang, Jianwu / Wei, Jianguo / Suzuki, Takeharu / Perrier, Pascal:
"Investigation and modeling of coarticulation during speech",
1025-1028.
Hu, Fang:
"Tongue kinematics in diphthong production in Ningbo Chinese",
1029-1032.
Arai, Takayuki:
"Comparing tongue positions of vowels in oral and nasal contexts",
1033-1036.
Ouni, Slim:
"Can we retrieve vocal tract dynamics that produced speech? toward a speaker articulatory strategy model",
1037-1040.
Perrier, Pascal / Ma, Liang / Payan, Yohan:
"Modeling the production of VCV sequences via the inversion of a biomechanical model of the tongue",
1041-1044.
Niu, Xiaochuan / Kain, Alexander / Santen, Jan P. H. van:
"Estimation of the acoustic properties of the nasal tract during the production of nasalized vowels",
1045-1048.
Ogata, Kohichi:
"A web-based articulatory speech synthesis system for distance education",
1049-1052.
Alku, Paavo / Airas, Matti / Bäckström, Tom / Pulakka, Hannu:
"Group delay function as a means to assess quality of glottal inverse filtering",
1053-1056.
Björkner, Eva / Sundberg, Johan / Alku, Paavo:
"Subglottal pressure and NAQ variation in voice production of classically trained baritone singers",
1057-1060.
Fant, Gunnar / Kruckenberg, Anita:
"Covariation of subglottal pressure, F0 and intensity",
1061-1064.
Pérez, Javier / Bonafonte, Antonio:
"Automatic voice-source parameterization of natural speech",
1065-1068.
Zeroual, Chakir / Esling, John H. / Crevier-Buchman, Lise:
"Physiological study of whispered speech in Moroccan Arabic",
1069-1072.
Moura, C. P. / Andrade, D. / Cunha, L. M. / Cunha, M. J. / Vilarinho, H. / Barros, H. / Freitas, Diamantino / Pais-Clemente, M.:
"Voice quality in down syndrome children treated with rapid maxillary expansion",
1073-1076.
Hanquinet, Julien / Grenez, Francis / Schoentgen, Jean:
"Synthesis of disordered speech",
1077-1080.
Fontecave, Julie / Berthommier, Frédéric:
"Quasi-automatic extraction of tongue movement from a large existing speech cineradiographic database",
1081-1084.
Sapir, Shimon / Mimran, Ravit Cohen:
"The working memory token test (WMTT): preliminary findings in young adults with and without dyslexia",
1085-1088.
Paulo, Sérgio / Oliveira, Luís C.:
"Reducing the corpus-based TTS signal degradation due to speaker's word pronunciations",
1089-1092.
Lee, Wai-Sum:
"A phonetic study of the "er-hua" rimes in Beijing Mandarin",
1093-1096.
Airas, Matti / Pulakka, Hannu / Bäckström, Tom / Alku, Paavo:
"A toolkit for voice inverse filtering and parametrisation",
2145-2148.
Sciamarella, Denisse / d'Alessandro, Christophe:
"Stylization of glottal-flow spectra produced by a mechanical vocal-fold model",
2149-2152.
Nomura, Hideyuki / Funada, Tetsuo:
"Numerical glottal sound source model as coupled problem between vocal cord vibration and glottal flow",
2153-2156.
Pouplier, Marianne / Stone, Maureen:
"A tagged-cine MRI investigation of German vowels",
2157-2160.
Serrurier, Antoine / Badin, Pierre:
"A three-dimensional linear articulatory model of velum based on MRI data",
2161-2164.
Cros, Anne / Demolin, Didier / Flesia, Ana Georgina / Galves, Antonio:
"On the relationship between intra-oral pressure and speech sonority",
2165-2168.
Spoken Language Resources and Technology Evaluation I, II
Jones, Douglas / Shen, Wade / Shriberg, Elizabeth / Stolcke, Andreas / Kamm, Teresa / Reynolds, Douglas:
"Two experiments comparing reading with listening for human processing of conversational telephone speech",
1145-1148.
Galliano, Sylvain / Geoffrois, Edouard / Mostefa, Djamel / Choukri, Khalid / Bonastre, Jean-François / Gravier, Guillaume:
"The ESTER phase II evaluation campaign for the rich transcription of French broadcast news",
1149-1152.
Saito, Takashi:
"A method of multi-layered speech segmentation tailored for speech synthesis",
1153-1156.
Paulo, Sérgio / Oliveira, Luís C.:
"Generation of word alternative pronunciations using weighted finite state transducers",
1157-1160.
Strik, Helmer / Binnenpoorte, Diana / Cucchiarini, Catia:
"Multiword expressions in spontaneous speech: do we really speak like that?",
1161-1164.
Kolár, Jáchym / Svec, Jan / Strassel, Stephanie / Walker, Christopher / Kozlíková, Dagmar / Psutka, Josef:
"Czech spontaneous speech corpus with structural metadata",
1165-1168.
Burkhardt, Felix / Paeschke, A. / Rolfes, M. / Sendlmeier, Walter F. / Weiss, Benjamin:
"A database of German emotional speech",
1517-1520.
Mareuil, Philippe Boula de / d'Alessandro, Christophe / Bailly, Gerard / Bechet, Frederic / Garcia, Marie-Neige / Morel, Michel / Prudon, Romain / Veronis, Jean:
"Evaluating the pronunciation of proper names by four French grapheme-to-phoneme converters",
1521-1524.
Jurcicek, Filip / Zahradil, Jiri / Jelinek, Libor:
"A human-human train timetable dialogue corpus",
1525-1528.
Branco, Gloria / Almeida, Luis / Gomes, Rui / Beires, Nuno:
"A Portuguese spoken and multi-modal dialog corpora",
1529-1532.
Chan, Joyce Y. C. / Ching, P. C. / Lee, Tan:
"Development of a Cantonese-English code-mixing speech corpus",
1533-1536.
Zgank, Andrej / Verdonik, Darinka / Markus, Aleksandra Zögling / Kacic, Zdravko:
"BNSI Slovenian broadcast news database - speech and text corpus",
1537-1540.
Volín, Jan / Skarnitzl, Radek / Pollák, Petr:
"Confronting HMM-based phone labelling with human evaluation of speech production",
1541-1544.
Strassel, Stephanie / Kolár, Jáchym / Song, Zhiyi / Barclay, Leila / Glenn, Meghan:
"Structural metadata annotation: moving beyond English",
1545-1548.
Charlet, Delphine / Krstulovic, Sacha / Bimbot, Frédéric / Boëffard, Olivier / Fohr, Dominique / Mella, Odile / Korkmazsky, Filip / Mostefa, Djamel / Choukri, Khalid / Vallée, Arnaud:
"Neologos: an optimized database for the development of new speech processing algorithms",
1549-1552.
Lin, Cheng-Yuan / Chen, Kuan-Ting / Jang, J.-S. Roger:
"A hybrid approach to automatic segmentation and labeling for Mandarin Chinese speech corpus",
1553-1556.
Chiang, Yuang-Chin / Liang, Min-Siong / Lin, Hong-Yi / Lyu, Ren-Yuan:
"The multiple pronunciations in Taiwanese and the automatic transcription of Buddhist sutra with augmented read speech",
1557-1560.
Davel, Marelie / Barnard, Etienne:
"Bootstrapping pronunciation dictionaries: practical issues",
1561-1564.
Ward, Nigel G. / Rivera, Anais G. / Ward, Karen / Novick, David G.:
"Root causes of lost time and user stress in a simple dialog system",
1565-1568.
Parisi, Julie A. / Brungart, Douglas S.:
"Evaluating communication effectiveness in team collaboration",
1569-1572.
Conejero, David / Lounds, Alan / Mateo, Carmen Garcia / Rodriguez-Linares, Leandro / Mochales, Raquel / Moreno, Asuncion:
"Bilingual aligned corpora for speech to speech translation for Spanish, English and Catalan",
1573-1576.
Boril, Hynek / Pollak, Petr:
"Design and collection of Czech Lombard speech database",
1577-1580.
Kazemzadeh, Abe / You, Hong / Iseli, Markus / Jones, Barbara / Cui, Xiaodong / Heritage, Margaret / Price, Patti / Anderson, Elaine / Narayanan, Shrikanth / Alwan, Abeer:
"TBALL data collection: the making of a young children's speech corpus",
1581-1584.
Tohyama, Hitomi / Matsubara, Shigeki / Kawaguchi, Nobuo / Inagaki, Yasuyoshi:
"Construction and utilization of bilingual speech corpus for simultaneous machine interpretation research",
1585-1588.
Bates, Rebecca / Menning, Patrick / Willingham, Elizabeth / Kuyper, Chad:
"Meeting acts: a labeling system for group interaction in meetings",
1589-1592.
Silaghi, Marius C. / Vargiya, Rachna:
"A new evaluation criteria for keyword spotting techniques and a new algorithm",
1593-1596.
Draxler, Christoph / Steffen, Alexander:
"Phattsessionz: recording 1000 adolescent speakers in schools in Germany",
1597-1600.
Abate, Solomon Teferra / Menzel, Wolfgang / Tafila, Bairu:
"An Amharic speech corpus for large vocabulary continuous speech recognition",
1601-1604.
Dolfing, Hans / Reitter, David / Almeida, Luís / Beires, Nuno / Cody, Michael / Gomes, Rui / Robinson, Kerry / Zielinski, Roman:
"The FASil speech and multimodal corpora",
1605-1608.
Müller, Karin:
"Revealing phonological similarities between German and dutch",
1609-1612.
Early Language Acquisition
Ishizuka, Kentaro / Mugitani, Ryoko / Kato, Hiroko / Amano, Shigeaki:
"A longitudinal analysis of the spectral peaks of vowels for a Japanese infant",
1169-1172.
Zajdó, Krisztina / Stelt, Jeannette M. van der / Wempe, Ton G. / Pols, Louis C. W.:
"Cross-linguistic comparison of two-year-old children's acoustic vowel spaces: contrasting Hungarian with dutch",
1173-1176.
Lintfert, Britta / Schneider, Katrin:
"Acoustic correlates of contrastive stress in German children",
1177-1180.
Salvi, Giampiero:
"Ecological language acquisition via incremental model-based clustering",
1181-1184.
Sudo, Tamami / Mogi, Ken:
"Perceptual and linguistic category formation in infants",
1185-1188.
Bridging the Gap ASR-HSR
Dusan, Sorin / Rabiner, Larry R.:
"On integrating insights from human speech perception into automatic speech recognition",
1233-1236.
Scharenborg, Odette:
"Parallels between HSR and ASR: how ASR can contribute to HSR",
1237-1240.
Bosch, Louis ten / Scharenborg, Odette:
"ASR decoding in a computational model of human word recognition",
1241-1244.
Maier, Viktoria / Moore, Roger K.:
"An investigation into a simulation of episodic memory for automatic speech recognition",
1245-1248.
Fosler-Lussier, Eric / Rytting, C. Anton / Srinivasan, Soundararajan:
"Phonetic ignorance is bliss: investigating the effects of phonetic information reduction on ASR performance",
1249-1252.
Holmberg, Marcus / Gelbart, David / Ramacher, Ulrich / Hemmert, Werner:
"Automatic speech recognition with neural spike trains",
1253-1256.
Carey, Michael J. / Quang, Tuan P.:
"A speech similarity distance weighting for robust recognition",
1257-1260.
Murakami, Takao / Maruyama, Kazutaka / Minematsu, Nobuaki / Hirose, Keikichi:
"Japanese vowel recognition based on structural representation of speech",
1261-1264.
Srinivasan, Soundararajan / Wang, DeLiang:
"Modeling the perception of multitalker speech",
1265-1268.
Harding, Sue / Barker, Jon / Brown, Guy J.:
"Binaural feature selection for missing data speech recognition",
1269-1272.
Wesker, Thorsten / Meyer, Bernd / Wagener, Kirsten / Anemüller, Jörn / Mertins, Alfred / Kollmeier, Birger:
"Oldenburg logatome speech corpus (OLLO) for speech recognition experiments with humans and machines",
1273-1276.
Speech Recognition - Pronunciation Modelling
Jeon, Je Hun / Chung, Minhwa:
"Automatic generation of domain-dependent pronunciation lexicon with data-driven rules and rule adaptation",
1337-1340.
Tjalve, Michael / Huckvale, Mark:
"Pronunciation variation modelling using accent features",
1341-1344.
Truong, Khiet P. / Neri, Ambra / Wet, Febe de / Cucchiarini, Catia / Strik, Helmer:
"Automatic detection of frequent pronunciation errors made by L2-learners",
1345-1348.
Psutka, Josef / Ircing, Pavel / Psutka, J. V. / Hajic, Jan / Byrne, William J. / Mírovský, Jirí:
"Automatic transcription of Czech, Russian, and Slovak spontaneous speech in the MALACH project",
1349-1352.
Dupont, Stéphane / Ris, Christophe / Couvreur, Laurent / , Jean-Marc Boite / Boite, Jean-Marc:
"A study of implicit and explicit modeling of coarticulation and pronunciation variation",
1353-1356.
Takahashi, Shin-ya / Morimoto, Tsuyoshi / Maeda, Sakashi / Tsuruta, Naoyuki:
"Detection of coughs from user utterances using imitated phoneme model",
1357-1360.
Ramasubramanian, V. / Srinivas, P. / Sreenivas, T. V.:
"Stochastic pronunciation modeling by ergodic-HMM of acoustic sub-word units",
1361-1364.
Liu, Chen / Melnar, Lynette:
"An automated linguistic knowledge-based cross-language transfer method for building acoustic models for a language without native training data",
1365-1368.
Bouselmi, Ghazi / Fohr, Dominique / Illina, Irina / Haton, Jean-Paul:
"Fully automated non-native speech recognition using confusion-based acoustic model integration",
1369-1372.
Prosodic Structure
Aubergé, Véronique / Rilliard, Albert:
"The focus prosody: more than a simple binary function",
1373-1376.
Dalton, Martha / Ní Chasaide, Ailbhe:
"Peak timing in two dialects of connaught irish",
1377-1380.
Fletcher, Janet:
"Compound rises and "uptalk" in spoken English",
1381-1384.
Yang, Li-chiung:
"Duration and the temporal structure of Mandarin discourse",
1385-1388.
Wang, Bei:
"Prosodic realization of split noun phrases in Mandarin Chinese compared in topic and focus contexts",
1389-1392.
Xiong, Ziyu:
"Downstep effect on disyllabic words of citation forms in standard Chinese",
1393-1396.
Ni, Jinfu / Kawai, Hisashi / Hirose, Keikichi:
"Estimation of intonation variation with constrained tone transformations",
1397-1400.
Pan, Ho-hsien:
"Voice quality of falling tones in taiwan min",
1401-1404.
Tseng, Chiu-yu / Fu, Bau-Ling:
"Duration, intensity and pause predictions in relation to prosody organization",
1405-1408.
Yuan, Jiahong / Brenier, Jason M. / Jurafsky, Daniel:
"Pitch accent prediction: effects of genre and speaker",
1409-1412.
Fujisaki, Hiroya / Ohno, Sumio:
"Analysis and modeling of fundamental frequency contours of hindi utterances",
1413-1416.
Govender, Natasha / Barnard, Etienne / Davel, Marelie:
"Fundamental frequency and tone in isizulu: initial experiments",
1417-1420.
Bishop, Judith / Peake, Marc / Sityaev, Dmitry:
"Intonational sequences in tuscan Italian",
1421-1424.
Petrone, Caterina:
"Effects of raddoppiamento sintattico on tonal alignment in Italian",
1425-1428.
Dubeda, Tomás / Votrubec, Jan:
"Acoustic analysis of Czech stress: intonation, duration and intensity revisited",
1429-1432.
Yeou, Mohamed:
"Variability of F0 peak alignment in moroccan Arabic accentual focus",
1433-1436.
Lacheret, Anne / Lyche, Ch. / Morel, Michel:
"Phonological analysis of schwa and liaison within the PFC project (phonologie du franais contemporain): how determinant are the prosodic factors?",
1437-1440.
Barbosa, Plínio A. / Arantes, Pablo / Meireles, Alexsandro R. / Vieira, Jussara M.:
"Abstractness in speech-metronome synchronisation: P-centres as cyclic attractors",
1441-1444.
Applications of Confidence Related Measures to ASR
Yamada, Makoto / Kato, Tsuneo / Naito, Masaki / Kawai, Hisashi:
"Improvement of rejection performance of keyword spotting using anti-keywords derived from large vocabulary considering acoustical similarity to keywords",
1445-1448.
Schlüter, Ralf / Scharrenbach, T. / Steinbiss, Volker / Ney, Hermann:
"Bayes risk minimization using metric loss functions",
1449-1452.
Kobayashi, Akio / Onoe, Kazuo / Sato, Shoei / Imai, Toru:
"Word error rate minimization using an integrated confidence measure",
1453-1456.
Dong, Bin / Zhao, Qingwei / Yan, Yonghong:
"Fast confidence measure algorithm for continuous speech recognition",
1457-1460.
Ketabdar, Hamed / Vepa, Jithendra / Bengio, Samy / Bourlard, Hervé:
"Developing and enhancing posterior based speech recognition systems",
1461-1464.
Liu, Peng / Tian, Ye / Zhou, Jian-Lai / Soong, Frank K.:
"Background model based posterior probability for measuring confidence",
1465-1468.
Multilingual TTS
Tomokiyo, Laura Mayfield / Black, Alan W. / Lenzo, Kevin A.:
"Foreign accents in synthetic speech: development and evaluation",
1469-1472.
Fernandez, Raul / Zhang, Wei / Eide, Ellen / Bakis, Raimo / Hamza, Wael / Liu, Yi / Picheny, Michael / Pitrelli, John F. / Qing, Yong / Shuang, Zhi Wei / Shen, Li Qin:
"Toward multiple-language TTS: experiments in English and Mandarin",
1473-1476.
Latorre, Javier / Iwano, Koji / Furui, Sadaoki:
"Cross-language synthesis with a polyglot synthesizer",
1477-1480.
Gakuru, Mucemi / Iraki, Frederick K. / Tucker, Roger / Shalonova, Ksenia / Ngugi, Kamanda:
"Development of a Kiswahili text to speech system",
1481-1484.
Ordinas, J. Botella / Fischer, V. / Waast-Richard, C.:
"Multilingual models in the IBM bilingual text-to-speech systems",
1485-1488.
Janicki, Artur / Herman, Piotr:
"Reconstruction of Polish diacritics in a text-to-speech system",
1489-1492.
Speech Bandwidth Extension
Ehara, Hiroyuki / Morii, Toshiyuki / Oshikiri, Masahiro / Yoshida, Koji / Honma, Kouichi:
"Design of bandwidth scalable LSF quantization using interframe and intraframe prediction",
1493-1496.
Geiser, Bernd / Jax, Peter / Vary, Peter:
"Artificial bandwidth extension of speech supported by watermark-transmitted side information",
1497-1500.
Hu, Rongqiang / Krishnan, Venkatesh / Anderson, David V.:
"Speech bandwidth extension by improved codebook mapping towards increased phonetic classification",
1501-1504.
Bansal, Dhananjay / Raj, Bhiksha / Smaragdis, Paris:
"Bandwidth expansion of narrowband speech using non-negative matrix factorization",
1505-1508.
Seltzer, Michael L. / Acero, Alex / Droppo, Jasha:
"Robust bandwidth extension of noise-corrupted narrowband speech",
1509-1512.
Cabral, Joao P. / Oliveira, Luis C.:
"Pitch-synchronous time-scaling for high-frequency excitation regeneration",
1513-1516.
Large Vocabulary Speech Recognition Systems
Vergyri, Dimitra / Kirchhoff, Katrin / Gadde, R. / Stolcke, Andreas / Zheng, Jing:
"Development of a conversational telephone speech recognizer for Levantine Arabic",
1613-1616.
Ramabhadran, Bhuvana:
"Exploiting large quantities of spontaneous speech for unsupervised training of acoustic models",
1617-1620.
Lin, Che-Kuang / Lee, Lin-Shan:
"Improved spontaneous Mandarin speech recognition by disfluency interruption point (IP) detection using prosodic features",
1621-1624.
Ma, Jeff Z. / Matsoukas, Spyros:
"Improvements to the BBN RT04 Mandarin conversational telephone speech recognition system",
1625-1628.
Sakti, Sakriani / Nakamura, Satoshi / Markov, Konstantin:
"Incorporating a Bayesian wide phonetic context model for acoustic rescoring",
1629-1632.
Messaoudi, Abdel / Lamel, Lori / Gauvain, Jean-Luc:
"Modeling vowels for Arabic BN transcription",
1633-1636.
Afify, Mohamed / Nguyen, Long / Xiang, Bing / Abdou, Sherif / Makhoul, John:
"Recent progress in Arabic broadcast news transcription at BBN",
1637-1640.
Matsoukas, Spyros / Prasad, Rohit / Laxminarayan, Srinivas / Xiang, Bing / Nguyen, Long / Schwartz, Richard:
"The 2004 BBN 1xRT recognition systems for English broadcast news and conversational telephone speech",
1641-1644.
Prasad, Rohit / Matsoukas, Spyros / Kao, C.-L. / Ma, Jeff Z. / Xu, D.-X. / Colthurst, T. / Kimball, O. / Schwartz, Richard / Gauvain, Jean-Luc / Lamel, Lori / Schwenk, Holger / Adda, G. / Lefevre, F.:
"The 2004 BBN/LIMSI 20xRT English conversational telephone speech recognition system",
1645-1648.
Xiang, Bing / Nguyen, Long / Guo, Xuefeng / Xu, Dongxin:
"The BBN Mandarin broadcast news transcription system",
1649-1652.
Deléglise, Paul / Estève, Yannick / Meignier, Sylvain / Merlin, Teva:
"The LIUM speech transcription system: a CMU Sphinx III-based system for French broadcast news",
1653-1656.
Lamel, Lori / Adda, G. / Bilinski, E. / Gauvain, Jean-Luc:
"Transcribing lectures and seminars",
1657-1660.
Hain, Thomas / Dines, John / Garau, Giulia / Karafiát, Martin / Moore, Darren / Wan, Vincent / Ordelman, Roeland / Renals, Steve:
"Transcription of conference room meetings: an investigation",
1661-1664.
Gauvain, Jean-Luc / Adda, G. / Adda-Decker, Martine / Allauzen, Alexandre / Gendner, V. / Lamel, Lori / Schwenk, Holger:
"Where are we in transcribing French broadcast news?",
1665-1668.
Scharenborg, Odette / Seneff, Stephanie:
"Two-pass strategy for handling OOVs in a large vocabulary recognition task",
1669-1672.
Nguyen, Long / Xiang, Bing / Afify, Mohamed / Abdou, Sherif / Matsoukas, Spyros / Schwartz, Richard / Makhoul, John:
"The BBN RT04 English broadcast news transcription system",
1673-1676.
Zhang, Rong / Bawab, Ziad Al / Chan, Arthur / Chotimongkol, Ananlada / Huggins-Daines, David / Rudnicky, Alexander I.:
"Investigations on ensemble based semi-supervised acoustic model training",
1677-1680.
Nouza, Jan / Zdánský, Jindrich / David, Petr / Cerva, Petr / Kolorenc, Jan / Nejedlová, Dana:
"Fully automated system for Czech spoken broadcast transcription with very large (300k+) lexicon",
1681-1684.
Schuster, Mike / Hori, Takaaki / Nakamura, Atsushi:
"Experiments with probabilistic principal component analysis in LVCSR",
1685-1688.
Vu, Thang Tat / Nguyen, Dung Tien / Luong, Mai Chi / Hosom, John-Paul:
"Vietnamese large vocabulary continuous speech recognition",
1689-1692.
Shinozaki, Takahiro / Ostendorf, Mari / Atlas, Les:
"Data sampling for improved speech recognizer training",
1693-1696.
Prosody Modelling and Speech Technology I, II
Levow, Gina-Anne:
"Context in multi-lingual tone and pitch accent recognition",
1809-1812.
Tamburini, Fabio:
"Automatic prominence identification and prosodic typology",
1813-1816.
Ingulfsen, Tommy / Burrows, Tina / Buchholz, Sabine:
"Influence of syntax on prosodic boundary prediction",
1817-1820.
Gretter, Roberto / Seppi, Dino:
"Using prosodic information for disambiguation purposes",
1821-1824.
Gu, Wentao / Hirose, Keikichi / Fujisaki, Hiroya:
"Analysis of the effects of word emphasis and echo question on F0 contours of Cantonese utterances",
1825-1828.
Burrows, Tina / Jackson, Peter / Knill, Katherine / Sityaev, Dmitry:
"Combining models of prosodic phrasing and pausing",
1829-1832.
Hirst, Daniel / Auran, Cyril:
"Analysis by synthesis of speech prosody: the Prozed environment",
3225-3228.
Cox, Stephen:
"A discriminative approach to phrase break modelling",
3229-3232.
Read, Ian / Cox, Stephen:
"Stochastic and syntactic techniques for predicting phrase breaks",
3233-3236.
Xydas, Gerasimos / Zervas, Panagiotis / Kouroupetroglou, Georgios / Fakotakis, Nikolaos / Kokkinakis, George:
"Tree-based prediction of prosodic phrase breaks on top of shallow textual features",
3237-3240.
Dong, Honghui / Tao, Jianhua / Xu, Bo:
"Chinese prosodic phrasing with a constraint-based approach",
3241-3244.
Dong, Minghui / Lua, Kim-Teng / Li, Haizhou:
"A probabilistic approach to prosodic word prediction for Mandarin Chinese TTS",
3245-3248.
Teixeira, João Paulo / Freitas, Diamantino / Fujisaki, Hiroya:
"Evaluation of a system for F0 contour prediction for european Portuguese",
3249-3252.
Li, Ke / Sagisaka, Yoshinori:
"Analysis on command sequences of a F0 generation model for Mandarin speech and its application to their automatic extraction",
3253-3256.
Hirose, Keikichi / Furuyama, Yusuke / Minematsu, Nobuaki:
"Corpus-based extraction of F0 contour generation process model parameters",
3257-3260.
Escudero, David / Cardeñoso-Payo, Valentín:
"Optimized selection of intonation dictionaries in corpus based intonation modelling",
3261-3264.
Sun, Qinghua / Hirose, Keikichi / Gu, Wentao / Minematsu, Nobuaki:
"Generation of fundamental frequency contours for Mandarin speech synthesis based on tone nucleus model",
3265-3268.
Chiang, Chen-Yu / Wang, Yih-Ru / Chen, Sin-Horng:
"On the inter-syllable coarticulation effect of pitch modeling for Mandarin speech",
3269-3272.
Rojc, Matej / Aguero, Pablo Daniel / Bonafonte, Antonio / Kacic, Zdravko:
"Training the tilt intonation model using the JEMA methodology",
3273-3276.
Wang, Dagen / Narayanan, Shrikanth:
"Piecewise linear stylization of pitch via wavelet analysis",
3277-3280.
Romsdorfer, Harald / Pfister, Beat:
"Phonetic labeling and segmentation of mixed-lingual prosody databases",
3281-3284.
Morais, Edmilson / Violaro, Fábio:
"Exploratory analysis of linguistic data based on genetic algorithm for robust modeling of the segmental duration of speech",
3285-3288.
Gibbon, Dafydd / Fernandes, Flaviane Romani:
"Annotation-mining for rhythm model comparison in Brazilian portuguese",
3289-3292.
Nagano, Tohru / Mori, Shinsuke / Nishimura, Masafumi:
"A stochastic approach to phoneme and accent estimation",
3293-3296.
Brenier, Jason M. / Cer, Daniel M. / Jurafsky, Daniel:
"The detection of emphatic words using acoustic and lexical features",
3297-3300.
Surendran, Dinoj / Levow, Gina-Anne / Xu, Yi:
"Tone recognition in Mandarin using focus",
3301-3304.
Wypych, Mikolaj:
"An automatic intonation recognizer for the Polish language based on machine learning and expert knowledge",
3305-3308.
Sakurai, Atsuhiro:
"Generalized envelope matching technique for time-scale modification of speech (GEM-TSM)",
3309-3312.
Detecting and Synthesizing Speaker State
Hirschberg, Julia / Benus, Stefan / Brenier, Jason M. / Enos, Frank / Friedman, Sarah / Gilman, Sarah / Girand, Cynthia / Graciarena, Martin / Kathol, Andreas / Michaelis, Laura / Pellom, Bryan L. / Shriberg, Elizabeth / Stolcke, Andreas:
"Distinguishing deceptive from non-deceptive speech",
1833-1836.
Liscombe, Jackson / Hirschberg, Julia / Venditti, Jennifer J.:
"Detecting certainness in spoken tutorial dialogues",
1837-1840.
Vidrascu, Laurence / Devillers, Laurence:
"Detection of real-life emotions in call centers",
1841-1844.
Liscombe, Jackson / Riccardi, Giuseppe / Hakkani-Tür, Dilek:
"Using context to improve emotion detection in spoken dialog systems",
1845-1848.
Yanushevskaya, Irena / Gobl, Christer / Ní Chasaide, Ailbhe:
"Voice quality and f0 cues for affect expression: implications for synthesis",
1849-1852.
Takahashi, Toru / Fujii, Takeshi / Nishi, Masashi / Banno, Hideki / Irino, Toshio / Kawahara, Hideki:
"Voice and emotional expression transformation based on statistics of vowel parameters in an emotional speech database",
1853-1856.
Rapid Development of Spoken Dialogue Systems
Fabbrizio, Giuseppe Di / Tur, Gokhan / Hakkani-Tür, Dilek:
"Automated wizard-of-oz for spoken dialogue systems",
1857-1860.
Katsurada, Kouichi / Sato, Kunitoshi / Adachi, Hiroaki / Yamada, Hirobumi / Nitta, Tsuneo:
"A rapid prototyping tool for constructing web-based MMI applications",
1861-1864.
Hanna, Philip / O'Neill, Ian / Liu, Xingkun / McTear, Michael:
"Developing extensible and reusable spoken dialogue components: an examination of the Queen's communicator",
1865-1868.
Wang, Ye-Yi / Acero, Alex:
"SGStudio: rapid semantic grammar development for spoken language understanding",
1869-1872.
Akbacak, Murat / Gao, Yuqing / Gu, Liang / Kuo, Hong-Kwang Jeff:
"Rapid transition to new spoken dialogue domains: language model training using knowledge from previous domain applications and web text resources",
1873-1876.
Rayner, Manny / Bouillon, Pierrette / Chatzichrisafis, Nikos / Hockey, Beth Ann / Santaholma, Marianne / Starlander, Marianne / Isahara, Hitoshi / Kanzaki, Kyoko / Nakao, Yukie:
"A methodology for comparing grammar-based and robust approaches to speech understanding",
1877-1880.
Text-to-Speech I, II
Mairesse, François / Walker, Marilyn:
"Learning to personalize spoken generation for dialogue systems",
1881-1884.
Revelin, S. / Cadic, D. / Waast-Richard, C.:
"Optimization of text-to-speech phonetic transcriptions using a-posteriori signal comparison",
1885-1888.
Salor, Özgül / Demirekler, Mübeccel:
"Voice transformation using principle component analysis based LSF quantization and dynamic programming approach",
1889-1892.
Li, Hai Ping / Zhang, Wei:
"Adapt Mandarin TTS system to Chinese dialect TTS systems",
1893-1896.
Zheng, Min / Shi, Qin / Zhang, Wei / Cai, Lianhong:
"Grapheme-to-phoneme conversion based on TBL algorithm in Mandarin TTS system",
1897-1900.
Massimino, Paolo / Pacchiotti, Alberto:
"An automaton-based machine learning technique for automatic phonetic transcription",
1901-1904.
Soonklang, Tasanawan / Damper, Robert I. / Marchand, Yannick:
"Comparative objective and subjective evaluation of three data-driven techniques for proper name pronunciation",
1905-1908.
Engwall, Olov:
"Articulatory synthesis using corpus-based estimation of line spectrum pairs",
1909-1912.
Chen, Aoju / Os, Els den:
"Effects of pitch accent type on interpreting information status in synthetic speech",
1913-1916.
Prusi, Perttu / Kainulainen, Anssi / Hakulinen, Jaakko / Turunen, Markku / Salonen, Esa-Pekka / Helin, Leena:
"Towards generic spatial object model and route guidance grammar for speech-based systems",
1917-1920.
Hsia, Chi-Chun / Wu, Chung-Hsien / Liu, Te-Hsien:
"Duration-embedded bi-HMM for expressive voice conversion",
1921-1924.
Hirai, Toshio / Kawai, Hisashi / Tsuzaki, Minoru / Nishizawa, Nobuyuki:
"Analysis of major factors of naturalness degradation in concatenative synthesis",
1925-1928.
Tian, Jilei / Nurminen, Jani / Kiss, Imre:
"Duration modeling and memory optimization in a Mandarin TTS system",
1929-1932.
Liang, Min-Siong / Chuang, Ke-Chun / Yang, Rhuei-Cheng / Chiang, Yuang-Chin / Lyu, Ren-Yuan:
"A bi-lingual Mandarin-to-taiwanese text-to-speech system",
1933-1936.
Reichel, Uwe D. / Schiel, Florian:
"Using morphology and phoneme history to improve grapheme-to-phoneme conversion",
1937-1940.
Goubanova, Olga / King, Simon:
"Predicting consonant duration with Bayesian belief networks",
1941-1944.
Jande, Per-Anders:
"Inducing decision tree pronunciation variation models from annotated speech data",
1945-1948.
Wang, Lijuan / Zhao, Yong / Chu, Min / Soong, Frank K. / Cao, Zhigang:
"Phonetic transcription verification with generalized posterior probability",
1949-1952.
Cheng, Hua / Weng, Fuliang / Hantaweepant, Niti / Cavedon, Lawrence / Peters, Stanley:
"Training a maximum entropy model for surface realization",
1953-1956.
Toda, Tomoki / Shikano, Kiyohiro:
"NAM-to-speech conversion with Gaussian mixture models",
1957-1960.
Savino, Michelina / Refice, Mario / Mitaritonna, Massimo:
"Which Italian do current systems speak? a first step towards pronunciation modelling of Italian varieties",
1961-1964.
Oliver, Dominika / Clark, Robert A. J.:
"Modelling pitch accent types for Polish speech synthesis",
1965-1968.
Hansakunbuntheung, C. / Thangthai, Ausdang / Wutiwiwatchai, Chai / Siricharoenchai, Rungkarn:
"Learning methods and features for corpus-based phrase break prediction on Thai",
1969-1972.
Taylor, Paul:
"Hidden Markov models for grapheme to phoneme conversion",
1973-1976.
Reubold, Ulrich / Steffen, Alexander:
"Pitch-effects in diphone recording: are logatomes inappropriate?",
2797-2800.
Toda, Tomoki / Tokuda, Keiichi:
"Speech parameter generation algorithm considering global variance for HMM-based speech synthesis",
2801-2804.
Tachibana, Makoto / Yamagishi, Junichi / Masuko, Takashi / Kobayashi, Takao:
"Performance evaluation of style adaptation for hidden semi-Markov model based speech synthesis",
2805-2808.
Webster, Gabriel / Burrows, Tina / Knill, Katherine:
"A comparison of methods for speaker-dependent pronunciation tuning for text-to-speech synthesis",
2809-2812.
Syrdal, Ann K. / Conkie, Alistair D.:
"Perceptually-based data-driven join costs: comparing join types",
2813-2816.
Pantazis, Yannis / Stylianou, Yannis / Klabbers, Esther:
"Discontinuity detection in concatenated speech synthesis based on nonlinear speech analysis",
2817-2820.
Speaker Characterization and Recognition I-IV
Wang, Longbiao / Kitaoka, Norihide / Nakagawa, Seiichi:
"Robust distant speaker recognition based on position dependent cepstral mean normalization",
1977-1980.
Leeuwen, David A. van:
"Speaker adaptation in the NIST speaker recognition evaluation 2004",
1981-1984.
Goldberger, Jacob / Aronowitz, Hagai:
"A distance measure between GMMs based on the unscented transform and its application to speaker recognition",
1985-1988.
Dusan, Sorin:
"Estimation of speaker's height and vocal tract length from speech signal",
1989-1992.
Torre Toledano, Doroteo / Fombella, Carlos / Gonzalez Rodriguez, Joaquin / Hernandez Gomez, Luis:
"On the relationship between phonetic modeling precision and phonetic speaker recognition accuracy",
1993-1996.
Fortuna, J. / Sivakumaran, P. / Ariyaeeinia, A. / Malegaonkar, A.:
"Open-set speaker identification using adapted Gaussian mixture models",
1997-2000.
McAuley, James / Ming, Ji / Corr, Pat:
"Speaker verification in noisy conditions using correlated subband features",
2001-2004.
Collet, Mikaël / Mam, Yassine / Charlet, Delphine / Bimbot, Frédéric:
"Probabilistic anchor models approach for speaker verification",
2005-2008.
Arcienega, Mijail / Alexander, Anil / Zimmermann, Philipp / Drygajlo, Andrzej:
"A Bayesian network approach combining pitch and spectral envelope features to reduce channel mismatch in speaker verification and forensic speaker recognition",
2009-2012.
Yiu, Kwok-Kwong / Mak, Man-Wai / Kung, Sun-Yuan:
"Channel robust speaker verification via Bayesian blind stochastic feature transformation",
2013-2016.
Matsui, Tomoko / Tanabe, Kunio:
"dPLRM-based speaker identification with log power spectrum",
2017-2020.
Zhang, Xianxian / Hansen, John H. L. / Angkititrakul, Pongtep / Takeda, Kazuya:
"Speaker verification using Gaussian mixture models within changing real car environments",
2021-2024.
Amino, Kanae / Sugawara, Tsutomu / Arai, Takayuki:
"The correspondences between the perception of the speaker individualities contained in speech sounds and their acoustic properties",
2025-2028.
Kim, Samuel / Yoon, Sungwan / Eriksson, Thomas / Kang, Hong-Goo / Youn, Dae Hee:
"A noise-robust pitch synchronous feature extraction algorithm for speaker recognition systems",
2029-2032.
Deng, Jing / Zheng, Thomas Fang / Song, Zhanjiang / Liu, Jian:
"Modeling high-level information by using Gaussian mixture correlation for GMM-UBM based speaker recognition",
2033-2036.
Zhang, Xianxian / Hansen, John H. L.:
"In-set/out-of-set speaker identification based on discriminative speech frame selection",
2037-2040.
Lei, Zhenchun / Yang, Yingchun / Wu, Zhaohui:
"Mixture of support vector machines for text-independent speaker recognition",
2041-2044.
Zhang, Shilei / Bai, Junmei / Zhang, Shuwu / Xu, Bo:
"Optimal model order selection based on regression tree in speaker identification",
2045-2048.
Faúndez-Zanuy, Marcos / Solé-Casals, Jordi:
"Speaker verification improvement using blind inversion of distortions",
2049-2052.
Omar, Mohamed Kamal / Navrátil, Jiri / Ramaswamy, Ganesh N.:
"Maximum conditional mutual information modeling for speaker verification",
2169-2172.
Ferrer, Luciana / Sönmez, Kemal / Kajarekar, Sachin:
"Class-dependent score combination for speaker recognition",
2173-2176.
Aronowitz, Hagai / Irony, Dror / Burshtein, David:
"Modeling intra-speaker variability for speaker recognition",
2177-2180.
Chetty, Girija / Wagner, Michael:
"Liveness detection using cross-modal correlations in face-voice person authentication",
2181-2184.
Asami, Taichi / Iwano, Koji / Furui, Sadaoki:
"Stream-weight optimization by LDA and adaboost for multi-stream speaker verification",
2185-2188.
Solewicz, Yosef A. / Koppel, Moshe:
"Considering speech quality in speaker verification fusion",
2189-2192.
Stolcke, Andreas / Ferrer, Luciana / Kajarekar, Sachin / Shriberg, Elizabeth / Venkataraman, Anand:
"MLLR transforms as features in speaker recognition",
2425-2428.
Baker, Brendan / Vogt, Robbie / Sridharan, Sridha:
"Gaussian mixture modelling of broad phonetic and syllabic events for text-independent speaker verification",
2429-2432.
Aronowitz, Hagai / Burshtein, David:
"Efficient speaker identification and retrieval",
2433-2436.
Sinha, R. / Tranter, S. E. / Gales, M. J. F. / Woodland, P. C.:
"The Cambridge University March 2005 speaker diarisation system",
2437-2440.
Zhu, Xuan / Barras, Claude / Meignier, Sylvain / Gauvain, Jean-Luc:
"Combining speaker identification and BIC for speaker diarization",
2441-2444.
Istrate, Dan / Scheffer, Nicolas / Fredouille, Corinne / Bonastre, Jean-François:
"Broadcast news speaker tracking for ESTER 2005 campaign",
2445-2448.
Moraru, Daniel / Ben, Mathieu / Gravier, Guillaume:
"Experiments on speaker tracking and segmentation in radio broadcast news",
3049-3052.
Dalmasso, Emanuele / Laface, Pietro / Colibro, Daniele / Vair, Claudio:
"Unsupervised segmentation and verification of multi-speaker conversational speech",
3053-3056.
Krstulovic, Sacha / Bimbot, Frédéric / Charlet, Delphine / Boëffard, Olivier:
"Focal speakers: a speaker selection method able to deal with heterogeneous similarity criteria",
3057-3060.
Ben, Mathieu / Gravier, Guillaume / Bimbot, Frédéric:
"A model space framework for efficient speaker detection",
3061-3064.
Scheffer, Nicolas / Bonastre, Jean-François:
"Speaker detection using acoustic event sequences",
3065-3068.
Tsai, Wei-Ho / Wang, Hsin-Min:
"Speaker clustering of unknown utterances based on maximum purity estimation",
3069-3072.
Zochová, Petra / Radová, Vlasta:
"Modified DISTBIC algorithm for speaker change detection",
3073-3076.
Gonon, Gilles / Gribonval, Rémi / Bimbot, Frédéric:
"Decision trees with improved efficiency for fast speaker verification",
3077-3080.
Eveno, Nicolas / Besacier, Laurent:
"A speaker independent "liveness" test for audio-visual biometrics",
3081-3084.
Kuroiwa, Shingo / Umeda, Yoshiyuki / Tsuge, Satoru / Ren, Fuji:
"Distributed speaker recognition using speaker-dependent VQ codebook and earth mover's distance",
3085-3088.
Leung, Ka-Yee / Mak, Man-Wai / Siu, Manhung / Kung, Sun-Yuan:
"Speaker verification via articulatory feature-based conditional pronunciation modeling with vowel and consonant mixture models",
3089-3092.
Chen, Jixu / Dai, Beiqian / Sun, Jun:
"Prosodic features based on wavelet analysis for speaker verification",
3093-3096.
Mihoubi, M. / O'Shaughnessy, Douglas / Dumouchel, P.:
"Relevant information extraction for discriminative training applied to speaker identification",
3097-3100.
Louradour, Jérôme / Daoudi, Khalid:
"Conceiving a new sequence kernel and applying it to SVM speaker verification",
3101-3104.
Deng, Jing / Zheng, Thomas Fang / Liu, Jian / Wu, Wenhu:
"The predictive differential amplitude spectrum for robust speaker recognition in stationary noises",
3105-3108.
Mason, Michael / Vogt, Robbie / Baker, Brendan / Sridharan, Sridha:
"Data-driven clustering for blind feature mapping in speaker verification",
3109-3112.
Zhou, Xi / Yao, Zhi-qiang / Dai, Beiqian:
"Improved covariance modeling for GMM in speaker identification",
3113-3116.
Vogt, Robbie / Baker, Brendan / Sridharan, Sridha:
"Modelling session variability in text-independent speaker verification",
3117-3120.
Siafarikas, Mihalis / Ganchev, Todor / Fakotakis, Nikolaos / Kokkinakis, George:
"Overlapping wavelet packet features for speaker verification",
3121-3124.
Yin, An-rong / Xie, Xiang / Kuang, Jingming:
"Using Hadamard ECOC in multi-class problems based on SVM",
3125-3128.
Single-channel Speech Enhancement
Cohen, Israel:
"Supergaussian GARCH models for speech signals",
2053-2056.
Mouchtaris, A. / Spiegel, J. Van der / Mueller, P. / Tsakalides, P.:
"A spectral conversion approach to feature denoising and speech enhancement",
2057-2060.
Ortega, Alfonso / Lleida, Eduardo / Masgrau, Enrique / Buera, Luis / Miguel, Antonio:
"Acoustic feedback cancellation in speech reinforcement systems for vehicles",
2061-2064.
Bourgeois, Julien / Freudenberger, Jürgen / Lathoud, Guillaume:
"Implicit control of noise canceller for speech enhancement",
2065-2068.
Kumar, T. M. Sunil / Sreenivas, T. V.:
"Speech enhancement using Markov model of speech segments",
2069-2072.
Braquet, Vladimir / Kobayashi, Takao:
"A wavelet based noise reduction algorithm for speech signal corrupted by coloured noise",
2073-2076.
Zavarehei, Esfandiar / Vaseghi, Saeed:
"Speech enhancement in temporal DFT trajectories using Kalman filters",
2077-2080.
Yan, Qin / Vaseghi, Saeed / Zavarehei, Esfandiar / Milner, Ben:
"Formant-tracking linear prediction models for speech processing in noisy environments",
2081-2084.
Jiang, Hui / Fu, Qian-Jie:
"Statistical noise compensation for cochlear implant processing",
2085-2088.
Pham, Tuan Van / Kubin, Gernot:
"WPD-based noise suppression using nonlinearly weighted threshold quantile estimation and optimal wavelet shrinking",
2089-2092.
Li, Weifeng / Itou, Katunobu / Takeda, Kazuya / Itakura, Fumitada:
"Subjective and objective quality assessment of regression-enhanced speech in real car environments",
2093-2096.
Unoki, Masashi / Kubo, Masaaki / Haniu, Atsushi / Akagi, Masato:
"A model for selective segregation of a target instrument sound from the mixed sound of various instruments",
2097-2100.
Hendriks, Richard C. / Heusdens, Richard / Jensen, Jesper:
"Improved decision directed approach for speech enhancement using an adaptive time segmentation",
2101-2104.
Lollmann, Heinrich W. / Vary, Peter:
"Generalized filter-bank equalizer for noise reduction with reduced signal delay",
2105-2108.
Roman, Nicoleta / Wang, DeLiang:
"A pitch-based model for separation of reverberant speech",
2109-2112.
Zhao, David Y. / Kleijn, W. Bastiaan:
"On noise gain estimation for HMM-based speech enhancement",
2113-2116.
Deshmukh, Om / Espy-Wilson, Carol:
"Speech enhancement using auditory phase opponency model",
2117-2120.
Acoustic Modelling for LVCSR
Mak, Brian / Yeung, Siu-Kei Au / Lai, Yiu-Pong / Siu, Manhung:
"High-density discrete HMM with the use of scalar quantization indexing",
2121-2124.
Zheng, Jing / Stolcke, Andreas:
"Improved discriminative training using phone lattices",
2125-2128.
Zhu, Qifeng / Chen, Barry Y. / Grezl, Frantisek / Morgan, Nelson:
"Improved MLP structures for data-driven feature extraction for ASR",
2129-2132.
Macherey, Wolfgang / Haferkamp, Lars / Schlüter, Ralf / Ney, Hermann:
"Investigations on error minimizing training criteria for discriminative training in automatic speech recognition",
2133-2136.
Sim, K. C. / Gales, M. J. F.:
"Temporally varying model parameters for large vocabulary continuous speech recognition",
2137-2140.
Zhu, Qifeng / Stolcke, Andreas / Chen, Barry Y. / Morgan, Nelson:
"Using MLP features in SRI's conversational speech recognition system",
2141-2144.
Gender and Age Issues in Speech and Language Research I, II
Gerosa, Matteo / Giuliani, Diego / Brugnara, Fabio:
"Speaker adaptive acoustic modeling with mixture of adult and children's speech",
2193-2196.
D'Arcy, Shona / Russell, Martin:
"A comparison of human and computer recognition accuracy for children's speech",
2197-2200.
Cosi, Piero / Pellom, Bryan L.:
"Italian children's speech recognition for advanced interactive literacy tutors",
2201-2204.
Adda-Decker, Martine / Lamel, Lori:
"Do speech recognizers prefer female speakers?",
2205-2208.
Yildirim, Serdar / Lee, Chul Min / Lee, Sungbok / Potamianos, Alexandros / Narayanan, Shrikanth:
"Detecting Politeness and frustration state of a child in a conversational computer game",
2209-2212.
Binnenpoorte, Diana / Bael, Christophe Van / Os, Els den / Boves, Louis:
"Gender in everyday speech and language: a corpus-based study",
2213-2216.
Elenius, Daniel / Blomberg, Mats:
"Adaptation and normalization experiments in speech recognition for 4 to 8 year old children",
2749-2752.
Jansen, Wim / Hamme, Hugo Van:
"PROSPECT features and their application to missing data techniques for vocal tract length normalization",
2753-2756.
Hagen, Andreas / Pellom, Bryan L.:
"Data driven subword unit modeling for speech recognition and its application to interactive reading tutors",
2757-2760.
Batliner, Anton / Blomberg, Mats / D'Arcy, Shona / Elenius, Daniel / Giuliani, Diego / Gerosa, Matteo / Hacker, Christian / Russell, Martin / Steidl, Stefan / Wong, Michael:
"The PF_STAR children's speech corpus",
2761-2764.
Bell, Linda / Boye, Johan / Gustafson, Joakim / Heldner, Mattias / Lindström, Anders / Wirén, Mats:
"The Swedish NICE corpus - spoken dialogues between children and embodied characters in a computer game scenario",
2765-2768.
Miyauchi, Yusuke / Hodoshima, Nao / Yasu, Keiichi / Hayashi, Nahoko / Arai, Takayuki / Shindo, Mitsuko:
"A preprocessing technique for improving speech intelligibility in reverberant environments: the effect of steady-state suppression on elderly people",
2769-2772.
Language and Dialect Identification I, II
Matejka, Pavel / Schwarz, Petr / Cernocký, Jan / Chytil, Pavel:
"Phonotactic language identification using high quality phoneme recognition",
2237-2240.
Huang, Rongqing / Hansen, John H. L.:
"Advances in word based dialect/accent classification",
2241-2244.
Hamdi, Rym / Ghazali, Salem / Barkat-Defradas, Melissa:
"Syllable structure in spoken Arabic: a comparative investigation",
2245-2248.
Marcadet, J. C. / Fischer, V. / Waast-Richard, C.:
"A transformation-based learning approach to language identification for mixed-lingual text-to-speech synthesis",
2249-2252.
Itahashi, Shuichi / Zhu, Shiwei / Yamamoto, Mikio:
"Constructing family trees of multilingual speech using Gaussian mixture models",
2253-2256.
Rouas, Jean-Luc:
"Modeling long and short-term prosody for language identification",
2257-2260.
Wu, Tingyao / Compernolle, Dirk Van / Duchateau, Jacques / Yang, Qian / Martens, Jean-Pierre:
"Improving the discrimination between native accents when recorded over different channels",
2821-2824.
Trancoso, Isabel / Serralheiro, António / Viana, Céu / Caseiro, Diamantino:
"Aligning and recognizing spoken books in different varieties of Portuguese",
2825-2828.
Ma, Bin / Li, Haizhou / Lee, Chin-Hui:
"An acoustic segment modeling approach to automatic language identification",
2829-2832.
Zhu, Dong / Adda-Decker, Martine / Antoine, Fabien:
"Different size multilingual phone inventories and context-dependent acoustic models for language identification",
2833-2836.
Gao, Sheng / Ma, Bin / Li, Haizhou / Lee, Chin-Hui:
"A text categorization approach to automatic language identification",
2837-2840.
Salvi, Giampiero:
"Advances in regional accent clustering in Swedish",
2841-2844.
Spoken Language Translation I, II
Paulik, M. / Fügen, Christian / Stüker, Sebastian / Schultz, Tanja / Schaaf, Thomas / Waibel, Alex:
"Document driven machine translation enhanced ASR",
2261-2264.
Khadivi, Shahram / Zolnay, András / Ney, Hermann:
"Automatic text dictation in computer-assisted translation",
2265-2268.
Rodríguez, L. / Civera, J. / Vidal, E. / Casacuberta, Francisco / Martínez, C.:
"On the use of speech recognition in computer assisted translation",
2269-2272.
Kathol, Andreas / Precoda, Kristin / Vergyri, Dimitra / Wang, Wen / Riehemann, Susanne:
"Speech translation for low-resource languages: the case of Pashto",
2273-2276.
Picó, David / González, Jorge / Casacuberta, Francisco / Caseiro, Diamantino / Trancoso, Isabel:
"Finite-state transducer inference for a speech-input Portuguese-to-English machine translation system",
2277-2280.
Ohta, Kenko / Yasuda, Keiji / Kikui, Genichiro / Yanagida, Masuzo:
"Quantitative evaluation of effects of speech recognition errors on speech translation quality",
2281-2284.
Matusov, E. / Kanthak, S. / Ney, Hermann:
"On the integration of speech recognition and statistical machine translation",
3177-3180.
Quan, V. H. / Federico, M. / Cettolo, M.:
"Integrated n-best re-ranking for spoken language translation",
3181-3184.
Crego, Josep M. / Mariño, José B. / Gispert, Adrià de:
"An n-gram-based statistical machine translation decoder",
3185-3188.
Gu, Liang / Gao, Yuqing:
"Use of maximum entropy in natural word generation for statistical concept-based speech-to-speech translation",
3189-3192.
Gispert, Adrià de / Mariño, José B. / Crego, Josep M.:
"Improving statistical machine translation by classifying and generalizing inflected verb forms",
3193-3196.
Bozarov, Abdulvohid / Sagisaka, Yoshinori / Zhang, Ruiqiang / Kikui, Genichiro:
"Improved speech recognition word lattice translation by confidence measure",
3197-3200.
Multi-channel Speech Enhancement
Lotter, Thomas / Sauert, Bastian / Vary, Peter:
"A stereo input-output superdirective beamformer for dual channel noise reduction",
2285-2288.
Klee, Ulrich / Gehrig, Tobias / McDonough, John:
"Kalman filters for time delay of arrival-based source localization",
2289-2292.
Ichikawa, Osamu / Nishimura, Masafumi:
"Simultaneous adaptation of echo cancellation and spectral subtraction for in-car speech recognition",
2293-2296.
Hu, Rong / Zhao, Yunxin:
"Variable step size adaptive decorrelation filtering for competing speech separation",
2297-2300.
Saitoh, Daisuke / Kaminuma, Atsunobu / Saruwatari, Hiroshi / Nishikawa, Tsuyoki / Lee, Akinobu:
"Speech extraction in a car interior using frequency-domain ICA with rapid filter adaptations",
2301-2304.
Hu, Rongqiang / Kamath, Sunil D. / Anderson, David V.:
"Speech enhancement using non-acoustic sensors",
2305-2308.
Delcroix, Marc / Hikichi, Takafumi / Miyoshi, Masato:
"Improved blind dereverberation performance by using spatial information",
2309-2312.
Li, Junfeng / Akagi, Masato:
"A hybrid microphone array post-filter in a diffuse noise field",
2313-2316.
Krishnan, Venkatesh / Whitehead, Phil S. / Anderson, David V. / Clements, Mark A.:
"A framework for estimation of clean speech by fusion of outputs from multiple speech enhancement systems",
2317-2320.
Denda, Yuki / Nishiura, Takanobu / Yamashita, Yoichi:
"A study of weighted CSP analysis with average speech spectrum for noise robust talker localization",
2321-2324.
Kim, Young-Ik / An, Sung Jun / Kil, Rhee Man / Park, Hyung-Min:
"Sound segregation based on binaural zero-crossings",
2325-2328.
Freudenberger, Jürgen / Linhard, Klaus:
"A two-microphone diversity system and its application for hands-free car kits",
2329-2332.
Murakami, Takahiro / Kurihara, Kiyoshi / Ishida, Yoshihisa:
"Directionally constrained minimization of power algorithm for speech signals",
2333-2336.
Brutti, Alessio / Omologo, Maurizio / Svaizer, Piergiorgio:
"Oriented global coherence field for the estimation of the head orientation in smart rooms equipped with distributed microphone arrays",
2337-2340.
Madhu, Nilesh / Martin, Rainer:
"Robust speaker localization through adaptive weighted pair TDOA (AWEPAT) estimation",
2341-2344.
Lathoud, Guillaume / Magimai-Doss, Mathew / Mesot, Bertrand:
"A spectrogram model for enhanced source localization and noise-robust ASR",
2345-2348.
Srinivasan, Sriram / Nilsson, Mattias / Kleijn, W. Bastiaan:
"Denoising through source separation and minimum tracking",
2349-2352.
Grisoni, Louisa Busca / Hansen, John H. L.:
"Collaborative voice activity detection for hearing aids",
2353-2356.
Robledo-Arnuncio, Enrique / Juang, Biing-Hwang:
"Using inter-frequency decorrelation to reduce the permutation inconsistency problem in blind source separation",
2357-2360.
Subramanya, Amarnag / Zhang, Zhengyou / Liu, Zicheng / Droppo, Jasha / Acero, Alex:
"A graphical model for multi-sensory speech processing in air-and-bone conductive microphones",
2361-2364.
Phonetics and Phonology I, II
Dusan, Sorin:
"On the nature of acoustic information in identification of coarticulated vowels",
2449-2452.
Gendrot, Cédric / Adda-Decker, Martine:
"Impact of duration on F1/F2 formant values of oral vowels: an automatic analysis of large broadcast news corpora in French and German",
2453-2456.
Quene, Hugo:
"Modeling of between-speaker and within-speaker variation in spontaneous speech tempo",
2457-2460.
Komatsu, Masahiko / Aoyagi, Makiko:
"Vowel devoicing vs. mora-timed rhythm in spontaneous Japanese - inspection of phonetic labels of OGI_TS",
2461-2464.
Al-Tamimi, Jalal-Eddin / Ferragne, Emmanuel:
"Does vowel space size depend on language vowel inventories? evidence from two Arabic dialects and French",
2465-2468.
Shih, Chilin:
"Understanding phonology by phonetic implementation",
2469-2472.
Moates, Danny R. / Bond, Zinny S. / Fox, Russell / Stockmal, Verna:
"The feature [sonorant] in lexical access",
2869-2872.
Mikuteit, Simone:
"Voice and aspiration in German and east bengali stops: a cross-language study",
2873-2876.
Jacobi, Irene / Pols, Louis C. W. / Stroop, Jan:
"Polder dutch: aspects of the /ei/-lowering in standard dutch",
2877-2880.
Castelli, Eric / Carré, René:
"Production and perception of Vietnamese vowels",
2881-2884.
Ngoc, Tuan Vu / d'Alessandro, Christophe / Michaud, Alexis:
"Using open quotient for the characterisation of vietnamese glottalised tones",
2885-2888.
Hajek, John / Stevens, Mary:
"On the acoustic characterization of ejective stops in Waima'a",
2889-2892.
Stevens, Mary / Hajek, John:
"Spirantization of /p t k/ in Sienese Italian and so-called semi-fricatives",
2893-2896.
Gili Fivela, Barbara / Zmarich, Claudio:
"Italian geminates under speech rate and focalization changes: kinematic, acoustic, and perception data",
2897-2900.
Kim, Sunhee:
"Durational characteristics of Korean Lombard speech",
2901-2904.
Isei-Jaakkola, Toshiko / Asakawa, Satoshi:
"A cross-linguistic study of vowel quantity in different word structures: Japanese, Finnish and Czech",
2905-2908.
Mori, Laura / Barkat-Defradas, Melissa:
"Acoustic properties of foreign accent: VOT variations in Moroccan-accented Italian",
2909-2912.
Rauber, Andréia S. / Escudero, Paola / Bion, Ricardo A. H. / Baptista, Barbara O.:
"The interrelation between the perception and production of English vowels by native speakers of Brazilian Portuguese",
2913-2916.
Hoelterhoff, Julia:
"Recognition of German obstruents",
2917-2920.
Skarnitzl, Radek / Volín, Jan:
"Czech voiced labiodental continuant discrimination from basic acoustic data",
2921-2924.
Maj, Jean-Baptiste / Bonneau, Anne / Fohr, Dominique / Laprie, Yves:
"An elitist approach for extracting automatically well-realized speech sounds with high confidence",
2925-2928.
Tyson, Na'im R.:
"Applying multiple regression models for predicting word duration in a corpus of spontaneous speech",
2929-2932.
Oliveira, Catarina / Moutinho, Lurdes Castro / Teixeira, António J. S.:
"On european Portuguese automatic syllabification",
2933-2936.
Chalamandaris, A. / Raptis, S. / Tsiakoulis, Pirros:
"Rule-based grapheme-to-phoneme method for the Greek",
2937-2940.
Kalimeris, Constandinos / Mikros, George / Bakamidis, Stelios:
"Assimilation and deletion phenomena involving word-final /n/ and word-initial /p, t, k/ in modern Greek: a codification of the observed variation intended for use in TTS synthesis",
2941-2944.
Weiss, Christian / Aschenberner, Bianca:
"A German viseme-set for automatic transcription of input text used for audio-visual speech synthesis",
2945-2948.
Roy, Johanna-Pascale:
"Visual perception of anticipatory rounding gestures in French",
2949-2952.
Human factors, User Experience and Natural Language Application Design
Levin, Esther / Levin, Alex:
"Spoken dialog system for real-time data capture",
2497-2500.
Pucher, Michael / Fröhlich, Peter:
"A user study on the influence of mobile device class, synthesis method, data rate and lexicon on speech synthesis quality",
2501-2504.
Chen, Fang / Katzenellenbogen, Yael:
"User's experience of a commercial speech dialogue system",
2505-2508.
Levin, Esther / Mané, Amir M.:
"Voice user interface design for automated directory assistance",
2509-2512.
Alvarez-Ryan, Maria Gabriela / Gupta, Narendra / Hollister, Barbara / Alonso, Tirso:
"Optimizing user experience through design of the spoken language understanding (SLU) module",
2513-2516.
Wright, Jeremy / Kapilow, David / Abella, Alicia:
"Interactive visualization of human-machine dialogs",
2517-2520.
TTS Inventory
Aylett, Matthew P.:
"Synthesising hyperarticulation in unit selection TTS",
2521-2524.
Tihelka, Daniel:
"Symbolic prosody driven unit selection for highly natural synthetic speech",
2525-2528.
Matousek, Jindrich / Hanzlícek, Zdenek / Tihelka, Daniel:
"Hybrid syllable/triphone speech synthesis",
2529-2532.
Campillo Díaz, Francisco / Alba, José Luis / Rodríguez Banga, Eduardo:
"A neural network approach for the design of the target cost function in unit-selection speech synthesis",
2533-2536.
Weiss, Christian:
"FSM and k-nearest-neighbor for corpus based video-realistic audio-visual synthesis",
2537-2540.
Chen, Gui-Lin / Han, Ke-Song / Yu, Zhen-Li / Yue, Dong-Jian / Zu, Yi-Qing:
"An embedded and concatenative approach to TTS of multiple languages",
2541-2544.
Ezzat, Tony / Meyers, Ethan / Glass, James / Poggio, Tomaso:
"Morphing spectral envelopes using audio flow",
2545-2548.
Colotte, Vincent / Beaufort, Richard:
"Linguistic features weighting for a text-to-speech system without prosody model",
2549-2552.
Amdal, Ingunn / Svendsen, Torbjørn:
"Unit selection synthesis database development using utterance verification",
2553-2556.
Zhao, Yong / Wang, Lijuan / Chu, Min / Soong, Frank K. / Cao, Zhigang:
"Refining phoneme segmentations using speaker-adaptive context dependent boundary models",
2557-2560.
Chen, Yining / Zhao, Yong / Chu, Min:
"Customizing base unit set with speech database in TTS systems",
2561-2564.
Rouibia, Soufiane / Rosec, Olivier:
"Unit selection for speech synthesis based on a new acoustic target cost",
2565-2568.
Chazan, Dan / Hoory, Ron / Kons, Zvi / Sagi, Ariel / Shechtman, Slava / Sorin, Alexander:
"Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling",
2569-2572.
Alías, Francesc / Iriondo, Ignasi / Formiga, Lluís / Gonzalvo, Xavier / Monzo, Carlos / Sevillano, Xavier:
"High quality Spanish restricted-domain TTS oriented to a weather forecast application",
2573-2576.
Bjørkan, Ingmund / Svendsen, Torbjørn / Farner, Snorre:
"Comparing spectral distance measures for join cost optimization in concatenative speech synthesis",
2577-2580.
Barros, Maria João / Maia, Ranniery / Tokuda, Keiichi / Resende, Fernando Gil / Freitas, Diamantino:
"HMM-based european Portuguese TTS system",
2581-2584.
Hamza, Wael / Pitrelli, John F.:
"Combining the flexibility of speech synthesis with the naturalness of pre-recorded audio: a comparison of two approaches to phrase-splicing TTS",
2585-2588.
Strecha, Guntram / Jokisch, Oliver / Eichner, Matthias / Hoffmann, Rüdiger:
"Codec integrated voice conversion for embedded speech synthesis",
2589-2592.
Sundermann, David / Strecha, Guntram / Bonafonte, Antonio / Höge, Harald / Ney, Hermann:
"Evaluation of VTLN-based voice conversion for embedded speech synthesis",
2593-2596.
Isogai, Juri / Yamagishi, Junichi / Kobayashi, Takao:
"Model adaptation and adaptive training using ESAT algorithm for HMM-based speech synthesis",
2597-2600.
Fung, Tien-Ying / Li, Yuk-Chi / Sio, Eddie / Lee, Icarus / Meng, Helen / Ching, P. C.:
"Embedded Cantonese TTS for multi-device access to web content",
2601-2604.
Schnell, Karl / Lacroix, Arild:
"Model based analysis of a diphone database for improved unit concatenation",
2605-2608.
Speech Coding
So, Stephen / Paliwal, Kuldip K.:
"Switched split vector quantisation of line spectral frequencies for wideband speech coding",
2705-2708.
Bao, Changchun / Lukasiak, Jason / Ritz, Christian:
"A novel voicing cut-off determination for low bit-rate harmonic speech coding",
2709-2712.
Krüger, Hauke / Vary, Peter:
"A partial decorrelation scheme for improved predictive open loop quantization with noise shaping",
2713-2716.
Krishnan, Venkatesh / Barnwell III, Thomas P. / Anderson, David V.:
"Using dynamic codebook re-ordering to exploit inter-frame correlation in MELP coders",
2717-2720.
Durey, Adriane Swalm / Krishnan, Venkatesh / Barnwell III, Thomas P.:
"Enhanced speech coding based on phonetic class segmentation",
2721-2724.
Ertan, Ali Erdem / Barnwell III, Thomas P.:
"A pitch-synchronous pitch-cycle modification method for designing a hybrid i-MELP/waveform-matching speech coder",
2725-2728.
Chang, Joon-Hyuk / Shin, Jong-Won / Lee, Seung Yeol / Kim, Nam Soo:
"A new structural preprocessor for low-bit rate speech coding",
2729-2732.
Falk, Tiago H. / Chan, Wai-Yip / Kabal, Peter:
"An improved GMM-based voice quality predictor",
2733-2736.
Erkelens, Jan:
"High-quality memoryless subband coding of impulse responses at 22 bits per frame",
2737-2740.
Chen, Shi-Han / Wu, Kuo-Guan / Kuo, Chih-Chung:
"A study of variable pulse allocation for MPE and CELP coders based on PESQ analysis",
2741-2744.
Pérez-Córdoba, José L. / Peinado, Antonio M. / Gómez, Angel M. / Rubio, Antonio J.:
"Joint source-channel coding of LSP parameters for bursty channels",
2745-2748.
Discourse and Dialogue I, II
Pfleger, Norbert / Löckelt, Markus:
"Synchronizing dialogue contributions of human users and virtual characters in a virtual reality environment",
2773-2776.
Venkataraman, Anand / Liu, Yang / Shriberg, Elizabeth / Stolcke, Andreas:
"Does active learning help automatic dialog act tagging in meeting data?",
2777-2780.
Bohus, Dan / Rudnicky, Alexander I.:
"A principled approach for rejection threshold optimization in spoken dialog systems",
2781-2784.
Pérez-Piñar López, David / García Mateo, Carmen:
"Application of confidence measures for dialogue systems through the use of parallel speech recognizers",
2785-2788.
Rosset, Sophie / Tribout, Delphine:
"Multi-level information and automatic dialog acts detection in human-human spoken dialogs",
2789-2792.
Akker, Rieks op den / Bunt, Harry / Keizer, Simon / Schooten, Boris van:
"From question answering to spoken dialogue: towards an information search assistant for interactive multimodal information extraction",
2793-2796.
Wesseling, Wieneke / Son, Rob J. J. H. van:
"Timing of experimentally elicited minimal responses as quantitative evidence for the use of intonation in projecting TRPs",
3389-3392.
Yamada, Shinya / Itoh, Toshihiko / Araki, Kenji:
"Linguistic and acoustic features depending on different situations - the experiments considering speech recognition rate",
3393-3396.
Bühler, Dirk / Hamerich, Stefan W.:
"Towards voiceXML compilation for portable embedded applications in ubiquitous environments",
3397-3400.
Strangert, Eva:
"Prosody in public speech: analyses of a news announcement and a Political interview",
3401-3404.
Nanavati, Amit Anil / Rajput, Nitendra:
"Characterising dialogue call-flows for pervasive environments",
3405-3408.
Faruquie, Tanveer / Kankar, Pankaj / Rajput, Nitendra / Verma, Abhishek:
"An architecture for pluggable disambiguation mechanism for RDC based voice applications",
3409-3412.
Rajput, Nitendra / Nanavati, Amit Anil / Kumar, Abhishek / Chaudhary, Neeraj:
"Adapting dialog call-flows for pervasive devices",
3413-3416.
Krum, Ulf / Holzapfel, Hartwig / Waibel, Alex:
"Clarification questions to improve dialogue flow and speech recognition in spoken dialogue systems",
3417-3420.
Fernández, Fernando / Ferreiros, Javier / Sama, Valentín / Montero, Juan Manuel / Segundo, Rubén San / Macías-Guarasa, Javier / García, Rafael:
"Speech interface for controlling an hi-fi audio system based on a Bayesian belief networks approach for dialog modeling",
3421-3424.
Speech Recognition in Ubiquitous Networking and Context-Aware Computing
Pearce, David / Engelsma, Jonathan / Ferrans, James / Johnson, John:
"An architecture for seamless access to distributed multimodal services",
2845-2848.
Tan, Zheng-Hua / Dalsgaard, Paul / Lindberg, Børge / Xu, Haitian:
"Robust speech recognition in ubiquitous networking and context-aware computing",
2849-2852.
Ion, Valentin / Haeb-Umbach, Reinhold:
"Unified probabilistic approach to error concealment for distributed speech recognition",
2853-2856.
James, Alastair / Milner, Ben:
"Combining packet loss compensation methods for robust distributed speech recognition",
2857-2860.
Skogstad, Trond / Svendsen, Torbjørn:
"Distributed ASR using speech coder data for efficient feature vector representation",
2861-2864.
Furui, Sadaoki / Ichiba, Tomohisa / Shinozaki, Takahiro / Whittaker, Edward W. D. / Iwano, Koji:
"Cluster-based modeling for ubiquitous speech recognition",
2865-2868.
Speech Coding and Quality Assessment
Takahashi, Akira / Kurashima, Atsuko / Morioka, Chiharu / Yoshino, Hideaki:
"Objective quality assessment of wideband speech by an extension of ITU-t recommendation p.862",
3153-3156.
Werner, Marc / Vary, Peter:
"Quality control for UMTS-AMR speech channels",
3157-3160.
Chen, Wei / Kabal, Peter / Shabestary, Turaj Z.:
"Perceptual postfilter estimation for low bit rate speech coders using Gaussian mixture models",
3161-3164.
Fujita, Kengo / Kato, Tsuneo / Yamada, Hideaki / Kawai, Hisashi:
"SNR-dependent background noise compensation of PESQ values for cellular phone speech",
3165.
Lee, Gil Ho / Yoon, Jae Sam / Kim, Hong Kook:
"A MFCC-based CELP speech coder for server-based speech recognition in network environments",
3169-3172.
Grancharov, Volodya / Samuelsson, Jonas / Kleijn, W. Bastiaan:
"Distortion measures for vector quantization of noisy spectrum",
3173-3176.
Speech Inversion
Mokhtari, Parham / Kitamura, Tatsuya / Takemoto, Hironori / Honda, Kiyoshi:
"Vocal tract area function inversion by linear regression of cepstrum",
3201-3204.
Engwall, Olov:
"Introducing visual cues in acoustic-to-articulatory inversion",
3205-3208.
Sorokin, Viktor N. / Leonov, A. S. / Makarov, I. S. / Tsyplikhin, A. I.:
"Speech inversion and re-synthesis",
3209-3212.
Huckvale, Mark / Howard, Ian:
"Teaching a vocal tract simulation to imitate stop consonants",
3213-3216.
Potard, Blaise / Laprie, Yves:
"Using phonetic constraints in acoustic-to-articulatory inversion",
3217-3220.
Toutios, Asterios / Margaritis, Konstantinos:
"A support vector approach to the acoustic-to-articulatory mapping",
3221-3224.
Topics in Speech Recognition
Liu, Yang / Shriberg, Elizabeth / Stolcke, Andreas / Harper, Mary:
"Comparing HMM, maximum entropy, and conditional random fields for disfluency detection",
3313-3316.
Raj, Bhiksha / Singh, Rita / Smaragdis, Paris:
"Recognizing speech from simultaneous speakers",
3317-3320.
Wan, Vincent / Carmichael, James:
"Polynomial dynamic time warping kernel support vector machines for dysarthric speech recognition with sparse training data",
3321-3324.
Lejeune, R. / Baude, J. / Tchong, C. / Crepy, H. / Waast-Richard, C.:
"Flavoured acoustic model and combined spelling to sound for asymmetrical bilingual environment",
3325-3328.
Bartels, Chris / Duh, Kevin / Bilmes, Jeff / Kirchhoff, Katrin / King, Simon:
"Genetic triangulation of graphical models for speech and language processing",
3329-3332.
Aradilla, Guillermo / Vepa, Jithendra / Bourlard, Hervé:
"Improving speech recognition using a data-driven approach",
3333-3336.
Matsuda, Shigeki / Herbordt, Wolfgang / Nakamura, Satoshi:
"Outlier detection for acoustic model training using robust statistics",
3337-3340.
Roux, Jonathan Le / McDermott, Erik:
"Optimization methods for discriminative training",
3341-3344.
Cardinal, Patrick / Boulianne, Gilles / Comeau, Michel:
"Segmentation of recordings based on partial transcriptions",
3345-3348.
Seid, Hussien / Gambäck, Björn:
"A speaker independent continuous speech recognizer for Amharic",
3349-3352.
Ogawa, Tetsuji / Kobayashi, Tetsunori:
"Optimizing the structure of partly-hidden Markov models using weighted likelihood-ratio maximization criterion",
3353-3356.
Kumar, C. Santhosh / Mohandas, V. P. / Li, Haizhou:
"Multilingual speech recognition: a unified approach",
3357-3360.
Bartos, Tomás / Müller, Ludek:
"Detection of recognition errors based on classifiers trained on artificially created data",
3361-3364.
Li, Jinyu / Lee, Chin-Hui:
"On designing and evaluating speech event detectors",
3365-3368.
Razik, Joseph / Mella, Odile / Fohr, Dominique / Haton, Jean-Paul:
"Local word confidence measure using word graph and n-best list",
3369-3372.
Ren, Xiaolin / He, Xin / Zhang, Yaxin:
"Mandarin/English mixed-lingual name recognition for mobile phone",
3373-3376.
Ferreiros, Javier / Segundo, Rubén San / Fernández, Fernando / D'Haro, Luis-Fernando / Sama, Valentín / Barra, Roberto / Mellén, Pedro:
"New word-level and sentence-level confidence scoring using graph theory calculus and its evaluation on speech understanding",
3377-3380.
Nakamura, Masanobu / Iwano, Koji / Furui, Sadaoki:
"Analysis of spectral space reduction in spontaneous speech and its effects on speech recognition performances",
3381-3384.
King, Simon / Bartels, Chris / Bilmes, Jeff:
"SVitchboard 1: small vocabulary tasks from Switchboard",
3385-3388.