Keynotes
Pols, Louis C. W.:
"Acquiring and implementing phonetic knowledge",
K3-6.
Neuvo, Yrjö:
"Mobile future",
K7-10.
Brennan, Susan E.:
"How visual co-presence and joint attention shape speaking",
K11-12.
What do Industry and Universities Expect from Each Other? (Special Session)
Greenberg, Steven:
"Whither speech technology? - a twenty-first century perspective",
P3-6.
Dobler, Stefan / Hermansson, Hans / Minde, Tor-Björn:
"3g mobile networks and mobile internet as a promotor for new applications - challenges to industry and universities",
P7-10.
Niiniluoto, Ilkka:
"Universities and industry: marriage or co-operation between independent partners?",
11-12.
Neuvo, Yrjö:
"Considerations on what industry expects from universities",
13-14.
Strong, Gary:
"A perspective on industry/university relationships in the US",
15-16.
Choukri, Khalid:
"ELRA contribution to bridge the gap between industry and academia",
17-18.
Linguistic Modelling: Language Model Compression
Maltese, G. / Bravetti, P. / Crépy, H. / Grainger, B. J. / Herzog, M. / Palou, F.:
"Combining word- and class-based language models: a comparative study in several languages using automatic and manual word-clustering techniques",
21-24.
Isogai, Shuntaro / Shirai, Katsuhiko / Yamamoto, Hirofumi / Sagisaka, Yoshinori:
"Multi-class composite n-gram language model using multiple word clusters and word successions",
25-28.
Zitouni, Imed / Smaili, Kamel / Haton, Jean-Paul:
"Statistical language model based on a hierarchical approach: MCnv",
29-32.
Whittaker, E. W. D. / Raj, Bhiksha:
"Quantization-based language model compression",
33-36.
Speech Production: Voice Source
Bloothooft, Gerrit / Wijck, Mieke van / Pabon, Peter:
"Relations between vocal registers in voice breaks",
39-42.
Ramsay, Gordon:
"A quasi-one-dimensional model of aerodynamic and acoustic flow in the time-varying vocal tract: source and excitation mechanisms",
43-46.
Henrich, Nathalie / d'Alessandro, Christophe / Doval, Boris:
"Spectral correlates of voice open quotient and glottal flow asymmetry : theory, limits and experimental data",
47-50.
Avanzini, Federico / Alku, Paavo / Karjalainen, Matti:
"One-delayed-mass model for efficient synthesis of glottal flow",
51-54.
Speech Recognition and Understanding: Pronunciation and Subword Units
Zheng, Fang / Song, Zhanjiang / Fung, Pascale / Byrne, William:
"Modeling pronunciation variation using context-dependent weighting and b/s refined acoustic modeling",
57-60.
Bazzi, Issam / Glass, James:
"Learning units for domain-independent out-of- vocabulary word modelling",
61-64.
Nakajima, Hideharu / Hirano, Izumi / Sagisaka, Yoshinori / Shirai, Katsuhiko:
"Pronunciation variant analysis using speaking style parallel corpus",
65-68.
Kneissler, Jan / Klakow, Dietrich:
"Speech recognition for huge vocabularies by using optimized sub-word units",
69-72.
Lee, Kyung-Tak / Wellekens, Christian J.:
"Dynamic lexicon using phonetic features",
1413-1416.
Ziegenhain, Ute / Bauer, Josef G.:
"Triphone tying techniques combining a-priori rules and data driven methods",
1417-1420.
Bosch, Louis F. M. ten / Cremelie, Nick:
"Pronunciation modeling and lexical adaptation in midsize vocabulary ASR",
1421-1424.
Yi, Liu / Fung, Pascale:
"Estimating pronunciation variations from acoustic likelihood score for HMM reconstruction",
1425-1428.
Bisani, M. / Ney, Hermann:
"Breadth-first search for finding the optimal phonetic transcription from multiple utterances",
1429-1432.
Wolff, Matthias / Eichner, Matthias / Hoffmann, Rüdiger:
"Improved data-driven generation of pronunciation dictionaries using an adapted word list",
1433-1436.
Livescu, Karen / Glass, James:
"Segment-based recognition on the phonebook task: initial results and observations on duration modeling",
1437-1440.
Riis, Sřren Kamaric / Pedersen, Morten With / Jensen, Kare Jean:
"Multilingual text-to-phoneme mapping",
1441-1444.
Tsai, Ming-yi / Chou, Fu-chiang / Lee, Lin-shan:
"Pronunciation variation analysis with respect to various linguistic levels and contextual conditions for Mandarin Chinese",
1445-1448.
Tomokiyo, Laura Mayfield:
"Hypothesis-driven accent discrimination",
1449-1452.
Ma, Changxue / Randolph, Mark A.:
"An approach to automatic phonetic baseform generation based on Bayesian networks",
1453-1457.
Schramm, Hauke / Beyerlein, Peter:
"Towards discriminative lexicon optimization",
1457-1460.
He, Xiaodong / Zhao, Yunxin:
"Model complexity optimization for nonnative English speakers",
1461-1464.
Fegyó, Tibor / Mihajlik, Péter / Tatai, Péter / Gordos, Géza:
"Pronunciation modeling in hungarian number recognition",
1465-1468.
Phonetics and Phonology: Prosody and Others
Swerts, Marc / Kloots, Hanne / Gillis, Steven / Schutter, Georges De:
"Factors affecting schwa-insertion in final consonant clusters in standard dutch",
75-78.
Hitchcock, Leah / Greenberg, Steven:
"Vowel height is intimately associated with stress accent in spontaneous american English discourse",
79-82.
Gibbon, Dafydd:
"Finite state prosodic analysis of african corpus resources",
83-86.
Schröder, Marc / Cowie, Roddy / Douglas-Cowie, Ellen / Westerdijk, Machiel / Gielen, Stan:
"Acoustic correlates of emotion dimensions in view of speech synthesis",
87-90.
Ouden, Hanny den / Terken, Jacques:
"Measuring pitch range",
91-94.
Gibbon, Dafydd / Gut, Ulrike:
"Measuring speech rhythm",
95-98.
D’Imperio, Mariapaola:
"Tonal alignment, scaling and slope in Italian question and statement tunes",
99-102.
Iivonen, Antti:
"Pragmatic temporal voice range profile as a tool in the research of speech styles",
103-106.
Kim, Wooil / Kim, Taeyun / Ahn, Sungjoo / Ko, Hanseok:
"Model based stress decision method",
107-110.
Nordgĺrd, Torbjřrn / Foldvik, Arne Kjell:
"Reduction of alternative pronunciations in the norwegian computational lexicon norkompleks",
111-114.
Elordieta, Gorka / Hualde, José Ignacio:
"The role of duration as a correlate of accent in lekeitio basque",
115-118.
Johansson, Victoria / Horne, Merle / Strömqvist, Sven:
"Word final aspiration as a phrase boundary cue: data from spontaneous Swedish discourse",
119-122.
Shen, Xipeng / Xu, Bo:
"Study and auto-detection of stress based on tonal pitch range in Mandarin",
123-126.
Amir, Noam / Kerret, Ori / Karlinski, Dimitry:
"Classifying emotions in speech: a comparison of methods",
127-130.
Speech Perception: First and Second Language Learning
Behne, Dawn M. / Czigler, Peter E. / Sullivan, Kirk P.H.:
"Development of vowel quantity perception in late childhood",
133-136.
Yang, Byunggon:
"A study on the production-perception link of English vowels produced by native and non-native speakers",
137-140.
Otake, Takashi / Yamaguchi, Yuka:
"Japanese can be aware of syllables and morae: evidence from Japanese-English bilingual children",
141-144.
Callan, Daniel / Tajima, Keiichi / Callan, Akiko / Akahane-Yamada, Reiko / Masaki, Shinobu:
"Neural processes underlying perceptual learning of a difficult second language phonetic contrast",
145-148.
Komatsu, Masahiko / Mori, Kazuya / Arai, Takayuki / Murahara, Yuji:
"Human language identification with reduced segmental information: comparison between monolinguals and bilinguals",
149-152.
Speech Perception: Miscellaneous
Fernández, Santiago / Feijóo, Sergio:
"Coarticulatory effects in perception",
155-158.
Harding, Sue / Meyer, Georg:
"A case for multi-resolution auditory scene analysis",
159-162.
Ménard, Lucie / Schwartz, Jean-Luc / Boë, Louis-Jean / Kandel, Sonia / Vallée, Nathalie:
"Perceptual identification and normalization of synthesized French vowels from birth to adulthood",
163-166.
Ménard, Lucie / Boë, Louis-Jean:
"Perceptual categorization of maximal vowel spaces from birth to adulthood simulated by an articulatory model",
167-170.
Eskenazi, Maxine / Black, Alan W.:
"A study on speech over the telephone and aging",
171-174.
Chen, Marcia / Alwan, Abeer:
"On the perception of voicing for plosives in noise",
175-178.
Jiang, Jintao / Alwan, Abeer / Auer, Edward T. / Bernstein, Lynne E.:
"Predicting visual consonant perception from physical measures",
179-182.
Ainsworth, William A. / Cervera, T.:
"Effects of noise adaptation on the perception of voiced plosives in isolated syllables",
371-374.
Hiroshige, Makoto / Araki, Kenji / Tochinai, Koji:
"On differential limen of word-based local speechrate variation in Japanese expressed by duration ratio",
375-378.
Tokuma, Wan:
"A multidimensional scaling study of fricatives; a comparison of perceptual and physical dimensions",
379-382.
Swerts, Marc / Krahmer, Emiel:
"Reconstructing dialogue history",
383-386.
House, David / Beskow, Jonas / Granström, Björn:
"Timing and interaction of visual cues for prominence in audiovisual speech perception",
387-390.
Komatsu, Masahiko / Tokuma, Shinichi / Tokuma, Won / Arai, Takayuki:
"Modelling the perceptual identification of Japanese consonants from LPC cepstral distances",
391-394.
Burnham, Denis / Ciocca, Valter / Stokes, Stephanie:
"Auditory-visual perception of lexical tone",
395-398.
Eriksson, Anders / Thunberg, Gunilla C. / Traunmüller, Hartmut:
"Syllable prominence: a matter of vocal effort, phonetic distinct-ness and top-down processing",
399-402.
Mixdorff, Hansjörg / Widera, Christina:
"Perceived prominence in terms of a linguistically motivated quantitative intonation model",
403-406.
Hawkins, Sarah / Nguyen, Noël:
"Perception of coda voicing from properties of the onset and nucleus of 'led' and 'let'",
407-410.
Lin, L. / Ambikairajah, E. / Holmes, W. H.:
"Auditory filter bank design using masking curves",
411-414.
Erdenebat, Dashtseren / Shigeyoshi, Kitazawa / Tatsuya, Kitamura:
"A new feature driven cochlear implant speech processing strategy",
415-418.
Noise Robust Recognition: Frontend and Compensation Algorithms (Special Session)
Zhu, Qifeng / Iseli, Markus / Cui, Xiaodong / Alwan, Abeer:
"Noise robust feature extraction for ASR using the Aurora 2 database",
185-188.
Ellis, Daniel P.W. / Gomez, Manuel J. Reyes:
"Investigations into tandem acoustic modeling for the Aurora task",
189-192.
Andrassy, Bernt / Vlaj, Damjan / Beaugeant, Christophe:
"Recognition performance of the siemens front-end with and without frame dropping on the Aurora 2 database",
193-196.
Kotnik, Bojan / Kacic, Zdravko / Horvat, Bogomir:
"A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm",
197-200.
Veth, Johan de / Mauuary, Laurent / Noe, Bernhard / Wet, Febe de / Sienel, Jürgen / Boves, Louis / Jouvet, Denis:
"Feature vector selection to improve ASR robustness in noisy conditions",
201-204.
Macho, Dusan / Nadeu, Climent:
"Comparison of spectral derivative parameters for robust speech recognition",
205-208.
Yapanel, Umit / Hansen, John H. L. / Sarikaya, Ruhi / Pellom, Bryan:
"Robust digit recognition in noise: an evaluation using the AURORA corpus",
209-212.
Barker, Jon / Cooke, Martin / Green, Phil:
"Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise",
213-217.
Droppo, Jasha / Deng, Li / Acero, Alex:
"Evaluation of the SPLICE algorithm on the Aurora2 database",
217-220.
Segura, José C. / Torre, Angel de la / Benitez, M. Carmen / Peinado, Antonio M.:
"Model-based compensation of the additive noise for continuous speech recognition. experiments using the Aurora II database and tasks",
221-224.
Morris, Andrew / Hagen, Astrid / Bourlard, Hervé:
"MAP combination of multi-stream HMM or HMM/ANN experts",
225-228.
Jarc, Bojan / Babic, Rudolf:
"Second order statistics spectrum estimation method for robust speech recognition",
229-232.
Yao, Kaisheng / Chen, Jingdong / Paliwal, Kuldip K. / Nakamura, Satoshi:
"Feature extraction and model-based noise compensation for noisy speech recognition evaluated on AURORA 2 task",
233-236.
Linguistic Modelling: Language Model Adaptation
Federico, Marcello / Bertoldi, Nicola:
"Broadcast news LM adaptation using contemporary texts",
239-242.
Maucec, Mirjam Sepesy / Kacic, Zdravko:
"Topic detection for language model adaptation of highly-inflected languages by using a fuzzy comparison function",
243-246.
Georgila, Kallirroi / Fakotakis, Nikos / Kokkinakis, George:
"Efficient stochastic finite-state networks for language modelling in spoken dialogue systems",
247-250.
Visweswariah, Karthik / Printz, Harry:
"Language models conditioned on dialog state",
251-254.
Chen, Langzhou / Gauvain, Jean-Luc / Lamel, Lori / Adda, Gilles / Adda, Martine:
"Using information retrieval methods for language model adaptation",
255-258.
Speech Production: Articulation
Engwall, Olov:
"Making the tongue model talk: merging MRI & EMA measurements",
261-264.
Moen, Inger / Simonsen, Hanne Gram / Huseby, Morten / Grue, John:
"The relationship between intraoral air pressure and tongue/palate contact during the articulation of norwegian /t/ and /d/",
265-268.
Elgendy, Ahmed M. / Pols, Louis C. W.:
"Mechanical versus perceptual constraints as determinants of articulatory strategy",
269-272.
Gick, Bryan / Wilson, Ian:
"Pre-liquid excrescent schwa: what happens when vocalic targets conflict",
273-276.
Ouni, Slim / Laprie, Yves:
"Exploring the null space of the acoustic-to- articulatory inversion using a hypercube codebook",
277-280.
Speech Recognition and Understanding: Topic Detection and Information Retrieval
Theunissen, M. W. / Scheffler, K. / Preez, J. A. du:
"Phoneme-based topic spotting on the switchboard corpus",
283-286.
Franz, Martin / McCarley, J. Scott / Ward, Todd / Zhu, Wei-Jing:
"Topic styles in IR and TDT: effect on system behavior",
287-290.
Zweig, Geoffrey / Huang, Jing / Padmanabhan, Mukund:
"Extracting caller information from voicemail",
291-294.
Kuo, Hong-Kwang Jeff / Lee, Chin-Hui:
"A portability study on natural language call steering",
295-298.
Chen, Berlin / Wang, Hsin-min / Lee, Lin-shan:
"Improved spoken document retrieval by exploring extra acoustic and linguistic cues",
299-302.
Phonetics and Phonology: Segmentals and Synthesis
Tsukada, Kimiko:
"Native vs non-native production of English vowels in spontaneous speech: an acoustic phonetic study",
305-308.
Goronzy, Silke / Sahakyan, Marina / Wokurek, Wolfgang:
"Is non-native pronunciation modelling necessary ?",
309-312.
Laprie, Yves / Bonneau, Anne:
"Burst segmentation and evaluation of acoustic cues",
313-316.
Granser, Theodor / Moosmüller, Sylvia:
"The schwa in albanian",
317-320.
Ashby, Simone / Carson-Berndsen, Julie / Joue, Gina:
"A testbed for developing multilingual phonotactic descriptions",
321-324.
Fung, Wing-Nga / Lau, Sze-Lok:
"A physiological analysis of nasals and nasalization in Chinese",
325-328.
Donovan, Robert E.:
"A component by component listening test analysis of the IBM trainable speech synthesis system",
329-332.
Pan, Shimei / McKeown, Kathleen / Hirschberg, Julia:
"Semantic abnormality and its realization in spoken language",
333-336.
Campbell, Nick:
"TALKING FOREIGN - concatenative speech synthesis and the language barrier",
337-340.
Jensen, Christian:
"Schwa-assimilation in danish synthetic speech",
341-344.
Tamura, Masatsune / Masuko, Takashi / Tokuda, Keiichi / Kobayashi, Takao:
"Text-to-speech synthesis with arbitrary speaker's voice from average voice",
345-348.
Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"High quality voice conversion based on Gaussian mixture model with dynamic frequency warping",
349-352.
Tang, Min / Wang, Chao / Seneff, Stephanie:
"Voice transformations: from speech synthesis to mammalian vocalizations",
353-356.
Gutiérrez-Arriola, J. M. / Montero, J. M. / Vallejo, J. A. / Córdoba, R. / San-Segundo, R. / Pardo, Juan M.:
"A new multi-speaker formant synthesizer that applies voice conversion techniques",
357-360.
Mashimo, Mikiko / Toda, Tomoki / Shikano, Kiyohiro / Campbell, Nick:
"Evaluation of cross-language voice conversion based on GMM and straight",
361-364.
Coulston, Rachel:
"Ejective reduction in chaha is conditioned by more than prosodic position",
365-368.
Noise Robust Recognition: Frontend (Special Session)
Kim, Hong Kook / Rose, Richard C. / Kang, Hong-Goo:
"Acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments",
421-424.
Cheng, Yan Ming / Macho, Dusan / Wei, Yuanjun / Ealey, Douglas / Kelleher, Holly / Pearce, David / Kushner, William / Ramabadran, Tenkasi:
"A robust front-end algorithm for distributed speech recognition",
425-428.
Benitez, M. Carmen / Burget, Lukas / Chen, Barry / Dupont, Stephane / Garudadri, Hari / Hermansky, Hynek / Jain, Pratibha / Kajarekar, Sachin / Morgan, Nelson / Sivadas, Sunil:
"Robust ASR front-end using spectral-based and discriminant features: experiments on the Aurora tasks",
429-432.
Noé, Bernhard / Sienel, Jürgen / Jouvet, Denis / Mauuary, Laurent / Veth, Johan de / Boves, Louis / Wet, Febe de:
"Noise reduction for noise robust feature extraction for distributed speech recognition",
433-436.
Ealey, Douglas / Kelleher, Holly / Pearce, David:
"Harmonic tunnelling: tracking non-stationary noises during speech",
437-440.
Linguistic Modelling: Semantic Modelling
Carter, David / Gransden, Ian:
"Resource-limited sentence boundary detection",
443-446.
Pargellis, Andrew / Fosler-Lussier, Eric / Potamianos, Alexandros / Lee, Chin-Hui:
"Metrics for measuring domain independence of semantic classes",
447-450.
Mou, Xiaolong / Seneff, Stephanie / Zue, Victor:
"Context-dependent probabilistic hierarchical sublexical modelling using finite state transducers",
451-454.
Bellegarda, Jerome R. / Silverman, Kim E. A.:
"Data-driven semantic inference for unconstrained desktop command and control",
455-458.
Jansche, Martin:
"Information extraction via heuristics for a movie showtime query system",
459-462.
Speech Perception: Recognition and Intelligibility
Otake, Takashi / Cutler, Anne:
"Recognition of (almost) spoken words: evidence from word play in Japanese",
465-468.
Colotte, Vincent / Laprie, Yves / Bonneau, Anne:
"Perceptual experiments on enhanced and slowed down speech sentences for second language acquisition",
469-473.
Greenberg, Steven / Arai, Takayuki:
"The relation between speech intelligibility and the complex modulation spectrum",
473-476.
Crouzet, Olivier / Ainsworth, William A.:
"Envelope information in speech processing: acoustic-phonetic analysis vs. auditory figure-ground segregation",
477-480.
Adank, Patti / Hout, Roeland van / Smits, Roel:
"A comparison between human vowel normalization strategies and acoustic vowel transformation techniques",
481-484.
Speech Recognition and Understanding: LVCSR
Ircing, P. / Krbec, P. / Hajic, J. / Psutka, J. / Khudanpur, S. / Jelinek, Frederick / Byrne, William:
"On large vocabulary continuous speech recognition of highly inflectional language - czech",
487-490.
Shinozaki, Takahiro / Hori, Chiori / Furui, Sadaoki:
"Towards automatic transcription of spontaneous presentations",
491-494.
Siohan, Olivier / Ando, Akio / Afify, Mohamed / Jiang, Hui / Lee, Chin-Hui / Li, Qi / Liu, Feng / Onoe, Kazuo / Soong, Frank K. / Zhou, Qiru:
"A real-time Japanese broadcast news closed-captioning system",
495-498.
Beyerlein, Peter / Aubert, X. / Harris, M. / Meyer, C. / Schramm, Hauke:
"Investigations on conversational speech recognition",
499-502.
Gao, Yuqing / Erdogan, Hakan / Li, Yongxin / Goel, Vaibhava / Picheny, Michael:
"Recent advances in speech recognition system for IBM DARPA communicator",
503-506.
Willett, Daniel / McDermott, Erik / Minami, Yasuhiro / Katagiri, Shigeru:
"Time and memory efficient viterbi decoding for LVCSR using a precompiled search network",
847-850.
Liu, Feng / Afify, Mohamed / Jiang, Hui / Siohan, Olivier:
"A new verification-based fast match approach to large vocabulary speech recognition",
851-854.
Nakagawa, Seiichi / Horibe, Yukihisa:
"A fast calculation method in LVCSRS by time-skipping and clustering of probability density distributions",
855-858.
Homma, Shinichi / Kobayashi, Akio / Sato, Shoei / Imai, Toru / Ando, Akio:
"Speech recognition of Japanese news commentary",
859-862.
Speech Synthesis: Systems and Prosody
Cosi, Piero / Tesser, Fabio / Gretter, Roberto / Avesani, Cinzia / Macon, Mike:
"Festival speaks Italian!",
509-512.
Monaghan, Alex / Kassaei, Mahmoud / Luckin, Mark / Amador-Hernandez, Mariscela / Lowry, Andrew / Faulkner, Dan / Sannier, Fred:
"Multilingual TTS for computer telephony: the aculab approach",
513-516.
Kiss, Géza / Németh, Géza / Olaszy, Gábor / Gordos, Géza:
"A flexible multilingual TTS development and speech research tool",
517-520.
Klabbers, Esther / Stöber, Karlheinz / Veldhuis, Raymond / Wagner, Petra / Breuer, Stefan:
"Speech synthesis development made easy: the bonn open synthesis system",
521-525.
Olaszy, Gábor / Németh, Géza / Olaszi, Péter:
"Automatic prosody generation - a model for hungarian",
525-528.
Herwijnen, Olga van / Terken, Jacques:
"Evaluation of PROS-3 for the assignment of prosodic structure, compared to assignment by human experts",
529-532.
Yamashita, Yoichi / Ishida, Tomoyoshi:
"Stochastic F0 contour model based on the clustering of F0 shapes of a syntactic unit",
533-536.
Sun, Xuejing / Applebaum, Ted H.:
"Intonational phrase break prediction using decision tree and n-gram model",
537-540.
Zaki, A. / Rajouani, A. / Najim, M.:
"Synthesizing intonation of standard arabic language",
541-545.
Xu, Dawei / Mori, Hiroki / Kasuya, Hideki:
"Invariance of relative F0 change field of Chinese disyllabic words",
545-548.
Müller, Achim F. / Hoffmann, Rüdiger:
"Accent label prediction by time delay neural networks using gating clusters",
549-553.
Henrichsen, Peter Juel:
"Transformation-based learning of danish stress assignment",
553-556.
Baumann, Stefan / Trouvain, Jürgen:
"On the prosody of German telephone numbers",
557-560.
Schröder, Marc:
"Emotional speech synthesis: a review",
561-564.
Gustafson, Kjell / House, David:
"Fun or boring? a web-based evaluation of expressive synthesis for children",
565-568.
Speech Recognition and Understanding: Articulatory and Perceptual Approaches to ASR
Chen, Jingdong / Paliwal, Kuldip K. / Nakamura, Satoshi:
"Sub-band based additive noise removal for robust speech recognition",
571-574.
Tam, Yik-Cheung / Mak, Brian:
"Development of an asynchronous multi-band system for continuous speech recognition",
575-578.
Jancovic, Peter / Ming, Ji:
"A multi-band approach based on the probabilistic union model and frequency-filtering features for robust speech recognition",
579-582.
Gu, Liang / Rose, Kenneth:
"Split-band perceptual harmonic cepstral coefficients as acoustic features for speech recognition",
583-586.
Hagen, Astrid / Bourlard, Herve:
"Error correcting posterior combination for robust multi-band speech recognition",
587-590.
Gajic, Bojana / Paliwal, Kuldip K.:
"Robust parameters for speech recognition based on subband spectral centroid histograms",
591-594.
Edmondson, William H. / Zhang, Li:
"Pseudo-articulatory representations and the recognition of syllable patterns in speech",
595-598.
Frankel, Joe / King, Simon:
"ASR - articulatory speech recognition",
599-602.
Ma, Jeff Z. / Deng, Li:
"Efficient decoding strategy for conversational speech recognition using state-space models for vocal-tract-resonance dynamics",
603-606.
Weber, Katrin / Bengio, Samy / Bourlard, Hervé:
"HMM2- extraction of formant structures and their use for robust ASR",
607-610.
Yu, Xiaoqing / Wan, Wanggen / Lun, Daniel P. K.:
"Auditory model based speech recognition in noisy environment",
611-614.
Wendt, Sascha / Fink, Gernot A. / Kummert, Franz:
"Forward masking for increased robustness in automatic speech recognition",
615-618.
Li, Qi / Soong, Frank K. / Siohan, Olivier:
"An auditory system-based feature for robust speech recognition",
619-622.
Noise Robust Recognition: Robust Systems - What Helps? (Special Session)
Lieb, Markus / Fischer, Alexander:
"Experiments with the philips continuous ASR system on the AURORA noisy digits database",
625-628.
Saon, George / Huerta, Juan M. / Jan, Ea-Ee:
"Robust digit recognition in noisy environments: the IBM Aurora 2 system",
629-632.
Afify, Mohamed / Jiang, Hui / Korkmazskiy, F. / Lee, Chin-Hui / Li, Qi / Siohan, Olivier / Soong, Frank K. / Surendran, Arun C.:
"Evaluating the Aurora connected digit recognition task -- a bell labs approach",
633-636.
Phonetics and Phonology: Segmentals
Fougeron, Cécile / Goldman, J. P. / Frauenfelder, U. H.:
"Liaison and schwa deletion in French: an effect of lexical frequency and competition?",
639-642.
Zee, Eric / Lee, Wai-Sum:
"An acoustical analysis of the vowels in beijing Mandarin",
643-646.
Delvaux, Veronique / Soquet, Alain:
"Discriminant analysis of nasal vs. oral vowels in French: comparison between different parametric representations",
647-650.
Demolin, Didier / Delvaux, Véronique:
"Whispery voiced nasal stops in rwanda",
651-654.
Speech Production: Prosody
Fant, Gunnar / Kruckenberg, Anita / Liljencrants, Johan / Botinis, Antonis:
"Prominence correlates. a study of Swedish",
657-660.
Ohno, Sumio / Fujisaki, Hiroya:
"Quantitative analysis of the effects of emphasis upon prosodic features of speech",
661-664.
Dogil, Grzegorz / Möbius, Bernd:
"Towards a model of target oriented production of prosody",
665-668.
Shih, Chilin / Kochanski, Greg:
"Prosody control for speaking and singing styles",
669-672.
Kochanski, Greg / Shih, Chilin:
"Automated modeling of Chinese intonation in continuous speech",
911-914.
Frid, Johan:
"Prediction of intonation patterns of accented words in a corpus of read Swedish news through pitch contour stylization",
915-918.
Alku, Paavo / Vintturi, Juha / Vilkma, Erkki:
"The use of fundamental frequency raising as a strategy for increasing vocal intensity in soft, normal, and loud phonation",
919-922.
Botinis, Antonis / Fourakis, Marios / Bannert, Robert:
"Prosodic interactions on segmental durations ingreek",
923-926.
Chu, Min / Feng, Yongqiang:
"Study on factors influencing durations of syllables in Mandarin",
927-930.
Gustafson-Capkova, Sofia / Megyesi, Beata:
"A comparative study of pauses in dialogues and read speech",
931-934.
Takamaru, Keiichi / Hiroshige, Makoto / Araki, Kenji / Tochinai, Koji:
"Detecting Japanese local speech rate deceleration in spontaneous conversational speech using a variable threshold",
935-938.
Petersen, Niels Reinholt:
"Modelling fundamental frequency in first post-tonic syllables in danish sentences",
939-942.
Savino, Michelina:
"Non-finality and pre-finality in bari Italian intonation: a preliminary account",
943-946.
Mixdorff, Hansjörg / Jokisch, Oliver:
"Building an integrated prosodic model of German",
947-950.
Ibrahim, Omar A. G. / El-Ramly, S.H. / Abdel-Kader, N.S.:
"A model of F0 contour for arabic affirmative and interrogative sentences",
951-954.
Smith, Caroline L. / Hogan, Lisa A.:
"Variation in final lengthening as a function of topic structure",
955-958.
Herwijnen, Olga van / Terken, Jacques:
"Do speakers realize the prosodic structure they say they do?",
959-962.
Tabain, Marija / Rolland, Guillaume / Savariaux, Christophe:
"Coarticulatory effects at prosodic boundaries: some acoustic results",
963-966.
Barbosa, Plínio A.:
"Generating duration from a cognitively plausible model of rhythm production",
967-970.
Speech Recognition and Understanding: Acoustic Modelling - I
Stuttle, M. N. / Gales, M. J. F.:
"A mixture of Gaussians front end for speech recognition",
675-678.
Zheng, Jing / Butzberger, John / Franco, Horacio / Stolcke, Andreas:
"Improved maximum mutual information estimation training of continuous density HMMs",
679-682.
Perronnin, Florent / Kuhn, Roland / Nguyen, Patrick / Junqua, Jean-Claude:
"Maximum-likelihood training of a bipartite acoustic model for speech recognition",
683-686.
Sarikaya, Ruhi / Hansen, John H. L.:
"Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognition",
687-690.
Eide, Ellen:
"Distinctive features for use in an automatic speech recognition system",
1613-1616.
Zhang, Jiyong / Zheng, Fang / Li, Jing / Luo, Chunhua / Zhang, Guoliang:
"Improved context-dependent acoustic modeling for continuous Chinese speech recognition",
1617-1620.
Duchateau, Jacques / Demuynck, Kris / Compernolle, Dirk Van / Wambacq, Patrick:
"Class definition in discriminant feature analysis",
1621-1624.
Segura, Jose C. / Benitez, M. Carmen / Torre, Angel de la / Rubio, Antonio J.:
"Feature extraction from time-frequency matrices for robust speech recognition",
1625-1628.
Peng, Yu / Zuoying, Wang:
"Using spatial correlation information in speech recognition",
1629-1632.
Bauer, Josef G.:
"On the choice of classes in MCE based discriminative HMM-training for speech recognizers used in the telephone environment",
1633-1636.
Keshet, Joseph / Chazan, Dan / Bobrovsky, Ben-Zion:
"Plosive spotting with margin classifiers",
1637-1640.
Brugnara, Fabio:
"Model agglomeration for context-dependent acoustic modeling",
1641-1644.
Levit, M. / Gorin, A. L. / Wright, J. H.:
"Multipass algorithm for acquisition of salient acoustic morphemes",
1645-1648.
Emori, Tadashi / Shinoda, Koichi:
"Rapid vocal tract length normalization using maximum likelihood estimation",
1649-1652.
Okuda, Kozo / Matsui, Tomoko / Nakamura, Satoshi:
"Towards the creation of acoustic models for stressed Japanese speech",
1653-1656.
Baba, Akira / Yoshizawa, Shinichi / Yamada, Miichi / Lee, Akinobu / Shikano, Kiyohiro:
"Elderly acoustic model for large vocabulary continuous speech recognition",
1657-1660.
Zhang, Jin-Song / Zhang, Shu-Wu / Sagisaka, Yoshinori / Nakamura, Satoshi:
"A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition",
1661-1664.
Rodriguez, L. J. / Torres, I. / Varona, A.:
"Evaluation of sublexical and lexical models of acoustic disfluencies for spontaneous speech recognition in Spanish",
1665-1668.
Deviren, Murat / Daoudi, Khalid:
"Structural learning of dynamic Bayesian networks in speech recognition",
1669-1672.
Linguistic Modelling: Language Models
Onishi, Shigehiko / Yamamoto, Hirofumi / Sagisaka, Yoshinori:
"Structured language model for class identification of out-of-vocabulary words arising from multiple wordclasses",
693-696.
Jitsuhiro, Takatoshi / Yamamoto, Hirofumi / Yamada, Setsuo / Sagisaka, Yoshinori:
"New language models using phrase structures extracted from parse trees",
697-700.
Sicilia-Garcia, E. I. / Ming, Ji / Smith, F. J.:
"Triggering individual word domains in n-gram language models",
701-704.
Akiba, Tomoyosi / Itou, Katunobu:
"A structured statistical language model conditioned by arbitrarily abstracted grammatical categories based on GLR parsing",
705-708.
Matsui, Atsushi / Segi, Hiroyuki / Kobayashi, Akio / Imai, Toru / Ando, Akio:
"Speech recognition of broadcast sports news",
709-712.
Mori, Shinsuke / Nishimura, Masafumi / Itoh, Nobuyasu:
"Improvement of a structured language model: arbori-context tree",
713-716.
Kim, Woosung / Khudanpur, Sanjeev / Wu, Jun:
"Smoothing issues in the structured language model",
717-720.
Shen, Xipeng / Xu, Bo:
"The study of the effect of training set on statistical language modeling",
721-724.
Esteve, Yannick / Bechet, Frédéric / Nasr, Alexis / Mori, Renato De:
"Stochastic finite state automata language model triggered by dialogue states",
725-728.
Rayner, Manny / Dowding, John / Hockey, Beth Ann:
"A baseline method for compiling typed unification grammars into context free language models",
729-732.
Whittaker, E. W. D. / Raj, Bhiksha:
"Comparison of width-wise and length-wise language model compression",
733-736.
Siivola, Vesa / Kurimo, Mikko / Lagus, Krista:
"Large vocabulary statistical language modeling for continuous speech recognition in finnish",
737-740.
López-Cózar, R. / Milone, D. H.:
"A new technique based on augmented language models to improve the performance of spoken dialogue systems",
741-744.
Takagi, Kazuyuki / Ozeki, Kazuhiko:
"Pause information for dependency analysis of read Japanese sentences",
1041-1044.
Chen, Berlin / Wang, Hsin-min / Lee, Lin-shan:
"An HMM/n-gram-based linguistic processing approach for Mandarin spoken document retrieval",
1045-1048.
Lin, Yi-Chung / Wang, Huei-Ming:
"Probabilistic concept verification for language understanding in spoken dialogue systems",
1049-1052.
Külekcý, M. Oguzhan / Özkan, Mehmed:
"Turkish word segmentation using morphological analyzer",
1053-1056.
Tarsaku, Pongthai / Sornlertlamvanich, Virach / Thongprasirt, Rachod:
"Thai grapheme-to-phoneme using probabilistic GLR parser",
1057-1060.
Blache, Philippe / Hirst, Daniel:
"Aligning prosody and syntax in property grammars",
1061-1064.
Barkat, Melissa / Vasilescu, Ioana:
"From perceptual designs to linguistic typology and automatic language identification : overview and perspectives",
1065-1068.
Fitt, Susan:
"Morphological approaches for an English pronunciation lexicon",
1069-1072.
Joue, Gina / Carson-Berndsen, Julie:
"An embodiment paradigm for speech recognition systems",
1073-1076.
Xu, Kui / Weng, Fuliang / Meng, Helen M. / Luk, Po Chui:
"Multi-parser architecture for query processing",
1077-1080.
Chen, Yi-Chia / Lin, Yi-Chung:
"Two-stage probabilistic approach to text segmentation",
1081-1084.
Ordelman, Roeland / Hessen, Arjan van / Jong, Franciska de:
"Lexicon optimization for dutch speech recognition in spoken document retrieval",
1085-1088.
Brřndsted, Tom:
"Evaluation of recent speech grammar standardization efforts",
1089-1092.
Speaker Recognition: Identification, Verification and Tracking. Speech Recognition and Understanding: Language Identification
Brungart, Douglas S. / Scott, Kimberly R. / Simpson, Brian D.:
"The influence of vocal effort on human speaker identification",
747-750.
Faltlhauser, Robert / Ruske, Günther:
"Improving speaker recognition using phonetically structured Gaussian mixture models",
751-754.
Sanderson, Conrad / Paliwal, Kuldip K.:
"Information fusion for robust speaker verification",
755-758.
Satoh, Takayuki / Masuko, Takashi / Kobayashi, Takao / Tokuda, Keiichi:
"A robust speaker verification system against imposture using an HMM-based speech synthesis system",
759-762.
Surendran, Arun C.:
"Sequential decisions for faster and more flexible verification",
763-766.
Tsai, Wei-Ho / Chu, Y. C. / Huang, Chao-Shih / Chang, Wen-Whei:
"Background learning of speaker voices for textindependent speaker identification",
767-771.
Tsai, Wei-Ho / Chang, Wen-Whei / Huang, Chao-Shih:
"Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification",
771-774.
Wutiwiwatchai, Chai / Achariyakulporn, Varin / Kasuriya, Sawit:
"Improvement of speaker verification for Thai language",
775-778.
Rodríguez-Saeta, Javier / Koechling, Christian / Hernando, Javier:
"Speaker identification for car infotainment applications",
779-782.
Schalk, H. / Reininger, Herbert / Euler, Stephan:
"A system for text dependent speaker verification - field trial evaluation and simulation results",
783-786.
Martin, Alvin F. / Przybocki, Mark A.:
"Speaker recognition in a multi-speaker environment",
787-790.
Ou, Zhijian / Wang, Zuoying:
"A new DP-like speaker clustering algorithm",
791-794.
Sivakumaran, P. / Fortuna, J. / Ariyaeeinia, A. M.:
"On the use of the Bayesian information criterion in multiple speaker detection",
795-798.
Benarousse, Laurent / Geoffrois, Edouard:
"Preliminary experiments on language identification using broadcast news recordings",
799-802.
Kirchhoff, Katrin / Parandekar, Sonia:
"Multi-stream statistical n-gram modeling with application to automatic language identification",
803-806.
Phonetics and Phonology: Prominence and Timing
Streefkerk, Barbertje M. / Pols, Louis C. W. / Bosch, Louis F. M. ten:
"Up to what level can acoustical and textual features predict prominence",
811-814.
Chung, Hyunsong / Huckvale, Mark A.:
"Linguistic factors affecting timing in Korean with application to speech synthesis",
815-818.
Schaeffler, Felix:
"Measuring rhythmic deviation in second language speech",
819-822.
Maddieson, Ian:
"Good timing: place-dependent voice onset time in ejective stops",
823-826.
Speech Synthesis: Concatenation
Francois, Helene / Boeffard, Olivier:
"Design of an optimal continuous speech database for text-to-speech synthesis considered as a set covering problem",
829-832.
Vosnidis, Christos / Digalakis, Vassilis:
"Use of clustering information for coarticulation compensation in speech synthesis by word concatenation",
833-836.
Founda, Maria / Tambouratzis, George / Chalamandaris, Aimilios / Carayannis, George:
"Reducing spectral mismatches in concatenative speech synthesis via systematic database enrichment",
837-840.
Ferencz, Attila / Choi, Sung-Woo / Song, Ho-Eun / Koo, Myoung-Wan:
"Hansori 2001 - corpus-based implementation of the Korean hansori text-to-speech synthesizer",
841-844.
Barry, William / Nielsen, Claus / Andersen, Ove:
"Must diphone synthesis be so unnatural?",
975-978.
Syrdal, Ann K.:
"Phonetic effects on listener detection of vowel concatenation",
979-982.
Boeffard, Olivier:
"Variable-length acoustic units inference for text-to-speech synthesis",
983-986.
Bulyko, Ivan / Ostendorf, Mari:
"Unit selection for speech synthesis using splicing costs with weighted finite state transducers",
987-990.
Law, K. M. / Lee, Tan / Lau, Wai:
"Cantonese text-to-speech synthesis using sub-syllable units",
991-994.
Speech Recognition and Understanding: Noise Robustness
Wet, Febe de / Cranen, Bert / Veth, Johan de / Boves, Loe:
"A comparison of LPC and FFT-based acoustic features for noise robust ASR",
865-868.
Yamada, Miichi / Baba, Akira / Yoshizawa, Shinichi / Mera, Yuichiro / Lee, Akinobu / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Unsupervised noisy environment adaptation algorithm using MLLR and speaker selection",
869-872.
Tufekci, Zekeriya / Gowdy, John N. / Gurbuz, Sabri / Patterson, E.:
"Applying parallel model compensation with mel-frequency discrete wavelet coefficients for noise-robust speech recognition",
873-876.
Hwang, Tai-Hwei / Yuo, Kuo-Hwei / Wang, Hsiao-Chuan:
"Linear interpolation of cepstral variance for noisy speech recognition",
877-880.
Matsumoto, Hiroshi / Shimizu, Akihiko / Yamamoto, Kazumasa:
"Evaluation of a generalized dynamic cepstrum in distant speech recognition",
881-884.
Martin, Arnaud / Damnati, Géraldine / Mauuary, Laurent:
"Robust speech/non-speech detection using LDA applied to MFCC for continuous speech recognition",
885-888.
Trentin, Edmondo / Gori, Marco:
"Toward noise-tolerant acoustic models",
889-892.
Evans, Nicholas W. D. / Mason, John S.:
"Noise estimation without explicit speech, non-speech detection: a comparison of mean, modal and median based approaches",
893-896.
Chengalvarayan, Rathi:
"Evaluation of front-end features and noise compensation methods for robust Mandarin speech recognition",
897-900.
Frey, Brendan J. / Deng, Li / Acero, Alex / Kristjansson, Trausti:
"ALGONQUIN: iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition",
901-904.
Hansen, John H. L. / Sarikaya, Ruhi / Yapanel, Umit / Pellom, Bryan:
"Robust speech recognition in noise: an evaluation using the SPINE corpus",
905-908.
Siu, Manhung / Chan, Yu-Chung:
"Robust speech recognition against packet loss",
1095-1098.
Naito, Masaki / Kuroiwa, Shingo / Kato, Tsuneo / Shimizu, Tohru / Higuchi, Norio:
"Rapid CODEC adaptation for cellular phone speech recognition",
1099-1102.
Gallardo-Antolin, Ascension / Pelaez-Moreno, Carmen / Diaz-de-Maria, Fernando:
"A robust front-end for ASR over IP snd GSM networks: an integrated scenario",
1103-1106.
Renevey, Philippe / Vetter, Rolf / Krauss, Jens:
"Robust speech recognition using missing feature theory and vector quantization",
1107-1110.
Ming, Ji / Jancovic, Peter / Hanna, Philip / Stewart, Darryl:
"Modeling the mixtures of known noise and unknown unexpected noise for robust speech recognition",
1111-1114.
Kawamura, Takayoshi / Takeda, Kazuya / Itakura, Fumitada:
"Robust speech recognition based on selective use of missing frequency band HMMs",
1115-1118.
Masuda-Katsuse, Ikuyo:
"A new method for speech recognition in the presence of non-stationary, unpredictable and high-level noise",
1119-1122.
Kotnik, Bojan / Kacic, Zdravko / Horvat, Bogomir:
"A computational efficient real time noise robust speech recognition based on improved spectral subtraction method",
1123-1126.
Vlaj, Damjan / Kacic, Zdravko / Horvat, Bogomir:
"The use of noisy frame elimination and frequency spectrum magnitude reduction in noise robust speech recognition",
1127-1130.
Chien, Jen-Tzung:
"Combined linear regression adaptation and Bayesian predictive classification for robust speech recognition",
1131-1134.
Hilger, Florian / Ney, Hermann:
"Quantile based histogram equalization for noise robust speech recognition",
1135-1138.
Yao, Kaisheng / Paliwal, Kuldip K. / Nakamura, Satoshi:
"Sequential noise compensation by a sequential kullback proximal algorithm",
1139-1142.
Signal Analysis: Microphone Arrays & Source Localisation
Koutras, Athanasios / Dermatas, Evangelos / Kokkinakis, George:
"Blind speech separation of moving speakers using hybrid neural networks",
997-1000.
Herbordt, W. / Buchner, H. / Kellermann, W.:
"Computationally efficient frequency-domain combination of acoustic echo cancellation and robust adaptive beamforming",
1001-1004.
Seltzer, Michael L. / Raj, Bhiksha:
"Calibration of microphone arrays for improved speech recognition",
1005-1008.
Koutras, Athanasios / Dermatas, Evangelos / Kokkinakis, George:
"Improving simultaneous speech recognition in real room environments using overdetermined blind source separation",
1009-1012.
Asano, Futoshi / Goto, Masataka / Itou, Katunobu / Asoh, Hideki:
"Real-time sound source localization and separation system and its application to automatic speech recognition",
1013-1016.
Speech Recognition and Understanding: Audio-Visual Processing
Lee, Joohun / Kim, JinYoung:
"An efficient lipreading method using the symmetry of lip",
1019-1022.
Heckmann, Martin / Wild, Thorsten / Berthommier, Frédéric / Kroschel, Kristian:
"Comparing audio- and a-posteriori-probability-based stream confidence measures for audio-visual speech recognition",
1023-1026.
Potamianos, Gerasimos / Neti, Chalapathy / Iyengar, Giridharan / Helmuth, Eric:
"Large-vocabulary audio-visual speech recognition by machines and humans",
1027-1030.
Daubias, Philippe / Deleglise, Paul:
"Evaluation of an automatically obtained shape and appearance model for automatic audio visual speech recognition",
1031-1034.
Pelachaud, C. / Magno-Caldognetto, E. / Zmarich, C. / Cosi, Piero:
"An approach to an Italian talking head",
1035-1038.
SIGshow (Special Session)
Eriksson, Anders / Bloothooft, Gerrit:
"Education on the web: launch of three new websites".
Hirst, Daniel / Bel, Bernard / Campbell, Nick:
"SProSIG: a special interest group on speech prosody",
Bonastre, Jean-François / Magrin-Chagnolleau, Ivan / Euler, Stephan / Pellegrino, François / André-Obrecht, Régine / Mason, John S. / Bimbot, Frédéric:
"SPeaker and language characterization (spLC): a special interest group (SIG) of ISCA",
1145-1148.
Campbell, Nick / Hess, Wolfgang / Möbius, Bernd / Santen, Jan van:
"The ISCA special interest group on speech synthesis",
1149-1152.
Massaro, Dominic W.:
"Auditory visual speech processing",
1153-1156.
Bimbot, Frédéric / Bonastre, Jean-Francois:
"The specificity of French speech processing (no proceedings paper)",
paper 1344.
Dybkjćr, Laila:
"SIGdial - special interest group on discourse and dialogue",
1345-1348.
Delcloque, Philippe:
"Integrating speech technology in language learning: an overview of the activities of inSTIL",
1349-1352.
Nadeu, Climent / Ó’Cróinín, Donncha / Petek, Bojan / Sarasola, Kepa / Williams, Briony:
"ISCA SALTMIL SIG: speech and language technology for minority languages",
1353-1556.
Speech Synthesis: Prosody
Chen, Weijun / Lin, Fuzong / Li, Jianmin / Zhang, Bo:
"Training prosodic phrasing rules for Chinese TTS systems",
1159-1162.
Heggtveit, Per Olav / Natvig, Jon Emil:
"Intonation modelling with a lexicon of natural F0 contours",
1163-1166.
Silverman, Kim E. A. / Bellegarda, Jerime R. / Lenzo, Kevin A.:
"Smooth contour estimation in data-driven pitch modelling",
1167-1170.
Saito, Takashi / Sakamoto, Masaharu:
"Generating F0 contours by statistical manipulation of natural F0 shapes",
1171-1174.
Hirschberg, Julia / Rambow, Owen:
"Learning prosodic features using a tree representation",
1175-1178.
Applications: Multimodal Applications
Gurbuz, Sabri / Patterson, Eric K. / Tufekci, Zekeriya / Gowdy, John N.:
"Lip-reading from parametric lip contours for audio- visual speech recognition",
1181-1184.
Lucey, Simon / Sridharan, Sridha / Chandran, Vinod:
"An investigation of HMM classifier combination strategies for improved audio-visual speech recognition",
1185-1188.
Bernsen, Niels Ole / Dybkjćr, Laila:
"Combining multi-party speech and text exchanges over the internet",
1189-1192.
Nakadai, Kazuhiro / Hidai, Ken-ichi / Okuno, Hiroshi G. / Kitano, Hiroaki:
"Real-time multiple speaker tracking by multi-modal integration for mobile robots",
1193-1196.
Nitta, Tsuneo / Katsurada, Kouichi / Yamada, Hirobumi / Nakamura, Yusaku / Kobayashi, Satoshi:
"XISL: an attempt to separate multimodal interactions from XML contents",
1197-1200.
Speech Recognition and Understanding: Speaker Adaptation
Gunawardana, Asela / Byrne, William:
"Discriminative speaker adaptation with conditional maximum likelihood linear regression",
1203-1206.
Kenny, Patrick / Boulianne, Gilles / Dumouchel, Pierre:
"What is the best type of prior distribution for EMAP speaker adaptation?",
1207-1210.
Kim, Yoon:
"Maximum-likelihood affine cepstral filtering (MLACF) technique for speaker normalization",
1211-1214.
Zhou, Bowen / Hansen, John H. L.:
"A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping",
1215-1218.
Yoshizawa, Shinichi / Baba, Akira / Matsunami, Kanako / Mera, Yuichirou / Yamada, Miichi / Lee, Akinobu / Shikano, Kiyohiro:
"Evaluation on unsupervised speaker adaptation based on sufficient HMM statictics of selected speakers",
1219-1222.
Speech Recognition and Understanding: Adaptation
Lei, Jia / Bo, Xu:
"A novel target-driven MLLR adaptation algorithm with multi-layer structure",
1225-1228.
Wallhoff, Frank / Willett, Daniel / Rigoll, Gerhard:
"Scaled likelihood linear regression for hidden Markov model adaptation",
1229-1232.
Myrvoll, Tor Andre / Paliwal, Kuldip K. / Svendsen, Torbjřrn:
"Fast adaptation using constrained affine transformations with hierarchical priors",
1233-1236.
Liu, Xiaoxing / Yuan, Baosheng / Yan, Yonghong:
"A context adaptation approach for building context dependent models in LVCSR",
1237-1240.
Lefevre, Fabrice / Gauvain, Jean-Luc / Lamel, Lori:
"Improving genericity for task-independent speech recognition",
1241-1244.
Matrouf, Driss / Bellot, Olivier / Nocera, Pascal / Linares, Georges / Bonastre, Jean-Francois:
"A posteriori and a priori transformations for speaker adaptation in large vocabulary speech recognition systems",
1245-1248.
Purnell, Darryl W. / Botha, Elizabeth C.:
"Bayesian methods for HMM speech recognition with limited training data",
1249-1252.
Wong, Kwok-Man / Mak, Brian:
"Rapid speaker adaptation using MLLR and subspace regression classes",
1253-1256.
Yoma, Nestor Becerra / Silva, Jorge:
"Speaker adaptation of output probabilities and state duration distributions for speech recognition",
1257-1260.
Wu, Jian / Chang, Eric:
"Cohorts based custom models for rapid speaker and dialect adaptation",
1261-1264.
Vasilache, Marcel / Viikki, Olli:
"Speaker adaptation of quantized parameter HMMs",
1265-1268.
Tsao, Yu / Lee, Shang-Ming / Chou, Fu-Chiang / Lee, Lin-Shan:
"Segmental eigenvoice for rapid speaker adaptation",
1269-1272.
Warakagoda, Narada D. / Johnsen, Magne H.:
"Speaker adaptation in an ASR system based on nonlinear dynamical systems",
1273-1276.
Dialogue Systems: Project Descriptions
Córdoba, R. / San-Segundo, R. / Montero, J. M. / Colás, J. / Ferreiros, J. / Macías-Guarasa, J. / Pardo, Juan M.:
"An interactive directory assistance service for Spanish with large-vocabulary recognition",
1279-1282.
Xu, Yunbiao / Araki, Masahiro / Niimi, Yasuhisa:
"A multilingual-supporting dialog system using a common dialog controller",
1283-1286.
Nouza, Tomas / Nouza, Jan:
"Graphic platform for designing and developing practical voice interaction systems",
1287-1290.
Besacier, Laurent / Blanchon, H. / Fouquet, Y. / Guilbaud, J. P. / Helme, S. / Mazenot, S. / Moraru, D. / Vaufreydaz, D.:
"Speech translation for French in the NESPOLE! european project",
1291-1294.
Hickey, Marianne / Brittan, Paul St John:
"Lessons from the development of a conversational interface",
1295-1298.
Hirschberg, Julia / Bacchiani, Michiel / Hindle, Don / Isenhour, Phil / Rosenberg, Aaron / Stark, Litza / Stead, Larry / Whittaker, Steve / Zamchick, Gary:
"SCANMail: browsing and searching speech data by content",
1299-1302.
Lo, Wai-Kit / Schone, Patrick / Meng, Helen M.:
"Multi-scale retrieval in MEI: an English-Chinese translingual speech retrieval system",
1303-1306.
Chien, Shih-Chieh / Chang, Sen-Chia:
"Compact word graph in spoken dialogue system",
1307-1310.
Sasajima, Munehiko / Yano, Takebhide / Shimomori, Taishi / Uehara, Tatsuya:
"MINOS-II: a prototype car navigation system with mixed initiative turn taking dialogue",
1311-1314.
Kiriyama, Shinya / Hirose, Keikichi / Minematsu, Nobuaki:
"Use of topic knowledge in spoken dialogue information retrieval system for academic documents",
1315-1318.
Komatani, Kazunori / Tanaka, Katsuaki / Kashima, Hiroaki / Kawahara, Tatsuya:
"Domain-independent spoken dialogue platform using key-phrase spotting based on combined language model",
1319-1322.
Durston, Peter J. / Farrell, Mark / Attwater, David / Allen, James / Kuo, Hong-Kwang Jeff / Afify, Mohamed / Fosler-Lussier, Eric / Lee, Chin-Hui:
"OASIS natural language call steering trial",
1323-1326.
Azzini, Ivano / Falavigna, Daniele / Gretter, Roberto / Lanzola, Giordano / Orlandi, Marco:
"First steps toward an adaptive spoken dialogue system in medical domain",
1327-1330.
Nakano, Mikio / Minami, Yasuhiro / Seneff, Stephanie / Hazen, Timothy J. / Cyphers, D. Scott / Glass, James / Polifroni, Joseph / Zue, Victor:
"Mokusei: a telephone-based Japanese conversational system in the weather domain",
1331-1334.
Glass, James / Weinstein, Eugene:
"Speechbuilder: facilitating spoken dialogue system development",
1335-1338.
Rahim, M. / Fabbrizio, Giuseppe Di / Kamm, C. / Walker, Marilyn / Pokrovsky, A. / Ruscitti, P. / Levin, E. / Lee, S. / Syrdal, Ann K. / Schlosser, K.:
"Voice-IF: a mixed-initiative spoken dialogue system for AT&t conference services",
1339-1342.
Wahlster, Wolfgang / Reithinger, Norbert / Blocher, Anselm:
"Smartkom: multimodal communication with a life- like character",
1547-1550.
Meng, Helen M. / Chan, Shuk Fong / Wong, Yee Fong / Chan, Cheong Chat / Wong, Yiu Wing / Fung, Tien Ying / Tsui, Wai Ching / Chen, Ke / Wang, Lan / Wu, Ting Yao / Li, Xiaolong / Lee, Tan / Choi, Wing Nin / Ching, P. C. / Chi, Huisheng:
"ISIS: a learning system with combined interaction and delegation dialogs",
1551-1554.
Wang, Ye-Yi:
"Robust language understanding in mipad",
1555-1558.
Lemon, Oliver / Bracy, Anne / Gruenstein, Alexander / Peters, Stanley:
"The WITAS multi-modal dialogue system I",
1559-1562.
Shriver, Stefanie / Rosenfeld, Roni / Zhu, Xiaojin / Toth, Arthur / Rudnicky, Alexander I. / Flueckiger, Markus:
"Universalizing speech: notes from the USI project",
1563-1566.
Dialogue Systems: Resources
Shriberg, Elizabeth / Stolcke, Andreas / Baron, Don:
"Observations on overlap: findings and implications for automatic processing of multi-party conversation",
1359-1362.
Beckham, Jennifer L. / Fabbrizio, Giuseppe Di / Klarlund, Nils:
"Towards SMIL as a foundation for multimodal, multimedia applications",
1363-1366.
Kipp, Michael:
"ANVIL - a generic annotation tool for multimodal dialogue",
1367-1370.
Walker, Marilyn / Aberdeen, J. / Boland, J. / Bratt, E. / Garofolo, J. / Hirschman, Lynette / Le, A. / Lee, S. / Narayanan, Shrikanth / Papineni, K. / Pellom, Bryan / Polifroni, Joseph / Potamianos, Alexandros / Prabhu, P. / Rudnicky, Alexander I. / Sanders, G. / Seneff, Stephanie / Stallard, D. / Whittaker, Steve:
"DARPA communicator dialog travel planning systems: the june 2000 data collection",
1371-1374.
Speaker Recognition: Features and Transforms
Huang, Chao / Chen, Tao / Li, Stan / Chang, Eric / Zhou, Jianlai:
"Analysis of speaker variability",
1377-1380.
Nishida, M. / Ariki, Y.:
"Speaker recognition by separating phonetic space and speaker space",
1381-1384.
Wang, Nick J.-C. / Tsai, Wei-Ho / Lee, Lin-Shan:
"Eigen-MLLR coefficients as new feature parameters for speaker identification",
1385-1388.
Navratil, Jiri / Chaudhari, Upendra V. / Ramaswamy, Ganesh N.:
"Speaker verification using target and background dependent linear transforms and multi-system fusion",
1389-1392.
Speech Perception: Prosody
Caspers, Johanneke:
"Testing the perceptual relevance of syntactic completion and melodic configuration for turn-taking in dutch",
1395-1398.
Rietveld, Toni / Vermillion, Patricia:
"Cues for perceived pitch register",
1399-1402.
Chen, Aoju / Rietveld, Toni / Gussenhoven, Carlos:
"Language-specific effects of pitch range on the perception of universal intonational meaning",
1403-1406.
Janse, Esther:
"Comparing word-level intelligibility after linear vs. non-linear time-compression",
1407-1410.
Speech Production: Miscellaneous
Koopmans-van Beinum, Florien J. / Clement, Chris J. / Dikkenberg-Pot, Ineke Van den:
"AMSTIVOC (AMsterdam system for transcription of infant VOCalizations) applied to utterances of deaf and normally hearing infants",
1471-1474.
Engwall, Olov:
"Using linguopalatal contact patterns to tune a 3d tongue model",
1475-1478.
Kaburagi, Tokihiko / Honda, Masaaki:
"Electromagnetic articulograph (EMA) based on a nonparametric representation of tthe magnetic field",
1479-1482.
Teixeira, A. / Vaz, F.:
"European portuguese nasal vowels: an EMMA study",
1483-1486.
Fuchs, Susanne / Perrier, Pascal / Mooshammer, Christine:
"The role of the palate in tongue kinematics: an experimental assessment in v sequences from EPG and EMMA data",
1487-1490.
Aylett, Matthew P.:
"Modelling care of articulation with HMMs is dangerous",
1491-1494.
Murphy, Peter J.:
"Spectral tilt as a perturbation-free measurement of noise levels in voice signals",
1495-1498.
Schoentgen, Jean:
"Estimation of the modulation frequency and modulation depth of the fundamental frequency owing to vocal micro-tremor of the voice source signal",
1499-1502.
Dinther, Ralph van / Veldhuis, Raymond N.J. / Kohlrausch, Armin:
"The perceptual relevance of glottal-pulse parameter variations",
1503-1506.
Ogner, Marcel / Kacic, Zdravko:
"Speaker normalization based on test to reference speaker mapping",
1507-1510.
Pitermann, Michel / Munhall, Kevin G.:
"A face-to-muscle inversion of a biomechanical face model for audiovisual and motor control research",
1511-1514.
South, Allan J:
"A model of vowel production under positive pressure breathing",
1515-1518.
Podhorski, Adam / Czepulonis, Marek:
"Helium speech normalisation by codebook mapping",
1519-1523.
Existing and Future Corpora: Next Generation Speech Resources (Special Session)
Campbell, Nick:
"Building a corpus of natural speech - and tools for the processing of expressive speech",
1525-1528.
Broeder, Daan / Brugman, Hennie / Wittenburg, Peter:
"Aspects of modern multi-modal/multi-media corpora exploitation environments",
1529-1532.
Bigbee, Tony / Loehr, Dan / Harper, Lisa:
"Emerging requirements for multi-modal annotation and analysis tools",
1533-1536.
Altosaar, Toomas / Karjalainen, Matti / Vainio, Martti:
"Three-dimensional modelling of speech corpora: added value through visualisation",
1537-1540.
Türk, Ulrich:
"The technical processing in smartkom data collection: a case study",
1541-1544.
Signal Analysis: Speech Processing in Car Environments
Matassoni, M. / Omologo, M. / Svaizer, P.:
"Use of real and contaminated speech for training of a hands-free in-car speech recognizer",
1569-1572.
Plucienkowski, Jay P. / Hansen, John H. L. / Angkititrakul, Pongtep:
"Combined front-end signal processing for in-vehicle speech systems",
1573-1576.
Selouani, Sid-Ahmed / Tolba, Hesham / O’Shaughnessy, Douglas:
"Robust automatic speech recognition in low-SNR car environments by the application of a connectionist subspace-based approach to the melbased cepstral coefficients",
1577-1580.
Korthauer, Andreas:
"Recognition of spelled city names in automotive environments",
1581-1584.
Lleida, Eduardo / Masgrau, Enrique / Ortega, Alfonso:
"Acoustic echo control and noise reduction for cabin car communication",
1585-1588.
Speech Recognition and Understanding: Finite State Transducers for ASR
Hazen, Timothy J. / Hetherington, I. Lee / Park, Alex:
"FST-based recognition techniques for multi-lingual and multi-domain spontaneous speech",
1591-1594.
Boulianne, Gilles / Ouellet, Pierre / Dumouchel, Pierre:
"A transducer approach to word graph generation",
1595-1598.
Hetherington, I. Lee:
"An efficient implementation of phonological rules using finite-state transducers",
1599-1602.
Mohri, Mehryar / Riley, Michael:
"A weight pushing algorithm for large vocabulary speech recognition",
1603-1606.
Seward, Alexander:
"Transducer optimizations for tight-coupled decoding",
1607-1612.
Resources, Assessment and Standards: Assessment Tools & Methodology
Wijngaarden, Sander J. van / Smeele, Paula M.T. / Steeneken, Herman J.M.:
"A new method for testing communication efficiency and user acceptability of speech communication channels",
1675-1678.
Cucchiarini, Catia / Binnenpoorte, Diana / Goddijn, Simo:
"Phonetic transcriptions in the spoken dutch corpus: how to combine efficiency and good transcription quality",
1679-1682.
Hutchinson, Ben:
"A functional approach to speech recognition evaluation",
1683-1686.
Möller, Sebastian / Berger, Jens:
"Instrumental derivation of equipment impairment factors for describing telephone speech codec degradations",
1687-1690.
Lee, Akinobu / Kawahara, Tatsuya / Shikano, Kiyohiro:
"Julius --- an open source real-time large vocabulary recognition engine",
1691-1694.
Toledano, Doroteo Torre / Gómez, Luis A. Hernández:
"Local refinement of phonetic boundaries: a general framework and its application using different transition models",
1695-1698.
Ludwig, Thorsten / Heute, Ulrich:
"Detection of digital transmission systems for voice quality measurements",
1699-1702.
Lewis, Eric / Tatham, Mark:
"Automatic segmentation of recorded speech into syllables for speech synthesis",
1703-1706.
Teixeira, Joăo Paulo / Freitas, Diamantino / Braga, Daniela / Barros, Maria Joăo / Latsch, Vagner:
"Phonetic events from the labeling the european portuguese database for speech synthesis, FEUP/IPBDB",
1707-1710.
Nefti, Samir / Boeffard, Olivier:
"Acoustical and topological experiments for an HMM-based speech segmentation system",
1711-1714.
Zhou, Qiru / Zheng, Jinsong / Lee, Chin-Hui:
"TclBLASR: an automatic speech recognition extension for tcl",
1715-1718.
Existing and Future Corpora: Automated Analysis of Speech Resources (Special Session)
Kessens, Judith M. / Strik, Helmer:
"Lower WERs do not guarantee better transcriptions",
1721-1724.
Chang, Shuangyu / Greenberg, Steven / Wester, Mirjam:
"An elitist approach to articulatory-acoustic feature classification",
1725-1728.
Wester, Mirjam / Greenberg, Steven / Chang, Shuangyu:
"A dutch treatment of an elitist approach to articulatory-acoustic feature classification",
1729-1732.
Dialogue Systems: Dialogue Systems and Generation
Galley, Michel / Fosler-Lussier, Eric / Potamianos, Alexandros:
"Hybrid natural language generation for spoken dialogue systems",
1735-1738.
Cook, Nicholas J. / Benest, Ian D.:
"The generation of speech for a search guide",
1739-1742.
Araki, Masahiro / Ono, Tasuku / Ueda, Kiyoshi / Nishimoto, Takuya / Niimi, Yasuhisa:
"An automatic dialogue system generator from the internet information contents",
1743-1746.
Rogati, Monica / Walker, Marilyn / Rambow, Owen:
"Training a sentence planner for spoken dialog: the impact of syntactic and planning features",
1747-1750.
Speaker Recognition: Alternative Trends in Verification
Vivaracho, Carlos E. / Ortega-García, Javier / Alonso, Luis / Moro, Quiliano I.:
"A comparative study of MLP-based artificial neural networks in text-independent speaker verification against GMM-based systems",
1753-1757.
Fine, Shai / Navratil, Jiri / Gopinath, Ramesh A.:
"Enhancing GMM scores using SVM "hints"",
1757-1760.
Kharroubi, Jamal / Petrovska-Delacretaz, Dijana / Chollet, Gerard:
"Combining GMM's with suport vector machines for text-independent speaker verification",
1761-1764.
Gu, Yong / Thomas, Trevor:
"A text-independent speaker verification system using support vector machines classifier",
1765-1768.
Stapert, Robert P. / Mason, John S.:
"A segmental mixture model for speaker recognition",
2509-2512.
Blouet, Raphael / Bimbot, Frédéric:
"Tree based score computation for speaker verification",
2513-2516.
Andrews, Walter D. / Kohler, Mary A. / Campbell, Joseph P.:
"Phonetic speaker recognition",
2517-2520.
Doddington, George:
"Speaker recognition based on idiolectal differences between speakers",
2521-2524.
Speech Recognition and Understanding: Speech Understanding
Hori, Chiori / Furui, Sadaoki:
"Advances in automatic speech summarization",
1771-1774.
Hacioglu, Kadri / Ward, Wayne:
"A word graph interface for a flexible concept based speech understanding framework",
1775-1778.
Knight, Sylvia / Gorrell, Genevieve / Rayner, Manny / Milward, David / Koeling, Rob / Lewin, Ian:
"Comparing grammar-based and robust approaches to speech understanding: a case study",
1779-1782.
Abdou, Sherif / Scordilis, Michael:
"Integrating multiple knowledge sources for improved speech understanding",
1783-1786.
Speech Recognition and Understanding: Algorithms and Architectures
Litichever, Zeev / Chazan, Dan:
"Classification of transition sounds with application to automatic speech recognition",
1789-1792.
Faizakov, Avi / Cohen, Arnon / Vaich, Tzur:
"Gaussian subtraction (GS) algorithms for word spotting in continuous speech",
1793-1796.
Shire, Michael L.:
"Relating frame accuracy with word error in hybrid ANN-HMM ASR",
1797-1800.
Zhang, Guoliang / Zheng, Fang / Wu, Wenhu:
"A two-layer lexical tree based beam search in continuous Chinese speech recognition",
1801-1804.
Itoh, Yoshiaki / Tanaka, Kazuyo:
"Automatic labeling and digesting for lecture speech utilizing repeated speech by shift CDP",
1805-1808.
Hori, Takaaki / Noda, Yoshiaki / Matsunaga, Shoichi:
"Improved phoneme-history-dependent search for large-vocabulary continuous-speech recognition",
1809-1813.
Psutka, Josef / Müller, Ludek / Psutka, Josef V.:
"Comparison of MFCC and PLP parameterizations in the speaker independent continuous speech recognition task",
1813-1816.
Pusateri, Ernest / Thong, J.M. Van:
"N-best list generation using word and phoneme recognition fusion",
1817-1820.
Ahn, Dong-Hoon / Chung, Minhwa:
"A one pass semi-dynamic network decoder based on language model network",
1821-1824.
Macherey, W. / Keysers, D. / Dahmen, J. / Ney, Hermann:
"Improving automatic speech recognition using tangent distance",
1825-1828.
Chotimongkol, Ananlada / Rudnicky, Alexander I.:
"N-best speech hypotheses reordering using linear regression",
1829-1832.
Deligne, Sabine / Eide, Ellen / Gopinath, Ramesh / Kanevsky, Dimitri / Maison, Benoit / Olsen, Peder / Printz, Harry / Sedivy, Jan:
"Low-resource hidden Markov model speech recognition",
1833-1836.
Hirsch, H. G. / Hellwig, K. / Dobler, S.:
"Speech recognition at multiple sampling rates",
1837-1840.
Shimodaira, Hiroshi / Noma, Ken-ichi / Nakai, Mitsuru / Sagayama, Shigeki:
"Support vector machine with dynamic time-alignment kernel for speech recognition",
1841-1844.
Srinivasamurthy, Naveen / Ortega, Antonio / Narayanan, Shrikanth:
"Efficient scalable speech compression for scalable speech recognition",
1845-1848.
Signal Analysis: Speech Enhancement and Noise Processing
Stadermann, J. / Stahl, V. / Rose, G.:
"Voice activity detection in noisy environments",
1851-1854.
Sheikhzadeh, Hamid / Abutalebi, Hamid Reza:
"An improved wavelet-based speech enhancement system",
1855-1858.
Ramabadran, Tenkasi / Meunier, Jeff / Jasiuk, Mark / Kushner, Bill:
"Enhancing distributed speech recognition with back- end speech reconstruction",
1859-1862.
Tihelka, Jiri / Sovka, Pavel:
"Implementation effective one-channel noise reduction system",
1863-1866.
Kim, Hyoung-Gook / Obermayer, Klaus / Bode, Mathias / Ruwisch, Dietmar:
"Efficient speech enhancement by diffusive gain factors (DGF)",
1867-1870.
Mahé, Gaël / Gilloire, André:
"Correction of the voice timbre distortions on telephone network",
1871-1874.
Lee, Yunjung / Lee, Joohun / Lee, Ki Yong / Shirai, Katsuhiko:
"Speech enhancement based on IMM with NPHMM",
1875-1878.
Fujimoto, M. / Ariki, Y.:
"Speech recognition under musical environments using kalman filter and iterative MLLR adaptation",
1879-1882.
Vetter, Rolf / Renevey, Philippe / Krauss, Jens:
"Dual channel speech enhancement using coherence function and MDL-based subspace approach in bark domain",
1883-1886.
Renevey, Philippe / Drygajlo, Andrzej:
"Entropy based voice activity detection in very noisy conditions",
1887-1890.
Karnebäck, Stefan:
"Discrimination between speech and music based on a low frequency modulation feature",
1891-1894.
Cheng, Yiou-Wen / Lee, Lin-Shan:
"Credibility proof for speech content and speaker verification by fragile watermarking with consecutive frame-based processing",
1895-1898.
Potamitis, I. / Fakotakis, Nikos / Kokkinakis, George:
"Map estimation for on-line noise compensation of time trajectories of spectral coefficients",
1899-1902.
Attias, Hagai / Deng, Li / Acero, Alex / Platt, John C.:
"A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise",
1903-1906.
Speech Synthesis: Grapheme-to-Phoneme Conversion
Kienappel, Anne K. / Kneser, Reinhard:
"Designing very compact decision trees for grapheme-to-phoneme transcription",
1911-1914.
Mana, Franco / Massimino, Paolo / Pacchiotti, Alberto:
"Using machine learning techniques for grapheme to phoneme transcription",
1915-1918.
Llitjos, Ariadna Font / Black, Alan W.:
"Knowledge of language origin improves pronunciation accuracy of proper names",
1919-1922.
Boula de Mareüil, Philippe / Floricic, Franck:
"On the pronunciation of acronyms in French and in Italian",
1923-1926.
Signal Analysis: Speech Enhancement
Shin, Vladimir I. / Kim, Doh-Suk / Kim, Moo Young / Kim, Jeongsu:
"Enhancement of noisy speech by using improved global soft decision",
1929-1932.
Cohen, Israel:
"Enhancement of speech using bark-scaled wavelet packet decomposition",
1933-1936.
Bahoura, Mohammed / Rouat, Jean:
"A new approach for wavelet speech enhancement",
1937-1940.
Yoon, Sukhyun / Yoo, Chang D.:
"Speech/noise-dominant decision for speech enhancement",
1941-1944.
Speech Recognition and Understanding: Discriminative Training
Wang, Fan / Zheng, Fang / Wu, Wenhu:
"An MCE based classification tree using hierarchical feature-weighting in speech recognition",
1947-1950.
Zhou, Jianlai / Chang, Eric / Huang, Chao:
"Selective MCE training strategy in Mandarin speech recognition",
1951-1954.
Wu, Chung-Hsien / Yan, Gwo-Lang:
"Discriminative disfluency modeling for spontaneous speech recognition",
1955-1958.
Hung, Jeih-weih / Wang, Hsin-min / Lee, Lin-shan:
"Comparative analysis for data-driven temporal filters obtained via principal component analysis (PCA) and linear discriminant analysis (LDA) in speech recognition",
1959-1962.
Speech Coding: Advances in Speech Coding
Heikkinen, Ari / Ruoppila, Vesa T. / Pietilä, Samuli:
"Coding method for successive pitch periods",
1965-1968.
Nurminen, Jani / Heikkinen, Ari / Saarinen, Jukka:
"Objective evaluation of methods for quantization of variable-dimension spectral vectors in WI speech coding",
1969-1972.
Pobloth, Harald / Kleijn, W. Bastiaan:
"Squared error as a measure of phase distortion",
1973-1976.
Faundez-Zanuy, Marcos:
"Non-linear predictive vector quantization of speech",
1977-1980.
Katugampala, Nilantha / Kondoz, Ahmet M.:
"A variable rate hybrid coder based on a synchronized harmonic excitation",
1981-1984.
Ho, M. S. / Molyneux, D. J. / Cheetham, B. M. G.:
"A hybrid sub-band sinusoidal coding scheme",
1985-1988.
Lukasiak, J. / Burnett, I. S. / Ritz, C. H.:
"Low rate speech coding incorporating simultaneously masked spectrally weighted linear prediction",
1989-1992.
Najaf-Zadeh, Hossein / Kabal, Peter:
"Narrowband perceptual audio coding: enhancements for speech",
1993-1996.
Bessette, B. / Lefebvre, Roch / Salami, R. / Jelinek, M. / Vainio, J. / Rotola-Pukkila, J. / Mikkola, H. / Jarvinen, K.:
"Techniques for high-quality ACELP coding of wideband speech",
1997-2000.
Pujalte, Sílvia / Moreno, Asunción:
"Wideband ACELP at 16 kb/s with multi-band excitation",
2001-2004.
Lee, Seung Won / Bae, Keun Sung:
"Wideband speech coding algorithm with application of discrete wavelet transform to upper band",
2005-2008.
Satheesh, S. / Sreenivas, T. V.:
"A switched DPCM/subband coder for pre-echo reduction",
2009-2012.
Etemoglu, Cagri Özgenc / Cuperman, Vladimir:
"A generalized multistage VQ approach for spectral magnitude quantization",
2013-2016.
Jung, Sung-Kyo / Park, Young-Cheol / Youn, Sung-Wan / Kim, Kyoung-Tae / Youn, Dae-Hee:
"Efficient implementation of ITU-t g.723.1 speech coder for multichannel voice transmission and storage",
2017-2020.
Resources, Assessment and Standards: Corpora
Hansen, John H. L. / Angkititrakul, Pongtep / Plucienkowski, Jay / Gallant, Stephen / Yapanel, Umit / Pellom, Bryan / Ward, Wayne / Cole, Ron:
""CU-move" : analysis & corpus development for interactive in-vehicle speech systems",
2023-2026.
Kawaguchi, Nobuo / Matsubara, Shigeki / Takeda, Kazuya / Itakura, Fumitada:
"Multimedia data collection of in-car speech communication",
2027-2030.
Heeman, Peter A. / Cole, David / Cronk, Andrew:
"The u.s. speechdat-car data collection",
2031-2034.
Németh, Géza / Zainkó, Csaba:
"Word unit based multilingual comparative analysis of text corpora",
2035-2038.
Backfried, Gerhard / Hecht, Robert / Loots, Sabine / Pfannerer, Norbert / Riedler, Jürgen / Schiefer, Christian:
"Creating a european English broadcast news transcription corpus and system",
2039-2042.
Burger, Susanne / Besacier, Laurent / Coletti, Paolo / Metze, Florian / Morel, Céline:
"The nespole! voIP dialogue database",
2043-2046.
Matousek, Jindrich / Psutka, Josef / Kruta, Jiri:
"Design of speech corpus for text-to-speech synthesis",
2047-2050.
Son, Rob J. J. H. van / Binnenpoorte, Diana / Heuvel, Henk van den / Pols, Louis C. W.:
"The IFA corpus: a phonemically segmented dutch "open source" speech database",
2051-2054.
Louw, Philippa H. / Roux, Justus C. / Botha, Elizabeth C.:
"African speech technology (AST) telephone speech databases: corpus design and contents",
2055-2058.
Heuvel, Henk van den / Boudy, Jerome / Bakcsi, Zsolt / Cernocky, Jan / Galunov, Valery / Kochanina, Julia / Majewski, Wojciech / Pollak, Petr / Rusko, Milan / Sadowski, Jerzy / Staroniewicz, Piotr / Tropf, Herbert S.:
"Speechdat-e: five eastern european speech databases for voice-operated teleservices completed",
2059-2062.
Gibbon, Dafydd / Trippel, Thorsten / Sharoff, Serge:
"Concordancing for parallel spoken language corpora",
2063-2066.
Psutka, Josef / Radova, Vlasta / Müller, Ludek / Matousek, Jindrich / Ircing, Pavel / Graff, David:
"Large broadcast news and read speech corpora of spoken czech",
2067-2070.
Yablonsky, Serge A.:
"Development of Russian lexical databases, corpora and supporting tools for speech products",
2071-2074.
Fotinea, Stavroula-Evita F. / Tambouratzis, George D. / Carayannis, George V.:
"Constructing a segment database for greek time domain speech synthesis",
2075-2078.
Resources, Assessment and Standards: Assessment Methodology
Hone, Kate S. / Graham, Robert:
"Subjective assessment of speech-system interface usability",
2083-2086.
Chu, Min / Peng, Hu:
"An objective measure for estimating MOS of synthesized speech",
2087-2090.
Strik, Helmer / Cucchiarini, Catia / Kessens, Judith M.:
"Comparing the performance of two CSRs: how to determine the significance level of the differences",
2091-2094.
Terashima, Ryuta / Hoshino, Hiroyuki / Wakita, Toshihiro:
"Prediction of low recognition rate words for isolated word recognition system",
2095-2098.
Batusek, Robert:
"An objective measure for assessment of the concatenative TTS segment inventories",
2099-2102.
Speech Recognition and Understanding: Confidence Measures
Zhang, Rong / Rudnicky, Alexander I.:
"Word level confidence annotation using combinations of features",
2105-2108.
Moreno, Pedro J. / Logan, Beth / Raj, Bhiksha:
"A boosting approach for confidence scoring",
2109-2112.
Charlet, Delphine / Mercier, Guy / Jouvet, Denis:
"On combining confidence measures for improved rejection of incorrect data",
2113-2116.
Palmer, David D. / Ostendorf, Mari:
"Improved word confidence estimation using long range features",
2117-2120.
Carpenter, Paul / Jin, Chun / Wilson, Daniel / Zhang, Rong / Bohus, Dan / Rudnicky, Alexander I.:
"Is this conversation on track?",
2121.
Speech Recognition and Understanding: Language Modelling
Nisimura, Ryuichi / Komatsu, Kumiko / Kuroda, Yuka / Nagatomo, Kentaro / Lee, Akinobu / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Automatic n-gram language model creation from web resources",
2127-2130.
Caseiro, Diamantino / Trancoso, Isabel:
"On integrating the lexicon with the language model",
2131-2134.
Varona, A. / Torres, I.:
"Back-off smoothing evaluation over syntactic language models",
2135-2138.
Wu, Genqing / Zheng, Fang / Jin, Ling / Wu, Wenhu:
"An online incremental language model adaptation method",
2139-2142.
Samuelsson, Christer / Hieronymus, James L.:
"Using boosting and POS word graph tagging to improve speech recognition",
2143-2146.
Dialogue Systems: Techniques and Strategies
Yan, Pengju / Zheng, Fang / Xu, Mingxing:
"Robust parsing in spoken dialogue systems",
2149-2152.
Huang, Yinfei / Zheng, Fang / Su, Yi / Li, Fang / Wu, Wenhu:
"A theme structure method for the ellipsis resolution",
2153-2156.
Haase, Martin / Kriechbaum, Werner / Möhler, Gregor / Stenzel, Gerhard:
"Deriving document structure from prosodic cues",
2157-2160.
Su, Yi / Zheng, Fang / Huang, Yinfei:
"Design of a semantic parser with support to ellipsis resolution in a Chinese spoken language dialogue system",
2161-2164.
San-Segundo, R. / Montero, J. M. / Colás, J. / Gutiérrez, J. / Ramos, J. M. / Pardo, Juan M.:
"Methodology for dialogue design in telephone-based spoken dialogue systems: a Spanish train information system",
2165-2168.
Zhang, Bo / Cai, Qingsheng / Mao, Jianfeng / Chang, Eric / Guo, Baining:
"Spoken dialogue management as planning and acting under uncertainty",
2169-2172.
Matsusaka, Yosuke / Fujie, Shinya / Kobayashi, Tetsunori:
"Modeling of conversational strategy for the robot participating in the group conversation",
2173-2176.
Terken, Jacques / Riele, Saskia te:
"Supporting the construction of a user model in speech-only interfaces by adding multi-modality",
2177-2180.
Tseng, Shu-Chuan:
"A word- and turn-oriented approach to exploring the structure of Mandarin dialogues",
2181-2184.
Niimi, Yasuhisa / Oku, Tomoki / Nishimoto, Takuya / Araki, Masahiro:
"A rule based approach to extraction of topics and dialog acts in a spoken dialog system",
2185-2188.
Turunen, Markku / Hakulinen, Jaakko:
"Agent-based error handling in spoken dialogue systems",
2189-2192.
Degerstedt, Lars / Jönsson, Arne:
"Iterative implementation of dialogue system modules",
2193-2196.
Oppermann, Daniela / Schiel, Florian / Steininger, Silke / Beringer, Nicole:
"Off-talk - a problem for human-machine-interaction?",
2197-2200.
Schwarz, Jana / Matousek, Vaclav:
"Automatic analysis of real dialogues and generating of training corpora",
2201-2204.
Macherey, Klaus / Och, Franz Josef / Ney, Hermann:
"Natural language understanding using statistical machine translation",
2205-2208.
Zhang, Jianping / Ward, Wayne / Pellom, Bryan / Yu, Xiuyang / Hacioglu, Kadri:
"Improvements in audio processing and language modeling in the CU communicator",
2209-2212.
Tsai, Augustine / Pargellis, Andrew N. / Lee, Chin-Hui / Olive, Joseph P.:
"Dialogue session: management using voiceXML",
2213-2216.
Ammicht, Egbert / Potamianos, Alexandros / Fosler-Lussier, Eric:
"Ambiguity representation and resolution in spoken dialogue systems",
2217-2220.
Popovici, C. / Andorno, M. / Laface, P. / Fissore, L. / Nigra, M. / Vair, C.:
"Learning of user formulations for business listings in automatic directory assistance",
2325-2328.
Louloudis, D. / Tsopanoglou, A. / Fakotakis, Nikos / Kokkinakis, George:
"Mathematical modeling of spoken human - machine dialogues including erroneous confirmations",
2329-2332.
Lewin, Ian:
"Limited enquiry negotiation dialogues",
2333-2336.
Cox, Stephen / Shahshahani, Ben:
"A comparison of some different techniques for vector based call-routing",
2337-2340.
Niklfeld, Georg / Finan, Robert / Pucher, Michael:
"Architecture for adaptive multimodal dialog systems based on voiceXML",
2341-2344.
Speech Synthesis: Miscellaneous
Tsuzaki, Minoru:
"Feature extraction by auditory modeling for unit selection in concatenative speech synthesis",
2223-2226.
Lee, Minkyu:
"Perceptual cost functions for unit searching in large corpus-based text-to-speech",
2227-2230.
Kim, Sanghun / Lee, Youngjik / Hirose, Keikichi:
"Pruning of redundant synthesis instances based on weighted vector quantization",
2231-2234.
Fitt, Susan:
"Using real words for recording diphones",
2235-2238.
Dines, John / Sridharan, Sridha / Moody, Miles:
"Application of the trended hidden Markov model to speech synthesis",
2239-2242.
Sandri, Stefano / Zovato, Enrico:
"Two features to check phonetic transcriptions in text to speech systems",
2243-2246.
Xydas, Gerasimos / Kouroupetroglou, Georgios:
"Text-to-speech scripting interface for appropriate vocalisation of e-texts",
2247-2250.
Rojc, Matej / Kacic, Zdravko:
"Representation of large lexica using finite-state transducers for the multilingual text-to-speech synthesis systems",
2251-2254.
Hirose, Keikichi / Eto, Masaya / Minematsu, Nobuaki / Sakurai, Atsuhiro:
"Corpus-based synthesis of fundamental frequency contours based on a generation process model",
2255-2258.
Tychtl, Zbyn.ek / Psutka, Josef:
"Corpus-based database of residual excitations used for speech reconstruction from MFCCs",
2259-2262.
Yoshimura, Takayoshi / Tokuda, Keiichi / Masuko, Takashi / Kobayashi, Takao / Kitamura, Tadashi:
"Mixed excitation for HMM-based speech synthesis",
2263-2266.
Ohtsuka, Takahiro / Kasuya, Hideki:
"Aperiodicity control in ARX-based speech analysis-synthesis method",
2267-2270.
Karjalainen, Matti / Paatero, Tuomas:
"Generalized source-filter structures for speech synthesis",
2271-2274.
Wypych, Mikolaj:
"The speech synthesis environment and parametric modeling of coarticulation",
2275-2278.
Integration of Phonetic Knowledge in Speech Technology: Experiments and Experiences (Special Session)
Carson-Berndsen, Julie / Walsh, Michael:
"Defining constraints for multilinear speech processing",
2281-2284.
Batliner, Anton / Möbius, Bernd / Möhler, Gregor / Schweitzer, Antje / Nöth, Elmar:
"Prosodic models, automatic speech understanding, and speech synthesis: towards the common ground",
2285-2288.
Christensenyz, Heidi / Lindbergy, Břrge / Anderseny, Ove:
"Introducing phonetically motivated information into ASR",
2289-2292.
Gravier, Guillaume / Yvon, Francois / Jacob, Bruno / Bimbot, Frédéric:
"Integrating contextual phonological rules in a large vocabulary decoder",
2293-2296.
Pastor-i-Gadea, M. / Casacuberta, F.:
"Automatic learning of finite state automata for pronunciation modeling",
2297-2300.
Speech Coding: Wideband Speech Coding
Rotola-Pukkila, J. / Vainio, J. / Mikkola, H. / Järvinen, K. / Bessette, B. / Lefebvre, Roch / Salami, R. / Jeline, M.:
"AMR wideband codec - leap in mobile communication voice quality",
2303-2306.
Farrugia, Maria / Kondoz, Ahmet M.:
"Combined speech and audio coding with bit rate and bandwidth scalability",
2307-2310.
Fék, Mark / Várkonyi-Kóczy, Annamária R. / Boucher, Jean-Marc:
"Joint speech and audio coding combining sinusoidal modeling and wavelet packets",
2311-2314.
Ritz, C. H. / Burnett, I. S.:
"Temporal decomposition: a promising approach to low rate wideband speech compression",
2315-2318.
Ragot, Stephane / Lahdili, Hassan / Lefebvre, Roch:
"Wideband LSF quantization by generalized voronoi codes",
2319-2322.
Speech Recognition and Understanding: Robust ASR
Rigazio, Luca / Nguyen, Patrick / Kryze, David / Junqua, Jean-Claude:
"Separating speaker and environment variabilities for improved recognition in non-stationary conditions",
2347-2350.
Rose, Richard C. / Kim, Hong Kook / Hindle, Don:
"Robust speech recognition techniques applied to a speech in noise task",
2351-2355.
Afify, Mohamed / Siohan, Olivier / Lee, Chin-Hui:
"Minimax classification with parametric neighborhoods for noisy speech recognition",
2355-2358.
Padmanabhan, M. / Dharanipragada, S.:
"Maximum likelihood non-linear transformation for environment adaptation in speech recognition",
2359-2362.
Turunen, Jari / Vlaj, Damjan:
"A study of speech coding parameters in speech recognition",
2363-2366.
Applications: Miscellaneous Applications
Garcia-Mateo, Carmen / Docio-Fernandez, Laura / Cardenal-Lopez, Antonio:
"Some practical considerations in the deployment of a wireless-communication interactive voice response system",
2369-2372.
Rosenberg, Aaron / Hirschberg, Julia / Bacchiani, Michiel / Parthasarathy, S. / Isenhour, Philip / Stead, Larry:
"Caller identification for the SCANMail voicemail browser",
2373-2376.
Koumpis, Konstantinos / Renals, Steve / Niranjan, Mahesan:
"Extractive summarization of voicemail using lexical and prosodic feature subset selection",
2377-2380.
Scharenborg, Odette / Sturm, Janienke / Boves, Lou:
"Business listings in automatic directory assistance",
2381-2384.
Pastor-i-Gadea, M. / Sanchis, A. / Casacuberta, F. / Vidal, E.:
"Eutrans: a speech-to-speech translator prototype",
2385-2388.
Metze, Florian / McDonough, John / Soltau, Hagen:
"Speech recognition over netmeeting connections",
2389-2392.
Martín, Juan C. Díaz / Zapata, Juan L. García / García, José M. Rodríguez / Salgado, José F. Álvarez / Bueno, Pablo Espada / Vilda, Pedro Gómez:
"DIARCA: a component approach to voice recognition",
2393-2396.
Kyung, Y. J. / Jung, J. O. / Sohn, S. M. / Chun, H. J. / Moon, S. Y. / Kim, M. H. / Sull, W. H.:
"The mvprotek : m-commerce voice verification system",
2397-2400.
Alm, Norman / Iwabuchi, Mamoru / Andreasen, Peter N. / Nakamura, Kenryu / Murray, Iain R.:
"Real-time multilingual communication by means of prestored conversational units",
2401-2404.
Murray, Iain R. / Arnott, John L. / Alm, Norman / Dye, Richard / Harper, Gillian:
"Writing script-based dialogues for AAC",
2405-2409.
Iida, Akemi / Sakurada, Yosuke / Campbell, Nick / Yasumura, Michiaki:
"Communication aid for non-vocal people using corpusbased concatenative speech synthesis",
2409-2412.
Suzuki, Noriko / Kakehi, Kazuhiko / Takeuchi, Yugo / Okada, Michio:
"Social effects on vocal rate with echoic mimicry using prosody-only voice",
2413-2416.
Castelli, Eric / Istrate, Dan:
"Everyday life sounds and speech analysis for a medical telemonitoring system",
2417-2420.
Draxler, Christoph / Bengler, Klaus / Olaverri-Monreal, Christina:
"Speaking while driving - preliminary results on spellings in the German speechdat-car database",
2421-2424.
Signal Analysis: Pitch and Speech Analysis
Chazan, Dan / Tzur, Meir (Zibulski) / Hoory, Ron / Cohen, Gilad:
"Efficient periodicity extraction based on sine-wave representation and its application to pitch determination of speech signals",
2427-2430.
Shdaifat, I. / Grigat, R. / Lütgert, Stefan:
"Viseme recognition using multiple feature matching",
2431-2434.
Hirtum, A. Van / Berckmans, D.:
"The fundamental frequency of cough by autocorrelation analysis",
2435-2438.
Ishimoto, Yuichi / Unoki, Masashi / Akagi, Masato:
"A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency",
2439-2442.
Sasou, Akira / Tanaka, Kazuyo:
"Robust LP analysis using glottal source HMM with application to high-pitched and noise corrupted speech",
2443-2446.
Choi, Yong-Soo / Youn, Dae-Hee:
"Fast harmonic estimation using a low resolution pitch for low bit rate harmonic coding",
2447-2450.
Cheveigné, Alain de / Kawahara, Hideki:
"Comparative evaluation of F0 estimation algorithms",
2451-2454.
Ishi, Carlos Toshinori / Minematsu, Nobuaki / Nishide, Ryuji / Hirose, Keikichi:
"Identification of accent and intonation in sentences for CALL systems",
2455-2458.
Kawahara, Hideki / Zolfaghari, Parham:
"Systematic F0 glitches around nasal-vowel transitions",
2459-2462.
Wojdel, Jacek C. / Rothkrantz, Leon J. M.:
"Using aerial and geometric features in automatic lip-reading",
2463-2466.
Schnell, Karl / Lacroix, Arild:
"Inverse filtering of tube models with frequency dependent tube terminations",
2467-2470.
Ouni, Kaďs / Lachiri, Zied / Ellouze, Noureddine:
"Formant estimation using gammachirp filterbank",
2471-2474.
Potamitis, I. / Fakotakis, Nikos:
"Autoregressive time-frequency interpolation in the context of missing data theory for impulsive noise compensation",
2475-2478.
Petrinovic, D. / Cuperman, Vladimir:
"Analysis of the voiced speech using the generalized fourier transform with quadratic phase",
2479-2482.
Integration of Phonetic Knowledge in Speech Technology: Is Phonetic Knowledge any use? Panel discussion (Special Session)
Greenberg, Steven:
"From here to utility - melding phonetic insight with speech technology",
2485-2488.
Speech Coding: Speech Transmission Systems
Park, Sang-Wook / Park, Young-Cheol / Youn, Dae-Hee:
"Speech quality measure for voIP using wavelet based bark coherence function",
2491-2494.
Wijngaarden, Sander J. van / Steeneken, Herman J. M.:
"A proposed method for measuring language dependency of narrow band voice coders",
2495-2498.
Yoon, Sung Wan / Jung, Sung Kyo / Park, Young Cheol / Youn, Dae Hee:
"An efficient transcoding algorithm for g.723.1 and g.729a speech coders",
2499-2502.
Perez-Cordoba, Jose L. / Rubio, Antonio J. / Peinado, Antonio M. / Torre, Angel de la:
"Joint source-channel coding for low bit-rate coding of LSP parameters",
2503-2506.
Speech Recognition and Understanding: Rhythm and Timing in ASR
Wrede, Britta / Fink, Gernot A. / Sagerer, Gerhard:
"An investigation of modelling aspects for ratedependent speech recognition",
2527-2530.
Nanjo, Hiroaki / Kato, Kazuomi / Kawahara, Tatsuya:
"Speaking rate dependent acoustic modeling for spontaneous lecture speech recognition",
2531-2534.
Fábián, Tibor / Pfau, Thilo / Ruske, Günther:
"Analysis of n-best output hypotheses for fast speech in large vocabulary continuous speech recognition",
2535-2538.
Farinas, Jérôme / Pellegrino, François:
"Automatic rhythm modeling for language identification",
2539-2542.
Speech Recognition and Understanding: Confidence Measures and OOV
Zhang, Yaxin / Lee, Raymond / Madievski, Anton:
"Confidence measure (CM) estimation for large vocabulary speaker-independent continuous speech recognition system",
2545-2548.
Kodama, Yasuhiro / Utsuro, Takehito / Nishizaki, Hiromitsu / Nakagawa, Seiichi:
"Experimental evaluation on confidence of agreement among multiple Japanese LVCSR models",
2549-2552.
San-Segundo, R. / Macías-Guarasa, J. / Ferreiros, J. / Martín, P. / Pardo, Juan M.:
"Detection of recognition errors and out of the spelling dictionary names in a spelled name recognizer for Spanish",
2553-2556.
Mengusoglu, Erhan / Ris, Christophe:
"Use of acoustic prior information for confidence measure in ASR applications",
2557-2560.
Ferrer, Luciana / Estienne, Claudio:
"Improving performance of a keyword spotting system by using a new confidence measure",
2561-2564.
Tan, Beng T. / Gu, Yong / Thomas, Trevor:
"Word level confidence measures using n-best sub-hypotheses likelihood ratio",
2565-2568.
Goel, Vaibhava / Kumar, Shankar / Byrne, William:
"Confidence based lattice segmentation and minimum Bayes-risk decoding",
2569-2572.
Jiang, Hui / Soong, Frank K. / Lee, Chin-Hui:
"A data selection strategy for utterance verification in continuous speech recognition",
2573-2576.
Ogata, J. / Ariki, Y.:
"Improved speech recognition using iterative decoding based on confidence measures",
2577-2580.
Schaaf, Thomas:
"Detection of OOV words using generalized word models and a semantic class language model",
2581-2584.
Bouwman, Gies / Sturm, Janienke / Boves, Lou:
"Effects of OOV rates on keyphrase rejection schemes",
2585-2588.
Signal Analysis: Source Localisation and Beam Forming
Sánchez-Bote, J. L. / González-Rodríguez, J. / Simón-Zorita, D.:
"A new auditory based microphone array and objective evaluation using e-RASTI",
2591-2594.
Araki, Shoko / Makino, Shoji / Mukai, Ryo / Saruwatari, Hiroshi:
"Equivalence between frequency domain blind source separation and frequency domain adaptive null beamformers",
2595-2598.
Mukai, Ryo / Araki, Shoko / Makino, Shoji:
"Separation and dereverberation performance of frequency domain blind source separation for speech in a reverberant environment",
2599-2602.
Saruwatari, Hiroshi / Kawamura, Toshiya / Shikano, Kiyohiro:
"Blind source separation for speech based on fast-convergence algorithm with ICA and beamforming",
2603-2606.
Mizumachi, Mitsunori / Nakamura, Satoshi:
"Noise reduction using paired-microphones for both far-field and near-field sound sources",
2607-2610.
Nishiura, Takanobu / Nakamura, Satoshi / Shikano, Kiyohiro:
"Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone array",
2611-2614.
Álvarez-Marquina, A. / Gómez-Vilda, P. / Martínez-Olalla, R. / Nieto-Lluís, V. / Rodellar-Biarge, V.:
"Speech enhancement and source separation based on binaural negative beamforming",
2615-2618.
Gómez-Vilda, P. / Álvarez-Marquina, A. / Nieto-Lluís, V. / Rodellar-Biarge, V. / Martínez-Olalla, R.:
"Multiple source separation in the frequency domain using negative beamforming",
2619-2622.
Martin, Rainer / Petrovsky, Alexey / Lotter, Thomas:
"Planar superdirective microphone arrays for speech acquisition in the car",
2623-2626.
Kinnunen, Tomi / Kärkkäinen, Ismo / Fränti, Pasi:
"Is speech data clustered? - statistical analysis of cepstral features",
2627-2630.
Nokas, George / Dermatas, Evangelos / Kokkinakis, George:
"Maximum likelihood adaptation for distant speech recognition of stationary and moving speakers in reverberant environments",
2631-2634.
Couvreur, Laurent / Ris, Christophe / Couvreur, Christophe:
"Model-based blind estimation of reverberation time: application to robust ASR in reverberant environments",
2635-2638.
Momomura, Yasunori / Okada, Kenji / Arai, Takayuki / Kanedera, Noboru / Murahara, Yuji:
"Using the modulation complex wavelet transform for feature extraction in automatic speech recognition",
2639-2642.
Okuno, Hiroshi G. / Nakadai, Kazuhiro / Lourens, Tino / Kitano, Hiroaki:
"Separating three simultaneous speeches with two microphones by integrating auditory and visual processing",
2643-2646.
Signal Analysis: Speech Features and Modelling
Funaki, Keiichi:
"A time-varying complex AR speech analysis based on GLS and ELS method",
2649-2652.
Pitz, Michael / Molau, Sirko / Schlüter, Ralf / Ney, Hermann:
"Vocal tract normalization equals linear transformation in cepstral space",
2653-2656.
Yu, An-Tze / Wang, Hsiao-Chuan:
"An algorithm for finding line spectrum frequencies of added speech signals and its application to robust speech recognition",
2657-2660.
Gonon, G. / Montrésor, S. / Baudry, M.:
"Improved entropic gain for speech signals analysis/synthesis based on an adaptive time-frequency segmentation scheme",
2661-2664.
Speech Recognition and Understanding: Kids, Toys and Emotions
Lucke, Helmut / Omote, Masanori:
"Automatic word acquisition from continuous speech",
2667-2670.
Li, Qun / Russell, Martin J.:
"Why is automatic recognition of children's speech difficult?",
2671-2674.
Arunachalam, Sudha / Gould, Dylan / Andersen, Elaine / Byrd, Dani / Narayanan, Shrikanth:
"Politeness and frustration language in child-machine interactions",
2675-2678.
Nogueiras, Albino / Moreno, Asunción / Bonafonte, Antonio / Marińo, José B.:
"Speech emotion recognition using hidden Markov models",
2679-2682.
Applications: Media Applications
Ibrahim, Aseel / Lundberg, Jonas / Johansson, Jenny:
"Speech enhanced remote control for media terminal",
2685-2688.
Amaral, Rui / Langlois, Thibault / Meinedo, Hugo / Neto, Joao / Souto, Nuno / Trancoso, Isabel:
"The development of a portuguese version of a media watch system",
2689-2692.
Roach, Matthew / Mason, John S.:
"Classification of video genre using audio",
2693-2696.
Horiuchi, Yasuo / Ichikawa, Akira:
"Prosody in finger braille and teletext receiver for finger braille",
2697-2702.
Speech Recognition and Understanding: Distributed Speech Recognition
Bernard, Alexis / Alwan, Abeer:
"Joint channel decoding - Viterbi recognition for wireless applications",
2703-2706.
Peinado, Antonio M. / Sanchez, Victoria / Segura, José C. / Perez-Cordoba, José L.:
"MMSE-based channel error mitigation for distributed speech recognition",
2707-2710.
Stadermann, J. / Meermeier, R. / Rigoll, Gerhard:
"Distributed speech recognition using traditional and hybrid modeling techniques",
2711-2714.
Riskin, Eve A. / Boulis, Constantinos / Otterson, Scott / Ostendorf, Mari:
"Graceful degradation of speech recognition performance over lossy packet networks",
2715-2718.
Speech Recognition and Understanding: Prosody and Cross-Language in ASR
Schultz, Tanja / Waibel, Alex:
"Experiments on cross-language acoustic modeling",
2721-2724.
Zgank, Andrej / Imperl, Bojan / Johansen, Finn Tore / Kacic, Zdravko / Horvat, Bogomir:
"Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering",
2725-2729.
Harju, Mikko / Salmela, Petri / Leppänen, Jussi / Viikki, Olli / Saarinen, Jukka:
"Comparing parameter tying methods for multilingual acoustic modelling",
2729-2732.
Chengalvarayan, Rathi:
"Accent-independent universal HMM-based speech recognizer for american, australian and british English",
2733-2736.
Chen, Fang / Sääv, Jonas:
"The effect of time stress on automatic speech recognition accuracy when using second language",
2737-2740.
Wong, Yiu Wing / Chang, Eric:
"The effect of pitch and lexical tone on different Mandarin speech recognition tasks",
2741-2744.
Stemmer, Georg / Nöth, Elmar / Niemann, Heinrich:
"Acoustic modeling of foreign words in a German speech recognition system",
2745-2748.
Siu, K. C. / Meng, Helen M.:
"Semi-automatic grammar induction for bi-directional English-Chinese machine translation",
2749-2752.
Charnvivit, Patavee / Jitapunkul, Somchai / Ahkuputra, Visarut / Maneenoi, Ekkarit / Thathong, Umavasee / Thampanitchawong, Boonchai:
"F0 feature extraction by polynomial regression function for monosyllabic Thai tone recognition",
2753-2756.
Kim, Ji-Hwan / Woodland, P. C.:
"The use of prosody in a combined system for punctuation generation and speech recognition",
2757-2760.
Wang, Chao / Seneff, Stephanie:
"Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the jupiter domain",
2761-2765.
Stephenson, Todd A. / Mathew, M. / Bourlard, Herve:
"Modeling auxiliary information in Bayesian network based ASR",
2765-2768.
Chen, Feili / Chang, Eric:
"A new dynamic HMM model for speech recognition",
2769-2772.
Wang, Wern-Jun / Lee, Chun-Jen / Huang, Eng-Fong / Chen, Sin-Horng:
"Multi-keyword spotting of telephone speech using orthogonal transform-based SBR and RNN prosodic model",
2773-2776.
Iskra, Andrej / Petek, Bojan / Brřndsted, Tom:
"Recognition of slovenian speech: within and cross-language experiments on monophones using the speechdat(II)",
2777-2780.
Batliner, Anton / Buckow, Jan / Huber, Richard / Warnke, Volker / Nöth, Elmar / Niemann, Heinrich:
"Boiling down prosody for the classification of boundaries and accents in German and English",
2781-2784.
Education: Education and Training
Drygajlo, Andrzej / Garcia Molina, Gary:
"Javaspeakerrecognition - interactive workbench for visualizing speaker recognition concepts on the WWW",
2787-2790.
Arai, Takayuki / Usuki, Nobuyuki / Murahara, Yuji:
"Prototype of a vocal-tract model for vowel production designed for education in speech science",
2791-2794.
Cooke, Martin / Garcia-Lecumberri, Maria Luisa / Maidment, John:
"A tool for automatic feedback on phonemic transcription",
2795-2798.
Chang, Eric / Shi, Yu / Zhou, Jianlai / Huang, Chao:
"Speech lab in a box: a Mandarin speech toolbox to jumpstart speech related research",
2799-2802.
Jong, John H. A. L. de / Bernstein, Jared:
"Relating phonepass scores overall scores to the council of europe framework level descriptors",
2803-2806.
Vicsi, Klára / Roach, Peter / Öster, Anne-Marie / Kacic, Zdravko / Csatári, F. / Sfakianaki, A. / Veronik, R. / Gordos, Géza:
"A multilingual, multimodal, speech training system, SPECO",
2807-2810.
Nakamura, Naoki / Minematsu, Nobuaki / Nakagawa, Seiichi:
"Instantaneous estimation of accentuation habits for Japanese students to learn English pronunciation",
2811-2814.
Tanaka, Takashi / Mori, Kazumasa / Kobayashi, Satoshi / Nakagawa, Seiichi:
"Automatic construction of CALL system from TV news program with captions",
2815-2818.
Speaker Recognition: Features and Robustness
Arcienega, Mijail / Drygajlo, Andrzej:
"Pitch-dependent GMMs for text-independent speaker recognition systems",
2821-2825.
Ezzaidi, Hassan / Rouat, Jean / O’Shaughnessy, Douglas:
"Towards combining pitch and MFCC for speaker recognition systems",
2825-2828.
Kim, Yu-Jin / Jung, Hea-Kyoung / Chung, Jae-Ho:
"Formant-broadened CMS using peak-picking in LOG spectrum",
2829-2832.
Mashao, Daniel J. / Baloyi, N. Tinyiko:
"Improvements in the speaker identification rate using feature-sets",
2833-2836.
Miyajima, Chiyomi / Tokuda, Keiichi / Kitamura, Tadashi:
"Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distribution",
2837-2840.
Yadong, Wu / Zhizhu, Li:
"Speaker recognition based on feature space trace",
2841-2844.
Yoma, Nestor Becerra / Fernandez, Miguel Villar:
"Additive and convolutional noise canceling in speaker verification using a stochastic weighted viterbi algorithm",
2845-2848.
Yoshida, Kenichi / Takagi, Kazuyuki / Ozeki, Kazuhiko:
"A multi-SNR subband model for speaker identification under noisy environments",
2849-2852.