Table of Contents and Access to Abstracts - Ordered by Sessions
Keynotes
Articulatory Measurements and Modelling
Assessment
Audio-Visual Speech
Corpora (Oral)
Corpora (Poster)
Dialogue (Oral)
Dialogue 1 (Poster)
Dialogue 2 (Poster)
Disorders in Speech Production and/or Speech Perception
Enhancements, Echo Cancellation, and Quality Measures
First and Second Language Learning
Joint Source-Channel Coding
Language Identification
Multimodal Interaction
Prosody - Prosodic Features in Dialogues
Prosody - Prosodic Phrasing and Interruptions
Prosody - Stress, Accent and Prominence Phrasing
Prosody - Study of Prosody for Speech Synthesis
Prosody - Temporal and/or Intonational Features
Speaker Recognition - Acoustic Features and Robustness
Speaker Recognition - Scoring and Decision
Speaker Recognition and Topic Detection
Speech Acoustics
Speech Analysis and Segmentation
Speech Analysis and Tools
Speech and Noise (Oral)
Speech and Noise 1 (Poster)
Speech and Noise 2 (Poster)
Speech and Noise 3 (Poster)
Speech and the Internet (Oral)
Speech and the Internet (Poster)
Speech Coding
Speech Communication Education
Speech Disorders & Speech for Disabled
Speech Generation and Synthesis - Acoustic Synthesis
Speech Generation and Synthesis - Acoustic Synthesis and Units
Speech Generation and Synthesis - Concatenation
Speech Generation and Synthesis - Prosody
Speech Generation and Synthesis - Systems and Evaluation
Speech Generation and Synthesis - Systems, Linguistic Processing
Speech Perception (Oral)
Speech Perception 1
Speech Perception 2
Speech Recognition - Acoustic Modelling 1 (Oral)
Speech Recognition - Acoustic Modelling 2 (Oral)
Speech Recognition - Acoustic Modelling 1 (Poster)
Speech Recognition - Acoustic Modelling 2 (Poster)
Speech Recognition - Acoustic Processing
Speech Recognition - Adaptation 1
Speech Recognition - Adaptation 2
Speech Recognition - Adaptation (Poster)
Speech Recognition - Broadcast News (Oral)
Speech Recognition - Broadcast News (Poster)
Speech Recognition - Confidence Measures 1
Speech Recognition - Confidence Measures 2
Speech Recognition - Language Modelling 1 (Oral)
Speech Recognition - Language Modelling 2 (Oral)
Speech Recognition - Language Modelling 1 (Poster)
Speech Recognition - Language Modelling 2 (Poster)
Speech Recognition - Large Vocabulary Continuous Speech Recognition (LVCSR)
Speech Recognition - Multilinguality
Speech Recognition - Multi-stream ASR
Speech Recognition - Search
Speech Recognition - Search and Pronunciation Modelling
Speech Recognition - Speaking Rate
Speech Recognition - Training
Speech Signal Processing
Speech Technology for Language Learning
Speech Translation
Speech Understanding - Miscellaneous Topics
Spoken Dialogue Systems
Systems, Architectures
Systems, Architectures, Interfaces
Text-Dependent Speaker Verification
Text-Independent Speaker Verification and Tracking
Topic Detection and Tracking
Wideband and Perceptually Based Coding
Keynotes
Jelinek, Frederick / Chelba, Ciprian:
"Putting language into language modeling",
keynote paper 1.
Gósy, Mária:
"The controversial connection between speech production and perception: theories vs. facts",
keynote paper.
Maybury, Mark T.:
"Multimedia interaction for the new millennium",
keynote paper 3.
Lindblom, Björn:
"How speech works - questions and preliminary answers",
keynote paper 4.
Speech Recognition, Adaptation 1
Chou, Wu:
"Maximum a posterior linear regression with elliptically symmetric matrix variate priors",
1-4.
Goronzy, Silke / Kompe, Ralf:
"A MAP-like weighting scheme for MLLR speaker adaptation",
5-8.
Hirsch, Hans-Günter:
"HMM adaptation for telephone applications",
9-12.
Huang, Jing / Padmanabhan, Mukund:
"A study of adaptation techniques on a voicemail transcription task",
13-16.
Logan, Beth:
"Maximum likelihood sequential adaptation",
17-20.
Prosody - Prosodic Features in Dialogues
Brinckmann, aren / Benzmüller, Ralf:
"The relationship between utterance type and F0 contour in German",
21-24.
Grigorova, Evelina / Filipov, Vladimir / Andreeva, Bistra:
"A contrastive investigation of discourse intonational characteristic features of sofia bulgarian and hamburg German in MAP task dialogues",
25-28.
Horne, Merle / Hansson, Petra / Bruce, Gösta / Frid, Johan:
"Prosodic correlates of information structure in Swedish human-human dialogues",
29-32.
Kitazawa, S. / Kobayashi, S.:
"Paralinguistic features as suprasegmental acoustics observed in natural Japanese dialogue",
33-36.
Tamoto, Masafumi / Kawamori, Masahito / Kawabata, Takeshi:
"Integrating prosodic features in dialogue understanding",
37-40.
Speech Recognition - Confidence Measures
Cox, Stephen / Dasmahapatra, Srinandan:
"A high-level approach to confidence estimation in speech recognition",
41-44.
Jia, Bin / Zhu, Xiaoyan / Luo, Yupin / Hu, Dongcheng:
"Utterance verification using modified segmental probability model",
45-48.
Klakow, Dietrich / Rose, Georg / Aubert, Xavier:
"OOV-detection in large vocabulary system using automatically defined word-fragments as fillers",
49-52.
Lin, Qiguang / Lubensky, David / Roukos, Salim:
"Use of recursive mumble models for confidence measuring",
53-56.
Rahim, Mazin:
"Utterance verification for the numeric language in a natural spoken dialogue",
57-60.
Speech Recognition - Acoustic Processing
Chengalvarayan, Rathinavelu:
"Robust energy normalization using speech/nonspeech discriminator for German connected digit recognition",
61-64.
Veth, Johan de / Cranen, Bert / Wet, Febe de / Boves, Louis:
"Acoustic pre-processing for optimal effectivity of missing feature theory",
65-68.
Heracleous, Panikos / Yamada, Takeshi / Nakamura, Satoshi / Shikano, Kiyohiro:
"Simultaneous recognition of multiple sound sources based on 3-d n-best search using microphone array",
69-72.
Hermansky, Hynek / Jain, Pratibha:
"Down-sampling speech representation in ASR",
73-76.
Macho, Dusan / Nadeu, Climent / Jancovic, Peter / Rozinaj, Gregor / Hernando, Javier:
"Comparison of time & frequency filtering and cepstral-time matrix approaches in ASR",
77-80.
Meinedo, Hugo / Neto, Joao P. / Almeida, Luis B.:
"Syllable onset detection applied to the portuguese language",
81-84.
Paliwal, Kuldip K.:
"Decorrelated and liftered filter-bank energies for robust speech recognition",
85-88.
Paches-Leal, Pau / Rose, Richard C. / Nadeu, Climent:
"Optimization algorithms for estimating modulation spectrum domain filters",
89-92.
San-Segundo, R. / Córdoba, R. / Ferreiros, J. / Gallardo, A. / Colás, J. / Pastor, J. / López, Y.:
"Efficient vector quantization using an n-path binary tree search algorithm",
93-96.
Warakagoda, Narada D. / Johnsen, Magne H.:
"Neural network based optimal feature extraction for ASR",
97-100.
Yato, Fumihiro / Inoue, Naomi / Hashimoto, Kazuo:
"A study of speech recognition for the elderly",
101-104.
Zhu, Jie / Chen, Fei-li:
"The analysis and application of a new endpoint detection method based on distance of autocorrelated similarity#",
105-108.
Articulatory Measurements and Modelling
Beautemps, Denis / Borel, Pascal / Manolios, Sébastien:
"Hyper-articulated speech: auditory and visual intelligibility",
109-112.
Engwall, Olov:
"Modeling of the vocal tract in three dimensions",
113-116.
Kienast, Miriam / Paeschke, Astrid / Sendlmeier, Walter:
"Articulatory reduction in emotional speech",
117-120.
Kaburagi, Tokihiko / Honda, Masaaki / Okadome, Takeshi:
"A trajectory formation model of articulatory movements using a multidimensional phonemic task",
121-124.
Krstulovic, Sacha:
"LPC-based inversion of the DRM articulatory model",
125-128.
Miki, Nobuhiro / Yokoyama, Thoru / Ohtani, Takeshi / Masaki, Shinobu / Shimada, Ikuhiro / Fujimoto, Ichiro / Nakamura, Yuji:
"A vocal tract model using multi-line equivalent circuits",
129-132.
Matsuda, Masahiro / Kasuya, Hideki:
"Acoustic nature of the whisper",
133-136.
Okadome, Takeshi / Kaburagi, Tokihiko / Honda, Masaaki:
"Relations between utterance speed and articulatory movements",
137-140.
Laprie, Slim Ouni And Yves:
"Design of hypercube codebooks for the acoustic-to-articulatory inversion respecting the non-linearities of the articulatory-to-acoustic mapping",
141-144.
Owens, Marie / Krüger, Anja / Donnelly, Paul / Smith, F J / Ming, Ji:
"A missing-word test comparison of human and statistical language model performance",
145-148.
Richmond, Korin:
"Estimating velum height from acoustics during continuous speech",
149-152.
Silva, C. / Chennoukh, S. / Trancoso, Isabel:
"On improving the decision algorithm for articulatory codebook search",
153-156.
Thimm, G. / Luettin, J.:
"Extraction of articulators in x-ray image sequences",
157-160.
Teixeira, António / Vaz, Francisco / Príncipe, José Carlos:
"Effects of source-tract interaction in perception of nasality",
161-164.
Vaxelaire, Béatrice / Sock, Rudolph / Hecker, Véronique:
"Perceiving anticipatory phonetic gestures in French",
165-168.
Vilain, Anne / Abry, Christian / Badin, Pierre:
"Motor equivalence evidenced by articulatory modelling",
169-172.
First and Second Language Learning
Wet, Febe de / Cucchiarini, Catia / Strik, Helmer / Boves, Lou:
"Using likelihood ratios to perform utterance verification in automatic pronunciation assessment",
173-176.
Kawai, Goh / Ishi, Carlos Toshinori:
"A system for learning the pronunciation of Japanese pitch accent",
177-182.
Nouza, Jan:
"Computer-aided spoken-language training with enhanced visual and auditory feedback",
183-186.
Song, Zhanjiang / Zheng, Fang / Xu, Mingxing / Wu, Wenhu:
"An effective scoring method for speaking skill evaluation system",
187-190.
Santiago-Oriola, Conception:
"Vocal synthesis in a computerized dictation exercise",
191-194.
Trancoso, Isabel / Viana, Céu / Mascarenhas, Isabel / Teixeira, Carlos:
"On deriving rules for nativised pronunciation in navigation queries",
195-198.
Yvon, Francois:
"Pronouncing unknown words using multi-dimensional analogies",
199-202.
Laczko, Maria:
"Characteristics features of planning of speech and production of secondary schoolchildren's spontaneous speech",
202.
Speech Recognition - Adaptation 2
Byrne, William / Gunawardana, Asela:
"Discounted likelihood linear regression for rapid adaptation",
203-206.
Chien, Jen-Tzung / Junqua, Jean-Claude / Gelin, Philippe:
"Extraction of reliable transformation parameters for unsupervised speaker adaptation",
207-210.
Chesta, Cristina / Siohan, Olivier / Lee, Chin-Hui:
"Maximum a posteriori linear regression for hidden Markov model adaptation",
211-214.
Hariharan, Ramalingam / Viikki, Olli:
"On combining vocal tract length normalisation and speaker adaptation for noise robust speech recognition",
215-218.
Rottland, Jörg / Neukirchen, Christoph / Willett, Daniel / Rigoll, Gerhard:
"Speaker adaptation using regularization and network adaptation for hybrid MMI-NN/HMM speech recognition",
219-222.
Prosody - Prosodic Phrasing and Interruptions
Alter, Kai / Schirmer, Annett / Kotz, Sonja A. / Friederici, Angela D.:
"Prosodic phrasing and accentuation in speech production of patients with right hemisphere lesions",
223-226.
Goto, Masataka / Itou, Katunobu / Hayamizu, Satoru:
"A real-time filled pause detection system for spontaneous speech recognition",
227-230.
Iwano, Koji:
"Prosodic word boundary detection using mora transition modeling of fundamental frequency contours -speaker independent experiments-",
231-234.
Warnke, Volker / Gallwitz, Florian / Batliner, Anton / Buckow, Jan / Huber, R. / Nöth, Elmar / Höthker, A.:
"Integrating multiple knowledge sources for word hypotheses graph interpretation",
235-238.
Yang, Li-chiung:
"Prosodic correlates of interruptions in spoken dialogue",
239-242.
Assessment
Constantinides, Paul C. / Rudnicky, Alexander I.:
"Dialog analysis in the carnegie mellon communicator",
243-246.
Fiscus, Jon / Doddington, George / Garofolo, John / Martin, Alvin:
"NIST's 1998 topic detection and tracking evaluation (TDT2)",
247-250.
Sonntag, Gerit P. / Portele, Thomas / Haas, Felicitas / Köhler, Joachim:
"Comparative evaluation of six German TTS systems",
251-254.
Steeneken, Herman J. M.:
"Standardisation of ergonomic assessment of speech communication",
255-258.
Suzuki, Noriko / Takeuchi, Yugo / Ishii, Kazuo / Okada, Michio:
"Evaluation of affiliation in interaction with autonomous creatures",
259-262.
Speech Recognition - Confidence Measures 2
Bauer, Josef G. / Junkawitsch, Jochen:
"Accurate recognition of city names with spelling as a fall back strategy",
263-266.
Bartkova, Katarina / Jouvet, Denis:
"Selective prosodic post-processing for improving recognition of French telephone numbers",
267-270.
Chang, Eric I.:
"Improving rejection with semantic slot-based confidence scores",
271-274.
Davies, K. / Donovan, R. / Epstein, M. / Franz, Martin / Ittycheriah, Abraham / Jan, E. E. / LeRoux, J. M. / Lubensky, David / Neti, Chalapathy / Padmanabhan, Mukund / Papineni, K. / Roukos, Salim / Sakrajda, A. / Sorensen, Jeffrey S. / Tydlitat, B. / Ward, T.:
"The IBM conversational telephony system for financial applications",
275-278.
El Méliani, Rachida / OShaughnessy, Douglas:
"Error spotting using syllabic fillers in spontaneous conversational speech recognition",
279-282.
Jouvet, Denis / Monné, Jean:
"Recognition of spelled names over the telephone and rejection of data out of the spelling lexicon",
283-286.
Koo, Myoung-Wan / Lee, Sun-Jeong:
"An utterance verification system based on subword modeling for a vocabulary independent speech recognition system",
287-290.
Moreau, Nicolas / Jouvet, Denis:
"Use of a confidence measure based on frame level likelihood ratios for the rejection of incorrect data",
291-294.
Macías-Guarasa, J. / Ferreiros, J. / Gallardo, A. / San-Segundo, R. / Pardo, Juan Manuel / Villarrubia, L.:
"Variable preselection list length estimation using neural networks in a telephone speech hypothesis-verification system",
295-298.
Pfau, Thilo / Faltlhauser, Robert / Ruske, Günther:
"Speaker normalization and pronunciation variant modeling: helpful methods for improving recognition of fast speech",
299-302.
Rose, Richard C. / Riccardi, Giuseppe:
"Automatic speech recognition using acoustic confidence conditioned language models",
303-306.
Strom, Volker / Heine, Henrik:
"Utilizing prosody for unconstrained morpheme recognition",
307-310.
Stolcke, Andreas / Shriberg, Elizabeth / Hakkani-Tür, Dilek / Tür, Gökhan:
"Modeling the prosody of hidden events for improved word recognition",
311-314.
Wessel, Frank / Macherey, Klaus / Ney, Hermann:
"A comparison of word graph and n-best list based confidence measures",
315-318.
Speech Analysis and Tools
Prätzas, Marcus M. / Balss, Ulrich / Reininger, Herbert / Wüst, Harald:
"C++ software environment for speech signal processing",
319-322.
Ma, Kun / Demirel, Pelin / Espy-Wilson, Carol / MacAuslan, Joel:
"Improvement of electrolaryngeal speech by introducing normal excitation information",
323-326.
Ittycheriah, Abraham / Mammone, Richard J.:
"Detecting user speech in barge-in over prompts using speaker identification methods",
327-330.
Lobanov, Boris / Levkovskaya, T. / Kheidorov, Igor E.:
"Speaker and channel-normalized set of formant parameters for telephone speech recognition",
331-334.
Liew, Alan W.C. / Sum, K. L. / Leung, S. H. / Lau, Wai H.:
"Fuzzy segmentation of lip image using cluster analysis",
335-338.
McTear, Michael F.:
"Software to support research and development of spoken dialogue systems",
339-342.
Kajarekar, Sachin / Malayath, Narendranath / Hermansky, Hynek:
"Analysis of sources of variability in speech",
343-346.
Shimamura, Tetsuya / Hayakawa, Haruko:
"Adaptive nonlinear prediction based on order statistics for speech signals",
347-350.
Souza, M. N. / Caprini, E. J. / Machado, C. G. / Ludolf, M. V. / Calôba, L. P. / Seixas, J. M. / Resende, F. G. / Netto, S. L. / Freitas, Diamantino R. / Teixeira, Joao Paulo / Espain, C. / Pera, V. / Moreira, F.:
"Developing a voiced information retrieval system for the portuguese language capable to handle both brazilian and portuguese spoken versions",
351-354.
Soraghan, John J. / Hussain, Amir / Shim, Ivy:
"Real-time speech modeling using computationally efficient locally recurrent neural networks (CERNs)",
355-358.
Tokuhira, M. / Ariki, Y.:
"Effectiveness of KL-transformation in spectral delta expansion",
359-362.
Language Identification
Berkling, Kay / Reynolds, Douglas A. / Zissman, Marc:
"Evaluation of confidence measures for language identification",
363-366.
Tsai, Wuei-He / Chang, Wen-Whei:
"Chinese dialect identification using an acoustic-phonotactic model",
367-370.
Cummins, Fred / Gers, Felix / Schmidhuber, Jürgen:
"Language identification from prosody without explicit features",
371-374.
Harbeck, Stefan / Ohler, Uwe:
"Multigrams for language identification",
375-378.
Hombert, Jean-Marie / Maddieson, Ian:
"The use of 'rare' segments for language identification",
379-382.
Itahashi, Shuichi / Kiuchi, Toshikazu / Yamamoto, Mikio:
"Spoken language identification utilizing fundamental frequency and cepstra",
383-386.
Matrouf, D. / Adda-Decker, Martine / Gauvain, Jean-Luc / Lamel, Lori:
"Comparing different model configurations for language identification using a phonotactic approach",
387-390.
Mori, K. / Toba, N. / Harada, T. / Arai, T. / Komatsu, M. / Aoyagi, M. / Murahara, Y.:
"Human language identification with reduced spectral information",
391-394.
Barkat, Melissa / Ohala, John / Pellegrino, François:
"Prosody as a distinctive feature for the discrimination of arabic dialects",
395-398.
Pellegrino, François / Farinas, Jérôme / André-Obrecht, Régine:
"Comparison of two phonetic approaches to language identification",
399-402.
Speech Recognition - Speaking Rate
Anderson, Stephen / Liberman, Natalie / Gillick, Larry / Foster, Stephen / Hama, Sahoko:
"The effects of speaker training on ASR accuracy",
403-406.
Faltlhauser, Robert / Pfau, Thilo / Ruske, Günther:
"Creating hidden Markov models for fast speech by optimized clustering",
407-410.
Richardson, M. / Hwang, M. / Acero, Alex / Huang, Xuedong:
"Improvements on speech recognition for fast talkers",
411-414.
Saul, Lawrence / Rahim, Mazin:
"Modeling the rate of speech by Markov processes on curves",
415-418.
Tuerk, Andreas / Young, Steve:
"Modelling speaking rate using a between frame distance metric",
419-422.
Speech Acoustics
Bloothooft, Gerrit / Pabon, Peter:
"Vocal registers revisited",
423-426.
Bouteille, Franck / Scalart, Pascal / Corazza, Michel:
"Pseudo affine projection algorithm new solution for adaptive identication",
427-430.
Jesus, Luis M. T. / Shadle, Christine H.:
"Acoustic analysis of a speech corpus of european portuguese fricative consonants",
431-434.
Petlyuchenko, Natalia:
"Acoustic characteristics of plosives in consonant-consonant sequences at word boundaries",
435-438.
Son, Rob J. J. H. van / Pols, Louis C. W.:
"Effects of stress and lexical structure on speech efficiency",
439-442.
Speech Recognition - Search and Pronunciation Modelling
Abe, Yoshiharu / Itsui, Hiroyasu / Maruta, Yuzo / Nakajima, Kunio:
"A two-stage speech recognition method with an error correction model",
443-446.
Chen, C. Julian:
"Speech recognition with automatic punctuation",
447-450.
Eide, Ellen:
"Automatic modeling of pronunciation variations",
451-454.
Franz, Martin / Novak, Miroslav:
"Reducing search complexity in low perplexity tasks",
455-458.
Coletti, Paolo / Federico, Marcello:
"A two-stage speech recognition method for information retrieval applications",
459-462.
Fosler-Lussier, Eric:
"Multi-level decision trees for static and dynamic pronunciation models",
463-466.
Finke, Michael / Fritsch, Jürgen / Koll, Detlef / Waibel, Alex:
"Modeling and efficient decoding of large vocabulary conversational speech",
467-470.
Husson, Jean-Luc:
"Evaluation of a segmentation system based on multi-level lattices",
471-474.
Hanna, Philip / Stewart, Darryl / Ming, Ji:
"The application of an improved DP match for automatic lexicon generation",
475-478.
Iyer, Rukmini / Kimball, Owen / Gish, Herbert:
"Modeling trajectories in the HMM framework",
479-482.
Kwon, Oh-Wook / Hwang, Kyuwoong / Park, Jun:
"Korean large vocabulary continuous speech recognition using pseudomorpheme units",
483-486.
Kabré, Harouna / Waibel, Alexander:
"Navigating German cities by spontaneous French queries",
487-490.
Korkmazskiy, Filipp / Lee, Chin-Hui:
"Generating alternative pronunciations from a dictionary",
491-494.
Mangu, Lidia / Brill, Eric / Stolcke, Andreas:
"Finding consensus among words: lattice-based word error minimization",
495-498.
Ortmanns, Stefan / Reichl, Wolfgang / Chou, Wu:
"An efficient decoding method for real time speech recognition",
499-502.
Padmanabhan, Mukund / Saon, G. / Basu, S. / Huang, Jing / Zweig, Geoffrey:
"Recent improvements in voicemail transcription",
503-506.
Ramabhadran, Bhuvana / Deligne, Sabine / Ittycheriah, Abraham:
"Acoustics-based baseform generation with pronunciation and/or phonotactic models",
507-510.
Shirosaki, Yasuo / Kikuchi, Hideaki / Shirai, Katsuhiko:
"Improving recognition correct rate of important words in large vocabulary speech recognition",
511-514.
Saraclar, Murat / Nock, Harriet / Khudanpur, Sanjeev:
"Pronunciation modeling by sharing gaussian densities across phonetic models",
515-518.
Aubert, Xavier L.:
"One pass cross word decoding for large vocabularies based on a lexical tree search organization",
1559-1562.
Prosody - Stress, Accent and Prominence Phrasing
Batliner, Anton / Nutt, M. / Warnke, Volker / Nöth, Elmar / Buckow, Jan / Huber, R. / Niemann, Heinrich:
"Automatic annotation and classification of phrase accents in spontaneous speech",
519-522.
Conkie, Alistair / Riccardi, Giuseppe / Rose, Richard C.:
"Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events",
523-526.
Fach, Marcus L.:
"A comparison between syntactic and prosodic phrasing",
527-530.
Fivela, Barbara Gili:
"The prosody of left-dislocated topic constituents in italian read speech",
531-534.
Haas, Jürgen / Warnke, Volker / Niemann, Heinrich / Cettolo, M. / Corazza, A. / Falavigna, D. / Lazzari, G.:
"Semantic boundaries in multiple languages",
535-538.
Kim, Yeon-Jun / Byeon, Heo-Jin / Oh, Yung-Hwan:
"Prosodic phrasing in korean, determine governor, and then split or not",
539-542.
Mersdorf, Joachim J. / Schmidt, Kai U. / Köster, Stefanie:
"Linear prediction coding of individual pitch accent shapes",
543-546.
Nakatani, Christine H.:
"Prominence variation beyond given/new",
547-550.
Streefkerk, Barbertje M. / Pols, Louis C. W. / Bosch, Louis F. M. ten:
"Acoustical features as predictors for prominence in read aloud dutch sentences used in ANN's",
551-554.
Theune, Mariet:
"Parallelism, coherence, and contrastive accent",
555-558.
Speech Disorders & Speech for Disabled
Bonneau, Anne / Mokhtari, Parham:
"A phonetically-guided diagnosis of auditory deficiency based on synthetic speech stimuli",
559-562.
Godino-Llorente, Juan I. / Aguilera-Navarro, Santiago / Hernández-Espinosa, Carlos / Fernández-Redondo, Mercedes / Gómez-Vilda, Pedro:
"On the selection of meaningful speech parameters used by a pathologic/non pathologic voice register classifier",
563-566.
Harborg, Erik / Holter, Trym / Johnsen, Magne Hallstein / Svendsen, Torbjon:
"On-line captioning of TV-programs for the hearing impaired",
567-570.
Jo, Cheol-Woo / Kim, Dae-Hyun:
"Classification of pathological voice into normal/benign/malignant state",
571-574.
Maruyama, Ichiro / Abe, Yoshiharu / Sawamura, Eiji / Mitsuhashi, Tetsuo / Ehara, Terumasa / Shirai, Katsuhiko:
"Cognitive experiments on timing lag for superimposing closed captions",
575-578.
Ogner, Marcel / Kacic, Zdravko:
"Speaker normalization for audio-visual articulation training",
579-582.
Prizl-Jakovac, Tatjana:
"Vowel production in aphasia",
583-586.
Speech Recognition - Multi-stream ASR
Cerisara, Christophe / Haton, Jean-Paul / Fohr, Dominique:
"Towards a global optimization scheme for multi-band speech recognition",
587-590.
Janin, Adam / Ellis, Dan / Morgan, Nelson:
"Multi-stream speech recognition: ready for prime time?",
591-594.
Mirghafori, Nikki / Morgan, Nelson:
"Sooner or later: exploring asynchrony in multi-band speech recognition",
595-598.
Morris, Andrew / Hagen, Astrid / Bourlard, Hervé:
"The full combination sub-bands approach to noise robust HMM/ANN based ASR",
599-602.
Okawa, Shigeki / Nakajima, Takehiro / Shirai, Katsuhiko:
"A recombination strategy for multi-band speech recognition based on mutual information criterion",
603-606.
Speech Generation and Synthesis - Concatenation
Beutnagel, Mark / Mohri, Mehryar / Riley, Michael:
"Rapid unit selection from a large speech corpus for concatenative speech synthesis",
607-610.
Chen, Jing-Dong / Campbell, Nick:
"Objective distance measures for assessing concatenative speech synthesis",
611-614.
Lewis, Eric / Tatham, Mark:
"Word and syllable concatenation in text-to-speech synthesis",
615-618.
Stöber, Karlheinz / Portele, Thomas / Wagner, Petra / Hess, Wolfgang:
"Synthesis by word concatenation",
619-622.
Taylor, Paul / Black, Alan W.:
"Speech synthesis by phonological structure matching",
623-626.
Speech Communication Education
Bloothooft, Gerrit:
"The implementation of a european masters in language and speech",
627-630.
Cooke, Martin / Parker, Helen / Brown, Guy J. / Wrigley, Stuart N.:
"The interactive auditory demonstrations project",
631-634.
McTear, Michael F.:
"Curricula and courseware in spoken language engineering in europe: a critical appraisal",
635-638.
Hoffmann, Rüdiger / Ketzmerick, Bettina / Kordon, Ulrich / Kürbis, Steffen:
"An interactive tutorial on text-to-speech synthesis from diphones in time domain",
639-642.
Qvarfordt, Pernilla / Jönsson, Arne:
"Evaluating the dialogue component in the GULAN educational system",
643-646.
Speech Recognition - Broadcast News
Beyerlein, Peter / Aubert, Xavier / Haeb-Umbach, Reinhold / Harris, Matthew / Klakow, Dietrich / Wendemuth, A. / Molau, Sirko / Pitz, Michael / Sixtus, A.:
"The philips/RWTH system for transcription of broadcast news",
647-650.
Davenport, Jason / Nguyen, Long / Matsoukas, Spyros / Schwartz, Richard / Makhoul, John:
"Toward realtime transcription of broadcast news",
651-654.
Gauvain, Jean-Luc / Lamel, Lori / Adda, Gilles / Jardino, Michéle:
"Recent advances in transcribing television and radio broadcasts",
655-658.
Jang, Photina Jaeyun / Hauptmann, Alexander G.:
"Selection for acoustic coverage from unlimited speech extracted from closed-captioned TV",
659-662.
Kennedy, Paul E. / Hauptmann, Alexander G.:
"Laughter extracted from television closed captions as speech recognizer training data",
663-666.
Nguyen, Long / Matsoukas, Spyros / Davenport, Jason / Liu, Daben / Billa, Jay / Kubala, Francis / Makhoul, John:
"Further advances in transcription of broadcast news",
667-670.
Ohtsuki, Katsutoshi / Furui, Sadaoki / Sakurai, Naoyuki / Iwasaki, Atsushi / Zhang, Zhi-Peng:
"Recent advances in Japanese broadcast news transcription",
671-674.
Pitz, Michael / Molau, Sirko:
"Automatic verification of broadcast news transcriptions",
675-678.
Tritschler, Alain / Gopinath, Ramesh A.:
"Improved speaker segmentation and segments clustering using the bayesian information criterion",
679-682.
Wu, Xintian / Yan, Yonghong:
"Development of the 1998 OGI-FONIX broadcast news transcription system",
683-686.
Williams, Gethin / Ellis, Daniel P.W.:
"Speech/music discrimination based on posterior probability features",
687-690.
Wegmann, Steven / Zhan, Puming / Carp, Ira / Newman, Michael / Yamron, Jon / Gillick, Larry:
"Dragon systems' 1998 broadcast news transcription system",
691-694.
Yu, Hua / Finke, Michael / Waibel, Alex:
"Progress in automatic meeting transcription",
695-698.
Prosody - Temporal and/or Intonational Features
Brindöpke, Christel / Fink, Gernot A. / Kummert, Franz:
"A comparative study of HMM-based approaches for the automatic recognition of perceptually relevant aspects of spontaneous German speech melody",
699-710.
Demenko, Grazyna / Jassem, Wiktor:
"Modelling intonational phrase structure with artificial neural networks",
711-714.
Duez, Danielle:
"Effects of articulation rate on duration in read French speech",
715-718.
Gurlekian, Jorge A. / Riccillo, Marcela Leticia / Renato, Alejandro / Alvarez, Jose:
"A semi automatic method for the characterization of Spanish intonation contours",
719-722.
Tolba, Hesham / O'Shaughnessy, Douglas:
"Towards recognizing "non-lexical" words in spontaneous conversational speech",
723-726.
Isogai, Mitsuaki / Mizuno, Hideyuki:
"A new F0 contour control method based on vector representation of F0 contour",
727-730.
Kleckova, Jana:
"Developing the database of the spontaneous speech prosody characteristics",
731-734.
Möhler, Gregor / Mayer, Jörg:
"A method for the analysis of prosodic registers",
735-738.
Smirnova, Natalia:
"Whole tunes, nuclear and pre-nuclear patterns and prosodic features in the perception of interrogativity and non-finality in dutch.",
739-742.
Wang, Wern-Jun / Liao, Yuan-Fu / Chen, Sin-Horng:
"Prosodic modeling of Mandarin speech and its application to lexical decoding",
743-746.
Zhang, Jin-Song / Kawanami, Hiromichi:
"Modeling carryover and anticipation effects for Chinese tone recognition",
747-750.
Speaker Recognition - Acoustic Features and Robustness
Besacier, Laurent / Luettin, J. / Maitre, G. / Meurville, E.:
"Experimental evaluation of text-independent speaker verification on laboratory and field test databases in the M2VTS project",
751-754.
Balchandran, Rajesh / Ramanujam, Vidhya / Mammone, Richard J.:
"Channel estimation and normalization by coherent spectral averaging for robust speaker verification",
755-758.
Magrin-Chagnolleau, Ivan / Durou, Geoffrey:
"Time-frequency principal components of speech: application to speaker identification",
759-762.
Faúndez-Zanuy, Marcos:
"Speaker recognition by means of a combination of linear and nonlinear predictive models",
763-766.
Jang, Gil-Jin / Yun, Seong-Jin / Oh, Yung-Hwan:
"Feature vector transformation using independent component analysis and its application to speaker identification",
767-770.
Lavner, Yizhar / Rosenhouse, Judith / Gath, Isak:
"The prototype model in speaker identification",
771-774.
Lo, T. F. / Mak, M. W. / Yiu, K. K.:
"A new cepstrum-based channel compensation method for speaker verification",
775-778.
Miyajima, Chiyomi / Watanabe, Hideyuki / Kitamura, Tadashi / Katagiri, Shigeru:
"Speaker recognition based on discriminative feature extraction - optimization of mel-cepstral features using second-order all-pass warping function",
779-782.
Ortega-Garcia, Javier / Cruz-Llanas, Santiago / Gonzalez-Rodriguez, Joaquin:
"Facing severe channel variability in forensic speaker verification conditions",
783-786.
Quatieri, Thomas F. / Singer, E. / Dunn, R. B. / Reynolds, Douglas A. / Campbell, J. P.:
"Speaker and language recognition using speech codec parameters",
787-790.
Ramanujam, Vidhya / Balchandran, Rajesh / Mammone, Richard J.:
"Robust speaker verification in noisy conditions by modification of spectral time trajectories",
791-794.
Vergin, Rivarol / O'Shaughnessy, Douglas / Dumouchel, Pierre:
"Toward parametric representation of speech for speaker recognition systems",
795-798.
Zilca, R. D. / Bistritz, Y.:
"Text independent speaker identification using LSP codebook speaker models and linear discriminant functions",
799-802.
Speech Recognition - Large Vocabulary Continuous Speech Recognition (LVCSR)
Che, Chiwei / Wang, Nick / Huang, Max / Huang, Hank / Seide, Frank:
"Development of the philips 1999 taiwan Mandarin benchmark system",
803-806.
Ljolje, Andrej / Riley, Michael D. / Hindle, Donald M.:
"The AT&t large vocabulary conversational speech recognition system",
807-810.
Mohri, Mehryar / Riley, Michael:
"Integrated context-dependent networks in very large vocabulary speech recognition",
811-814.
Reichert, J. / Schultz, Tanja / Waibel, Alex:
"Mandarin large vocabulary speech recognition using the globalphone database",
815-818.
Zheng, Fang / Song, Zhanjiang / Xu, Mingxing / Wu, Jian / Huang, Yinfei / Wu, Wenhu / Bi, Cheng:
"Easytalk: a large-vocabulary speaker-independent Chinese dictation machine",
819-822.
Speech Generation and Synthesis - Systems and Evaluation
Fitt, Susan / Isard, Stephen:
"Synthesis of regional English using a keyword lexicon",
823-826.
Maeda, Noriyasu / Hideki, Banno / Kajita, Shoji / Takeda, Kazuya / Itakura, Fumitada:
"Speaker conversion through non-linear frequency warping of straight spectrum",
827-830.
McInnes, F. R. / Attwater, D. J. / Edgington, Michael D. / Schmidt, Mark S. / Jack, Mervyn A.:
"User attitudes to concatenated natural speech and text-to-speech synthesis in an automated information service",
831-834.
Traber, Christof / Huber, Karl / Nedir, Karim / Pfister, Beat / Keller, Eric / Zellner, Brigitte:
"From multilingual to polyglot speech synthesis",
835-838.
Tanaka, Kimihito / Mizuno, Hideyuki / Abe, Masanobu / Nakajima, Shin'ya:
"A Japanese text-to-speech system based on multi-form units with consideration of frequency distribution in Japanese",
839-842.
Speech Technology for Language Learning
Deville, G. / Deroo, O. / Gielen, Henri Leich (1) S. / Vanparys, J.:
"Automatic detection and correction of pronunciation errors for foreign language learners: the demosthenes application",
843-846.
Eskenazi, Maxine / Hansma, Scott / Corwin, John / Albornoz, Jordi:
"User adaptation in the fluency pronunciation trainer",
847-850.
Franco, Horacio / Neumeyer, Leonardo / Ramos, María / Bratt, Harry:
"Automatic detection of phone-level mispronunciation for language learning",
851-854.
Herron, Daniel / Menzel, Wolfgang / Atwell, Eric / Bisiani, Roberto / Daneluzzi, Fabio / Morton, Rachel / Schmidt, Juergen A.:
"Automatic localization and diagnosis of pronunciation errors for second-language learners of English",
855-858.
Vicsi, Klára / Roach, Peter / Öster, Annemarie / Kacic, Zdravko / Barczikay, P. / Sinka, I.:
"SPECO - a multimedia multilingual teaching and training system for speech handicapped children",
859-862.
Speech Recognition - Multilinguality
Ahadi, S. M.:
"Recognition of continuous persian speech using a medium-sized vocabulary speech corpus",
863-866.
Fegyó, Tibor / Tatai, Péter:
"Multi-lingual speech recognition based on demi-syllable subword units",
867-870.
Fung, Pascale / Ma, Chi Yuen / Liu, Wai Kat:
"MAP-based cross-language adaptation augmented by linguistic knowledge: from English to Chinese",
871-874.
Grocholewski, Stefan:
"Analysis of HMM models in alphabet letters recognition",
875-878.
Hirose, Keikichi / Zhang, Jin-song:
"Tone recognition of Chinese continuous speech using tone critical segments",
879-882.
Ho, Tai-Hsuan / Liu, Chin-Jung / Sun, Herman / Tsai, Ming-Yi / Lee, Lin-Shan:
"Phonetic state tied-mixture tone modeling for large vocabulary continuous Mandarin speech recognition",
883-886.
Imperl, Bojan / Horvat, Bogomir:
"The clustering algorithm for the definition of multilingual set of context dependent speech models",
887-890.
Liu, Jian / He, Xiaodong / Mo, Fuyuan / Yu, Tiecheng:
"Study on tone classification of Chinese continuous speech in speech recognition system",
891-894.
Liu, Yi / Fung, Pascale:
"Decision tree-based triphones are robust and practical for mandarian speech recognition",
895-898.
López de Ipińa, K. / Varona, A. / Torres, I. / Rodríguez, L. J.:
"Decision trees for inter-word context dependencies in Spanish continuous speech recognition tasks",
899-902.
Nassar, Amin M. / Abdel Kader, Nemat S. / Refat, Amr M.:
"End points detection for noisy speech using a wavelet based algorithm",
903-906.
Nieuwoudt, C. / Botha, E. C.:
"Adaptation of acoustic models for multilingual recognition",
907-910.
Uebler, Ulla / Boros, Manuela:
"Recognition of non-native German speech with multilingual recognizers",
911-914.
Systems, Architectures, Interfaces
Altosaar, Toomas / Millar, Bruce / Vainio, Martti:
"Relational vs. object-oriented models for representing speech: a comparison using ANDOSL data",
915-918.
Draxler, Christoph / Grudszus, Robert / Euler, Stephan / Bengler, Klaus:
"First experiences of the German speechdat-car database collection in mobile environments",
919-922.
Edgington, Mike / Attwater, David / Durston, Peter:
"OASIS - a framework for spoken language call steering",
923-926.
Gegenmantel, Eike:
"VOCAPI - small standard API for command & control",
927-930.
Müller, Christel / Schröder, Karsten:
"Standardised speech interfaces - key for objective evaluation of recognition accuracy",
931-934.
Matsunaga, Shoichi / Noda, Yoshiaki / Ohtsuki, Katsutoshi / Doi, Eiji / Itoh, Tomio:
"A medical rehabilitation diagnoses transcription method that integrates continuous and isolated word recognition",
935-938.
Németh, Géza / Zainkó, Csaba / Olaszy, Gábor / Prószéky, Gábor:
"Problems of creating a flexible e-mail reader for hungarian",
939-942.
Olaszy, Gábor / Németh, Géza / Olaszi, Péter / Gordos, Géza:
"Interactive, TTS supported speech message composer for large, limited vocabulary, but open information systems",
943-946.
Penn, Gerald / Carpenter, Bob:
"ALE for speech: a translation prototype",
947-950.
Rodríguez, L.J. / Torres, M. I. / Alcaide, J. M. / Varona, A. / López de Ipina, K. / Penagarikano, M. / Bordel, G.:
"An integrated system for Spanish CSR tasks",
951-954.
Sanderman, Angelien / Bosgoed, Ellen / Graaff, Hans de / Splunder, Peter van:
"Use of speech synthesis in an application",
955-958.
Tamura, Masatsune / Kondo, Shigekazu / Masuko, Takashi / Kobayashi, Takao:
"Text-to-audio-visual speech synthesis based on parameter generation from HMM",
959-962.
Wouters, Johan / Rundle, Brian / Macon, Michael W.:
"Authoring tools for speech synthesis using the sable markup standard",
963-966.
Speaker Recognition - Scoring and Decision
Ariyaeeinia, A. M. / Sivakumaran, P. / Pawlewski, M. / Loomes, M. J.:
"Dynamic weighting of the distortion sequence in text-dependent speaker verification",
967-970.
Altincay, Hakan / Demirekler, Mübeccel:
"On the use of supra model information from multiple classifiers for robust speaker identification",
971-974.
El-Maliki, Mounir / Drygajlo, Andrzej:
"Missing features detection and handling for robust speaker verification",
975-978.
Fakotakis, Nikos / Sirigos, John / Kokkinakis, George:
"High performance text-independent speaker recognition system based on voiced/unvoiced segmentation and multiple neural nets",
979-982.
Fredouille, Corinne / Bonastre, Jean-François / Merlin, Teva:
"Similarity normalization method based on world model and a posteriori probability for speaker verification",
983-986.
Isobe, Toshihiro / Takahashi, Jun-ichi:
"Text-independent speaker verification using virtual speaker based cohort normalization",
987-990.
Luettin, J. / Ben-Yacoub, S.:
"Robust person verification based on speech and facial images",
991-994.
Mathew, M. / Yegnanarayana, B. / Sundar, R.:
"A neural network-based text-dependent speaker verification system using suprasegmental features",
995-998.
Pelecanos, Jason / Sridharan, Sridha:
"Modelling output probability distributions for enhancing speaker recognition",
999-1002.
Rodríguez-Linares, L. / García-Mateo, C. / Alba-Castro, J. L.:
"On the use of neural networks to combine utterance and speaker verification systems in a text-dependent speaker verification task",
1003-1006.
Ruiz-Mezcua, B. / Rodríguez-Galán, R. / Hernández-Gómez, Luis A. / Domingo-García, Paloma / Bailly-Baillicre Gutiérrez, Enrique:
"Genesys: a neural network model for speaker identification",
1007-1010.
Sabac, Bogdan / Gavat, Inge:
"Speaker verification with growing cell structures",
1011-1014.
Tadj, Chakib / Dumouchel, Pierre / Mihoubi, Mohamed / Ouellet, Pierre:
"Environment adaptation and long term parameters in speaker identification",
1015-1018.
Yoshida, K. / Takagi, K. / Ozeki, K.:
"Speaker identification using subband HMMS",
1019-1022.
Zhang, W. D. / Yiu, K. K. / Mak, M. W. / Li, C. K. / He, M. X.:
"A priori threshold determination for phrase-prompted speaker verification",
1023-1026.
Speech Recognition - Broadcast News
Harris, Matthew / Aubert, Xavier / Haeb-Umbach, Reinhold / Beyerlein, Peter:
"A study of broadcast news audio stream segmentation and segment clustering",
1027-1030.
Liu, Daben / Kubala, Francis:
"Fast speaker change detection for broadcast news transcription and indexing",
1031-1034.
Palmery, David D. / Ostendorf, Mari / Burgerz, John D.:
"Robust information extraction from spoken language data",
1035-1038.
Renals, Steve / Gotoh, Yoshihiko:
"Integrated transcription and identification of named entities in broadcast speech",
1039-1042.
Woodland, P. C. / Odell, J. J. / Hain, T. / Moore, G. L. / Niesler, T. R. / Tuerk, Andreas / Whittaker, E. W. D.:
"Improvements in accuracy and speed in the HTK broadcast news transcription system",
1043-1046.
Speech Generation and Synthesis - Acoustic Synthesis
Acero, Alex:
"Formant analysis and synthesis using hidden Markov models",
1047-1050.
Bailly, Gérard:
"Accurate estimation of sinusoidal parameters in an harmonic+noise model for speech synthesis",
1051-1054.
Laine, Unto K.:
"Modal synthesis and modeling of vowels",
1055-1058.
O'Brien, Darragh / Monaghan, Alex I. C.:
"Shape invariant pitch modification of speech using a harmonic model",
1059-1062.
Beutnagel, Mark / Conkie, Alistair:
"Interaction of units in a unit selection database",
1063-1066.
Disorders in Speech Production and/or Speech Perception
García Gómez, Ramón / López Barquilla, Ricardo / Puertas Tera, José Ignacio / Parera Bermudez, José / Haton, Marie-Christine / Haton, Jean-Paul / Alinat, Pierre / Moreno, Sofia / Hess, Wolfgang / Sanchez Raya, Ma Araceli / Martínez Gual, Eduardo Alberto / Navas-Chaveli Daza, Juan Luis / Antoine, Christophe / Durel, Marie-Madeleine / Maugin, Genevieve / Hohmann, Silke:
"Speech training for deaf and hearing-impaired people",
1067-1070.
Hoshino, Shinichi / Kaneko, Itaru / Kikuchi, Hideaki / Shirai, Katsuhiko:
"A post-processing of speech for hearing impaired integrate into standard digital audio decoders",
1071-1074.
Imatomi, Setsuko / Arai, Takayuki / Mimura, Yuko / Kato, Masako:
"Effects of hoarseness on hypernasality ratings",
1075-1078.
Rezaei-Aghbash, N. / Whiteside, S. P. / Cudd, P. A.:
"Cross-language analysis of voice onset time in stuttered speech",
1079-1082.
Speech Recognition - Acoustic Modelling 1
Boulianne, Gilles / Brousseau, Julie / Talbot, Nathalie / Dumouchel, Pierre:
"Experiments in constrained maximum likelihood extraction of temporal features for speech recognition",
1083-1086.
Chen, S. S. / Gopinath, Ramesh A.:
"Model selection in acoustic modeling",
1087-1090.
Wong, Y. W. / Chow, K. F. / Lau, Wai H. / Lo, W. K. / Lee, Tan / Ching, P. C.:
"Acoustic modeling and language modeling for cantonese LVCSR",
1091-1094.
Deroo, O. / Ris, C. / Dupont, S.:
"Context dependent hybrid HMM/ANN systems for large vocabulary continuous speech recognition system",
1095-1098.
Fischer, V. / Ross, T.:
"Reduced gaussian mixture models in a large vocabulary continuous speech recognizer",
1099-1102.
Fritsch, J.:
"Mixture trees - hierarchically tied mixture densities for modeling HMM emission probabilities",
1103-1106.
Ichikawa, Akira / Shimizu, Tomoyuki / Horiuchi, Yasuo:
"Reinforcement learning for phoneme recognition",
1107-1110.
McCourt, Paul / Harte, Naomi / Vaseghi, Saeed:
"Combined temporal and spectral multi-resolution phonetic modelling",
1111-1114.
Novak, Miroslav / Picheny, Michael:
"Speed improvement of the time-asynchronous acoustic fast match",
1115-1118.
Sirigos, John / Fakotakis, Nikos / Kokkinakis, George:
"A hybrid ANN/HMM syllable recognition module based on vowel spotting",
1119-1122.
Shire, Michael L.:
"Data-driven modulation filter design under adverse acoustic conditions and using phonetic and syllabic units",
1123-1126.
Xu, Wei / Duchateau, Jacques / Demuynck, Kris / Dologlou, Ioannis / Wambacq, Patrick / Compernolle, Dirk van / Hamme, Hugo van:
"Accuracy versus complexity in context dependent phone modeling",
1127-1130.
Zhou, Jianlai / He, Xiaodong / Yu, Tiecheng / Mo, Fuyuan:
"A new hybrid structure of speech recognizer based on HMM and neural network",
1131-1134.
Zweig, Geoffrey / Padmanabhan, Mukund:
"Dependency modeling with bayesian networks in a voicemail transcription system",
1135-1138.
Dialogue 1
Asoh, Hideki / Matsui, Toshihiro / Fry, John / Asano, Futoshi / Hayamizu, Satoru:
"A spoken dialog system for a mobile office robot",
1139-1142.
Bell, Linda / Gustafson, Joakim:
"Interaction with an animated agent in a spoken dialogue system",
1143-1146.
Bernsen, Niels Ole / Dybkjaer, Laila / Heid, Ulrich:
"Current practice in the development and evaluation of spoken language dialogue systems.",
1147-1150.
Gustafson, Joakim / Lindberg, Nikolaj / Lundeberg, Magnus:
"The august spoken dialogue system",
1151-1154.
Grisvard, Olivier / Gaiffe, Bertrand:
"An event-based dialogue model and its implementation in multidial2",
1155-1158.
Huang, Chao / Xu, Peng / Zhang, Xin / Zhao, Shubin / Huang, Taiyi / Xu, Bo:
"LODESTAR: a Mandarin spoken dialogue system for travel information retrieval",
1159-1162.
Sasajima, Munehiko / Yano, Takehide / Kono, Yasuyuki:
"EUROPA: a generic framework for developing spoken dialogue systems",
1163-1166.
Nakano, Mikio / Dohsaka, Kohji / Miyazaki, Noboru / Hirasawa, Jun-ichi / Tamoto, Masafumi / Kawamori, Masahito / Sugiyama, Akira / Kawabata, Takeshi:
"Handling rich turn-taking in spoken dialogue systems",
1167-1170.
Pirker, Hannes / Loderer, Georg / Trost, Harald:
"Thus spoke the user to the wizard",
1171-1174.
Pargellis, Andrew / Kuo, Hon-Kwang Jeff / Lee, Chin-Hui:
"Automatic dialogue generator creates user defined applications",
1175-1178.
Relańo Gil, José / Tapias, Daniel / Villar-Navarro, Juan Manuel / Gancedo, Maria C. / Hernández-Gómez, Luis A.:
"Flexible mixed-initiative dialogue for telephone services",
1179-1182.
Veldhuijzen van Zanten, Gert:
"User modelling in adaptive dialogue management",
1183-1186.
Speaker Recognition and Topic Detection
Ashour, Gal / Gath, Isak:
"Characterization of speech during imitation",
1187-1190.
Bovbel, Evgeny I. / Tkachova, Polina P. / Kheidorov, Igor E.:
"The analysis of speaker individual features based on autoregressive hidden Markov models",
1191-1194.
Delacourt, Perrine / Kryze, David / Wellekens, Christian J.:
"Detection of speaker changes in an audio document",
1195-1198.
Glaeser, Axel:
"Dynamic test durations for text-independent speaker verification systems",
1199-1202.
Kolano, Guido / Regel-Brietzmann, Peter:
"Combination of vector quantization and gaussian mixture models for speaker verification with sparse training data",
1203-1206.
Li, Qi / Tsai, Augustine / Kim, Weon-Goo:
"A language-independent personal voice controller with embedded speaker verification",
1207-1210.
Lindberg, Johan / Blomberg, Mats:
"Vulnerability in speaker verification - a study of technical impostor techniques",
1211-1214.
McLaughlin, Jack / Reynolds, Douglas A. / Gleason, Terry:
"A study of computation speed-UPS of the GMM-UBM speaker recognition system",
1215-1218.
Maes, Stéphane H.:
"Conversational biometrics",
1219-1222.
Masuko, Takashi / Hitotsumatsu, Takafumi / Tokuda, Keiichi / Kobayashi, Takao:
"On the security of HMM-based speaker verification systems against imposture using synthetic speech",
1223-1226.
Majewski, Wojciech / Mazur-Majewska, Grazyna:
"Speech signal parametrization for speaker recognition under voice disguise conditions",
1227-1230.
Satué-Villar, Antonio / Faúndez-Zanuy, Marcos:
"On the relevance of language in speaker recognition",
1231-1234.
Yamashita, Yoichi:
"Prediction of keyword spotting accuracy based on simulation",
1235-1238.
Speech Recognition - Search
Castro, M. J. / Llorens, D. / Sánchez, Joan-Andreu / Casacuberta, F. / Aibar, P. / Segarra, E.:
"A fast version of the atros system",
1239-1242.
Goel, Vaibhava / Byrne, William:
"Task dependent loss functions in speech recognition: a* search over recognition lattices",
1243-1246.
Hanzl, Václav:
"Theory of structured cogitation in speech recognition",
1247-1250.
Ljolje, Andrej / Pereira, Fernando / Riley, Michael:
"Efficient general lattice generation and rescoring",
1251-1254.
Xu, Mingxing / Zheng, Fang / Wu, Wenhu:
"A fast and effective state decoding algorithm",
1255-1258.
Systems, Architectures
Jeanrenaud, Philippe / Cockroft, Greg / VanderHeidjen, Allard:
"A multimodal, multilingual telephone application: the wildfire electronic assistant",
1259-1262.
Os, Els den / Jongebloed, Hans / Stijsiger, Alice / Boves, Lou:
"Speaker verification as a user-friendly access for the visually impaired",
1263-1266.
Robinson, Tony / Abberley, Dave / Kirby, David / Renals, Steve:
"Recognition, indexing and retrieval of british broadcast news with the THISL system",
1267-1270.
Seneff, Stephanie / Lau, Raymond / Polifroni, Joseph:
"Organization, communication, and control in the GALAXY-II conversational system",
1271-1274.
Pateras, Claudia / Chapados, Nicolas / Kwan, Remi / Lavoie, Dominic / Tremblay, Réal:
"A mixed-initiative natural dialogue system for conference room reservation",
1275-1278.
Audio-Visual Speech
Kuratate, Takaaki / Munhall, Kevin G. / Rubin, Philip E. / Vatikiotis-Bateson, Eric / Yehia, Hani:
"Audio-visual synthesis of talking faces from speech production correlates",
1279-1282.
MacDonald, John / Andersen, Soren / Bachmann, Talis:
"Hearing by eye: visual spatial degradation and the mcgurk effect",
1283-1286.
Nankaku, Yoshihiko / Tokuda, Keiichi / Kitamura, Tadashi:
"Intensity- and location-normalized training for HMM-based visual speech recognition",
1287-1290.
Potamianos, Gerasimos / Potamianos, Alexandros:
"Speaker adaptation for audio-visual speech recognition",
1291-1294.
Radeau, M. / Colin, C.:
"The role of spatial separation on ventriloquism and mcgurk illusions",
1295-1298.
Speech Recognition - Acoustic Modelling 2
Castro, M. J. / Casacuberta, F.:
"Hybrid connectionist-structural acoustical modeling in the ATROS system",
1299-1302.
Delphin-Poulat, Lionel / Idier, Jérôme:
"Path-dependent kalman estimation of a cepstral bias",
1303-1306.
Dobrisek, S. / Mihelic, F. / Pavesic, N.:
"Acoustical modelling of phone transitions: biphones and diphones - what are the differences?",
1307-1310.
Demuynck, Kris / Duchateau, Jacques / Compernolle, Dirk Van:
"Optimal feature sub-space selection based on discriminant analysis",
1311-1314.
Debyeche, Mohamed / Afify, Mohamed / Haton, Jean-Paul:
"Phoneme recognition system based on HMM with distributed VQ codebook",
1315-1318.
He, Xiaodong / Liu, Jian / Zhou, Jianlai / Yu, Tiecheng:
"Research on speech units modeling in continuous speech recognition",
1319-1322.
Haeb-Umbach, Reinhold / Loog, Marco:
"An investigation of cepstral parameterisations for large vocabulary speech recognition",
1323-1326.
Hain, T. / Woodland, P. C.:
"Dynamic HMM selection for continuous speech recognition",
1327-1330.
Jiang, Li / Huang, Xuedong:
"Unified decoding and feature representation for improved speech recognition",
1331-1334.
Kim, DongHwa / Liu, Chaojun / Wu, Xintian / Yan, Yonghong:
"High accuracy acoustic modeling based on multi-stage decision tree",
1335-1338.
Ma, Jeff / Deng, Li:
"Optimization of dynamic regimes in a statistical hidden dynamic model for conversational speech recognition",
1339-1342.
Marino, José B. / Nogueiras-Rodríguez, Albino:
"Top-down bottom-up hybrid clustering algorithm for acoustic-phonetic modeling of speech",
1343-1346.
Nakamura, Atsushi / Matsui, Tomoko:
"Acoustic modeling based on a generalized laplacian distribution",
1347-1350.
Reinhard, Klaus / Niranjan, Mahesan:
"Diphone subspace models for phone-based HMM complementation",
1351-1354.
Singer, Harald / Nakamura, Atsushi:
"Unified framework for acoustic topology modelling: ML-SSS and question-based decision trees",
1355-1358.
Doherty, B. / Vaseghi, Saeed / McCourt, Paul:
"Linear transformations in sub-band groups for speech recognition",
1359-1366.
Lu, Peng-Ren / Hong, Wei-Tyng / Chiang, Sheng-Lun / Wang, Yih-Ru / Chen, Sin-Horng:
"A prototype of Mandarin speech telephone number inquiry system",
Abstract.
Witt, Silke / Young, Steve:
"Off-line acoustic modelling of non-native accents",
1367-1370.
Wightman, Colin W. / Harder, Ted A.:
"Semi-supervised adaptation of acoustic models for large-volume dictation",
1371-1374.
Dialogue 2
Ammicht, Egbert / Gorin, Allen / Alonso, Tirso:
"Knowledge collection for natural language spoken dialog systems",
1375-1378.
Byron, Donna K.:
"Improving discourse management in TRIPS-98",
1379-1382.
Wu, Chung-Hsien / Yan, Gwo-Lang / Lin, Chien-Liang:
"Speech act modeling in a spoken dialogue system using fuzzy hidden Markov model and bayes' decision criterion",
1383-1386.
Ehrlich, Ute:
"Task hierarchies representing sub-dialogs in speech dialog systems",
1387-1390.
Hirasawa, Jun-Ichi / Nakano, Mikio / Kawabata, Takeshi / Aikawa, Kiyoaki:
"Effects of system barge-in responses on user impressions",
1391-1394.
López-Cózar, R. / Rubio, Antonio J. / García, P. / Segura, J. C.:
"A new word-confidence threshold technique to enhance the performance of spoken dialogue systems",
1395-1398.
Lavelle, C. Alexia / Calmés, Martine de / Pérennou, Guy:
"Confirmation strategies to improve correction rates in a telephonic inquiry dialogue system",
1399-1402.
Niimi, Yasuhisa / Nishimoto, Takuya:
"Mathematical analysis of dialogue control strategies",
1403-1406.
Ocelikova, Jana / Matousek, Vaclav:
"Processing of anaphoric and elliptic sentences in a spoken dialog system",
1407-1410.
Papineni, K. A. / Roukos, Salim / Ward, T.:
"Free-flow dialog management using forms",
1411-1414.
Ries, Klaus:
"Towards the detection and description of textual meaning indicators in spontaneous conversations",
1415-1418.
Sturm, Janienke / Os, Els den / Boves, Lou:
"Dialogue management in the dutch ARISE train timetable information system",
1419-1422.
Krahmer, Emiel / Swerts, Marc / Theune, Mariet / Weegels, Mieke:
"Problem spotting in human-machine interaction",
1423-1426.
Lin, Bor-shen / Wang, Hsin-min / Lee, Lin-shan:
"Consistent dialogue across concurrent topics based on an expert system model",
1427-1430.
Speech Coding
Chapman, Thomas M. / Xydeas, C. S.:
"Secondary codebook storage quantisation",
1431-1434.
Edmondson, W. H. / Iskra, D. J. / Kienzle, P.:
"Pseudo-articulatory representations: promise, progress and problems",
1435-1438.
Gao, Ge / Ching, P. C.:
"A 1.7KBPS waveform interpolation speech coder using decomposition of pitch cycle waveform",
1439-1442.
Gottesman, Oded / Gersho, Allen:
"Enhanced analysis-by-synthesis waveform interpolative coding at 4 KBPS",
1443-1446.
Görtz, Norbert:
"Joint source-channel decoding by channel-coded optimal estimation (CCOE) for a CELP speech codec",
1447-1450.
Li, Chunyan / Gersho, Allen / Cuperman, Vladimir:
"Analysis-by-synthesis low-rate multimode harmonic speech coding",
1451-1454.
Lois, László:
"Variable length coding of transformed LSF coefficients",
1455-1458.
Mayrench, R. / Malah, D.:
"Low bit-rate speech coding using quantization of variable length segments",
1459-1462.
Martin, Rainer / Kang, Hong-Goo / Cox, Richard V.:
"Low delay analysis/synthesis schemes for joint speech enhancement and low bit rate speech coding",
1463-1466.
Oliva, Oscar / Faúndez-Zanuy, Marcos:
"A comparative study of several ADPCM schemes with linear and nonlinear prediction",
1467-1470.
Ohmura, H. / Tanaka, K.:
"Segmental feature extraction and coding for speech synthesis",
1471-1474.
Peláez-Moreno, C. / Díaz-de-María, F.:
"Backward adaptive RBF-based hybrid predictors for CELP-type coders at medium bit-rates",
1475-1478.
Sercov, Valentin V. / Petrovsky, Alexander A.:
"An improved speech model with allowance for time-varying pitch harmonic amplitudes and frequencies in low bit-rate MBE coders",
1479-1482.
Petrinovic, Davor / Petrinovic, Davorka:
"Sparse vector linear prediction matrices with multidiagonal structure",
1483-1486.
Stefanovic, M. / Kondoz, A.:
"Source-dependent variable rate speech coding below 3 KBPS",
1487-1490.
Chen, Xiaoping / Song, Yantao / Yu, Tiecheng:
"A novel speech coding approach based on half-wave vector quantization *",
1491-1494.
Zolfaghari, Parham / Robinson, Tony:
"Speech coding using mixture of gaussians polynomial model",
1495-1498.
Speech Recognition - Acoustic Modelling 1
Deng, Li / Ma, Jeff:
"A statistical coarticulatory model for the hidden vocal-tract-resonance dynamics",
1499-1502.
Albesano, D. / Mori, R. De / Gemello, R. / Mana, F.:
"A study on the effect of adding new dimensions to trajectories in the acoustic space",
1503-1506.
Gales, M. J. F. / Olsen, P. A.:
"Tail distribution modelling using the richter and power exponential distributions",
1507-1510.
Zhao, Qingwei / Wang, Zuoying / Lu, Dajin:
"A study of duration in continuous speech recognition based on DDBHMM",
1511-1514.
Vaich, T. / Cohen, A.:
"Comparison of continuous-density and semi-continuous HMM in isolated words recognition systems",
1515-1518.
Dialogue
Chu-Carroll, Jennifer:
"Form-based reasoning for mixed-initiative dialogue management in information-query systems",
1519-1522.
Dahlbäck, Nils / Jönsson, Arne:
"Knowledge sources in spoken dialogue systems",
1523-1526.
Os, Els den / Boves, Lou / Lamel, Lori / Baggia, Paolo:
"Overview of the ARISE project",
1527-1530.
Rudnicky, Alexander I. / Thayer, E. / Constantinides, Paul / Tchou, C. / Shern, R. / Lenzo, Kevin / Xu, W. / Oh, A.:
"Creating natural dialogs in the carnegie mellon communicator system",
1531-1534.
Rosset, Sophie / Bennacef, Samir / Lamel, Lori:
"Design strategies for spoken language dialog systems",
1535-1538.
Wideband and Perceptually Based Coding
Amodio, A. / Feng, Gang:
"A wideband speech coder based on harmonic coding at 16KBS",
1539-1542.
Bernard, Alexis / Alwan, Abeer:
"Perceptually based and embedded wideband CELP coding of speech",
1543-1546.
Földvári, Rudolf / Gyimesi, László:
"Very low bit rate voice coder based on a nonlinear hearing model",
1547-1550.
Perreau-Guimaraes, Marcos / Bonnet, Madeleine / Moreau, Nicolas:
"Low complexity bit allocation algorithm with psychoacoustical optimisation",
1551-1554.
Wan, Wanggen / Au, Oscar C. / Keung, Cyan L. / Yim, Chi H.:
"A novel approach of low bit-rate speech coding based on sinusoidal representation and auditory model",
1555-1558.
Speech Recognition - Language Modelling
Beaujard, Christel / Jardino, Michéle:
"Language modeling based on automatic word concatenations",
1563-1566.
Chelba, Ciprian / Jelinek, Frederick:
"Recognition performance of a structured language model",
1567-1570.
Chow, Vincent / Wu, Dekai:
"On the use of right context in sense-disambiguating language models",
1571-1574.
Donnelly, Paul G. / Smith, F. J. / Sicilia, E. / Ming, Ji:
"Language modelling with hierarchical domains",
1575-1578.
Damnati, Géraldine:
"Integration of several information sources for robust class-based statistical language modelling",
1579-1582.
Federico, Marcello:
"Efficient language model adaptation through MDI estimation",
1583-1586.
Gaudinat, Arnaud / Goldman, Jean-Philippe / Wehrli, Eric:
"Syntax-based speech recognition: how a syntactic parser can help a recognition system",
1587-1590.
Ito, Akinori / Kohda, Masaki / Ostendorf, Mari:
"A new metric for stochastic language model evaluation",
1591-1594.
Kuo, Hong-Kwang Jeff / Reichl, Wolfgang:
"Phrase-based language models for speech recognition",
1595-1598.
Kobayashi, Norihiko / Kobayashi, Tetsunori:
"Class-combined word n-gram for robust language modeling",
1599-1602.
Martins, Ciro / Neto, Joao P. / Almeida, Luís B.:
"Using partial morphological analysis in language modeling estimation for large vocabulary portuguese speech recognition",
1603-1606.
Ohler, Uwe / Harbeck, Stefan / Niemann, Heinrich:
"Discriminative training of language model classifiers",
1607-1610.
Zhang, Shuwu / Singer, Harald / Wu, Dekai / Sagisaka, Yoshinori:
"Improving n-gram modeling using distance-related unit association maximum entropy language modeling",
1611-1614.
Prosody - Study of Prosody for Speech Synthesis
Chen, Aimin / Wong, Shu Lian / Vaseghi, Saeed / Ho, Charles:
"Decision tree micro-prosody structures for text to speech synthesis",
1615-1618.
Córdoba, R. / Vallejo, J. A. / Montero, J. M. / Gutierrez-Arriola, J. / López, M. A. / Pardo, Juan Manuel:
"Automatic modeling of duration in a Spanish text-to-speech system using neural networks",
1619-1622.
Clark, Robert A.J. / Dusterhoff, Kurt E.:
"Objective methods for evaluating synthetic intonation",
1623-1626.
Dusterhoff, Kurt E. / Black, Alan W. / Taylor, Paul:
"Using decision trees within the tilt intonation model to predict F0 contours",
1627-1630.
Esposito, Richard / Yang, Li-chiung:
"Levels of prosodic representation in spoken discourse: an empirical approach",
1631-1634.
Fernández-Salgado, Xavier / Banga, Eduardo R.:
"Segmental duration modelling in a text-to-speech system for the galician language",
1635-1638.
Hirst, Daniel:
"The symbolic coding of segmental duration and tonal alignment: an extension to the INTSINT system.",
1639-1642.
Morlec, Yann / Bailly, Gérard / Aubergé, Véronique:
"Training an application-dependent prosodic model corpus, model and evaluation",
1643-1646.
Sheikhzadeh, H. / Eshkevari, A. / Khayatian, M. / Sadigh, R. / Ahadi, S. M.:
"Farsi language prosodic structure, research and implementation using a speech synthesizer",
1647-1650.
Teixeira, Joao Paulo / Paulo, Elisabete Rosa / Freitas, Diamantino / Pinto, Maria da Graca:
"Acoustical characterisation of the accented syllable in portuguese, a contribution to the naturalness of speech synthesis",
1651-1654.
Wang, Changfu / Fujisaki, Hiroya / Ohno, Sumio / Kodama, Tomohiro:
"Analysis and synthesis of the four tones in connected speech of the standard Chinese based on a command-response model",
1655-1658.
Williams, Sandra / Watson, Catherine I.:
"A profile of the discourse and intonational structures of route descriptions",
1659-1662.
Speech Perception 1
Amano, Shigeaki / Kondo, Tadahisa:
"Neighborhood effects on spoken word recognition in Japanese",
1663-1666.
Chéreau, C. / Hallé, P. A. / Segui, J.:
"Interference between surface form and abstract representation in spoken word perception",
1667-1670.
Colin, C. / Radeau, M.:
"Are the mcgurk illusions affected by left or right presentation of the speaker face?",
1671-1674.
Dupoux, Emmanuel / Fushimi, Takao / Kakehi, Kazuhiko / Mehler, Jacques:
"Prelexical locus of an illusory vowel effect in Japanese",
1675-1678.
Feijóo, Sergio / Fernández, Santiago / Barros, Nieves / Balsa, Ramón:
"Acoustic and perceptual characteristics of the Spanish fricatives",
1679-1686.
Gelin, Philippe / Junqua, Jean-Claude:
"Techniques for robust speech recognition in the car environment",
Abstract.
Karlsson, Fredrik / Eriksson, Anders:
"Difference limen for formant frequency discrimination at high fundamental frequencies",
1687-1690.
Sá Marta, Eduardo / Vieira de Sá, Luis:
"Auditory features for human communication of stop consonants under full-band and low-pass conditions",
1691-1694.
Widera, Christina / Portele, Thomas:
"Levels of reduction for German tense vowels",
1695-1698.
Speech Recognition - Acoustic Modelling 2
Souza, Peter de / Ramabhadran, Bhuvana / Gao, Yuqing / Picheny, Michael:
"Enhanced likelihood computation using regression",
1699-1702.
Liu, Chaojun / Wu, Xintian / Yan, Yonghong:
"High accuracy acoustic modeling using two-level decision-tree based state-tying",
1703-1706.
Singh, R. / Raj, B. / Stern, Richard M.:
"Domain adduced state tying for cross-domain acoustic modelling",
1707-1710.
Sankar, Ananth / Rao Gadde, Venkata Ramana:
"Parameter tying and gaussian clustering for faster, better, and smaller speech recognition",
1711-1714.
Schlüter, Ralf / Macherey, Wolfgang / Müller, Boris / Ney, Hermann:
"A combined maximum mutual information and maximum likelihood approach for mixture density splitting",
1715-1718.
Multimodal Interaction
Julia, Luc / Cheyer, Adam:
"Is talking to virtual more realistic?",
1719-1722.
Matsusaka, Yosuke / Tojo, Tsuyoshi / Kubota, Sentaro / Furukawa, Kenji / Tamiya, Daisuke / Hayata, Keisuke / Nakano, Yuichiro / Kobayashi, Tetsunori:
"Multi-person conversation via multi-modal interface - a robot who communicate with multi-user -",
1723-1726.
Narayanan, Shrikanth / Potamianos, Alexandros / Wang, Haohong:
"Multimodal systems for children: building a prototype",
1727-1730.
Okada, Michio / Suzuki, Noriko / Date, Masaaki:
"Social bonding in talking with social autonomous creatures",
1731-1734.
Purson, A. / Santi, S. / Bertrand, R. / Guaitella, Isabelle / Boyer, J. / Cavé, C.:
"The relationships between voice and gesture: eyebrow movements and questioning.",
1735-1738.
Joint Source-Channel Coding
Chang, Wen-Whei / Hsu, Heng-Iang / Wang, De-Yu:
"Robust vector quantization for channels with memory",
1739-1742.
Kövesi, Balázs / Lamblin, Claude / Quinquis, Catherine / Thiérion, Philippe / Navarro, William:
"A multi-rate codec family based on GSM EFR and ITU-t g.729",
1743-1746.
Pan, J. S. / Shieh, C. S. / Chiang, T. F.:
"A novel channel distortion measure for vector quantization and a fuzzy model for codebook index assignment",
1747-1750.
Sriratanaban, C. / Kondoz, A.:
"A full-rate GSM-AMR candidate",
1751-1754.
Villette, S. / Stefanovic, M. / Kondoz, A.:
"A multi-rate speech and channel codec: a GSM AMR half-rate candidate",
1755-1758.
Speech Recognition - Language Modelling
Adda, Gilles / Jardino, Michéle / Gauvain, Jean-Luc:
"Language modeling for broadcast news transcription",
1759-1762.
Béchet, Frédéric / Nasr, Alexis / Spriet, Thierry / Mori, Renato de:
"Large Span statistical language models: application to homophone disambiguation for large vocabulary speech recognition in French",
1763-1766.
Baggia, P. / Kellner, A. / Pérennou, Guy / Popovici, C. / Sturm, Janienke / Wessel, Frank:
"Language modelling and spoken dialogue systems - the ARISE experience",
1767-1770.
Brieussel-Pousse, Laure / Perennou, Guy:
"Language model level vs. lexical level for modeling pronunciation variation in a French CSR",
1771-1774.
Leung, Roger H.Y. / Choy, Chi-Yan / Leung, Hong C.:
"Characteristics of Chinese language models for large vocabulary telephone speech",
1775-1778.
Langlois, D. / Smadli, K.:
"A new based distance language model for a dictation machine: application to MAUD",
1779-1782.
Müller, Ludek / Psutka, Josef:
"Using various language model smoothing techniques for the transcription of a weather forecast broadcasted by the czech radio",
1783-1786.
McAllaster, Don / Gillick, Larry:
"Studies in acoustic training and language modeling using simulated speech data",
1787-1790.
Reichl, Wolfgang:
"Language model adaptation using minimum discrimination information",
1791-1794.
Smadli, K. / Brun, A. / Zitouni, I. / Haton, Jean-Paul:
"Automatic and manual clustering for large vocabulary speech recognition: a comparative study",
1795-1798.
Sánchez, Joan-Andreu / Benedí, José-Miguel:
"Learning of stochastic context-free grammars by means of estimation algorithms",
1799-1802.
Yamamoto, Hirofumi / Sagisaka, Yoshinori:
"Part-of-speech n-gram and word n-gram fused language model",
1803-1806.
Zhu, Xiaojin / Chen, Stanley F. / Rosenfeld, Ronald:
"Linguistic features for whole sentence maximum entropy language models",
1807-1810.
Zitouni, I. / Mari, J. F. / Smadli, K. / Haton, Jean-Paul:
"Variable-length sequence language model for large vocabulary continuous dictation machine",
1811-1814.
Zhang, Ruiqiang / Black, Ezra / Finch, Andrew:
"Using detailed linguistic structure in language modelling",
1815-1818.
Speech Generation and Synthesis - Prosody
Bulyko, Ivan / Ostendorf, Mari:
"Predicting gradient F0 variation: pitch range and accent prominence",
1819-1822.
Deans, Paul / Breen, Andrew / Jackson, Peter:
"CART-based duration modeling using a novel method of extracting prosodic features",
1823-1826.
Eom, Ki-Wan / Kim, Jin-Young / Kim, Sun-Mi:
"A primary study on the randomness control of the prosodic boundary index for natural synthetic speech",
1827-1830.
Ferencz, Attila / Nagy, István / Kovács, Tünde-Csilla / Ratiu, Teodora / Ferencz, Maria:
"On a hybrid time domain-LPC technique for prosody superimposing used for speech synthesis",
1831-1834.
Fackrell, J. W. A. / Vereecken, H. / Martens, J.-P. / Coile, Bert Van:
"Multilingual prosody modelling using cascades of regression trees and neural networks",
1835-1838.
Gu, Wentao / Shih, Chilin / Santen, Jan P.H. van:
"An efficient speaker adaptation method for TTS duration model",
1839-1842.
House, David / Bell, Linda / Gustafson, Kjell / Johansson, Linn:
"Child-directed speech synthesis: evaluation of prosodic variation for an educational computer program",
1843-1846.
Huckvale, Mark:
"Representation and processing of linguistic structures for an all-prosodic synthesis system using XML",
1847-1850.
Park, Won / Park, Hyung-Bin / Bae, Myung-Jin:
"A study on a pitch alteration by using the formant and phase compensation technique",
1851-1854.
Lee, Tan / Meng, Helen M. / Lau, Wai H. / Lo, W. K. / Ching, P. C.:
"Micro-prosodic control in cantonese text-to-speech synthesis",
1855-1858.
Mixdorff, Hansjörg / Mehnert, Dieter:
"Exploring the naturalness of several German high-quality-text-to-speech systems",
1859-1862.
Sakurai, A. / Kawanami, Hiromichi / Hirose, Keikichi:
"Detecting accent sandhi in Japanese using a superpositional F0 model",
1863-1866.
Kitagawa, Satoshi / Campbell, Nick:
"Focus detection by comparison of speech waveforms",
1867-1870.
Tatham, Mark / Lewis, Eric / Morton, Katherine:
"An advanced intonation model for synthesis",
1871-1874.
Takano, Satoshi / Abe, Masanobu:
"A new F0 modification algorithm by manipulating harmonics of magnitude spectrum",
1875-1878.
Villar Navarro, Juan Manuel / López Gonzalo, Eduardo / Relańo Gil, José:
"A mixed strategy approach to Spanish prosody",
1879-1882.
Speech Perception 2
Ainsworth, William A.:
"Perception of overlapping syllables",
1883-1886.
Cerrato, Loredana / Paoloni, Andrea:
"Are transcriptions of speech material recorded by means of bugs reliable?",
1887-1890.
Erdeljac, Vlasta / Horga, Damir:
"Influence of morphology on phoneme identification in spoken croatian",
1891-1894.
Hant, James J. / Alwan, Abeer:
"Modeling the masking of formant transitions in noise",
1895-1898.
Irino, Toshio / Patterson, Roy D.:
"Stabilised wavelet mellin transform: an auditory strategy for normalising sound-source size",
1899-1902.
Janíková, Zdena Palková and Jitka:
"Unintended preferences in the perceptive evaluation of rhythmical units in czech",
1903-1906.
Pallier, Christophe / Sebastián Gallés, Nuria / Colomé, Angels:
"Phonological representations and repetition priming",
1907-1910.
Vicsi, Klára / Csatari, F. / Bakcsi, Zs. / Tantos, A.:
"Distance score evaluation of the visualised speech spectra at audio-visual articulation training",
1911-1914.
Leeuwen, David A. van / Louwere, Michael de:
"Objective and subjective evaluation of the acoustic models of a continuous speech recognition system",
1915-1918.
Whiteside, S. P. / Varley, R. A.:
"Verbo-motor priming in the phonetic encoding of real and non-words",
1919-1922.
Speech Recognition - Language Modelling 1
Chen, Langzhou / Huang, Taiyi:
"An improved MAP method for language model adaptation",
1923-1926.
Clarkson, Philip / Robinson, Tony:
"Towards improved language model evaluation measures",
1927-1930.
Huang, Taiyi / Chen, Langzhou:
"A novel language model based on self-organized learning",
1931-1934.
Kilian, Ute / Class, Fritz:
"Combining syntactical and statistical language constraints in context-dependent language models for interactive speech applications",
1935-1938.
Martin, Sven / Hamacher, Christoph / Liermann, Jorg / Wessel, Frank / Ney, Hermann:
"Assessment of smoothing methods and complex stochastic language modeling",
1939-1942.
Speech and Noise
Bippus, Rolf / Fischer, Alexander / Stahl, Volker:
"Domain adaptation for robust automatic speech recognition in car environments",
1943-1946.
Huang, Jun / Zhao, Yunxin / Levinson, Stephen:
"A DCT-based fast enhancement technique for robust speech recognition in automobile usage",
1947-1950.
Hermus, Kris / Dologlou, Ioannis / Wambacq, Patrick / Compernolle, Dirk Van:
"Fully adaptive SVD-based noise removal for robust speech recognition",
1951-1954.
Westphal, Martin / Waibel, Alex:
"Towards spontaneous speech recognition for on-board car navigation and information systems",
1955-1958.
Das, Subrata / Lubensky, David / Wu, Cheng:
"Towards robust speech recognition in the telephony network environment - cellular and landline conditions",
1959-1962.
Text-Dependent Speaker Verification
Bimbot, Frédéric / Blomberg, Mats / Boves, Louis / Chollet, Gérard / Jaboulet, Cédric / Jacob, Bruno / Kharroubi, Jamal / Koolwaaij, Johan / Lindberg, Johan / Mariethoz, Johnny / Mokbel, Chafic / Mokbel, Houda:
"An overview of the PICASSO project research activities in speaker verification for telephone applications",
1963-1966.
Charlet, D.:
"Integrating time-alignment information into the decision making for text-dependent HMM-based speaker verification",
1967-1970.
Genoud, Dominique / Chollet, Gérard:
"Deliberate imposture: a challenge for automatic speaker verification systems.",
1971-1974.
Melin, H. / Lindberg, Johan:
"Variance flooring, scaling and tying for text-dependent speaker verification",
1975-1978.
Mariethoz, Johnny / Genoud, Dominique / Bimbot, Frédéric / Mokbel, Chafic:
"Client / world model synchronous alignement for speaker verification",
1979-1982.
Speech Understanding - Miscellaneous Topics
Boros, Manuela / Heisterkamp, Paul:
"Linguistic phrase spotting in a simple application spoken dialogue system",
1983-1986.
Deinzer, F. / Fischer, J. / Ahlrichs, U. / Nöth, Elmar:
"Learning of domain dependent knowledge in semantic networks",
1987-1990.
Hakkani-Tür, Dilek / Tür, Gökhan / Stolcke, Andreas / Shriberg, Elizabeth:
"Combining words and prosody for information extraction from speech",
1991-1994.
Ishikawa, Kai / Sumita, Eiichiro:
"Error correction translation using text corpora",
1995-1998.
Kronenberg, S. / Skuplik, K.:
"Efficient sentence disambiguation by preferred constituent order",
1999-2002.
Lee, Yue-Shi / Chen, Hsin-Hsi:
"Identifying linguistic segmentations in Chinese spoken dialogue",
2003-2006.
Chiang, Tung-Hui / Lin, Yi-Chung:
"Error recovery for robust language understanding in spoken dialogue systems",
2007-2010.
Liu, Xiaohu / Fung, Pascale / Cheung, Chi Shun:
"A monolingual semantic decoder based on word sense disambiguation for mixed language understanding",
2011-2014.
Meng, Helen M. / Lam, Wai / Wai, Carmen:
"To believe is to understand",
2015-2018.
Nöth, Elmar / Haas, Jürgen / Warnke, Volker / Gallwitz, Florian / Boros, Manuela:
"A hybrid approach to spoken dialogue understanding: prosody, statistics and partial parsing",
2019-2022.
Obuchi, Yasunari / Koizumi, Atsuko / Kitahara, Yoshinori / Matsuda, Jun'ichi / Tsukada, Toshihisa:
"Portable speech interpreter which has voice input and sophisticated correction functions",
2023-2026.
Potamianos, Alexandros / Riccardi, Giuseppe / Narayanan, Shrikanth:
"Categorical understanding using statistical ngram models",
2027-2030.
Spilker, Jörg / Weber, Hans / Görz, Günther:
"Detection and correction of speech repairs in word lattices",
2031-2034.
Schadle, Igor / Antoine, Jean-Yves / Memmi, Daniel:
"Connectionist language models for speech understanding: the problem of word order variation",
2035-2038.
Siu, Kai-Chung / Meng, Helen M.:
"Semi-automatic acquisition of domain-specific semantic structures",
2039-2042.
Takezawa, Toshiyuki:
"Transformation into language processing units by dividing and connecting utterance units",
2043-2046.
Wong, Aboy / Wu, Dekai:
"Learning a lightweight robust deterministic parser",
2047-2050.
Wu, Dekai / Sui, Zhifang / Zhao, Jun:
"An information-based method for selecting feature types for word prediction",
2051-2054.
Wang, Ye-Yi:
"A robust parser for spoken language understanding",
2055-2058.
Speech Generation and Synthesis - Systems, Linguistic Processing
Barbosa, Plínio A. / Violaro, Fábio / Albano, Eleonora C. / Simoes, Flávio / Aquino, Patrícia / Madureira, Sandra / Francozo, Edson:
"Aiuruete: a high-quality concatenative text-to-speech system for brazilian portuguese with demisyllabic analysis-based units and a hierarchical model of rhythm production",
2059-2062.
Burileanu, Dragos / Dan, Claudius / Sima, Mihai / Burileanu, Corneliu:
"A parser-based text preprocessor for romanian language TTS synthesis",
2063-2066.
Carlberger, Alice:
"Nparse - a shallow n-gram-based grammatical-phrase parser",
2067-2070.
Dermatas, Evangelos / Kokkinakis, George:
"A language-independent probabilistic model for automatic conversion between graphemic and phonemic transcription of words",
2071-2074.
Gros, Jerneja / Mihelic, F.:
"Acquisition of an extensive rule set for slovene grapheme-to-allophone transcription",
2075-2078.
Ho, Ching-Hsiang / Vaseghi, Saeed / Chen, Aimin:
"Voice conversion between UK and US accented English",
2079-2082.
Mizuno, Hideyuki / ABE, Masanobu / Nakajima, Shin'ya:
"Development of speech design tool "SESIGN99" to enhance synthesized speech",
2083-2086.
Hain, Horst-Udo:
"Automation of the training procedures for neural networks performing multi-lingual grapheme to phoneme conversion",
2087-2090.
Koutny, Ilona:
"Parsing hungarian sentences in order to determine their prosodic structures in a multilingual TTS system",
2091-2094.
Mihkla, Meelis / Eek, Arvo / Meister, Einar:
"Text-to-speech synthesis of estonian",
2095-2098.
Montero, J. M. / Gutiérrez-Arriola, J. / Colás, J. / Macías-Guarasa, J. / Enríquez, E. / Pardo, Juan Manuel:
"Development of an emotional speech synthesiser in Spanish",
2099-2102.
Pavesic, N. / Gros, Jerneja:
"S5: the SQEL slovene speech synthesis system",
2103-2106.
Rojc, Matej / Stergar, Janez / Wilhelm, Ralph / Hain, Horst-Udo / Holzapfel, Martin / Horvat, Bogomir:
"A multilingual text processing engine for the PAPAGENO text-to-speech synthesis system",
2107-2110.
Suh, Chang K. / Kagoshima, Takehiko / Morita, Masahiro / Seto, Shigenobu / Akamine, Masami:
"Toshiba English text-to-speech synthesizer (TESS)",
2111-2114.
Sannier, Frédérique / Aubergé, Véronique:
"Towards the generation of French phonetic inflected forms",
2115-2118.
Tzoukermann, Evelyne / Ménard, Lucie / Ouellet, Marise:
"Canadian French text-to-speech synthesis: modeling an optimal set of realizations for dialect markers",
2119-2122.
Busser, Bertjan / Daelemans, Walter / Bosch, Antal van den:
"Machine learning of word pronunciation: the case against abstraction",
2123-2126.
Speech & the Internet
Ordowski, M. / Deshmukh, N. / Ganapathiraju, A. / Hamaker, J. / Picone, Joseph:
"A public domain speech-to-text system",
2127-2130.
Fellbaum, Klaus / Richter, Joerg:
"Human speech production - an internet-based interactive multimodal tutorial",
2131-2134.
Gibbon, Dafydd / Kölsch, Silke / Mertins, Inge / Schulte, Michaela / Trippel, Thorsten:
"Terminology principles and support for spoken language system development",
2135-2138.
Horiuchi, Yasuo / Atsushi, Fujiwara / Ichikawa, Akira:
"New WWW browser for visually impaired people using interactive voice technology",
2139-2142.
Hanika, Jirí / Horák, Petr:
"Text to speech control protocol",
2143-2146.
Petek, Bojan:
"Multilinguality and human language technology courseware",
2147-2150.
Rouillard, José / Caelen, Jean:
"Multimodal information seeking dialogues on the world wide web",
2151-2154.
Tucker, Roger / Robinson, Tony / Christie, James:
"Compression of acoustic features - are perceptual quality and recognition performance incompatible goals?",
2155-2158.
Vaufreydaz, Dominique / Rouillard, José / Akbar, Mohammad:
"A network architecture for building applications that use speech recognition and/or synthesis",
2159-2162.
Speech Recognition - Language Modelling 2
Bellegarda, Jerome R.:
"Context scope selection in multi-Span statistical language modeling",
2163-2166.
Gildea, Daniel / Hofmann, Thomas:
"Topic-based language models using EM",
2167-2170.
Galescu, Lucian / Ringger, Eric K.:
"Augmenting words with linguistic information for n-gram language models",
2171-2174.
Nasr, Alexis / Estéve, Yannick / Béchet, Frédéric / Spriet, Thierry / Mori, Renato de:
"A language model combining n-grams and stochastic finite state automata",
2175-2178.
Wu, Jun / Khudanpur, Sanjeev:
"Combining nonlocal, syntactic and n-gram dependencies in language modeling",
2179-2182.
Speech Signal Processing
Kiss, Imre / Kapanen, Pekka:
"Robust feature vector compression algorithm for distributed speech recognition",
2183-2186.
Karjalainen, Matti / Tolonen, Tero:
"Separation of speech signals using iterative multi-pitch analysis and prediction",
2187-2190.
Parris, Eluned S. / Carey, Michael J. / Lloyd-Thomas, Harvey:
"Feature fusion for music detection",
2191-2194.
Vuuren, Sarel van / Hermansky, Hynek:
"Speech variability in the modulation spectral domain - SANOVA technique -",
2195-2198.
Yang, Dekun / Meyer, Georg F. / Ainsworth, William A.:
"Improving harmonic selection for speech intelligibility enhancement by the reassignment method",
2199-2202.
Text-Independent Speaker Verification and Tracking
Beigi, Homayoon S. M. / Maes, Stéphane H. / Chaudhari, Upendra V. / Sorensen, Jeffrey S.:
"A hierarchical approach to large-scale speaker recognition",
2203-2206.
Cernocky, J. / Petrovska-Delacrélaz, D. / Pigeon, S. / Verlinde, P. / Chollet, Gérard:
"A segmental approach to text-independent speaker verification",
2207-2210.
Johnson, S. E.:
"Who spoke when? - automatic segmentation and clustering for determining speaker turns",
2211-2214.
Przybocki, Mark A. / Martin, Alvin F.:
"The 1999 NIST speaker recognition evaluation, using summed two-channel telephone data for speaker detection and speaker tracking",
2215-2218.
Sönmez, Kemal / Heck, Larry / Weintraub, Mitchel:
"Speaker tracking and detection with multiple speakers",
2219-2222.
Corpora
Aiello, Demetrio / Cerrato, Loredana / Delogu, Cristina / Carlo, Andrea Di:
"The acquisition of a speech corpus for limited domain translation",
2223-2226.
Lee, Yue-Shi / Chen, Hsin-Hsi:
"Tagging spoken corpus",
2227-2230.
Csatári, F. / Bakcsi, Zs. / Vicsi, Klára:
"A hungarian child database for speech processing applications",
2231-2234.
Carson-Berndsen, Julie:
"A generic lexicon tool for word model definition in multimodal applications",
2235-2238.
Cassidy, Steve:
"Compiling multi-tiered speech databases into the relational model: experiments with the emu system",
2239-2242.
Elenius, Kjell:
"Two Swedish Speechdat databases - some experiences and results",
2243-2246.
Hayamizu, Satoru / Nagaya, Shigeki / Watanuki, Keiko / Nakazawa, Masayuki / Nobe, Shuichi / Yoshimura, Takashi:
"A multimodal database of gestures and speech",
2247-2250.
Matsui, Tomoko / Naito, Masaki / Singer, Harald / Nakamura, Atsushi / Sagisaka, Yoshinori:
"Japanese spontaneous speech database with wide regional and age distribution",
2251-2254.
Nakamura, Satoshi / Hiyane, Kazuo / Asano, Futoshi / Yamada, Takeshi / Endo, Takashi:
"Data collection in real acoustical environments for sound scene understanding and hands-free speech recognition",
2255-2258.
Noguchi, Hiroaki / Kiriyama, Kazuhisa / Matsuda, Hiroshi / Taniguchi, Miki / Den, Yasuharu / Katagiri, Yasuhiro:
"Automatic labeling of Japanese prosody using j-toBI style description",
2259-2262.
Pollák, Petr / Vopička, Josef / Sovka, Pavel:
"Czech language database of car speech and environmental noise",
2263-2266.
Kurematsu, Akira / Sukenori, Atsushi:
"Language model selection based on the analysis of Japanese spontaneous speech on travel arrangement task",
2267-2270.
Schiel, Florian / Draxler, Christoph / Hoole, Phil / Tillmann, Hans G.:
"New resources at BAS: acoustic, multimodal, linguistic",
2271-2274.
Sanders, Eric / Heuvel, Henk van den / Choukri, Khalid:
"Building speech databases for cellular networks",
2275-2278.
Heuvel, Henk van den / Boudy, Jerôme / Comeyne, Robrecht / Euler, Stephan / Moreno, Asuncion / Richard, Gael:
"The speechdat-car multilingual speech databases for in-car applications: some first validation results",
2279-2282.
Williams, Briony:
"A welsh speech database: preliminary results",
2283-2286.
Speech Generation and Synthesis - Acoustic Synthesis and Units
Au, Oscar C. / Wan, Wanggen / Keung, Cyan L. / Yim, Chi H.:
"Sinusoidal representation and auditory model-based parametric matching and smoothing and its application in speech analysis/synthesis",
2287-2290.
Balestri, Marcello / Pacchiotti, Alberto / Quazza, Silvia / Salza, Pier Luigi / Sandri, Stefano:
"Choose the best to modify the least: a new generation concatenative synthesis system",
2291-2294.
Chou, Fu-chiang / Tseng, Chiu-yu / Lee, Lin-shan:
"Selection of waveform units for corpus-based Mandarin speech synthesis based on decision trees and prosodic modification costs",
2295-2298.
Etxebarria, B. / Hernáez, I. / Madariaga, I. / Navas, E. / Rodríguez, J. C. / Gándara, R.:
"Improving quality in a speech synthesizer based on the MBROLA algorithm",
2299-2302.
Huang, Yan / Xu, Bo:
"A novel model TD-PSPTP for speech synthesis",
2303-2306.
Kapilow, David / Stylianou, Yannis / Schroeter, Juergen:
"Detection of non-stationarity in speech signals and its application to time-scaling",
2307-2310.
Koyama, Takao / Takahashi, Jun-ichi:
"A v-CV waveform based speech synthesis using global minimization of pitch conversion and concatenation distortion in v-CV unit sequence",
2311-2314.
Mann, Iain / McLaughlin, Steve:
"Stable speech synthesis using recurrent radial basis functions",
2315-2318.
Meron, Yoram / Hirose, Keikichi:
"Efficient weight training for selection based synthesis",
2319-2322.
Matousek, Jindrich:
"Speech synthesis using HMM-based acoustic unit inventory",
2323-2326.
Macon, Michael W. / Clements, Mark A.:
"An enhanced ABS/OLA sinusoidal model for waveform synthesis in TTS",
2327-2330.
Ouellet, Marise / Tzoukermann, Evelyne / Ménard, Lucie:
"High vowel /i y u/ in canadian and continental French: an analysis for a TTS system",
2331-2334.
Tychtl, Zbyněk / Psutka, Josef:
"Speech production based on the mel-frequency cepstral coefficients",
2335-2338.
Rank, Erhard:
"Exploiting improved parameter smoothing within a hybrid concatenative/LPC speech synthesizer",
2339-2342.
Stylianou, Yannis:
"Synchronization of speech frames based on phase data with application to concatenative speech synthesis",
2343-2346.
Yoshimura, Takayoshi / Tokuda, Keiichi / Masuko, Takashi / Kobayashi, Takao / Kitamura, Tadashi:
"Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis",
2347-2350.
Speech and Noise 1
Glotin, Hervé / Berthommier, Frédéric / Tessier, Emmanuel:
"A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition",
2351-2354.
Bayya, Aruna / Yegnanarayana, B.:
"Noise-invariant representation for speech signals",
2355-2358.
El-Maleh, Khaled / Kabal, Peter:
"Natural-quality background noise coding using residual substitution",
2359-2362.
Fernández, Julian / Lleida, Eduardo / Masgrau, Enrique:
"Microphone array design for robust speech acquisition and recognition",
2363-2366.
Guilmin, Gwénaél / Bouquin-Jeannčs, Régine Le / Gournay, Philippe:
"Study of the influence of noise pre-processing on the performance of a low bit rate parametric speech coder",
2367-2370.
Haverinen, Hemmo / Salmela, Petri / Häkkinen, Juha / Lehtokangas, Mikko / Saarinen, Jukka:
"MLP network for enhancement of noisy MFCC vectors",
2371-2374.
Iso-Sipilä, J. / Laurila, K. / Hariharan, Ramalingam / Viikki, Olli:
"Hands-free voice activation in noisy car environment",
2375-2378.
Karray, Lamia / Polard, Emmanuel:
"A wavelet denoising technique to improve endpoint detection in adverse conditions",
2379-2382.
Kuropatwinski, Marcin / Leckschat, Dieter / Kroschel, Kristian / Czyzewski, Andrzej / Hales, Chaz:
"Speech enhancement for linear-predictive-analysis-by-synthesis coders",
2383-2386.
Matsumoto, Hiroshi / Ubukata, Hiroaki:
"Robust HMM to variation of noisy environments based on variance extension of noise models",
2387-2390.
Nemer, Elias / Goubran, Rafik / Mahmoud, Samy:
"The fourth-order cumulant of speech signals with application to voice activity detection",
2391-2394.
Shieh, Woei-Chyang / Chang, Sen-Chia:
"The dependence of feature vectors under adverse noise",
2395-2398.
Tchorz, Jürgen / Kollmeier, Birger:
"Speech detection and SNR prediction basing on amplitude modulation pattern recognition",
2399-2402.
Vicente, Luis / Elliott, Stephen J. / Masgrau, Enrique:
"Fast active noise control for robust speech acquisition",
2403-2406.
Vizinho, Ascension / Green, Phil / Cooke, M. / Josifovski, Ljubomir:
"Missing data theory, spectral subtraction and signal-to-noise estimation for robust ASR: an integrated study",
2407-2410.
Vetter, Rolf / Virag, Nathalie / Renevey, Philippe / Vesin, Jean-Marc:
"Single channel speech enhancement using principal component analysis and MDL subspace selection",
2411-2414.
Speech Translation
Barrachina, Sergio / Vilar, Juan Miguel:
"Automatically deriving categories for translation",
2415-2418.
Corazza, A.:
"An inter-domain portable approach to interchange format construction",
2419-2422.
Casań, G. A. / Castańo, M. A.:
"Distributed representation of vocabularies in the RECONTRA neural translator",
2423-2426.
Reithinger, Norbert:
"Robust information extraction in a speech translation system",
2427-2430.
Sugaya, Fumiaki / Takezawa, Toshiyuki / Yokoo, Akio / Yamamoto, Seiichi:
"End-to-end evaluation in ATR-MATRIX: speech translation system between English and Japanese",
2431-2434.
Topic Detection and Tracking
Dharanipragada, S. / Franz, Martin / McCarley, J. S. / Roukos, Salim / Ward, T.:
"Story segmentation and topic detection for recognized speech",
2435-2438.
Jin, Hubert / Schwartz, Richard / Sista, Sreenivasa / Walls, Frederick:
"Topic tracking for radio, TV broadcast, and newswire",
2439-2442.
Lowe, Stephen A.:
"The beta-binomial mixture model for word frequencies in documents with applications to information retrieval",
2443-2446.
Nakazawa, Masayuki / Zhang, Jianxin / Oka, Ryuichi:
"Topic spotting and its description of summary from spontaneous speech",
2447-2450.
Walls, Frederick / Jin, Hubert / Sista, Sreenivasa / Schwartz, Richard:
"Topic detection in broadcast news",
2451-2454.
Speech & the Internet
Bowerman, Chris / Eriksson, Anders / Huckvale, Mark / Rosner, Mike / Tatham, Mark / Wolters, Maria:
"Criteria for evaluating internet tutorials in speech communication sciences",
2455-2458.
Drygajlo, Andrzej / Delafontaine, Guy:
"Javaspeechlab - interactive speech analysis laboratory on the world-wide web",
2459-2462.
Digalakis, Vassilis / Tsakalidis, Stavros / Neumeyer, Leonardo:
"Reviving discrete HMMs: the myth about the superiority of continuous HMMs",
2463-2466.
Fujisaki, Hiroya / Kameda, Hiroyuki / Ohno, Sumio / Abe, Kenji / Iijima, Michio / Suzuki, Masayoshi / Taketa, Kazunari:
"Principles and design of an intelligent system for information retrieval over the internet with a multimodal dialogue interface",
2467-2470.
Nishimoto, Takuya / Yuki, Hidehiro / Kawahara, Takehiko / Niimi, Yasuhisa:
"An asynchronous virtual meeting system for bi-directional speech dialog",
2471-2474.
Speech Recognition - Adaptation
Botinis, Antonis / Fourakis, Marios / Prinou, Irini:
"Prosodic effects on segmental durations in greek",
2475-2478.
Blomberg, Mats:
"Within-utterance correlation for speech recognition",
2479-2482.
Gelin, Philippe / Junqua, Jean-Claude:
"Techniques for robust speech recognition in the car environment",
2483-2486.
Giuliani, Diego:
"An on-line acoustic compensation technique for robust speech recognition",
2487-2490.
Hung, Wei-Wen / Wang, Hsiao-Chuan:
"Using adaptive signal limiter together with noise-robust techniques for noisy speech recognition",
2491-2494.
Hong, Wei-Tyng / Chen, Sin-Horng:
"A robust environment-effects suppression training algorithm for adverse Mandarin speech recognition",
2495-2498.
Harju, Mikko / Salmela, Petri / Viikki, Olli / Lehtokangas, Mikko / Saarinen, Jukka:
"Robust speaker adaptation of continuous density HMMS using multilayer perceptron network",
2499-2502.
Li, Chengrong / Chen, Jingdong / Xu, Bo:
"Regression class selection and speaker adaptation with MLLR in Mandarin continuous speech recognition",
2503-2506.
Li, Guoqiang / Du, Limin / Hou, Ziqiang:
"Regression transformation of prior means for speaker adaptation",
2507-2510.
Feng, Liu / Che, Chi-wei / Yu, Peng / Wang, Zuoying:
"Linguistic tree based maximum likelihood model interpolation",
2511-2514.
Naito, Masaki / Deng, Li / Sagisaka, Yoshinori:
"Model-based speaker normalization methods for speech recognition",
2515-2518.
Nguyen, Patrick / Wellekens, Christian / Junqua, Jean-Claude:
"Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments",
2519-2522.
Ono, Yoshio / Yamada, Maki / Hoshimi, Masakatsu:
"A study of speaker adaptation for speaker independent speech recognition method using phoneme similarity vector",
2523-2526.
Uebel, L. F. / Woodland, P. C.:
"An investigation into vocal tract length normalisation",
2527-2530.
Yuk, Zong Suk / Flanagan, James / Krishnamoorthy, Mahesh / Dayanidhi, Krishna:
"Adaptation to environment and speaker using maximum likelihood neural networks",
2531-2534.
Yu, Xiuyang / Ward, Wayne:
"Corrective training for speaker adaptation",
2535-2538.
Obradovic, R. / Pekar, D. / Krco, S. / Delic, V. / Senk, V.:
"A robust speaker-independent CPU-based ASR system",
2881-2884.
Enhancements, Echo Cancellation, and Quality Measures
Abouchakra, Rabih / Kabal, Peter:
"Delay estimation for transform domain acoustical echo cancellation",
2539-2542.
Beaugeant, C. / Scalart, Pascal:
"Noise reduction using perceptual spectral change",
2543-2546.
Hussain, Amir / Campbell, Douglas R.:
"Intelligibility improvements using diverse sub-band processing applied to noisy speech",
2547-2550.
Koutras, Athanasios / Dermatas, Evangelos / Kokkinakis, George:
"Recognizing simultaneous speech: a genetic algorithm approach",
2551-2554.
Bielawski, Krzysztof / Petrovsky, Alexander A.:
"Speech enhancement system for hands-free telephone based on the psychoacoustically motivated filter bank with allpass frequency transformation #",
2555-2558.
Shields, P. W. / Campbell, Douglas R.:
"Speech enhancement using a multi-microphone sub-band adaptive griffiths-jim noise canceller",
2559-2562.
Szarvas, M. / Fegyó, T. / Tatai, P. / Gordos, Géza:
"Qualiphone-a: a perceptual speech quality evaluation system for analog mobile networks",
2563-2566.
Saruwatari, Hiroshi / Kajita, Shoji / Takeda, Kazuya / Itakura, Fumitada:
"Speech enhancement using nonlinear microphone array under nonstationary noise conditions",
2567-2570.
Sarikaya, Ruhi / Hansen, John H. L.:
"Auditory masking threshold estimation for broadband noise sources with application to speech enhancement",
2571-2574.
Unoki, Masashi / Akagi, Masato:
"Segregation of vowel in background noise using the model of segregating two acoustic sources based on auditory scene analysis",
2575-2578.
Veaux, Christophe / Scalart, Pascal / Gilloire, André:
"Analysis and on-line detection of audible distortions in GSM telephony",
2579-2582.
Ru, Wen Rong / Lin, Shih-Chen / Chen, Po-Cheng / Kuo, Chun-Hung:
"A parameter-based 2-talker detection apparatus for echo cancellation",
2583-2586.
Yen, Kuan-Chieh / Huang, Jun / Zhao, Yunxin:
"Co-channel speech separation in the presence of correlated and uncorrelated noises",
2587-2590.
Speech and Noise 2
Burshtein, David / Gannot, Sharon:
"Speech enhancement using a mixture-maximum model",
2591-2594.
Gonzalez-Rodriguez, Joaquin / Cruz-Llanas, Santiago / Ortega-Garcia, Javier:
"Concurrent speakers separation through binaural processing of stereo recordings",
2595-2598.
Gustafsson, Harald / Nordholm, Sven / Claesson, Ingvar:
"Spectral subtraction with adaptive averaging of the gain function",
2599-2602.
Gaillard, François / Berthommier, Frédéric / Feng, Gang / Schwartz, Jean-Luc:
"A reliability criterion for time-frequency labeling based on periodicity in an auditory scene",
2603-2606.
Koval, Serguei / Stolbov, Mikhail / Khitrov, Mikhail:
"Broadband noise cancellation systems: new approach to working performance optimization",
2607-2610.
Linhard, Klaus / Haulick, Tim:
"Noise subtraction with parametric recursive gain curves",
2611-2614.
Masgrau, Enrique / Aguilar, Luis / Lleida, Eduardo:
"Performance comparison of several adaptive schemes for microphone array beamforming",
2615-2618.
Mizumachi, Mitsunori / Akagi, Masato:
"An objective distortion estimator for hearing aids and its application to noise reduction",
2619-2622.
Nemer, Elias / Goubran, Rafik / Mahmoud, Samy:
"Speech enhancement using fourth-order cumulants and time-domain optimal filters",
2623-2626.
Renevey, Philippe / Drygajlo, Andrzej:
"Missing feature theory and probabilistic estimation of clean speech components for robust speech recognition",
2627-2630.
Salavedra, Josep M. / Bou, Xavier:
"Distortion effects of several cumulant-based wiener filtering algorithms",
2631-2634.
Svoboda, Milan / Sovka, Pavel / Pollák, Petr:
"Combined noise suppression system for monaural cochlear implants",
2635-2638.
Wijngaarden, Sander J. van / Steeneken, Herman J. M.:
"Objective prediction of speech intelligibility at high ambient noise levels using the speech transmission index",
2639-2642.
Wan, Eric A. / Merwe, Rudolph van der:
"Noise-regularized adaptive filtering for speech enhancement",
2643-2646.
Zarubin, F. / Kovtonyuk, A. / Zadiraka, K.:
"Speech enhancement using karhunen-love transformation and wiener filtering in critical bands",
2647-2650.
Spoken Dialogue Systems
Brondsted, Tom:
"The CPK NLP suite for spoken language understanding",
2651-2654.
Chung, Grace / Seneff, Stephanie / Hetherington, Lee:
"Towards multi-domain speech understanding using a two-stage recognizer",
2655-2658.
Ipsic, I. / Mihelic, F. / Dobrisek, S. / Gros, Jerneja / Pavesic, N.:
"A slovenian spoken dialog system for air flight inquiries",
2659-2662.
Ramaswamy, Ganesh / Kleindienst, Jan / Coffman, Daniel / Gopalakrishnan, Ponani / Neti, Chalapathy:
"A pervasive conversational interface for information interaction",
2663-2666.
Vromans, B. / Vark, R.J. van / Rueber, B. / Kellner, A.:
"Extending the SUSI system with negative knowledge",
2667-2670.
Speech Perception
Crouzet, Olivier / Bacri, Nicole:
"Phonological constraints in speech segmentation processes: investigating levels of implementation.",
2671-2674.
Damper, Robert I. / Gunn, Steve R.:
"Learning phonetic distinctions from speech signals",
2675-2678.
Moreton, Elliott / Amano, Shigeaki:
"Phonotactics in the perception of Japanese vowel length: evidence for long-distance dependencies",
2679-2682.
Peperkamp, Sharon / Dupoux, Emmanuel / Sebastián-Gallés, Núria:
"Perception of stress by French, Spanish, and bilingual subjects",
2683-2686.
Silipo, Rosaria / Greenberg, Steven / Arai, Takayuki:
"Temporal constraints on speech intelligibility as deduced from exceedingly sparse spectral representations",
2687-2690.
Corpora
Choukri, Khalid / Mapelli, ValéRie / Allen, Jeff:
"New developments within the european language resources association (ELRA)",
2691-2694.
Eskenazi, Maxine / Rudnicky, Alexander I. / Gregory, Karin / Constantinides, Paul / Brennan, Robert / Bennett, Christina / Allen, Jwan:
"Data collection and processing in the carnegie mellon communicator",
2695-2698.
Höge, Harald / Draxler, Christoph / Heuvel, Henk van den / Johansen, Finn Tore / Sanders, Eric / Tropf, Herbert S.:
"Speechdat multilingual speech databases for teleservices: across the finish line",
2699-2702.
Mengel, Andreas / Heid, Ulrich:
"Enhancing reusability of speech corpora by hyperlinked query output",
2703-2706.
Silverman, Kim / Anderson, Victoria / Bellegarda, Jerome / Lenzo, Kevin / Naik, Devang:
"Design and ccollection of a corpus of polyphones and prosodic contexts for speech synthesis research and development",
2707-2708.
Speech Recognition - Training
Chang, Sen-Chia / Chien, Shih-Chieh / Shieh, Woei-Chyang:
"Mandarin telephone speech recognition using MCE/GPD-based speaker cluster HMM",
2709-2712.
Fonollosa, A. R. / Batlle, Eloi:
"Combining length restrictions and n-best techniques in multiple-pass search strategies",
2713-2716.
Gelin-Huet, Cecile / Rose, Kenneth / Rao, Ajit:
"The deterministic annealing approach for discriminative continuous HMM design",
2717-2720.
Huo, Qiang / Ma, Bin:
"On-line adaptive learning of CDHMM parameters based on multiple-stream prior evolution and posterior pooling",
2721-2724.
Kemp, Thomas / Waibel, Alex:
"Unsupervised training of a speech recognizer: recent experiments",
2725-2728.
Chesta, C. / Laface, Pietro / Nigra, M.:
"Piecewise HMM discriminative training",
2729-2732.
Lefévre, Fabrice / Montacié, Claude / Caraty, Marie-José:
"A MLE algorithm for the k-NN HMM system",
2733-2736.
McDonough, John / Byrne, William:
"Single-pass adapted training with all-pass transforms",
2737-2740.
Nogueiras-Rodríguez, Albino / Marino, José B.:
"Minimum confusibility training of context dependent demiphones",
2741-2744.
Rudzionis, A. / Rudzionis, V.:
"Phoneme recognition in fixed context using regularized discriminant analysis",
2745-2748.
Tran, Dat / Wagner, Michael:
"Hidden Markov models using fuzzy estimation",
2749-2752.
Vair, Claudio / Mercogliano, Massimiliano / Fissore, Luciano:
"Incremental training of CDHMMs using bayesian learning",
2753-2756.
Willett, Daniel / Müller, Stefan / Rigoll, Gerhard:
"A discriminative training procedure based on language model and dictionary for LVCSR",
2757-2760.
Wu, Jian / Guo, Qing:
"A novel discriminative method for HMM in automatic speech recognition",
2761-2764.
Speech Analysis and Segmentation
Avadhanulu, J. V. / Mathew, M. / Sreenivas, T. V.:
"EARLYZER: perceptualy motivated robust TFR of speech",
2765-2768.
Aguilera, C. M. / Navas, A. / Urquiza, R. / Gago, A.:
"Frequency lowering using a discrete exponential transform",
2769-2772.
Martino, Joseph Di / Laprie, Yves:
"An efficient F0 determination algorithm based on the implicit calculation of the autocorrelation of the temporal excitation signal",
2773-2776.
Howitt, Andrew Wilson:
"Vowel landmark detection",
2777-2780.
Kawahara, Hideki / Katayose, Haruhiro / Cheveigné, Alain de / Patterson, Roy D.:
"Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity",
2781-2784.
Lawlor, B. / Fagan, A. D.:
"A novel high quality efficient algorithm for time-scale modification of speech",
2785-2788.
Lee, Minkyu / Santen, Jan van / Möbius, Bernd / Olive, Joseph:
"Formant tracking using segmental phonemic information",
2789-2792.
McKenna, John / Isard, Stephen:
"Tailoring kalman filtering towards speaker characterisation",
2793-2796.
Salomon, Ariel / Espy-Wilson, Carol:
"Automatic detection of manner events based on temporal parameters",
2797-2800.
Seck, Mouhamadou / Bimbot, Frédéric / Zugaj, Didier / Delyon, Bernard:
"Two-class signal segmentation for speech/music detection in audio tracks",
2801-2804.
Tuan, Vu Ngoc / d'Alessandro, Christophe:
"Robust glottal closure detection using the wavelet transform",
2805-2808.
Santen, Jan P. H. van / Sproat, Richard W.:
"High-accuracy automatic segmentation",
2809-2812.
Zeljkovic, Ilija / Stylianou, Yannis:
"Single complex sinusoid and ARHE model based pitch extractors",
2813-2816.
Speech and Noise 3
Álvarez, A. / Martínez, R. / Gómez, P. / Nieto, V. / Pérez, M. M.:
"A robust isolated word recognizer for highly non-stationary environments. recognition results",
2817-2820.
Afify, Mohamed:
"Sequential bias compensation for robust speech recognition",
2821-2824.
Tarcisio, Coianiz / Daniele, Falavigna / Roberto, Gretter / Marco, Orlandi:
"Use of simulated data for robust telephone speech recognition",
2825-2828.
Hauptman, Y. / Bistritz, Y.:
"On the use of time alignments for noisy speech recognition",
2829-2832.
Häkkinen, Juha / Suontausta, J. / Hariharan, Ramalingam / Vasilache, M. / Laurila, K.:
"Improved feature vector normalization for noise robust connected speech recognition",
2833-2836.
Josifovski, Ljubomir / Cooke, Martin / Green, Phil / Vizinho, Ascension:
"State based imputation of missing data for robust speech recognition and speech enhancement",
2837-2840.
Kermorvant, Christopher / Morris, Andrew:
"A comparison of two strategies for ASR in additive noise: missing data and spectral subtraction",
2841-2844.
Milner, Ben / Farrell, Mark:
"A comparison of techniques for tone compensation in payphone-based speech recognition",
2845-2848.
Menéndez-Pidal, Xavier / Chen, Ruxin / Wu, Duanpei / Tanaka, Mick:
"Front-end improvements to reduce stationary & variable channel and noise distortions in continuous speech recognition tasks",
2849-2852.
Nokas, G. / Dermatas, E.:
"Speech recognition in noisy reverberant rooms using a frequency domain blind deconvolution method",
2853-2856.
Schless, Volker / Class, Fritz / Sandl, Peter:
"Optimization of a speech recognizer for aircraft environments",
2857-2860.
Yoma, Nestor Becerra / Ling, Lee Luan / Stump, Sandra Dotto:
"Temporal constraints in viterbi alignment for speech recognition in noise",
2861-2864.
Yamamoto, Kazumasa / Nakagawa, Seiichi:
"HMM composition of segmental unit input HMM for noisy speech recognition",
2865-2868.
Yoma, Nestor Becerra / Ling, Lee Luan / Stump, Sandra Dotto:
"Robust connected word speech recognition using weighted viterbi algorithm and context-dependent temporal constraints",
2869-2872.
Yao, Kaisheng / Shi, Bertram / Fung, Pascale / Cao, Zhigang:
"Liftered forward masking procedure for robust digits recognition",
2873-2876.
Zhao, Yunxin:
"Channel identification and spectrum estimation for robust automatic speech recognition",
2877-2880.