Plenary Lectures
Acoustic Analysis
Acoustic Modeling
Acoustic Phonetics
Acoustics in Synthesis
Acquisition and Learning by Machine
Acquisition/Learning/Training L2 Learners
Adverse Environments and Multiple Microphones
Data-based Synthesis
Databases and Tools
Dialects and Speaking Styles
Dialogue Events
Dialogue Special Sessions
Dialogue Systems
Duration and Rhythm
Emotion in Recognition and Synthesis
Feature Extraction for Speech Recognition
Focus, Stress and Accent
General ASR Posters
Instructional Technology for Spoken Language
Language Acquisition
Language Modeling
Large Vocabulary
Multilingual Speech Processing
Multimodal ASR (Face and Lips)
Multimodal Dialogue/HCI
Multimodal Spoken Language Processing
Neural Models of Speech Processing
NNs and Stochastic Modeling
Perception of Vowels and Consonants
Perception of Words
Phonetics and Perception
Phonetics, Transcription, and Analysis
Physics and Simulation of the Vocal Tract
Pitch and Rate
Production and Perception of Prosody
Production and Prosody Posters
Prosodic Synthesis in Dialogue
Prosodic Synthesis in Text to Speech
Prosody - Phonological/Phonetic Measures
Prosody and Labeling
Prosody in ASR and Segmentation
Robust Speech Processing
Speaker Adaptation and Normalization
Speaker Identification and Verification
Speaker/Language Identification and Verification
Speech Coding / HMMs and NNs in ASR
Speech Disorders
Speech Enhancement and Robust Processing
Speech Production - Measurement and Modeling
Speech Recognition Using HMMs and NNs
Speech Synthesis
Spoken Discourse Analysis/Synthesis
Spoken Language and NLP
Spoken Language Dialogue and Conversation
Spoken Language Processing for Special Populations
Stochastic Techniques in Robust Speech Recognition
Topics in ASR and Search
TTS Systems and Rules
User-Machine Interfaces
Utterance Verification and Word Spotting
Vocal Tract Geometry
Vowels
Plenary Lectures
Cutler, Anne:
"The comparative study of spoken-language processing",
1.
Flanagan, James L.:
"Natural communication with machines - progress and challenge",
2522.
Large Vocabulary
Li, Z. / Heon, M. / O'Shaughnessy, Douglas:
"New developments in the INRS continuous speech recognition system",
2-5.
Lamel, Lori / Adda, Gilles:
"On designing pronunciation lexicons for large vocabulary, continuous speech recognition",
6-9.
Fetter, Pablo / Dandurand, Frédéric / Regel-Brietzmann, Peter:
"Word graph rescoring using confidence measures",
10-13.
Aubert, X. L. / Beyerlein, Peter / Ullrich, Meinhard:
"A bottom-up approach for handling unseen triphones in large vocabulary continuous speech recognition",
14-17.
Valtchev, V. / Woodland, P. C. / Young, S. J.:
"Discriminative optimisation of large vocabulary recognition systems",
18-21.
Matsuoka, Tatsuo / Ohtsuki, Katsutoshi / Mori, Takeshi / Furui, Sadaoki / Shirai, Katsuhiko:
"Japanese large-vocabulary continuous-speech recognition using a business-newspaper corpus",
22-25.
Carter, David / Kaja, Jaan / Neumeyer, Leonardo / Rayner, Manny / Weng, Fuliang / Wiren, Mats:
"Handling compound nouns in a Swedish speech-understanding system",
26-29.
Macias-Guarasa, J. / Gallardo, A. / Ferreiros, J. / Pardo, José M. / Villarrubia, L.:
"Initial evaluation of a preselection module for a flexible large vocabulary speech recognition system in",
30-33.
Multimodal ASR (Face and Lips)
Alissali, Mamoun / Deleglise, Paul / Rogozan, Alexandrina:
"Asynchronous integration of visual information in an automatic speech recognition system",
34-37.
Matthews, I. A. / Bangham, J. / Cox, S. J.:
"Audiovisual speech recognition using multiscale nonlinear image decomposition.",
38-41.
Su, Qin / Silsbee, Peter L.:
"Robust audiovisual integration using semicontinuous hidden Markov models",
42-45.
Schumeyer, Richard P. / Barner, Kenneth E.:
"The effect of visual information on word initial consonant perception of dysarthric speech",
46-49.
Chandramohan, Devi / Silsbee, Peter L.:
"A multiple deformable template approach for visual speech recognition",
50-53.
Cosi, Piero / Caldognetto, E. Magno / Ferrero, Franco / Dugatto, M. / Vagges, K.:
"Speaker independent bimodal phonetic recognition experiments",
54-57.
Luettin, Juergen / Thacker, Neil A. / Beet, Steve W.:
"Speechreading using shape and intensity information",
58-61.
Luettin, Juergen / Thacker, Neil A. / Beet, Steve W.:
"Speaker identification by lipreading",
62-65.
Perception of Words
Gow Jr., David W. / Melvold, Janis / Manuel, Sharon:
"How word onsets drive lexical access and segmentation: evidence from acoustics, phonology and processing",
66-69.
Kuijk, David van / Wittenburg, Peter / Dijkstra, Ton:
"RAW: a real-speech model for human word recognition",
70-73.
Meftah, Mehdi / Boudelaa, Sami:
"How facilitatory can lexical information be during word recognition? evidence from moroccan arabic",
74-77.
Haveman, Alette P.:
"Effects of frequency on the auditory perception of open- versus closed-class words",
78-81.
Vitevitch, Michael S. / Luce, Paul A. / Charles-Luce, Jan / Kemmerer, David:
"Phonotactic and metrical influences on adult ratings of spoken nonsense words",
82-85.
Auer Jr., Edward T. / Bernstein, Lynne E.:
"Lipreading supplemented by voice fundamental frequency: to what extent does the addition of voicing increase lexical uniqueness for the lipreader?",
86-89.
Riele, S. te / Nooteboom, Sieb G. / Quené, H.:
"Strategies used in rhyme-monitoring",
90-93.
Donselaar, Wilma van / Kuijpers, Cecile / Cutler, Anne:
"How do dutch listeners process words with epenthetic schwa?",
94-97.
Phonetics, Transcription, and Analysis
Juola, Patrick / Zimmermann, Philip:
"Whole-word phonetic distances and the PGPfone alphabet",
98-101.
Ran, Shuping / Millar, J. Bruce / Rose, Phil:
"Automatic vowel quality description using a variable mapping to an eight cardinal vowel reference set",
102-105.
Kipp, Andreas / Wesenick, Maria-Barbara / Schiel, Florian:
"Automatic detection and segmentation of pronunciation variants in German speech corpora",
106-109.
Seneff, Stephanie / Lau, Raymond / Meng, Helen:
"ANGIE: a new framework for speech analysis based on morpho-phonological modelling",
110-113.
Yang, Byunggon:
"Perceptual contrast in the Korean and English vowel system normalized",
114-117.
Lee, Yong-Ju / Lee, Sook-Hyang:
"On phonetic characteristics of pause in the Korean read speech",
118-120.
Boudelaa, Sami / Meftah, Mehdi:
"Cross-language effects of lexical stress in word recognition: the case of Arabic English bilinguals",
121-124.
Wesenick, Maria-Barbara:
"Automatic generation of German pronunciation variants",
125-128.
Wesenick, Maria-Barbara / Kipp, Andreas:
"Estimating the quality of phonetic transcriptions and segmentations of speech signals",
129-132.
Petek, Bojan / Sustarsic, Rastislav / Komar, Smiljana:
"An acoustic analysis of contemporary vowels of the standard slovenian language",
133-136.
Robbe, Sandrine / Bonneau, Anne / Coste, Sylvie / Laprie, Yves:
"Using decision trees to construct optimal acoustic cues",
137-140.
Erickson, Donna / Fujimura, Osamu:
"Maximum jaw displacement in contrastive emphasis",
141-144.
Herman, Rebecca / Beckman, Mary / Honda, Kiyoshi:
"Subglottal pressure and final lowering in English",
145-148.
Kuijpers, Cecile / Donselaar, Wilma van / Cutler, Anne:
"Phonological variation: epenthesis and deletion of schwa in Dutch",
149-152.
Spoken Language Processing for Special Populations
Mahshie, James J.:
"Feedback considerations for speech training systems",
153-156.
Öster, Anne-Marie:
"Clinical applications of computer-based speech training for children with hearing impairment",
157-160.
Hazan, Valerie / Simpson, Andrew:
"Enhancing information-rich regions of natural VCV and sentence materials presented in noise",
161-164.
Hazan, Valerie / Adlard, Alan:
"Speech perceptual abilities of children with specific reading difficulty (dyslexia)",
165-168.
Paarmann, Larry D. / Wynne, Michael K.:
"Bimodal perception of spectrum compressed speech",
169-172.
Barac-Cikoja, Dragana / Revoile, Sally:
"Effect of sentential context on syllabic stress perception by hearing-impaired listeners",
173-175.
Russell, Martin / Brown, Catherine / Skilling, Adrian / Series, Rob / Wallace, Julie / Bohnam, Bill / Barker, Paul:
"Applications of automatic speech recognition to speech and language development in young children",
176-179.
Campbell, D. R.:
"Sub-band adaptive speech enhancement for hearing aids",
180-183.
Portele, Thomas / Krämer, Jürgen:
"Adapting a TTS system to a reading machine for the blind",
184-187.
Dialogue Special Sessions
Shirai, Katsuhiko:
"Modeling of spoken dialogue with and without visual information",
188-191.
Seneff, Stephanie / Goddeau, David / Pao, Christine / Polifroni, Joseph:
"Multimodal discourse modelling in a multi-user multi-domain environment",
192-195.
Kita, Kenji / Fukui, Yoshikazu / Nagata, Masaaki / Morimoto, Tsuyoshi:
"Automatic acquisition of probabilistic dialogue models",
196-199.
Paul Heisterkamp / McGlashan, Scott:
"Units of dialogue management: an example",
200-203.
Oviatt, Sharon / VanGent, Robert:
"Error resolution during multimodal human-computer interaction",
204-207.
Sarukkai, Ramesh R. / Ballard, Dana H.:
"Improved spontaneous dialogue recognition using dialogue and utterance triggers by adaptive probability boosting",
208-211.
Hübener, Kai / Jost, Uwe / Heine, Henrik:
"Speech recognition for spontaneously spoken German dialogues",
212-215.
Taylor, Paul / Shimodaira, Hiroshi / Isard, Stephen / King, Simon / Kowtko, Jacqueline:
"Using prosodic information to constrain language models for spoken dialogue",
216-219.
Heeman, Peter A. / Loken-Kim, Kyung-ho / Allen, James F.:
"Combining the detection and correction of speech repairs",
362-365.
Sagawa, Yuji / Sugimoto, Wataru / Ohnishi, Noboru:
"Generating spontaneous elliptical utterance",
366-369.
Bruce, Gösta / Filipsson, Marcus / Frid, Johan / Granström, Björn / Gustafson, Kjell / Horne, Merle / House, David / Lastow, Birgitta / Touati, Paul:
"Developing the modelling of Swedish prosody in spontaneous dialogue",
370-373.
Pan, Shimei / McKeown, Kathleen R.:
"Spoken language generation in a multimedia system",
374-377.
Hirose, Keikichi / Sakata, Mayumi / Kawanami, Hiromichi:
"Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features",
378-381.
Tanaka, Shuichi / Nakazato, Shu / Hoashi, Keiichiro / Shirai, Katsuhiko:
"Spoken dialogue interface in a dual task situation",
382-385.
Niimi, Yasuhisa / Kobayashi, Yutaka:
"A dialogue control strategy based on the reliability of speech recognition",
534-537.
Rudnicky, Alexander I. / Reed, Stephen / Thayer, Eric H.:
"Speechwear: a mobile speech system",
538-541.
Meng, Helen / Busayapongchai, Senis / Glass, James / Goddeau, David / Hetherington, Lee / Hurley, Edward / Pao, Christine / Polifroni, Joseph / Seneff, Stephanie / Zue, Victor:
"WHEELS: a conversational system in the automobile classifieds domain",
542-545.
Sadek, M. D. / Ferrieux, A. / Cozannet, A. / Bretier, P. / Panaget, F. / Simonin, J.:
"Effective human-computer cooperative spoken dialogue: the AGS demonstrator",
546-549.
Bennacef, S. K. / Devillers, L. / Rosset, S. / Lamel, Lori:
"Dialog in the RAILTEL telephone-based system",
550-553.
Lavie, Alon / Levin, Lori / Qu, Yan / Waibel, Alex / Gates, Donna / Gavaldŕ, Marsal / Mayfield, Laura / Taboada, Maite:
"Dialogue processing in a conversational speech translation system",
554-557.
Language Modeling
Niesler, T. R. / Woodland, P. C.:
"Combination of word-based and category-based language models",
220-223.
Valverde-Albacete, Francisco J. / Pardo, José M.:
"A multi-level lexical-semantics based language model design for guided integrated continuous speech recognition",
224-227.
Gallwitz, Florian / Nöth, Elmar / Niemann, Heinrich:
"A category based approach for recognition of out-of-vocabulary words",
228-231.
Seymore, Kristie / Rosenfeld, Ronald:
"Scalable backoff language models",
232-235.
Iyer, R. / Ostendorf, Mari:
"Modeling long distance dependence in language: topic mixtures vs. dynamic cache models",
236-239.
Federico, Marcello:
"Bayesian estimation methods for n-gram language model adaptation",
240-243.
Siu, Man-hung / Ostendorf, Mari:
"Modeling disfluencies in conversational speech",
386-389.
Miller, John / Alleva, Fil:
"Evaluation of a language model using a clustered model backoff",
390-393.
Bonafonte, Antonio / Marińo, José B.:
"Language modeling using x-grams",
394-397.
Ries, Klaus / Buo, Finn Dag / Waibel, Alex:
"Class phrase models for language modelling",
398-401.
Geutner, Petra:
"Introducing linguistic constraints into statistical language modeling",
402-405.
Hu, Jianying / Turin, William / Brown, Michael K.:
"Language modeling with stochastic automata",
406-409.
Feature Extraction for Speech Recognition
Sun, Don X.:
"Feature dimension reduction using reduced-rank maximum likelihood estimation for hidden Markov models",
244-247.
Hübener, Kai:
"Using multi-level segmentation coefficients to improve HMM speech recognition",
248-251.
Eisele, T. / Haeb-Umbach, Reinhold / Langmann, D.:
"A comparative study of linear feature transformation techniques for automatic speech recognition",
252-255.
Milner, Ben:
"Inclusion of temporal information into features for speech recognition",
256-259.
Wassner, Hubert / Chollet, Gérard:
"New cepstral representation using wavelet analysis and spectral transformation for robust speech recognition",
260-263.
Long, C. J. / Datta, S.:
"Wavelet based feature extraction for phoneme recognition",
264-267.
Drygajlo, Andrzej:
"New fast wavelet packet transform algorithms for frame synchronized speech processing",
410-413.
Umesh, S. / Cohen, L. / Marinovic, N. / Nelson, D.:
"Frequency-warping in speech",
414-417.
Kobayashi, Daisuke / Kajita, Shoji / Takeda, Kazuya / Itakura, Fumitada:
"Extracting speech features from human speech-like noise",
418-421.
Kajita, Shoji / Takeda, Kazuya / Itakura, Fumitada:
"Subband-crosscorrelation analysis for robust speech recognition",
422-425.
Bourlard, Hervé / Dupont, Stéphane:
"A new ASR approach based on independent processing and recombination of partial frequency bands",
426-429.
Nadeu, Climent / Marińo, José B. / Hernando, Javier / Nogueiras, Albino:
"Frequency and time filtering of filter-bank energies for HMM speech recognition",
430-433.
Speech Production - Measurement and Modeling
Laprie, Yves / Berger, Marie-Odile:
"Extraction of tongue contours in x-ray images with minimal user interaction",
268-271.
Demolin, Didier / Metens, Thierry / Soquet, Alain:
"Three-dimensional measurement of the vocal tract by MRI",
272-275.
Gleason, Philip / Tuller, Betty / Kelso, J. A. Scott:
"Syllable affiliation of final consonant clusters undergoes a phase transition over speaking rates",
276-278.
Lobo, Arthur / O'Malley, Michael:
"Towards a biomechanical model of the larynx",
279-282.
Morlec, Yann / Bailly, Gérard / Aubergé, Včronique:
"Generating intonation by superposing gestures",
283-286.
Kawahara, Hideki / Kato, Hiroko / Williams, J. C.:
"Effects of auditory feedback on F0 trajectory generation",
287-290.
Speech Coding / HMMs and NNs in ASR
Burnett, I. S. / Parry, J. J.:
"On the effects of accent and language on low rate speech coders",
291-294.
Pan, J. S. / McInnes, Fergus R. / Jack, Mervyn A.:
"VQ codevector index assignment using genetic algorithms for noisy channels",
295-298.
Cawley, Gavin C.:
"An improved vector quantization algorithm for speech transmission over noisy channels",
299-301.
Murgia, C. / Feng, G. / Guyader, A. Le / Quinquis, C.:
"Very low delay and high quality coding of 20 hz-15 khz speech signals at 64 kbit/s",
302-305.
Ribeiro, Carlos M. / Trancoso, Isabel M.:
"Application of speaker modification techniques to phonetic vocoding",
306-309.
Yonezaki, Tadashi / Shikano, Kiyohiro:
"Entropy coded vector quantization with hidden Markov models",
310-313.
Kohata, Minoru:
"An application of recurrent neural networks to low bit rate speech coding",
314-317.
Koishida, Kazuhito / Tokuda, Keiichi / Kobayashi, Takao / Imai, Satoshi:
"CELP coding system based on mel-generalized cepstral analysis",
318-321.
Chan, Cheung-Fat / Hui, Wai-Kwong:
"Wideband re-synthesis of narrowband CELP-coded speech using multiband excitation model",
322-325.
Koizumi, Takuya / Mori, Mikio / Taniguchi, Shuji / Maruya, Mitsutoshi:
"Recurrent neural networks for phoneme recognition",
326-329.
Mokhtar, M. A. / Zein-el-Abddin, A.:
"A model for the acoustic phonetic structure of arabic language using a single ergodic hidden Markov model",
330-333.
Gong, Yifan / Illina, Irina / Haton, Jean-Paul:
"Modelling long term variability information in mixture stochastic trajectory framework",
334-337.
Moudenc, T. / Sokol, R. / Mercier, Guy:
"Segmental phonetic features recognition by means of neural-fuzzy networks and integration in an n-best solutions post-processing",
338-341.
Illina, Irina / Gong, Yifan:
"Stochastic trajectory model with state-mixture for continuous speech recognition",
342-345.
Hild, Hermann / Waibel, Alex:
"Recognition of spelled names over the telephone",
346-349.
Boulianne, Gilles / Kenny, Patrick:
"Optimal tying of HMM mixture densities using decision trees",
350-353.
Choi, Hwan Jin / Oh, Yung Hwan:
"Speech recognition using an enhanced FVQ based on a codeword dependent distribution normalization and codeword
weighting by fuzzy objective function",
354-357.
Kurimo, Mikko / Somervuo, Panu:
"Using the self-organizing map to speed up the probability density estimation for speech recognition with mixture density HMMs",
358-361.
Vowels
Lang, Carrie E. / Ohala, John J.:
"Temporal cues for vowels and universals of vowel inventories",
434-437.
Syrdal, Ann K.:
"Acoustic variability in spontaneous conversational speech of american English talkers",
438-441.
Willerman, Raquel / Kuhl, Patricia K.:
"Cross-language speech perception: Swedish, English, and Spanish speakers' perception of front rounded vowels",
442-445.
Ingram, John C. L. / Park, See-Gyoon:
"Inter-language vowel perception and production by Korean and Japanese listeners",
446-449.
Kewley-Port, Diane / Akahane-Yamada, Reiko / Aikawa, Kiyoaki:
"Intelligibility and acoustic correlates of Japanese accented English vowels",
450-453.
Yoneyama, Kiyoko:
"Segmentation strategies for spoken language recognition: evidence from semi-bilingual Japanese speakers of English",
454-457.
NNs and Stochastic Modeling
Lee, Geunbae / Lee, Jong-Hyeok / Park, Kyubong / Kim, Byung-Chang:
"Integrating connectionist, statistical and symbolic approaches for continuous spoken Korean processing",
458-461.
Hermansky, Hynek / Timberwala, Sangita / Pavel, Misha:
"Towards ASR on partially corrupted speech",
462-465.
Gish, Herbert / Ng, Kenney:
"Parametric trajectory models for speech recognition",
466-469.
Knill, K. M. / Gales, M. J. F. / Young, S. J.:
"Use of Gaussian selection in large vocabulary continuous speech recognition using HMMs",
470-473.
Hogberg, J. / Sjölander, Kare:
"Cross phone state clustering using lexical stress and context",
474-477.
Lleida-Solano, Eduardo / Rose, Richard C.:
"Likelihood ratio decoding and confidence measures for continuous speech recognition",
478-481.
Ma, Xiaohui / Gong, Yifan / Fu, Yuqing / Lu, Jiren / Haton, Jean-Paul:
"A study on continuous Chinese speech recognition based on stochastic trajectory models",
482-485.
Itoh, Yoshiaki / Kiyama, Jiro / Kojima, Hiroshi / Seki, Susumu / Oka, Ryuichi:
"A proposal for a new algorithm of reference interval-free continuous DP for real-time speech or text retrieval",
486-489.
Ito, Akinori / Kohda, Masaki:
"Language modeling by string pattern n-gram for Japanese speech recognition",
490-493.
Kneser, Reinhard:
"Statistical language modeling using a variable context length",
494-497.
Johansen, Finn Tore:
"A comparison of hybrid HMM architectures using global discriminative training",
498-501.
Wei, Wei / Barnard, Etienne / Fanty, Mark:
"Improved probability estimation with neural network models",
502-505.
Yu, Ha-Jin / Oh, Yung-Hwan:
"A neural network using acoustic sub-word units for continuous speech recognition",
506-509.
Bosch, Louis F. M. ten / Smits, Roel:
"On the error criteria in neural networks as a tool for human classification modelling",
510-513.
Ramsay, Gordon:
"A non-linear filtering approach to stochastic training of the articulatory-acoustic mapping using the EM algorithm",
514-517.
Yang, Y. P. / Deller Jr., J. R.:
"A tool for automated design of language models",
518-521.
Freitag, F. / Monte, E.:
"Acoustic-phonetic decoding based on elman predictive neural networks",
522-525.
Lee, Tan / Ching, P. C.:
"On improving discrimination capability of an RNN based recognizer",
526-529.
Wakita, Yumi / Kawai, Jun / Iida, Hitoshi:
"An evaluation of statistical language modeling for speech recognition using a mixed category of both words and parts-of-speech",
530-533.
Neural Models of Speech Processing
Aleksandrovsky, Boris / Whitson, James / Andes, Gretchen / Lynch, Gary / Granger, Richard:
"Novel speech processing mechanism derived from auditory neocortical circuit analysis",
558-561.
Tang, Ping / Rouat, Jean:
"Modeling neurons in the anteroventral cochlear nucleus for amplitude modulation (AM) processing: application to speech sound",
562-565.
Vereecken, Halewijn / Martens, Jean-Pierre:
"Noise suppression and loudness normalization in an auditory model-based acoustic front-end",
566-569.
Hant, Jim / Strope, Brian / Alwan, Abeer:
"A psychoacoustic model for the noise masking of voiceless plosive bursts",
570-573.
Hunke, Martin / Holton, Thomas:
"Training machine classifiers to match the performance of human listeners in a natural vowel classification task",
574-577.
Aikawa, Kiyoaki / Kawahara, Hideki / Tsuzaki, Minoru:
"A neural matrix model for active tracking of frequency-modulated tones",
578-581.
Utterance Verification and Word Spotting
Rose, Richard C. / Lleida-Solano, Eduardo / Erhart, G. W. / Grubbe, R. V.:
"A user-configurable system for voice label recognition",
582-585.
Gelin, Philippe / Wellekens, Chris. J.:
"Keyword spotting enhancement for video soundtrack indexing",
586-589.
Méliani, Rachida El / O'Shaughnessy, Douglas:
"New efficient fillers for unlimited word recognition and keyword spotting",
590-593.
Spina, Michelle S. / Zue, Victor:
"Automatic transcription of general audio data: preliminary analyses",
594-597.
Kubala, Francis / Anastasakos, Tasos / Jin, Hubert / Nguyen, Long / Schwartz, Richard:
"Transcribing radio news",
598-601.
Setlur, Anand R. / Sukkar, Rafid A. / Jacob, John:
"Correcting recognition errors via discriminative utterance verification",
602-605.
Acquisition/Learning Training L2 Learners
Akahane-Yamada, Reiko / Tohkura, Yoh'ichi / Bradlow, Ann R. / Pisoni, David B.:
"Does training in speech perception modify speech production?",
606-609.
Ueyama, Motoko:
"Phrase-final lengthening and stress-timed shortening in the speech of native speakers and Japanese learners of English",
610-613.
Yamada, Nobuko:
"Japanese accentuations by foreign students and Japanese speakers of non-tokyo dialect",
614-617.
Varden, J. Kevin / Sato, Tsutomu:
"Devoicing of Japanese vowels by taiwanese learners of Japanese",
618-621.
Archambault, Daničle / Foucher, Catherine / Maneva, Blagovesta:
"Fluency and use of segmental dialect features in the acquisition of a second language (French) by English speakers",
622-625.
Martland, P. / Whiteside, Sandra P. / Beet, Steve W. / Baghai-Ravary, L.:
"Estimating child and adolescent formant frequency values from adult data",
626-629.
Focus, Stress and Accent
Sluijter, Agaath M. C. / Heuven, Vincent J. van:
"Acoustic correlates of linguistic stress and accent in dutch and american English",
630-633.
Fujisaki, Hiroya / Ohno, Sumio / Tomita, Osamu:
"On the levels of accentuation in spoken Japanese",
634-637.
Thibault, Linda / Ouellet, Marise:
"Tonal distinctions between emphatic stress and pretonic lengthening in quebec French",
638-641.
Elsner, Anja (Petzold):
"Distinction between 'normal' focus and 'contrastive/emphatic' focus",
642-645.
Nishinuma, Yukihiro / Arai, Masako / Ayusawa, Takako:
"Perception of tonal accent by americans learning Japanese",
646-649.
Shriberg, Elizabeth / Ladd, D. Robert / Terken, Jacques:
"Modeling intra-speaker pitch range variation: predicting F0 targets when "speaking up"",
650-653.
Spoken Language Dialogue and Conversation
Reithinger, Norbert / Engel, Ralf / Kipp, Michael / Klesen, Martin:
"Predicting dialogue acts for a speech-to-speech translation system",
654-657.
Müller, Johannes / Stahl, Holger / Lang, Manfred:
"Automatic speech translation based on the semantic structure",
658-661.
Norton, Lewis M. / Weir, Carl E. / Scholz, K. W. / Dahl, Deborah A. / Bouzid, Ahmed:
"A methodology for application development for spoken language systems",
662-664.
Seneff, Stephanie / Polifroni, Joseph:
"A new restaurant guide conversational system: issues in rapid prototyping for specialized domains",
665-668.
Kumamoto, Tadahiko / Ito, Akira:
"Semantic interpretation of a Japanese complex sentence in an advisory dialogue - focused on the postpositional word "KEDO," which works as a conjunction between clauses",
669-672.
Hong, Youngkuk / Koo, Myoung-Wan / Yang, Gijoo:
"A Korean morphological analyzer for speech translation system",
673-676.
Carlson, Rolf / Hunnicutt, Sheri:
"Generic and domain-specific aspects of the waxholm NLP and dialog modules",
677-680.
Kameyama, Megumi / Kawai, Goh / Arima, Isao:
"A real-time system for summarizing human-human spontaneous spoken dialogues",
681-684.
Hildebrandt, Bernd / Rautenstrauch, Heike / Sagerer, Gerhard:
"Evaluation of spoken language understanding and dialogue systems",
685-688.
Kakita, Kuniko:
"Inter-speaker interaction of F0 in dialogs",
689-692.
Brandt-Pook, Hans / Fink, Gernot A. / Hildebrandt, Bernd / Kummert, Franz / Sagerer, Gerhard:
"A robust dialogue system for making an appointment",
693-696.
Takagi, Kazuyuki / Itahashi, Shuichi:
"Segmentation of spoken dialogue by interjections, disfluent utterances and pauses",
697-700.
Goddeau, David / Meng, Helen / Polifroni, Joseph / Seneff, Stephanie / Busayapongchai, Senis:
"A form-based dialogue manager for spoken language applications",
701-704.
Whittaker, S. J. / Attwater, D. J.:
"The design of complex telephony applications using large vocabulary speech technology",
705-708.
Sutton, Stephen / Novick, David G. / Cole, Ronald A. / Vermeulen, Pieter / Villiers, Jacques de / Schalkwyk, Johan / , Mark Fanty / Fanty, Mark:
"Building 10,000 spoken dialogue systems",
709-712.
Yang, Yen-Ju / Chien, Lee-Feng / Lee, Lin-Shan:
"Speaker intention modeling for large vocabulary Mandarin spoken dialogues",
713-716.
Kenne, P. E. / O'Kane, Mary:
"Hybrid language models and spontaneous legal discourse",
717-720.
Kenne, P. E. / O'Kane, Mary:
"Topic change and local perplexity in spoken legal dialogue",
721-724.
Venditti, Jennifer J. / Swerts, Marc:
"Intonational cues to discourse structure in Japanese",
725-728.
Bernsen, Niels Ole / Dybkjćr, Hans / Dybkjćr, Laila:
"Principles for the design of cooperative spoken human-machine dialogue",
729-732.
Jenkin, Karen L. / Scordilis, Michael S.:
"Development and comparison of three syllable stress classifiers",
733-736.
Speech Disorders
Jamieson, D. G. / Deng, Li / Price, M. / Parsa, Vijay / Till, J.:
"Interaction of speech disorders with speech coders: effects on speech intelligibility",
737-740.
Vieira, Maurílio N. / Maran, Arnold G. D. / McInnes, Fergus R. / Jack, Mervyn A.:
"Detecting arytenoid cartilage misplacement through acoustic and electroglottographic jitter analysis",
741-744.
Vieira, Maurílio N. / McInnes, Fergus R. / Jack, Mervyn A.:
"Robust F0 and jitter estimation in pathological voices",
745-748.
Plante, F. / Kessler, H. / Cheetham, B. M. G. / Earis, J.:
"Speech monitoring of infective laryngitis",
749-752.
Schoentgen, Jean / Guchteneere, Raoul de:
"Searching for nonlinear relations in whitened jitter time series",
753-756.
Gavidia-Ceballos, Liliana / Hansen, John H. L. / Kaiser, James F.:
"Vocal fold pathology assessment using AM autocorrelation analysis of the teager energy operator",
757-760.
Kuehn, David P.:
"Continuous positive airway pressure (CPAP) in the treatment of hypernasality",
761-763.
Espy-Wilson, Carol Y. / Chari, Venkatesh R. / Huang, Caroline B.:
"Enhancement of alaryngeal speech by adaptive filtering",
764-767.
Deng, Li / Shen, Xuemin / Jamieson, D. G. / Till, J.:
"Simulation of disordered speech using a frequency-domain vocal tract model",
768-771.
Endo, Yasuo / Kasuya, Hideki:
"A stochastic model of fundamental period perturbation and its application to perception of pathological voice quality",
772-775.
Wallen, Eric J. / Hansen, John H. L.:
"A screening test for speech pathology assessment using objective quality measures",
776-779.
Cairns, Douglas A. / Hansen, John H. L. / Kaiser, James F.:
"Recent advances in hypernasal speech detection using the nonlinear teager energy operator",
780-783.
Vocal Tract Geometry
Honda, Kiyoshi / Maeda, Shinji / Hashi, Michiko / Dembowski, Jim / Westbury, John R.:
"Human palate and related structures: their articulatory consequences",
784-787.
Davis, Edward P. / Douglas, Andrew / Stone, Maureen:
"A continuum mechanics representation of tongue deformation",
788-792.
Bangayan, Philbert / Alwan, Abeer / Narayanan, Shrikanth:
"From MRI and acoustic data to articulatory synthesis: a case study of the lateral approximants in american English",
793-796.
Narayanan, Shrikanth / Kaun, Abigail / Byrd, Dani / Ladefoged, Peter / Alwan, Abeer:
"Liquids in tamil",
797-800.
Yang, Chang-Sheng / Kasuya, Hideki:
"Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images",
949-952.
El-Masri, S. / Pelorson, X. / Saguet, P. / Badin, Pierre:
"Vocal tract acoustics using the transmission line matrix (TLM) method",
953-956.
Bailly, Gérard:
"Building sensori-motor prototypes from audiovisual exemplars",
957-960.
Bĺvegĺrd, Mats / Fant, Gunnar:
"Parameterized VT area function inversion",
961-964.
Dang, Jianwu / Honda, Kiyoshi:
"An improved vocal tract model of vowel production implementing piriform resonance and transvelar nasal coupling",
965-968.
Blackburn, C. S. / Young, S. J.:
"Pseudo-articulatory speech synthesis for recognition using automatic feature extraction from x-ray data",
969-972.
Prosody in ASR and Segmentation
Oviatt, Sharon / Levow, Gina-Anne / MacEachern, Margaret / Kuhn, Karen:
"Modeling hyperarticulate speech during human-computer error resolution",
801-804.
Potisuk, Siripong / Harper, Mary P. / Gandour, Jackson T.:
"Using stress to disambiguate spoken Thai sentences containing syntactic ambiguity",
805-808.
Hsieh, Hung-yun / Lyu, Ren-yuan / Lee, Lin-shan:
"Use of prosodic information to integrate acoustic and linguistic knowledge in continuous Mandarin speech recognition with very large vocabulary",
809-812.
Rao, G. V. Ramana / Srichand, J.:
"Word boundary detection using pitch variations",
813-816.
Sakurai, Atsuhiro / Hirose, Keikichi:
"Detection of phrase boundaries in Japanese by low-pass filtering of fundamental frequency contours",
817-820.
Pagel, Vincent / Carbonell, Noelle / Laprie, Yves:
"A new method for speech delexicalization, and its application to the perception of French prosody",
821-824.
Acquisition and Learning by Machine
Bub, Udo:
"Task adaptation for dialogues via telephone lines",
825-828.
Cole, Ronald A. / Yan, Yonghong / Bailey, Troy:
"The influence of bigram constraints on word recognition by humans: implications for computer speech recognition",
829-832.
Kobayashi, Tetsunori:
"ALICE: acquisition of language in conversational environment - an approach to weakly supervised training of spoken language system for language porting",
833-836.
Yoshimura, Takashi / Hayamizu, Satoru / Ohmura, Hiroshi / Tanaka, Kazuyo:
"Pitch pattern clustering of user utterances in human-machine dialogue",
837-840.
Amengual, J. C. / Vidal, Enrique / Benedí, J. M.:
"Simplifying language through error-correcting decoding",
841-844.
Cettolo, Mauro / Corazza, Anna / Mori, Renato De:
"A mixed approach to speech understanding",
845-848.
Dialogue Systems
Gauvain, Jean-Luc / Gangolf, J. J. / Lamel, Lori:
"Speech recognition for an information kiosk",
849-852.
Strik, Helmer / Russel, Albert / Heuvel, Henk van den / Cucchiarini, Catia / Boves, Louis:
"Localizing an automatic inquiry system for public transport information",
853-856.
Marcus, Stephen M. / Brown, Deborah W. / Goldberg, Randy G. / Schoeffler, Max S. / Wetzel, William R. / Rosinski, Richard R.:
"Prompt constrained natural language - evolving the next generation of telephony services",
857-860.
Kawahara, Tatsuya / Lee, Chin-Hui / Juang, Biing-Hwang:
"Key-phrase detection and verification for flexible speech understanding",
861-864.
Suhm, Bernhard / Myers, Brad / Waibel, Alex:
"Interactive recovery from speech recognition errors in speech user interfaces",
865-868.
Issar, Sunil:
"Estimation of language models for new spoken language applications",
869-872.
Speech Enhancement and Robust Processing
Shen, Xuemin / Deng, Li / Yasmin, Anisa:
"H-infinity filtering for speech enhancement",
873-876.
Vaseghi, Saeed V. / Milner, Ben:
"A comparitive analysis of channel-robust features and channel equalization methods for speech recognition",
877-880.
Shen, Jia-lin / Hwang, Wen-liang / Lee, Lin-shan:
"Robust speech recognition features based on temporal trajectory filtering of frequency band spectrum",
881-884.
Power, Kevin:
"Durational modelling for improved connected digit recognition",
885-888.
Avendano, Carlos / Hermansky, Hynek:
"Study on the dereverberation of speech based on temporal envelope filtering",
889-892.
Brants, Thorsten:
"Estimating Markov model structures",
893-896.
Ringger, Eric K. / Allen, James F.:
"A fertility channel model for post-correction of continuous speech recognition",
897-900.
Yasukawa, Hiroshi:
"Restoration of wide band signal from telephone speech using linear prediction error processing",
901-904.
Matsumoto, Hiroshi / Naitoh, Noboru:
"Smoothed spectral subtraction for a frequency-weighted HMM in noisy speech recognition",
905-908.
Woods, William S. / Hansen, Martin / Wittkop, Thomas / Kollmeier, Birger:
"A simple architecture for using multiple cues in sound separation",
909-912.
Petek, Bojan / Andersen, Ove / Dalsgaard, Paul:
"On the robust automatic segmentation of spontaneous speech",
913-916.
Miglietta, C. G. / Mokbel, C. / Jouvet, D. / Monné, J.:
"Bayesian adaptation of speech recognizers to field speech data",
917-920.
Darlington, A. J. / Campbell, D. J.:
"Sub-band adaptive filtering applied to speech enhancement",
921-924.
Openshaw, J. P. / Mason, John S.:
"Noise robust estimate of speech dynamics for speaker recognition",
925-928.
Ortega-García, Javier / González-Rodríguez, Joaquín:
"Overview of speech enhancement techniques for automatic speaker recognition",
929-932.
Harte, Naomi / Vaseghi, Saeed V. / Milner, Ben:
"Dynamic features for segmental speech recognition",
933-936.
Koizumi, Takuya / Mori, Mikio / Taniguchi, Shuji:
"Speech recognition based on a model of human auditory system",
937-940.
Salavedra, J. M. / Masgrau, E.:
"APVQ encoder applied to wideband speech coding",
941-944.
Zhou, Jin / Shoham, Yair / Akansu, Ali:
"Simple fast vector quantization of the line spectral frequencies",
945-948.
Speaker Adaptation and Normalization I
Matsui, Tomoko / Furui, Sadaoki:
"N-best-based instantaneous speaker adaptation method for speech recognition",
973-976.
Montacié, C. / Caraty, M.-J. / Barras, C.:
"Mixture splitting technic and temporal control in a HMM-based recognition system",
977-980.
Yao, Lei / Yu, Dong / Huang, Taiyi:
"A unified spectral transformation adaptation approach for robust speech recognition",
981-984.
Huo, Qiang / Lee, Chin-Hui:
"On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognition",
985-988.
Ström, Nikko:
"Speaker adaptation by modeling the speaker variation in a continuous speech recognition system",
989-992.
Ariki, Yasuo / Tagashira, Shigeaki:
"An enquiring system of unknown words in TV news by spontaneous repetition (application of speaker normalization by speaker subspace projection)",
993-996.
Zhang, Jin-Song / Dai, Beiqian / Wang, Changfu / Kwan, Hingkeung / Hirose, Keikichi:
"Adaptive recognition method based on posterior use of distribution pattern of output probabilities",
1129-1132.
Woodland, P. C. / Pye, D. / Gales, M. J. F.:
"Iterative unsupervised adaptation using maximum likelihood linear regression",
1133-1136.
Anastasakos, Tasos / McDonough, John / Schwartz, Richard / Makhoul, John:
"A compact model for speaker-adaptive training",
1137-1140.
Homma, Shigeru / Takahashi, Jun-ichi / Sagayama, Shigeki:
"Iterative unsupervised speaker adaptation for batch dictation",
1141-1144.
Burnett, Daniel C. / Fanty, Mark:
"Rapid unsupervised adaptation to children's speech on a connected-digit task",
1145-1148.
Ishii, Jun / Tonomura, Masahiro / Matsunaga, Shoichi:
"Speaker adaptation using tree structured shared-state HMMs",
1149-1152.
Spoken Language and NLP
Schwartz, Richard / Miller, Scott / Stallard, David / Makhoul, John:
"Language understanding using hidden understanding models",
997-1000.
Gorin, Allen L.:
"Processing of semantic information in fluently spoken language",
1001-1004.
Stolcke, Andreas / Shriberg, Elizabeth:
"Automatic linguistic segmentation of conversational speech",
1005-1008.
Boros, M. / Eckert, W. / Gallwitz, Florian / Görz, Günther / Hanrieder, G. / Niemann, Heinrich:
"Towards understanding spontaneous speech: word accuracy vs. concept accuracy",
1009-1012.
Minker, Wolfgang / Bennacef, S. K. / Gauvain, Jean-Luc:
"A stochastic case frame approach for natural language understanding",
1013-1016.
Seide, Frank / Rüber, Bernhard / Kellner, Andreas:
"Improving speech understanding by incorporating database constraints and dialogue history",
1017-1020.
Buo, Finn Dag / Waibel, Alex:
"Learning to parse spontaneous speech",
1153-1156.
Antoine, Jean-Yves:
"Spontaneous speech and natural language processing ALPES: a robust semantic-led parser",
1157-1160.
Alvarez-Cercadillo, J. / Caminero-Gil, J. / Crespo-Casas, C. / Tapias-Merino, D.:
"The natural language processing module for a voice assisted operator at telefnica i+D",
1161-1164.
Berton, André / Fetter, Pablo / Regel-Brietzmann, Peter:
"Compound words in large-vocabulary German speech recognition systems",
1165-1168.
Batliner, Anton / Feldhaus, A. / Geissler, S. / Kiss, T. / Kompe, Ralf / Nöth, Elmar:
"Prosody, empty categories and parsing - a success story",
1169-1172.
Srinivas, B.:
""almost parsing" technique for language modeling",
1173-1176.
Spoken Discourse Analysis/Synthesis
Chino, Tetsuro / Tsuboi, Hiroyuki:
"A new discourse structure model for spontaneous spoken dialogue",
1021-1024.
Duff, David / Gates, Barbara / LuperFoy, Susann:
"An architecture for spoken dialogue management",
1025-1028.
Donzel, Monique E. van / Koopmans-van Beinum, Florien J.:
"Pausing strategies in discourse in dutch",
1029-1032.
Swerts, Marc / Wichmann, Anne / Beun, Robbert-Jan:
"Filled pauses as markers of discourse structure",
1033-1036.
Seong, Cheol-jae / Hahn, Minsoo:
"The prosodic analysis of Korean dialogue speech - through a comparative study with read speech",
1037-1040.
O'Kane, Mary / Kenne, P. E.:
"Changing the topic: how long does it take?",
1041-1044.
Acoustic Modeling
Westendorf, Christian-Michael / Jelitto, Jens:
"Learning pronunciation dictionary from speech data",
1045-1048.
Rathinavelu, C. / Deng, Li:
"The trended HMM with discriminative training for phonetic classification",
1049-1052.
Lazaridčs, Ariane / Normandin, Yves / Kuhn, Roland:
"Improving decision trees for acoustic modeling",
1053-1056.
Li, Gongjun / Huang, Taiyi:
"An improved training algorithm in HMM-based speech recognition",
1057-1060.
Ming, J. / O'Boyle, P. / McMahon, J. / Smith, F. J.:
"Speech recognition using a strong correlation assumption for the instantaneous spectra",
1061-1064.
Pachčs-Leal, Pau / Nadeu, Climent:
"On parameter filtering in continuous subword-unit-based speech recognition",
1065-1068.
Okawa, Shigeki / Shirai, Katsuhiko:
"Estimation of statistical phoneme center considering phonemic environments",
1069-1072.
Wang, Xue / Bosch, Louis F. M. ten / Pols, Louis C. W.:
"Integration of context-dependent durational knowledge into HMM-based speech recognition",
1073-1076.
Fukada, T. / Bacchiani, M. / Paliwal, Kuldip K. / Sagisaka, Yoshinori:
"Speech recognition based on acoustically derived segment units",
1077-1080.
Vergin, Rivarol / Farhat, Azarshid / O'Shaughnessy, Douglas:
"Robust gender-dependent acoustic-phonetic modelling in continuous speech recognition based on a new automatic male/female classification",
1081-1084.
Yang, Tae Young / Shin, Won Ho / Kim, Weon Goo / Youn, Dae Hee:
"A codebook adaptation algorithm for SCHMM using formant distribution",
1085-1088.
Simonin, J. / Bodin, S. / Jouvet, D. / Bartkova, K.:
"Parameter tying for flexible speech recognition",
1089-1092.
Nitta, Tsuneo / Tanaka, Shin'ichi / Masai, Yasuyuki / Matsu'ura, Hiroshi:
"Word-spotting based on inter-word and intra-word diphone models",
1093-1096.
Bonafonte, Antonio / Vidal, Josep / Nogueiras, Albino:
"Duration modeling with expanded HMM applied to speech recognition",
1097-1100.
Córdoba, Ricardo de / Pardo, José M.:
"Different strategies for distribution clustering using discrete, semicontinuous and continuous HMMs in CSR",
1101-1104.
Zeljkovic, Ilija / Narayanan, Shrikanth:
"Improved HMM phone and triphone models for realtime ASR telephony applications",
1105-1108.
Minami, Yasuhiro / Furui, Sadaoki:
"Improved extended HMM composition by incorporating power variance",
1109-1112.
Ramsay, Gordon / Deng, Li:
"Optimal filtering and smoothing for speech recognition using a stochastic target model",
1113-1116.
Hu, Zhihong / Schalkwyk, Johan / Barnard, Etienne / Cole, Ronald A.:
"Speech recognition using syllable-like units",
1117-1120.
Junqua, Jean-Claude / Vassallo, Lorenzo:
"Context modeling and clustering in continuous speech recognition",
2262-2265.
Deng, Li / Wu, Jim Jian-Xiong:
"Hierarchical partition of the articulatory state space for overlapping-feature based speech recognition",
2266-2269.
Oppizzi, Olivier / Fournier, David / Gilles, Philippe / Méloni, Henri:
"A fuzzy acoustic-phonetic decoder for speech recognition",
2270-2273.
Kirchhoff, Katrin:
"Syllable-level desynchronisation of phonetic features for speech recognition",
2274-2276.
Glass, James / Chang, Jane / McCandless, Michael:
"A probabilistic framework for feature-based speech recognition",
2277-2280.
Wu, Jim Jian-Xiong / Deng, Li / Chan, Jacky:
"Modeling context-dependent phonetic units in a continuous speech recognition system for Mandarin Chinese",
2281-2284.
Physics and Simulation of the Vocal Tract
Coker, Cecil H. / Krane, M. H. / Reis, B. Y. / Kubli, R. A.:
"Search for unexplored effects in speech production",
1121-1124.
Badin, Pierre / Abry, Christian:
"Articulatory synthesis from x-rays and inversion for an adaptive speech robot",
1125-1128.
Suzuki, Hisayoshi / Nakai, Takayoshi / Sakakibara, Hirosi:
"Analysis of acoustic properties of the nasal tract using 3-d FEM",
1285-1288.
Liljencrants, Johan:
"Experiments with analysis by synthesis of glottal airflow",
1289-1292.
Duration and Rhythm
Ouellet, Marise / Tardif, Benoît:
"From segmental duration properties to rhythmic structure: a study of interactions between high and low level",
1177-1180.
Wang, Xue / Pols, Louis C. W. / Bosch, Louis F. M. ten:
"Analysis of context-dependent segmental duration for automatic speech recognition",
1181-1184.
Dahan, Delphine:
"The role of the rhythmic groups in the segmentation of continuous French speech",
1185-1188.
McRobbie-Utasi, Zita:
"The implications of temporal patterns for the prosody of boundary signaling in connected speech",
1189-1192.
Lee, Hyunbok / Seong, Cheol-jae:
"Experimental phonetic study of the syllable duration of Korean with respect to the positional effect",
1193-1196.
Hermes, Dik J.:
"Timing of pitch movements and accentuation of syllables",
1197-1200.
Acoustic Analysis
Ying, Goangshiuan S. / Jamieson, Leah H. / Michell, Carl D.:
"A probabilistic approach to AMDF pitch detection",
1201-1204.
Soquet, Alain / Lecuit, Véronique / Metens, Thierry / Demolin, Didier:
"From sagittal cut to area function: an RMI investigation",
1205-1208.
Janer, Léonard / Bonet, Juan José / Lleida-Solano, Eduardo:
"Pitch detection and voiced/unvoiced decision algorithm based on wavelet transforms",
1209-1212.
Stylianou, Yannis:
"Decomposition of speech signals into a deterministic and a stochastic part",
1213-1216.
Jo, Cheol-Woo / Bang, Ho-Gyun / Ainsworth, William A.:
"Improved glottal closure instant detector based on linear prediction and standard pitch concept",
1217-1220.
Wang, Xihong / Zahorian, Stephen A. / Auberg, Stefan:
"Analysis of speech segments using variable spectral/temporal resolution",
1221-1224.
Eberman, Brian / Goldenthal, William:
"Time-based clustering for phonetic segmentation",
1225-1228.
Zolfaghari, Parham / Robinson, Tony:
"Formant analysis using mixtures of Gaussians",
1229-1232.
Richards, Hywel B. / Mason, John S. / Hunt, Melvyn J. / Bridle, John S.:
"Deriving articulatory representations from speech with various excitation modes",
1233-1236.
Sharma, Manish / Mammone, Richard J.:
""blind" speech segmentation: automatic segmentation of speech without linguistic knowledge",
1237-1240.
Ohmura, Hiroshi / Tanaka, Kazuyo:
"Speech synthesis using a nonlinear energy damping model for the vocal folds vibration effect",
1241-1244.
Namba, Munehiro / Kamata, Hiroyuki / Ishida, Yoshihisa:
"Neural networks learning with L1 criteria and its efficiency in linear prediction of speech signals",
1245-1248.
Esposito, Anna / Ezin, C. E. / Ceccarelli, M.:
"Preprocessing and neural classification of English stop consonants [b,d,g,p,t,k]",
1249-1252.
Ananthakrishnan, K. S.:
"A comparison of modified k-means(MKM) and NN based real time adaptive clustering algorithms for articulatory space codebook formation",
1253-1256.
Ding, Wen / Kasuya, Hideki:
"A novel approach to the estimation of voice source and vocal tract parameters from speech signals",
1257-1260.
Pfitzinger, Hartmut R. / Burger, Susanne / Heid, Sebastian:
"Syllable detection in read and spontaneous speech",
1261-1264.
Wang, Kuansan / Lee, Chin-Hui / Juang, Biing-Hwang:
"Maximum likelihood learning of auditory feature maps for stationary vowels",
1265-1268.
Bonafonte, Antonio / Nogueiras, Albino / Rodriguez-Garrido, Antonio:
"Explicit segmentation of speech using Gaussian models",
1269-1272.
Mousset, E. / Ainsworth, William A. / Fonollosa, José A. R.:
"A comparison of several recent methods of fundamental frequency and voicing decision estimation",
1273-1276.
Abe, Toshihiko / Kobayashi, Takao / Imai, Satoshi:
"Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency",
1277-1280.
Moreno, Asunción / Rutllán, Miquel:
"Integrated polispectrum on speech recognition",
1281-1284.
Speech Recognition Using HMMs and NNs
Neto, Joao P. / Martins, Ciro A. / Almeida, Luís B.:
"An incremental speaker-adaptation technique for hybrid HMM-MLP recognizer",
1293-1296.
Suh, Youngjoo / Lee, Youngjik:
"Phoneme segmentation of continuous speech using multi-layer perceptron",
1297-1300.
Bilmes, Jeff / Morgan, Nelson / Wu, Su-Lin / Bourlard, Hervé:
"Stochastic perceptual speech models with durational dependence",
1301-1304.
Cook, G. D. / Robinson, A. J.:
"Boosting the performance of connectionist large vocabulary speech recognition",
1305-1308.
Pican, Nicolas / Fohr, Dominique / Mari, Jean-François:
"HMMs and OWE neural network for continuous speech recognition",
1309-1312.
Waterhouse, Steve / Kershaw, Dan / Robinson, Tony:
"Smoothed local adaptation of connectionist systems",
1313-1316.
Adverse Environments and Multiple Microphones
Yamada, Takeshi / Nakamura, Satoshi / Shikano, Kiyohiro:
"Robust speech recognition with speaker localization by a microphone array",
1317-1320.
Jan, Ea-Ee / Flanagan, James L.:
"Sound source localization in reverberant environments using an outlier elimination algorithm",
1321-1324.
Kershaw, Dan / Robinson, Tony / Renals, Steve:
"The 1995 abbot LVCSR system for multiple unknown microphones",
1325-1328.
Giuliani, D. / Omologo, Maurizio / Svaizer, P.:
"Experiments of speech recognition in a noisy and reverberant environment using a microphone array and HMM",
1329-1332.
González-Rodríguez, Joaquín / Ortega-García, Javier / Martin, César / Hernández, Luis:
"Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays",
1333-1336.
Yen, Kuan-Chieh / Zhao, Yunxin:
"Robust automatic speech recognition using a multi-channel signal separation front-end",
1337-1340.
Prosodic Synthesis in Dialogue
Lindström, Anders / Bretan, Ivan / Ljungqvist, Mats:
"Prosody generation in text-to-speech conversion using dependency graphs",
1341-1344.
Asano, Hisako / Ohara, Hisashi / Ooyama, Yoshifumi:
"Extraction method of non-restrictive modification in Japanese as a marked factor of prosody",
1345-1348.
Prevost, Scott:
"Modeling contrast in the generation and synthesis of spoken language",
1349-1352.
Tsukada, Hajime:
"A left-to-right processing model of pausing in Japanese based on limited syntactic information",
1353-1356.
Galanis, D. / Darsinos, V. / Kokkinakis, George:
"Modeling of intonation bearing emphasis for TTS-synthesis of greek dialogues",
1357-1360.
Heuft, Barbara / Portele, Thomas:
"Synthesizing prosody: a prominence-based approach",
1361-1364.
Speech Synthesis
Sproat, Richard:
"Multilingual text analysis for text-to-speech synthesis",
1365-1368.
Ooyama, Yoshifumi / Asano, Hisako / Matsuoka, Koji:
"Spoken-style explanation generator for Japanese kanji using a text-to-speech system",
1369-1372.
Magata, Ken-ichi / Hamagami, Tomoki / Komura, Mitsuo:
"A method for estimating prosodic symbol from text for Japanese text-to-speech synthesis",
1373-1376.
López-Gonzalo, E. / Rodríguez-García, J. M.:
"Statistical methods in data-driven modeling of Spanish prosody for text to speech",
1377-1380.
Lee, Jung-Chul / Lee, Youngjik / Kim, Sang-Hun / Hahn, Minsoo:
"Intonation processing for TTS using stylization and neural network learning method",
1381-1384.
Black, Alan W. / Hunt, Andrew J.:
"Generating F0 contours from toBI labels using linear regression",
1385-1388.
Wang, Wern-Jun / Hwang, Shaw-Hwa / Chen, Sin-Horng:
"The broad study of homograph disambiguity for Mandarin speech synthesis",
1389-1392.
Dutoit, Thierry / Pagel, Vincent / Pierret, N. / Bataille, F. / Vrecken, O. Van der:
"The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposes",
1393-1396.
Hashimoto, Makoto / Higuchi, Norio:
"Training data selection for voice conversion using speaker selection and vector field smoothing",
1397-1400.
Lee, Ki Seung / Youn, Dae Hee / Cha, Il Whan:
"A new voice transformation method based on both linear and nonlinear prediction analysis",
1401-1404.
Baudoin, G. / Stylianou, Yannis:
"On the transformation of the speech spectrum for voice conversion",
1405-1408.
Delogu, Cristina / Paoloni, Andrea / Ragazzini, Susanna / Ridolfi, Paola:
"Spectral analysis of synthetic speech and natural speech with noise over the telephone line",
1409-1412.
Zhu, Weizhong / Kasuya, Hideki:
"A new speech synthesis system based on the ARX speech production model",
1413-1416.
Campos, Geraldo Lino de / Gouvęa, Evandro Bacci:
"Speech synthesis using the CELP algorithm",
1417-1420.
Hwang, Shaw-Hwa / Chen, Sin-Horng / Wang, Yih-Ru:
"A Mandarin text-to-speech system",
1421-1424.
Edgington, Mike D. / Lowry, A.:
"Residual-based speech modification algorithms for text-to-speech synthesis",
1425-1428.
Heggtveit, Per Olav:
"A generalized LR parser for text-to-speech synthesis",
1429-1432.
Pollard, M. P. / Cheetham, B. M. G. / Goodyear, C. C. / Edgington, Mike D. / Lowry, A.:
"Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis",
1433-1436.
Arai, Yasuhiko / Mochizuki, Ryo / Nishimura, Hirofumi / Honda, Takashi:
"An excitation synchronous pitch waveform extraction method and its application to the VCV-concatenation synthesis of Japanese spoken words",
1437-1440.
Wang, Ren-Hua / Liu, Qinfeng / Tang, Difei:
"A new Chinese text-to-speech system with high naturalness",
1441-1444.
Rinscheid, Ansgar:
"Voice conversion based on topological feature maps and time-variant filtering",
1445-1448.
Instructional Technology for Spoken Language
Yoram, Meron / Hirose, Keikichi:
"Language training system utilizing speech modification",
1449-1452.
Jamieson, D. G. / Yu, K.:
"Perception of English /r/ and /l/ speech contrasts by native Korean listeners with extensive English-language experience",
1453-1456.
Neumeyer, Leonardo / Franco, Horacio / Weintraub, Mitchel / Price, Patti:
"Automatic text-independent pronunciation scoring of foreign language student speech",
1457-1460.
Simoes, Antônio:
"Assessing the contribution of instructional technology in the teaching of pronunciation",
1461-1464.
Eskenazi, Maxine:
"Detection of foreign speakers' pronunciation errors for second language training - preliminary results",
1465-1468.
Mixdorff, Hansjörg:
"Foreign accent in intonation patterns - a contrastive study applying a quantitative model of the F0 contour",
1469-1472.
Markham, Duncan J. / Nagano-Madsen, Yasuko:
"Input modality effects in foreign accent",
1473-1476.
Multimodal Spoken Language Processing
Bernstein, Lynne E. / Benoît, Christian:
"For speech perception by humans or machines, three senses are better than one",
1477-1480.
Sekiyama, Kaoru / Tohkura, Yoh'ichi / Umeda, Michio:
"A few factors which affect the degree of incorporating lip-read information into speech perception",
1481-1484.
Vatikiotis-Bateson, E. / Munhall, K. G. / Kasahara, Y. / Garcia, F. / Yehia, H.:
"Characterizing audiovisual information during speech",
1485-1488.
Reed, Charlotte M.:
"The implications of the tadoma method of speechreading for spoken language processing",
1489-1492.
Campbell, Ruth:
"Seeing speech in space and time: psychological and neurological findings",
1493-1496.
Green, Kerry P.:
"Studies of the mcgurk effect: implications for theories of speech perception",
1652-1655.
Brooke, N. M.:
"Using the visual component in automatic speech recognition",
1656-1659.
Remez, Robert E.:
"Perceptual organization of speech in one and several modalities: common functions, common resources",
1660-1663.
Pisoni, David B. / Saldańa, Helena M. / Sheffert, Sonya M.:
"Multi-modal encoding of speech in memory: a first report",
1664-1667.
Prosody - Phonological/Phonetic Measures
Strom, Volker / Widera, Christina:
"What's in the "pure" prosody?",
1497-1500.
Swerts, Marc / Strangert, Eva / Heldner, Mattias:
"F0 declination in read-aloud and spontaneous speech",
1501-1504.
Kim, Yeon-jun / Oh, Yung-hwan:
"Prediction of prosodic phrase boundaries considering variable speaking rate",
1505-1508.
Yamashita, Yoichi / Mizoguchi, Riichiro:
"Prediction of F0 parameter of contextualized utterances in dialogue",
1509-1512.
Makarova, V. / Matsui, J.:
"The production and perception of potentially ambiguous intonation contours by speakers of Russian and Japanese",
1513-1516.
Eklund, Robert:
"What is invariant and what is optional in the realization of a FOCUSED word? a cross-dialectal study of Swedish sentences with moving focus",
1517-1520.
Phonetics and Perception
Shadle, Christine H. / Mair, Sheila J.:
"Quantifying spectral characteristics of fricatives",
1521-1524.
Warner, Natasha:
"Acoustic characteristics of ejectives in ingush",
1525-1528.
Son, Rob J. J. H. van / Pols, Louis C. W.:
"An acoustic profile of consonant reduction",
1529-1532.
Archambault, Daničle / Maneva, Blagovesta:
"Devoicing in post-vocalic canadian-French obstruants",
1533-1536.
Francis, Alexander L. / Nusbaum, Howard C.:
"Paying attention to speaking rate",
1537-1540.
Appelbaum, Irene:
"The lack of invariance problem and the goal of speech perception",
1541-1544.
Language Acquisition
Andruski, Jean E. / Kuhl, Patricia K.:
"The acoustic structure of vowels in mothers' speech to infants and adults",
1545-1548.
Clement, Chris J. / Koopmans-van Beinum, Florien J. / Pols, Louis C. W.:
"Acoustical characteristics of sound production of deaf and normally hearing infants",
1549-1552.
Kingston, John / Bartels, Christine / Benkí, José / Moore, Deanna / Rice, Jeremy / Thorburn, Rachel / Macmillan, Neil:
"Learning non-native vowel categories",
1553-1556.
Halle, P. A. / Deguchi, Toshisada / Tamekawa, Yuji / Boysson-Bardies, B. / Kiritani, Shigeru:
"Word recognition by Japanese infants",
1557-1560.
Jusczyk, Peter W.:
"Investigations of the word segmentation abilities of infants",
1561-1564.
Hayashi, Akiko / Tamekawa, Yuji / Deguchi, Toshisada / Kiritani, Shigeru:
"Developmental change in perception of clause boundaries by 6- and 10-month-old Japanese infants",
1565-1568.
Production and Prosody Posters
Alku, Paavo / Vilkman, Erkki:
"A frequency domain method for parametrization of the voice source",
1569-1572.
Marasek, Krzysztof:
"Glottal correlates of the word stress and the tense/lax opposition in German",
1573-1576.
Boyce, Suzanne / Espy-Wilson, Carol Y.:
"Coarticulatory stability in american English /r/",
1577-1580.
Masaki, Shinobu / Akahane-Yamada, Reiko / Tiede, Mark K. / Shimada, Yasuhiro / Fujimoto, Ichiro:
"An MRI-based analysis of the English /r/ and /l/ articulations",
1581-1584.
Kuijk, David van:
"Does lexical stress or metrical stress better predict word boundaries in Dutch?",
1585-1588.
Wrench, Alan A. / McIntosh, A. D. / Hardcastle, William J.:
"Optopalatograph (OPG): a new apparatus for speech production analysis",
1589-1592.
Carré, René:
"Prediction of vowel systems using a deductive approach",
1593-1596.
Mair, Sheila J. / Scully, Celia / Shadle, Christine H.:
"Distinctions between [t] and [tch] using electropalatography data",
1597-1600.
Hashi, Michiko / Kent, Raymond D. / Westbury, John R. / Lindstrom, Mary J.:
"Relating formants and articulation in intelligibility test words",
1601-1604.
Znagui, Imad / Yeou, Mohamed:
"The role of coarticulation in the perception of vowel quality in modern standard Arabic",
1605-1608.
Arnfield, Simon / Jones, Wilf:
"Updating the reading EPG",
1609-1611.
Ying, Goangshiuan S. / Jamieson, Leah H. / Chen, Ruxin / Mitchell, Carl D.:
"Lexical stress detection on stress-minimal word pairs",
1612-1615.
Wang, Jing:
"An acoustic study of the interaction between stressed and unstressed syllables in spoken Mandarin",
1616-1619.
Minematsu, Nobuaki / Nakagawa, Seiichi:
"Automatic detection of accent nuclei at the head of words for speech recognition",
1620-1623.
Chou, Fu-chiang / Tseng, Chiu-yu / Lee, Lin-shan:
"Automatic generation of prosodic structure for high quality Mandarin speech synthesis",
1624-1627.
Hamagami, Tomoki / Magata, Ken-ichi / Komura, Mitsuo:
"A study on Japanese prosodic pattern and its modeling in restricted speech",
1628-1631.
Hoskins, Steve:
"A phonetic study of focus in intransitive verb sentences",
1632-1635.
Rapp, Stefan:
"Goethe for prosody",
1636-1639.
Straub, K. A.:
"Prosodic cues in syntactically ambiguous strings; an interactive speech planning mechanism",
1640-1643.
Ni, Jinfu / Wang, Ren-Hua / Xia, Deyu:
"A functional model for generation of the local components of F0 contours in Chinese",
1644-1647.
Fellbaum, Marie:
"The acquisition of voiceless stops in the interlanguage of second language learners of English and Spanish",
1648-1651.
User-Machine Interfaces
Mellor, B. A. / Baber, C. / Tunley, C.:
"Evaluating automatic speech recognition as a component of a multi-input device human-computer interface",
1668-1671.
Life, A. / Salter, I. / Temem, J. N. / Bernard, F. / Rosset, S. / Bennacef, S. K. / Lamel, Lori:
"Data collection for the MASK kiosk: WOz vs prototype system",
1672-1675.
Karaorman, M. / Applebaum, T. H. / Itoh, T. / Endo, M. / Ohno, Y. / Hoshimi, M. / Kamai, T. / Matsui, K. / Hata, K. / Pearson, S. / Junqua, Jean-Claude:
"An experimental Japanese/English interpreting video phone system",
1676-1679.
Basson, Sara / Springer, Stephen / Fong, Cynthia / Leung, Hong / Man, Ed / Olson, Michele / Pitrelli, John / Singh, Ranvir / Wong, Suk:
"User participation and compliance in speech automated telecommunications applications",
1680-1683.
Bayer, Samuel:
"Embedding speech in web interfaces",
1684-1687.
Isobe, Toshihiro / Morishima, Masatoshi / Yoshitani, Fuminori / Koizumi, Nobuo / Murakami, Ken'ya:
"Voice-activated home banking system and its field trial",
1688-1691.
TTS Systems and Rules
Lee, Sangho / Oh, Yung-Hwan:
"A text analyzer for Korean text-to-speech systems",
1692-1695.
Karn, Helen E.:
"Design and evaluation of a phonological phrase parser for Spanish text-to-speech",
1696-1699.
Andersen, Ove / Kuhn, Roland / Lazaridčs, Ariane / Dalsgaard, Paul / Haas, Jürgen / Nöth, Elmar:
"Comparison of two tree-structured approaches for grapheme-to-phoneme conversion",
1700-1703.
Adamson, M. J. / Damper, Robert I.:
"A recurrent network that learns to pronounce English text",
1704-1707.
Albano, Eleonora Cavalcante / Moreira, Agnaldo Antonio:
"Archisegment-based letter-to-phone conversion for concatenative speech synthesis in Portuguese",
1708-1711.
Yoshida, Yuki / Nakajima, Shin'ya / Hakoda, Kazuo / Hirokawa, Tomohisa:
"A new method of generating speech synthesis units based on phonological knowledge and clustering technique",
1712-1715.
Prosody and Labeling
Grice, Martine / Reyelt, Matthias / Benzmüller, Ralf / Mayer, Jörg / Batliner, Anton:
"Consistency in transcription and labelling of German intonation with GToBI",
1716-1719.
Batliner, Anton / Kompe, Ralf / Kiessling, Andreas / Niemann, Heinrich / Nöth, Elmar:
"Syntactic-prosodic labeling of large spontaneous speech data-bases",
1720-1723.
Koopmans-van Beinum, Florien J. / Donzel, Monique E. van:
"Relationship between discourse structure and dynamic speech rate",
1724-1727.
Ward, Nigel:
"Using prosodic clues to decide when to produce back-channel utterances",
1728-1731.
Mast, Marion / Kompe, Ralf / Harbeck, Stefan / Kiessling, Andreas / Niemann, Heinrich / Nöth, Elmar / Schukat-Talamazzini, Ernst G. / Warnke, Volker:
"Dialog act classification with the help of prosody",
1732-1735.
Kuijk, David van / Heuvel, Henk van den / Boves, Louis:
"Using lexical stress in continuous speech recognition for dutch",
1736-1739.
Speaker/Language Identification and Verification
Kumpf, Karsten / King, Robin W.:
"Automatic accent classification of foreign accented australian English speech",
1740-1743.
Korkmazskiy, F. / Juang, Biing-Hwang:
"Discriminative adaptation for speaker verification",
1744-1747.
Stockmal, V. / Muljani, D. / Bond, Z. S.:
"Perceptual features of unknown foreign languages as revealed by multi-dimensional scaling",
1748-1751.
Yu, Kin / Mason, John S.:
"On-line incremental adaptation for speaker verification using maximum likelihood estimates of CDHMM parameters",
1752-1755.
Genoud, Dominique / Bimbot, Frédéric / Gravier, Guillaume / Chollet, Gérard:
"Combining methods to improve speaker verification decision",
1756-1759.
Martín del Alamo, Cesar / Alvarez, J. / Torre, C. de la / Poyatos, F. J. / Hernández, Lúis:
"Incremental speaker adaptation with minimum error discriminative training for speaker identification",
1760-1763.
Markov, Konstantin P. / Nakagawa, Seiichi:
"Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models",
1764-1767.
Thymé-Gobbel, Ann E. / Hutchins, Sandra E.:
"On using prosodic cues in automatic language identification",
1768-1771.
Kitamura, Tadashi / Takei, Shinsai:
"Speaker recognition model using two-dimensional mel-cepstrum and predictive neural network",
1772-1775.
Kwan, Hingkeung / Hirose, Keikichi:
"Unknown language rejection in language identification system",
1776-1779.
Hieronymus, James L. / Kadambe, Shubha:
"Spoken language identification using large vocabulary speech recognition",
1780-1783.
Teixeira, Carlos / Trancoso, Isabel M. / Serralheiro, António:
"Accent identification",
1784-1787.
Vuuren, Sarel van:
"Comparison of text-independent speaker recognition methods on telephone speech with acoustic mismatch",
1788-1791.
Yang, Xue / Millar, J. Bruce / Macleod, Iain:
"On the sources of inter- and intra-speaker variability in the acoustic dynamics of speech",
1792-1795.
Berkling, Kay M. / Barnard, Etienne:
"Language identification with inaccurate string matching",
1796-1799.
Carey, M. J. / Parris, E. S. / Lloyd-Thomas, H. / Bennett, S. J.:
"Robust prosodic features for speaker identification",
1800-1803.
Monte, E. / Hernando, J. / Miró, X. / Adolf, A.:
"Text independent speaker identification on noisy environments by means of self organizing maps",
1804-1807.
Dalsgaard, Paul / Andersen, Ove / Hesselager, Hanne / Petek, Bojan:
"Language identification using language-dependent phonemes and language-independent speech units",
1808-1811.
Emotion in Recognition and Synthesis
Scherer, Klaus R.:
"Adding the affective dimension: a new look in speech analysis and synthesis",
1811.
Ohala, John J.:
"Ethological theory and the expression of emotion in the voice",
1812-1815.
Murray, Iain R. / Arnott, John L.:
"Synthesizing emotions in speech: is it time to get excited?",
1816-1819.
Dellaert, Frank / Polzin, Thomas / Waibel, Alex:
"Recognizing emotion in speech",
1970-1973.
Heuft, Barbara / Portele, Thomas / Rauth, Monika:
"Emotions in time domain synthesis",
1974-1977.
Arnfield, Simon:
"Word class driven synthesis of prosodic annotations",
1978-1980.
Banbrook, M. / McLaughlin, S.:
"Dynamical modelling of vowel sounds as a synthesis tool",
1981-1984.
Johnstone, Tom:
"Emotional speech elicited using computer games",
1985-1988.
Cowie, Roddy / Douglas-Cowie, Ellen:
"Automatic statistical analysis of the signal and prosodic signs of emotion in speech",
1989-1992.
Stochastic Techniques in Robust Speech Recognition
Lee, Chin-Hui / Juang, Biing-Hwang / Chou, Wu / Molina-Perez, J. J.:
"A study on task-independent subword selection and modeling for speech recognition",
1820-1823.
Rahim, Mazin G. / Lee, Chin-Hui:
"Simultaneous ANN feature and HMM recognizer design using string-based minimum classification error (MCE) training",
1824-1827.
Gupta, Sunil K. / Soong, Frank K. / Haimi-Cohen, Raziel:
"Quantizing mixture-weights in a tied-mixture HMM",
1828-1831.
Gales, M. J. F. / Pye, D. / Woodland, P. C.:
"Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation",
1832-1835.
Surendran, A. C. / Lee, Chin-Hui / Rahim, Mazin G.:
"Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition",
1836-1839.
Chien, Jen-Tzung / Wang, Hsiao-Chuan / Lee, Lee-Min:
"Estimation of channel bias for telephone speech recognition",
1840-1843.
Prosodic Synthesis in Text to Speech
Johnson, M. E.:
"Synthesis of English intonation using explicit models of reading and spontaneous speech",
1844-1847.
Horne, Merle / Filipsson, Marcus:
"Implementation and evaluation of a model for synthesis of Swedish intonation",
1848-1851.
Katae, Nobuyuki / Kimura, Shinta:
"Natural prosody generation for domain specific text-to-speech systems",
1852-1855.
Tatham, Mark / Lewis, Eric:
"Improving text-to-speech synthesis",
1856-1859.
Bou-Ghazale, Sahar E. / Hansen, John H. L.:
"Synthesis of stressed speech from isolated neutral speech using HMM-based models",
1860-1863.
Dobnikar, Ales:
"Modeling segment intonation for Slovene TTS system",
1864-1867.
Dialogue Events
Shriberg, Elizabeth / Stolcke, Andreas:
"Word predictability after hesitations: a corpus-based study",
1868-1871.
Yang, Li-chiung:
"Interruptions and intonation",
1872-1875.
Lickley, Robin J. / Bard, Ellen Gurman:
"On not recognizing disfluencies in dialogue",
1876-1879.
Garner, Phil / Browning, Sue / Moore, Roger / Russell, Martin:
"A theory of word frequencies and its application to dialogue move recognition",
1880-1883.
Traum, David R. / Heeman, Peter A.:
"Utterance units and grounding in spoken dialogue",
1884-1887.
Novick, David G. / Hansen, Brian / Ward, Karen:
"Coordinating turn-taking with gaze",
1888-1891.
Databases and Tools
Roach, Peter / Arnfield, Simon / Barry, William J. / Baltova, J. / Boldea, Marian / Fourcin, Adrian / Gonet, W. / Gubrynowicz, Ryszard / Hallum, E. / Lamel, Lori / Marasek, Krzysztof / Marchal, Alain / Meister, E. / Vicsi, Klára:
"BABEL: an eastern european multi-language database",
1892-1893.
Wang, Ren-Hua / Xia, Deyu / Ni, Jinfu / Liu, Bicheng:
"USTC95---a putonghua corpus",
1894-1897.
Hurley, Edward / Polifroni, Joseph / Glass, James:
"Telephone data collection using the world wide web",
1898-1901.
Falcone, M. / Gallo, A.:
"The "SIVA" speech database for speaker verification: description and evaluation",
1902-1905.
Draxler, Christoph:
"A multi-level description of date expressions in German telephone speech",
1906-1909.
Halstead, Robert H. Jr. / Serridge, Ben / Thong, Jean-Manuel Van / Goldenthal, William:
"Viterbi search visualization using vista: a generic performance visualization tool",
1910-1913.
Altosaar, Toomas / Karjalainen, Matti / Vainio, Martti:
"A multilingual phonetic representation and analysis system for different speech databases",
1914-1917.
Langmann, D. / Haeb-Umbach, Reinhold / Boves, Louis / Os, E. den:
"FRESCO: the French telephone speech data collection - part of the european Speechdat(m) project",
1918-1921.
Müller, Johannes / Stahl, Holger / Lang, Manfred:
"Predicting the out-of-vocabulary rate and the required vocabulary size for speech processing applications",
1922-1925.
Parlangeau, Nathalie / Marchal, Alain:
"AMULET: automatic MUltisensor speech labelling and event tracking: study of the spatio-temporal correlations in voiceless plosive production",
1926-1929.
Hahn, Minsoo / Kim, Sanghun / Lee, Jung-Chul / Lee, Yong-Ju:
"Constructing multi-level speech database for spontaneous speech processing",
1930-1933.
Boldea, Marian / Doroga, Alin / Dumitrescu, Tiberiu / Pescaru, Maria:
"Preliminaries to a romanian speech database",
1934-1937.
Kohler, Klaus J.:
"Labelled data bank of spoken standard German the kiel corpus of read/spontaneous speech",
1938-1941.
Hetherington, Lee / McCandless, Michael:
"SAPPHIRE: an extensible speech analysis and recognition tool based on tcl/tk",
1942-1945.
Kiyama, Jiro / Itoh, Yoshiaki / Oka, Ryuichi:
"Automatic detection of topic boundaries and keywords in arbitrary speech using incremental reference interval-free continuous DP",
1946-1949.
Bai, Bo-Ren / Chien, Lee-Feng / Lee, Lin-Shan:
"Very-large-vocabulary Mandarin voice message file retrieval using speech queries",
1950-1953.
Melin, H.:
"Gandalf - a Swedish telephone speaker verification database",
1954-1957.
Bard, Ellen Gurman / Sotillo, C. / Anderson, A. H. / Taylor, M. M.:
"The DCIEM map task corpus: spontaneous dialogue under sleep deprivation and drug treatment",
1958-1961.
Menéndez-Pidal, Xavier / Polikoff, James B. / Peters, Shirley M. / Leonzio, Jennie E. / Bunnell, H. T.:
"The nemours database of dysarthric speech",
1962-1965.
Hennebert, Jean / Delacrétaz, Dijana Petrovska:
"POST: parallel object-oriented speech toolkit",
1966-1969.
Robust Speech Processing
Zhang, Xiaoyu / Mammone, Richard J.:
"Channel and noise normalization using affine transformed cepstrum",
1993-1996.
Claes, Tom / Xie, Fei / Compernolle, Dirk van:
"Spectral estimation and normalisation for robust speech recognition",
1997-2000.
Chou, Wu / Seshadri, Nambi / Rahim, Mazin G.:
"Trellis encoded vector quantization for robust speech recognition",
2001-2004.
Mak, Brian / Barnard, Etienne:
"Phone clustering using the bhattacharyya distance",
2005-2008.
Wakao, Atsushi / Takeda, Kazuya / Itakura, Fumitada:
"Variability of lombard effects under different noise conditions",
2009-2012.
Chi, Sang-mun / Oh, Yung-Hwan:
"Lombard effect compensation and noise suppression for noisy Lombard speech recognition",
2013-2016.
Dialects and Speaking Styles
Huggins, A. W. F. / Patel, Yogen:
"The use of shibboleth words for automatically classifying speakers by dialect",
2017-2020.
Kudo, Ikuo / Nakama, Takao / Watanabe, Tomoko / Kameyama, Reiko:
"Data collection of Japanese dialects and its influence into speech recognition",
2021-2024.
Miller, David R. / Trischitta, James:
"Statistical dialect classification based on mean phonetic features",
2025-2027.
Kvale, Knut:
"Norwegian numerals: a challenge to automatic speech recognition",
2028-2031.
Torre, C. de la / Caminero-Gil, J. / Alvarez, J. / Martín del Alamo, Cesar / Hernández-Gómez, Lúis:
"Evaluation of the telefnica i+d natural numbers recognizer over different dialects of Spanish from Spain and America",
2032-2035.
Production and Perception of Prosody
Cummins, Fred / Port, Robert F.:
"Rhythmic constraints on English stress timing",
2036-2039.
Vogel, Irene / Hoskins, Steve:
"On the interaction of clash, focus and phonological phrasing",
2040-2043.
Fant, Gunnar / Kruckenberg, Anita:
"On the quantal nature of speech timing",
2044-2047.
House, David:
"Differential perception of tonal contours through the syllable",
2048-2051.
Vainio, Martti / Altosaar, Toomas:
"Pitch, loudness, and segmental duration correlates: towards a model for the phonetic aspects of finnish prosody",
2052-2055.
Minematsu, Nobuaki / Nakagawa, Seiichi / Hirose, Keikichi:
"Prosodic manipulation system of speech material for perceptual experiments",
2056-2059.
Topics in ASR and Search
Ueberla, J. P. / Gransden, I. R.:
"Clustered language models with context-equivalent states",
2060-2062.
Yonezawa, Yuji / Akagi, Masato:
"Modeling of contextual effects and its application to word spotting",
2063-2066.
Junkawitsch, J. / Neubauer, L. / Höge, Harald / Ruske, Günther:
"A new keyword spotting algorithm with pre-calculated optimal thresholds",
2067-2070.
Lacouture, Roxane / Normandin, Yves:
"Detection of ambiguous portions of signal corresponding to OOV words or misrecognized portions of input",
2071-2074.
Brugnara, Fabio / Federico, Marcello:
"Techniques for approximating a trigram language model",
2075-2078.
Takagi, Keizaburo / Shinoda, Koichi / Hattori, Hiroaki / Watanabe, Takao:
"Unsupervised and incremental speaker adaptation under adverse environmental conditions",
2079-2082.
hamme, Hugo Van / Aelten, Filip Van:
"An adaptive-beam pruning technique for continuous speech recognition",
2083-2086.
Avendano, Carlos / Vuuren, Sarel van / Hermansky, Hynek:
"Data based filter design for RASTA-like channel normalization in ASR",
2087-2090.
Ortmanns, S. / Ney, Hermann / Seide, Frank / Lindam, I.:
"A comparison of time conditioned and word conditioned search techniques for large vocabulary speech recognition",
2091-2094.
Ortmanns, S. / Ney, Hermann / Eiden, A.:
"Language-model look-ahead for large vocabulary speech recognition",
2095-2098.
Husson, Jean-Luc / Laprie, Yves:
"A new search algorithm in segmentation lattices of speech signals",
2099-2102.
Yamada, Tomokazu / Sagayama, Shigeki:
"LR-parser-driven viterbi search with hypotheses merging mechanism using context-dependent phone models",
2103-2106.
Nouza, Jan:
"Discrete-utterance recognition with a fast match based on total data reduction",
2107-2110.
Caminero-Gil, J. / Torre, C. de la / Villarrubia, L. / Martín del Alamo, Cesar / Hernández, Lúis:
"On-line garbage modeling with discriminant analysis for utterance verification",
2111-2114.
Placeway, Paul / Lafferty, John:
"Cheating with imperfect transcripts",
2115-2118.
Iwahashi, Naoto:
"Novel training method for classifiers used in speaker adaptation",
2119-2122.
Minamino, Katsuki:
"Large vocabulary word recognition based on a graph-structured dictionary",
2123-2126.
Tran, Bach-Hiep / Seide, Frank / Steinbiss, Volker:
"A word graph based n-best search in continuous speech recognition",
2127-2130.
Goblirsch, David M.:
"Viterbi beam search with layered bigrams",
2131-2134.
Burhke, Eric / Chou, Wu / Zhou, Qiru:
"A wave decoder for continuous speech recognition",
2135-2138.
Thelen, Eric:
"Long term on-line speaker adaptation for large vocabulary dictation",
2139-2142.
Sagerer, Gerhard / Rautenstrauch, Heike / Fink, Gernot A. / Hildebrandt, Bernd / Jusek, A. / Kummert, Franz:
"Incremental generation of word graphs",
2143-2146.
Illina, Irina / Gong, Yifan:
"Improvement in n-best search for continuous speech recognition",
2147-2150.
Bonafonte, Antonio / Marińo, José B. / Nogueiras, Albino:
"Sethos: the UPC speech understanding system",
2151-2154.
Laface, Pietro / Fissore, Luciano / Maro, A. / Ravera, Franco:
"Segmental search for continuous speech recognition",
2155-2158.
Multimodal Dialogue/HCI
Breen, A. P. / Bowers, E. / Welsh, W.:
"An investigation into the generation of mouth shapes for a talking head",
2159-2162.
Goff, Bertrand Le / Benoît, Christian:
"A text-to-audiovisual-speech synthesizer for French",
2163-2166.
Iwano, Yuri / Kageyama, Shioya / Morikawa, Emi / Nakazato, Shu / Shirai, Katsuhiko:
"Analysis of head movements and its role in spoken dialogue",
2167-2170.
Hayamizu, Satoru / Hasegawa, Osamu / Itou, Katunobu / Sakaue, Katuhiko / Tanaka, Kazuyo / Nagaya, Shigeki / Nakazawa, Masayuki / Endoh, T. / Togawa, Fumio / Sakamoto, Kenji / Yamamoto, Kazuhiko:
"RWC multimodal database for interactions by integration of spoken language and visual information",
2171-2174.
Cavé, Christian / Guaďtella, Isabelle / Bertrand, Roxane / Santi, Serge / Harlay, Françoise / Espesser, Robert:
"About the relationship between eyebrow movements and F0 variations",
2175-2178.
Fais, Laurel / Loken-Kim, Kyung-ho / Morimoto, Tsuyoshi:
"How many words is a picture really worth?",
2179-2182.
Lagana, A. / Lavagetto, F. / Storace, A.:
"Visual synthesis of source acoustic speech through kohonen neural networks",
2183-2186.
Saldańa, Helena M. / Pisoni, David B. / Fellowes, Jennifer M. / Remez, Robert E.:
"Audio-visual speech perception without speech cues",
2187-2190.
Multilingual Speech Processing
Barnett, Jim / Corrada, A. / Gao, G. / Gillick, Larry / Ito, Yoshiko / Lowe, S. / Manganaro, L. / Peskin, Barbara:
"Multilingual speech recognition at dragon systems",
2191-2194.
Köhler, Joachim:
"Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds",
2195-2198.
Nakamura, Atsushi / Matsunaga, Shoichi / Shimizu, Tohru / Tonomura, Masahiro / Sagisaka, Yoshinori:
"Japanese speech databases for robust speech recognition",
2199-2202.
Lamel, Lori / Adda-Decker, Maqrtine / Gauvain, Jean-Luc / Adda, Gilles:
"Spoken language processing in a multilingual context",
2203-2206.
Zue, Victor / Seneff, Stephanie / Polifroni, Joseph / Meng, Helen / Glass, James:
"Multilingual human-computer interactions: from information access to language learning",
2207-2210.
Ackermann, U. / Angelini, B. / Brugnara, Fabio / Federico, Marcello / Giuliani, D. / Gretter, R. / Lazzari, G. / Niemann, H.:
"Speedata: multilingual spoken data entry",
2211-2214.
Alshawi, Hiyan:
"Head automata for speech translation",
2360-2363.
Wang, Ye-Yi / Lafferty, John / Waibel, Alex:
"Word clustering with parallel spoken language corpora",
2364-2367.
Yang, Jae-Woo / Lee, Youngjik:
"Toward translating Korean speech into other languages",
2368-2370.
Bub, Thomas / Schwinn, Johannes:
"VERBMOBIL: the evolution of a complex large speech-to-speech translation system",
2371-2374.
Lavie, Alon / Waibel, Alex / Levin, Lori / Gates, Donna / Gavaldŕ, Marsal / Zeppenfeld, Torsten / Zhan, Puming / Glickman, Oren:
"Translation of conversational speech with JANUS-II",
2375-2378.
Acoustics in Synthesis
Edmondson, William H. / Iles, Jon P. / Iskra, Dorota J.:
"Pseudo-articulatory representations in speech synthesis and recognition",
2215-2218.
Williams, David R.:
"Synthesis of initial (/s/-) stop-liquid clusters using HLsyn",
2219-2222.
Shih, Chilin:
"Synthesis of trill",
2223-2226.
Lo, W. K. / Ching, P. C.:
"Phone-based speech synthesis with neural network and articulatory control",
2227-2230.
Martland, P. / Whiteside, Sandra P. / Beet, Steve W. / Baghai-Ravary, L.:
"Analysis of ten vowel sounds across gender and regional/cultural accent",
2231-2234.
Abe, Masanobu:
"Speech morphing by gradually changing spectrum parameter and fundamental frequency",
2235-2238.
Pitch and Rate
Geoffrois, Edouard:
"The multi-lag-window method for robust extended-range F0 determination",
2239-2242.
Barner, Kenneth E.:
"Nonlinear estimation of DEGG signals with applications to speech pitch detection",
2243-2246.
Maidment, John A. / Garcia-Lecumberri, M. Luisa:
"Pitch analysis methods for cross-speaker comparison",
2247-2249.
Beet, Steve W. / Baghai-Ravary, L.:
"Continuous adaptation of linear models with impulsive excitation",
2250-2253.
Ohno, Sumio / Fukumiya, Masamichi / Fujisaki, Hiroya:
"Quantitative analysis of the local speech rate and its application to speech synthesis",
2254-2257.
Verhasselt, Jan P. / Martens, Jean-Pierre:
"A fast and reliable rate of speech detector",
2258-2261.
General ASR Posters
Zhan, Puming / Ries, Klaus / Gavaldŕ, Marsal / Gates, Donna / Lavie, Alon / Waibel, Alex:
"JANUS-II: towards spontaneous Spanish speech recognition",
2285-2288.
Demuynck, Kris / Duchateau, Jacques / Compernolle, Dirk van:
"Reduced semi-continuous models for large vocabulary continuous speech recognition in Dutch",
2289-2292.
Constantinescu, Andrei / Bornet, Olivier / Caloz, Gilles / Chollet, Gérard:
"Validating different flexible vocabulary approaches on the Swiss French Polyphone and Polyvar databases",
2293-2296.
Yoma, Nestor Becérra / McInnes, Fergus R. / Jack, Mervyn A.:
"Use of a reliability coefficient in noise cancelling by neural net and weighted matching algorithms",
2297-2300.
Ozeki, Kazuhiko:
"Likelihood normalization using an ergodic HMM for continuous speech recognition",
2301-2304.
Candille, Laurence / Méloni, Henri:
"Dynamic control of a production model",
2305-2308.
Hattori, Hiroaki / Yamada, Eiko:
"Speech recognition using sub-word units dependent on phonetic contexts of both training and recognition vocabularies",
2309-2312.
Jacob, Bruno / Senac, Christine:
"Hidden Markov models merging acoustic and articulatory information to automatic speech recognition",
2313-2315.
Blomberg, Mats / Elenius, Kjell:
"Creation of unseen triphones from diphones and monophones using a speech production approach",
2316-2319.
Xu, Bo / Ma, Bing / Zhang, Shuwu / Qu, Fei / Huang, Taiyi:
"Speaker-independent dictation of Chinese speech with 32k vocabulary",
2320-2323.
Humphries, J. J. / Woodland, P. C. / Pearce, D.:
"Using accent-specific pronunciation modelling for robust speech recognition",
2324-2327.
Sloboda, Tilo / Waibel, Alex:
"Dictionary learning for spontaneous speech recognition",
2328-2331.
Veth, Johan de / Boves, Louis:
"Comparison of channel normalisation techniques for automatic speech recognition over the phone",
2332-2335.
Leandro, Manuel A. / Pardo, José M.:
"Anchor point detection for continuous speech recognition in Spanish: the spotting of phonetic events",
2336-2339.
Raj, Bhiksha / Gouvęa, Evandro Bacci / Moreno, Pedro J. / Stern, Richard M.:
"Cepstral compensation by polynomial approximation for environment-independent speech recognition",
2340-2343.
Lilly, B. T. / Paliwal, Kuldip K.:
"Effect of speech coders on speech recognition performance",
2344-2347.
Janer, Léonard / Martí, Josep / Nadeu, Climent / Lleida-Solano, Eduardo:
"Wavelet transforms for non-uniform speech recogntion systems",
2348-2351.
Usagawa, Tsuyoshi / Bodden, Markus / Rateitschek, Klaus:
"A binaural model as a front-end for isolated word recognition",
2352-2355.
Okuno, Hiroshi G. / Nakatani, Tomohiro / Kawabata, Takeshi:
"A new speech enhancement: speech stream segregation",
2356-2359.
Data-based Synthesis
Slater, Andrew / Coleman, John:
"Non-segmental analysis and synthesis based on a speech database",
2379-2382.
Benzmüller, Ralf / Barry, William J.:
"Microsegment synthesis - economic principles in a low-cost solution",
2383-2386.
Huang, X. D. / Acero, Alex / Adcock, J. / Hon, H. W. / Goldsmith, J. / Liu, J. / Plumpe, Mike:
"Whistler: a trainable text-to-speech system",
2387-2390.
Portele, Thomas / Stöber, Karl-Heinz / Meyer, Horst / Hess, Wolfgang:
"Generation of multiple synthesis inventories by a bootstrapping procedure",
2391-2394.
Möbius, Bernd / Santen, Jan P. H. van:
"Modeling segmental duration in German text-to-speech synthesis",
2395-2398.
Campbell, Nick:
"Autolabelling Japanese ToBI",
2399-2402.
Speaker Identification and Verification
Parthasarathy, S. / Rosenberg, Aaron E.:
"General phrase speaker verification using sub-word background models and likelihood-ratio scoring",
2403-2406.
Murakami, J. / Sugiyama, M. / Watanabe, H.:
"Unknown-multiple signal source clustering problem using ergodic HMM and applied to speaker classification",
2407-2410.
Floch, J.-L. Le / Montacié, C. / Caraty, M.-J.:
"GMM and ARVM cooperation and competition for text-independent speaker recognition on telephone speech",
2411-2414.
Lin, Qiguang / Jan, Ea-Ee / Che, ChiWei / Yuk, Dong-Suk / Flanagan, James L.:
"Selective use of the speech spectrum and a VQGMM method for speaker identification",
2415-2418.
Newman, Michael / Gillick, Larry / Ito, Yoshiko / McAllaster, Don / Peskin, Barbara:
"Speaker verification through large vocabulary continuous speech recognition",
2419-2422.
Paoloni, Andrea / Ragazzini, Susanna / Ravaioli, G.:
"Predictive neural networks in text independent speaker verification: an evaluation on the SIVA database",
2423-2426.
Acoustic Phonetics
Shrotriya, Nisheeth / Verma, Rajesh / Gupta, Sunil K. / Agrawal, S. S.:
"Durational characterstics of hindi consonant clusters",
2427-2430.
Tan, Beng T. / Fu, Minyue / Spray, Andrew / Dermody, Phillip:
"The use of wavelet transforms in phoneme recognition",
2431-2434.
Kuwabara, Hisao:
"Acoustic properties of phonemes in continuous speech for different speaking rate",
2435-2438.
Fujisaki, Hiroya / Ohno, Sumio:
"Prosodic parameterization of spoken Japanese based on a model of the generation process of F0 contours",
2439-2442.
Maghbouleh, Arman:
"A logistic regression model for detecting prominences",
2443-2445.
Pfister, Beat:
"High-quality prosodic modification of speech signals",
2446-2449.
Perception of Vowels and Consonants
Zhang, Jialu:
"On the syllable structures of Chinese relating to speech recognition",
2450-2453.
Otake, Takashi / Yoneyama, Kiyoko:
"Can a moraic nasal occur word-initially in Japanese?",
2454-2457.
Strange, Winifred / Akahane-Yamada, Reiko / Fitzgerald, B. H. / Kubo, R.:
"Perceptual assimilation of american English vowels by Japanese listeners",
2458-2461.
Strange, Winifred / Bohn, Ocke-Schwen / Trent, S. A. / McNair, M. C. / Bielec, K. C.:
"Context and speaker effects in the perceptual assimilation of German vowels by american listeners",
2462-2465.
Zahid, Mohamed:
"Examination of a perceptual non-native speech contrast: pharyngealized/non-pharyngealized discrimination by French-speaking adults",
2466-2469.
Smits, Roel:
"Context-dependent relevance of burst and transitions for perceived place in stops: it's in production, not perception",
2470-2473.
Baba, Ryoji / Omuro, Kaori / Miyazono, Hiromitsu / Usagawa, Tsuyoshi / Higuchi, Masahiko:
"The perception of morae in long vowels comparison among Japanese, Korean and English speakers",
2474-2477.
Lickley, Robin J.:
"Juncture cues to disfluency",
2478-2481.
Sawusch, James R.:
"Effects of duration and formant movement on vowel perception",
2482-2485.
Deshmukh, N. / Duncan, R. J. / Ganapathiraju, A. / Picone, J.:
"Benchmarking human performance for continuous speech recognition",
2486-2489.
Arai, Takayuki / Pavel, Misha / Hermansky, Hynek / Avendano, Carlos:
"Intelligibility of speech with filtered time trajectories of spectral envelopes",
2490-2493.
Whalen, Douglas H. / Sheffert, Sonya M.:
"Perceptual use of vowel and speaker information in breath sounds",
2494-2497.
Mousty, Philippe / Radeau, Monique / Peereman, Ronald / Bertelson, Paul:
"The role of neighborhood relative frequency in spoken word recognition",
2498-2501.
McQueen, James M. / Pitt, Mark A.:
"Transitional probability and phoneme monitoring",
2502-2505.
Bonneau, Anne:
"Identification of vowel features from French stop bursts",
2506-2509.
Bond, Z. S. / Moore, Thomas J. / Gable, Beverley:
"Listening in a second language",
2510-2513.
Burnham, Denis / Francis, Elizabeth / Webster, Di / Luksaneeyanawin, Sudaporn / Attapaiboon, Chayada / Lacerda, Francisco / Keller, Peter:
"Perception of lexical tone across languages: evidence for a linguistic mode of processing",
2514-2517.
Magnuson, James S. / Akahane-Yamada, Reiko:
"Acoustic correlates to the effects of talker variability on the perception of English /r/ and /l/ by Japanese listeners",
2518-2521.