Table of Contents and Access to Abstracts
Plenary
Furui, Sadaoki:
"Recent advances in speech recognition",
3-12.
Fallside, Frank:
"On the acquisition of speech by machines, ASM",
13-14.
Continuous Speech Recognition
Ramesh, P. / Wilpon, Jay G. / McGee, M. A. / Roe, David B. / Lee, Chin-Hui / Rabiner, Lawrence R.:
"Speaker independent recognition of spontaneously spoken connected digits",
17-20.
Gopalakrishnan, P. S. / Nahamoo, David:
"Immediate recognition of embedded command words",
21-24.
Wilcox, Lynn D. / Bush, Marcia A.:
"HMM-based wordspotting for voice editing and indexing",
25-28.
Baker, Janet M.:
"Large vocabulary speaker-adaptive continuous speech recognition research overview at dragon systems",
29-32.
Sgardoni, Victoria / Gaganelis, Dimitrios A. / Frangoulis, Eleftherios D.:
"Continuous density HMM context dependent phones for speech recognition over the telephone",
33-36.
Segmental Speech Synthesis
Shirai, Katsuhiko / Hashimoto, K. / Kobayashi, T.:
"Text-to-speech synthesizer using superposition of sinusoidal waves generated by synchronized oscillators",
39-42.
Guerti, M. / Bailly, G.:
"Synthesis-by-rule using compost: modelling resonance trajectories",
43-46.
Ishikawa, Yasushi / Nakajima, Kunio:
"Neural network based spectral interpolation method for speech synthesis by rule",
47-50.
Garnier-Rizet, Marine:
"A rule-based segmental synthesis module for French",
51-54.
Human Factors
Fraser, Norman M. / Gilbert, G. Nigel:
"Effects of system voice quality on user utterances in speech dialogue systems",
57-60.
Day, P. / Grünupp, A. / Muthig, K.-P.:
"A human factors study of speech-to-text technology: consequences of discrete speech",
61-64.
Murray, Iain R. / Arnott, John L. / Newell, Alan E.:
"A comparison of document composition using a listening typewriter and conventional office systems",
65-68.
Vossen, Paulus H.:
"Evaluating speech input and output in a CAD-system using the hidden-operator method",
69-72.
Zajicek, M. / Hewitt, J.:
"Mixed mode input for a standard wordprocessor. investigating links between input mode, speech and keyboard, and specific task areas",
73-76.
Robust Isolated Word Recognition
Lockwood, P. / Boudy, J.:
"Experiments with a non-linear spectral subtractor (NSS), hidden Markov models and the projection, for robust speech recognition in cars",
79-82.
Lockwood, P. / Baillargeat, C. / Gillot, J. M. / Boudy, J. / Faucon, G.:
"Noise reduction for speech enhancement in cars: non-linear spectral subtraction / kalman filtering",
83-86.
Fellbaum, Klaus / Becker, Dieter:
"Isolated word recognition with integrated noise reduction",
87-90.
Hernando, Javier / Nadeu, Climent:
"A comparative study of parameters and distances for noisy speech recognition",
91-94.
Neural Nets: Phonetic Features, Phoneme Recognition, and Time Alignment
Laaksonen, Jorma T.:
"A new reliability-based phoneme segmentation method for the "neural" phonetic typewriter",
97-100.
Apolloni, Bruno / Pazienti, Francesco / Trotta, Vincenzo:
"Isolated word adaptive recognizer based on neural networks",
101-104.
Hataoka, Nobuo / Waibel, Alex H.:
"Evaluation of speaker-independent phoneme recognition on TIMIT database using TDNNs",
105-108.
Morgan, Nelson / Bourlard, Hervé / Wooters, C. / Kohn, Phil / Cohen, M.:
"Phonetic context in hybrid HMM/MLP continuous speech recognition",
109-112.
Andrews, E. C. / Mason, J. S.:
"Neural network classification of complex-valued speech features",
113-116.
Norris, Dennis:
"Rewiring lexical networks on the fly",
117-120.
Elenius, K. / Takacs, G.:
"Phoneme recognition with an artificial neural network",
121-124.
Jianxin, Jiang / Kechu, Yi / Zheng, Hu:
"A new self-organization algorithm of forming a phoneme map",
125-128.
Ran, Shuping / Millar, J. Bruce:
"Phoneme classification using neural networks based on acoustic-phonetic structure",
129-132.
Dodd, Nigel / Macfarlane, Donald / Marland, Chris:
"Networks for speech recognition structurally optimised by genetic techniques implemented on parallel hardware",
133-136.
Phonetics I, II
Pittam, J. / Ingram, J.:
"Influence of vietnamese tone and prosody on the acquisition of English stress patterns",
139-142.
Sendlmeier, Walter F.:
"The voiced/unvoiced distinction of initial stops by normal and hearing impaired listeners",
143-146.
Nathan, Krishna S.:
"Comparison of formant transition based stop classifiers: time-varying and time-invariant signal models",
147-150.
Benoît, Christian / Abry, Christian / Roe, L. J.:
"The effect of context on labiality in French",
151-156.
Datta, A. K. / Ganguli, N. R. / Mukherjee, B.:
"Nasalisation in bengali speech sounds acoustic-phonetic study",
157-160.
Ganguli, N. R.:
"Vowel formant frequency distribution of a major indian language",
161-164.
Harmegnies, Bernard / Bruyninckx, M. / Llisterri, Joaquim / Poch, Dolors:
"Effects of language change on voice quality in bilingual speakers, corpus content effect",
165-168.
Shevchenko, T. I. / Skopintseva, T. S.:
"Effects of social and regional backgrounds on LTAS in british English",
169-172.
Heuvel, Henk van den / Cranen, Bert / Rietveld, Toni:
"Speaker related variability in the durations of dutch speech segments",
251-254.
Liljencrants, Johan:
"Numerical simulations of glottal flow",
255-258.
Jansen, Joop / Cranen, Bert / Boves, Louis:
"Modelling of source characteristics of speech sounds by means of the LF-model",
259-262.
Herzel, H. / Wendler, J.:
"Evidence of chaos in phonatory samples",
263-266.
Van, L. Trinh / Guérin, Bernard / Castelli, E.:
"Source-tract coupling and the subglottal system in an articulatory synthesizer",
267-270.
Multilingual Speech Recognition Systems (Special Session)
Bamberg, Paul / Demedts, Anne / Elder, John / Huang, Caroline / Ingold, Charles / Mandel, Mark / Manganaro, Linda / Even, Stijn van:
"Phoneme-based training for large-vocabulary recognition in six european languages",
175-182.
Cerf-Danon, Helene / DeGennaro, Steven / Ferretti, Marco / Gonzalez, Jorge / Keppel, Eric:
"1. 0 TANGORA - a large vocabulary speech recognition system for five languages",
183-192.
Ney, Hermann / Billi, Roberto:
"Prototype systems for large-vocabulary speech recognition: polyglot and spicos",
193-200.
Spoken Language Parsing
Wright, J. H.:
"Adaptation of grammar-based language models for continuous speech recognition",
203-206.
Su, Keh-Yih / Chiang, Tung-Hui / Lin, Yi-Chung:
"A robustness and discrimination oriented score function for integrating speech and language processing",
207-210.
Baggia, Paolo / Fissore, Lorenzo / Gerbino, E. / Giachin, Egidio P. / Rullent, C.:
"Improving speech understanding performance through feedback verification",
211-214.
Corazza, A. / Mori, Renato De / Gretter, R. / Satta, G.:
"Computation of upper-bounds for island-driven stochastic parsers",
215-218.
Andry, Francois / Thornton, Simon:
"A parser for speech lattices using a UCG grammar",
219-222.
Young, Sheryl / Matessa, Michael:
"Using pragmatic and semantic knowledge to correct parsing of spoken language utterances",
223-227.
Speech Coding I-IV
Abrantes, A. J. / Marques, J. S. / Trancoso, Isabel M.:
"Hybrid sinusoidal modeling of speech without voicing decision",
231-234.
Marques, J. S. / Trancoso, Isabel M. / Abrantes, A. J.:
"Harmonic coding of speech: an experimental study",
235-238.
Rowe, David / Cowley, William / Perkis, Andrew:
"A multiband excitation linear predictive speech coder",
239-242.
Leung, S. H. / Lai, K. L. / Wong, O. Y. / Luk, A.:
"A new coded excitation model using multifrequency decomposition",
245-248.
Sereno, Daniele:
"Frame substitution and adaptive post-filtering in speech coding",
595-598.
Atungsiri, S. A. / Soheili, R. / Kondoz, A. M. / Evans, B. G.:
"Effective lost speech frame reconstruction for CELP coders",
599-602.
Nagabuchi, Hiromi / Kitawaki, Nobuhiko:
"Evaluation and improvement of coded speech quality degraded by cell loss in ATM networks",
603-606.
Vigier, Alain J.:
"Combined source-channel coding for a very noisy channed",
607-610.
Rosina, G. / Agostino, M. Sant' / Turco, E. / Vetrano, L.:
"Testing and quality enhancement of the GSM full rate voice channel",
611-614.
Kipper, U. / Reininger, Herbert / Wolf, Dietrich:
"Low bit rate speech coding using CELP with adaptive excitation codebook",
893-896.
Fuldseth, A. / Harborg, E. / Johansen, F. T. / Knudsen, J. E.:
"A real-time implementable 7 khz speech coder at 16 kbit/s",
897-900.
Zarkadis, D. J.:
"Adaptive spectral weighting for vector predictive coding of the LPC-spectra",
901-904.
Saoudi, Samir / Boucher, J. Marc / Guyader, Alain Le:
"Medium band speech coding using optimal scalar quantization of LSP",
905-908.
Seeker, Philip / Perkis, Andrew:
"Joint source and channel coding of line spectrum pairs",
909-912.
Chan, C. F. / Law, K. W.:
"An algorithm for computing LSP frequencies directly from the reflection coefficients",
913-916.
Meyer, Peter / Peters, W. / Paulus, J.:
"Variable rate speech coding using perceptive thresholds and adaptive VUS detection",
809-812.
Suddle, M. R. / Atungsiri, S. A. / Kondoz, A. M. / Evans, B. G.:
"A secure and robust CELP coder for land and satellite mobile systems",
813-816.
Ribeiro, C. M. / Trancoso, Isabel M.:
"A 4. 8 kbps celp coder with post-processing",
817-820.
Law, K. W. / Wong, O. Y. / Chan, C. F.:
"A real-time high quality joint-excitation linear predictive coder at 8 kbps",
821-824.
Deiacovo, Rosario Drogo / Montagna, Roberto:
"Some experiments in perceptual masking of quantizing noise in analysis-by-synthesis speech coders",
825-828.
Yang, Gao / Leich, Kenri / Boite, René:
"A very high-quality CELP coder at the rate of 2400 bps",
829-832.
Liu, Z. Yong:
"An effective pulse adaptive code-excited linear predictive coder at 4kb/S",
835-838.
Chan, C. F. / Leung, S. H.:
"A vocoder using high-order LPC filter with very few non-zero coefficients",
839-842.
Assessment, Intelligibility and Aids for Disabled
Rossi, Mario / Espesser, Robert / Pavlovic, Chaslav:
"The effects of in internal reference system and cross-modality matching on the subjective rating of speech synthesisers",
273-276.
Sydeserff, H. A. / Caley, R. J. / Isard, Stephen D. / Jack, Mervyn A. / Monaghan, Alex I. C. / Verhoeven, J.:
"Evaluation of speech synthesis techniques in a comprehension task",
277-280.
Howard-Jones, P. A.:
"'SOAP' - a speech output assessment package for controlled multilingual evaluation of synthetic speech",
281-284.
Houtgast, Tammo / Verhave, Jan A.:
"A physical approach to speech quality assessment: correlation patterns in the speech spectrogram",
285-288.
Miyata, H. / Houtgast, Tammo:
"Weighted MTF for predicting speech intelligibility in reverberant sound fields",
289-292.
Jekosch, Ute:
"Speech intelligibility studies for the european hermes spaceplane",
293-296.
Wei, Jianing / Faulkner, Andrew / Fourcin, Adrian:
"An application of speech processing and encoding scheme for Chinese lexical tone and consonant perception by hearing impaired listeners",
299-302.
Kanevsky, D. / Gopalakrishan, P. / Danis, C. / Daggett, G. / Epstein, E. / Nahamoo, David:
"On the development of a phone communication aid for the hearing impaired",
303-306.
Anglade, Yolande / Pierrel, Jean-Marie / Junqua, Jean-Claude:
"A spoken language interface for a telephone switchboard operator center",
307-310.
Murray, Iain R. / Arnott, John L. / Alm, Norman / Newell, Alan F.:
"A communication system for the disabled with emotional synthetic speech produced by rule",
311-314.
Speech Synthesis: Techniques and Applications
Portele, Thomas / Steffan, Birgit / Preuß, Rainer / Hess, Wolfgang:
"German speech synthesis by concatenation of non-parametric units",
317-320.
Abbattista, Giuseppe / Riccio, Antonello / Mumolo, Enzo:
"Automatic document reader with speech output capabilities",
321-324.
King, R. W.:
"Tools and processes for developing low-cost and high-quality text-to-speech synthesis for communication aids",
325-329.
Hermansky, Hynek / Anthony Cox Jr., Louis:
"Perceptual linear predictive (PLP) analysis-resynthesis technique",
329-332.
Greisbach, Reinhold / Kröger, Bernd J. / Esser, O. / Plaßmann, G.:
"A display technique for measurements of natural and synthetic articulatory dynamics",
333-336.
Chang, Yueh-Chin / Lee, Yi-Fan / Shia, Bang-Er / Wang, Hsiao-Chuan:
"Statistical models for the Chinese text-to-speech system",
337-340.
Taylor, P. A. / Nairn, I. A. / Sutherland, A. M. / Jack, Mervyn A.:
"A realtime speech synthesis system",
341-344.
Valbret, H. / Moulines, E. / Tubach, Jean-Pierre:
"Voice tranformation using PSOLA technique",
345-348.
Giustiniani, M. / Pierucci, Piero:
"Phonetic ergodic HMM for speech synthesis",
349-352.
Delogu, C. / Paoloni, P. / Pocci, P. / Sementina, C.:
"Quality evaluation of text-to-speech synthesizers using magnitude estimation, categorical estimation, pair comparison and reaction time methods",
353-356.
Zingte, H. / Hennebois, Cl.:
"Helping young children to associate sounds and letters through speech synthesis",
357-360.
Bourlard, Herve:
"Neural nets and hidden Markov models: review and generalizations",
363-369.
Jayant, N. S. / Johnston, J. D. / Shoham, Y.:
"Coding of wideband speech",
373-379.
Probabilistic Language Models for Speech Recognition
Pieraccini, Roberto / Levin, Esther:
"Stochastic representation of semantic structure for speech understanding",
383-386.
Matheson, Colin / McInnes, Fergus R.:
"Incorporating probabilities into the dualgram language model",
387-390.
Giachin, Egidio P.:
"A dynamic programming based framework for stochastic spoken language understanding",
391-394.
Prieto, Natividad / Vidal, Enrique:
"Learning language models through the ECGI method",
395-398.
Cremonini, R. / Ferretti, M. / Galimberti, M. C. / Maltese, Giulio / Mancini, Federico:
"Using a generative grammar to train a probabilistic language model for speaker-independent speech recognition",
399-402.
Speech Recognition and Phonetic Modelling
Shirai, Katsuhiko / Kitagawa, E. / Endo, T.:
"Optimal construction of context sensitive quantizer for phoneme recognition in continuous speech",
405-408.
O'Kane, M. / Kenne, P. / Landy, D. / Atkins, S.:
"Generalising from single-speaker recognition in a feature-based recogniser",
409-412.
Hirsch, H. G. / Meyer, Peter / Ruehl, Hans-Wilhelm:
"Improved speech recognition using high-pass filtering of subband envelopes",
413-416.
Gong, Yifan / Haton, Jean-Paul:
"Comparing two phoneme identification methods using a continuous speech recognizer",
417-420.
Ederveen, D. / Boves, Louis:
"Knowledge-based phoneme recognition",
421-424.
Speaker Identification and Verification
Kraayeveld, J. / Rietveld, A. C. M. / Heuven, V. J. van:
"Speaker characterization in dutch using prosodic parameters",
427-430.
Hunt, Alan K.:
"New commercial applications of telephone-network-based speech recognition and speaker verification",
431-434.
Bonastre, Jean-Francois / Meloni, Henri / Langlais, Philippe:
"Analytical strategy for speaker identification",
435-438.
Xu, L. / Mason, J. S.:
"Optimization of perceptually-based spectral transforms in speaker identification",
439-442.
Pitch Determination and Voice Separation
Cheveigne, Alain de:
"A mixed speech F0 estimation algorithm",
445-448.
Jones, Edward / Ambikairajah, Eliathamby:
"A perceptually-based pitch extractor for band-limited speech",
449-452.
Gu, Yu Hua:
"A robust pseudo perceptual pitch estimator",
453-456.
Degan, N. Dal / Fratti, M.:
"Pitch estimation based on a "narrowed" autocorrelation function",
457-460.
Speech Recognition: Understanding Systems
Nakagawa, Seiichi / Hirata, Yoshimitsu / Murase, Isao:
"The syntax-oriented spoken Japanese understanding system SPOJOS-SYNO II",
463-466.
Bergmann, H. / Hamer, H.-H. / Noll, A. / Paeseler, A. / Tomaschewski, H.:
"An adaptable man-machine interface using connected-word recognition",
467-470.
Poza, M. J. / Torre, C. de la / Tapias, D. / Villarrubia, L.:
"An approach to automatic recognition of keywords in unconstrained speech using parametric models",
471-474.
Hetherington, I. Lee / Leung, Hong C. / Zue, Victor W.:
"Toward vocabulary-independent recognition of telephone speech",
475-478.
Cole, Ronald / Roginski, Krist / Fanty, Mark:
"English alphabet recognition with telephone speech",
479-482.
Fiset, J.-Y. / Robert, J.-M. / Descout, Raymond:
"Evolutionary language models in air traffic control training",
483-486.
Jones, G. J. F. / Wright, J. H. / Wrigley, E. N. / Carey, M. J. / Parris, Eluned S.:
"Isolated-word sentence recognition using probabilistic context-free grammar",
487-489.
Hood, Mitchell:
"Lexical access in a speech understanding and dialogue system",
490-493.
Haeb-Umbach, Reinhold / Ney, Hermann:
"A look-ahead search technique for large vocabulary continuous speech recognition",
495-498.
Teixeira, Carlos J. / Trancoso, Isabel M.:
"Spectral subtraction for front-end noise reduction in a speech recognizer",
499-502.
Speech Databases, Analysis And Assessment
Larnel, Lori F. / Gauvain, Jean-Luc / Eskenazi, Maxine:
"BREF, a large vocabulary spoken corpus for French",
505-508.
Mathan, Luc / Morin, Dominique:
"Speech field databases: development and analysis",
509-512.
Itahashi, Shuichi:
"Large scale Japanese dialect speech corpora",
513-516.
Vossen, Paulus H.:
"Outline of a design-oriented evaluation framework for speech-driven applications",
517-520.
Winski, Richard / Kordi, Kamran:
"Assessment of continuous speech recognisers using recogniser sensitivity analysis",
521-524.
Bourjot, C. / Boyer, A. / Fohr, D.:
"A tool for assessment of acoustic phonetic lattices",
525-528.
Steeneken, Herman J. M. / Velden, Jeroen G. van:
"Ramos - recognizer assessment by means of manipulation of speech applied to connected speech recognition",
529-532.
Alphen, Paul van / Pols, Louis C. W.:
"Comparing various feature vectors in automatic speech recognition",
533-536.
Zue, Victor W. / Glass, James / Goodine, David / Hirschman, Lynette / Leung, Hong C. / Phillips, Michael / Polifroni, Joseph / Seneff, Stephanie:
"The MIT ATIS system; preliminary development, spontaneous speech data collection, and performance evaluation",
537-540.
Benaouicha, S. / Rajouani, A. / Zyoute, M.:
"Construction of an Arabic speech data base - duration model of Arabic vowels",
541-544.
Denbigh, P. N. / Zhao, J.:
"Pitch extraction and separation of overlapping speech",
545-548.
Neural Nets I, II
Bengio, Yoshua / Mori, Renato De / Flammia, Giovanni / Kompe, Half:
"Phonetically motivated acoustic parameters for continuous speech recognition using artificial neural networks",
551-554.
Carey, Michael J. / Parris, Eluned S.:
"Adapting input transformations using alpha-nets for whole word speech recognition",
555-558.
Niles, Les T.:
"TIMIT phoneme recognition using an HMM-derived recurrent neural network",
559-562.
Husoy, P. O. / Svendsen, T.:
"ANN-based speech recognition using a preprocessor for non-linear time compression",
563-566.
Sorensen, Helge B. D. / Hartmann, Uwe:
"A self-structuring neural noise reduction model",
567-570.
Petek, Bojan / Waibel, Alex H. / Tebelskis, Joseph M.:
"Integrated phoneme-function word architecture of hidden control neural networks for continuous speech recognition",
1407-1410.
Zhang, X. / Mason, J. S. / Andrews, E. C.:
"Multiple dynamic features to enhance neural net based speaker verification",
1411-1414.
Haffner, Patrick / Waibel, Alex H.:
"Time-delay neural networks embedding time alignment: a performance analysis",
1415-1418.
Fukuda, Yohji / Matsumoto, Haruya:
"Phoneme recognition using recurrent neural networks",
1419-1423.
Komori, Yasuhiro / Hatazaki, Kaichiro:
"An integration of knowledge and neural networks toward a phoneme typewriter without a language model",
1423-1426.
Parsing and Lexical Access
Hosaka, Junko / Takezawa, Toshiyuki / Ehara, Terumasa:
"Utilizing empirical data for postposition classification toward spoken Japanese speech recognition",
573-576.
Phillips, Michael / Glass, James / Zue, Victor W.:
"Automatic learning of lexical representations for sub-word unit based speech recognition systems",
577-580.
Lacouture, Roxane / Mori, Renato De:
"Lexical tree compression",
581-584.
Riley, Michael D. / Ljolje, Andrej:
"Lexical access with a statistically-derived phonetic network",
585-588.
Antoniol, G. / Brugnara, F. / Giuliani, D.:
"Admissible strategies for acoustic matching with a large vocabulary",
589-592.
Modelling Duration in Speech
Macarron, Alejandro / Escalada, Gregorio / Rodriguez, Miguel Angel:
"Generation of duration rules for a Spanish text-to-speech synthesizer",
617-620.
Mortamet, L.:
"Implementing duration expert rules into a text-to-speech synthesis system",
621-624.
Kaiki, Nobuyoshi / Mimura, Katsuhiko / Sagisaka, Yoshinori:
"Statistical modeling of segmental duration and power control for Japanese",
625-628.
Campbell, W. Nick:
"Phrase-level factors affecting timing in speech",
629-632.
Karjalainen, Matti / Altosaar, Toomas:
"Phoneme duration rules for speech synthesis by neural networks",
633-636.
Automatic Speech Recognition: Algorithms I-III
McInnes, Fergus R.:
"Context-sensitive phoneme lattice generation using interpolated demi-diphone and triphone models",
639-642.
Song, J. M. / Thomas, T. / Patel, M.:
"Experiments of 991-word speaker independent continuous speech recognition on DARPA RM task",
643-646.
Meloni, Henri / Bechet, F. / Gilles, P.:
"Bottom-up acoustic-phonetic decoding for the selection of word cohorts from a large vocabulary",
647-650.
Peinado, Antonio M. / Roman, Ramon / Segura, Jose C. / Rubio, Antonio J. / Garcia, Pedro / Diaz, Jesus E.:
"Entropic training for HMM speech recognition",
651-654.
Kenny, P. / Parthasarathy, S. / Gupta, V. N. / Lennig, Matthew / Mermelstein, Paul / O'Shaughnessy, Douglas:
"Energy, duration and Markov models",
655-658.
Nijtmans, J. J.:
"A new recursive Markov model with a new state pruning approach for large vocabulary continuous speech recognition",
659-663.
McInnes, Fergus R.:
"Context-sensitive phoneme lattice generation using interpolated demi-diphone and triphone models",
663-666.
Nowell, Peter / Thompson, Henry S.:
"An efficient implementation of the n-best algorithm for lexical access",
667-670.
Falaschi, Alessandro / Pucci, Massimo:
"Automatic derivation of HMM alternative pronunciation network topologies",
671-674.
Galiano, Isabel / Casacuberta, Francisco / Sanchis, Emilio:
"On the structure of subword units for a speaker independent continuous speech task",
675-678.
Zhao, Yunxin / Wakita, Hisashi / Zhuang, Xinhua:
"Generate word transcription dictionary from sentence utterances and evaluate its effect on speaker-independent continuous speech recognition",
679-682.
Varga, A. P. / Moore, Roger K.:
"Simultaneous recognition of concurrent speech signals using hidden Markov model decomposition",
1175-1178.
Ballantyne, I. A. / Sutherland, A. M. / Hannah, J. M. / Jack, Mervyn A.:
"A large vocabulary parallel processing continuous speech recognition system",
1179-1182.
Rose, Richard C. / Hofstetter, Edward M.:
"Techniques for robust word spotting in continuous speech messages",
1183-1186.
Falaschi, Alessandro / Micozzi, Alfredo:
"Word spotting by CSR through vector quantized background models",
1187-1190.
Junqua, Jean-Claude / Wakita, Hisashi:
"Towards an artificial laboratory for the design and simulation of cooperative speech processing algorithms",
1191-1194.
Edwards, K. / McInnes, Fergus R. / Jack, Mervyn A.:
"Accent specific modifications for continuous speech recognition based on a sub-word lattice approach",
1195-1198.
Lleida, Eduardo / Marino, Jose B. / Nadeu, Climent / Oliveras, Albert:
"Two level continuous speech recognition using demisyllable-based HMM word spotting",
1199-1202.
Applebaum, Ted H. / Hanson, Brian A.:
"Tradeoffs in the design of regression features for word recognition",
1203-1206.
Bahl, Lalit R. / Brown, Peter F. / Souza, Peter V. de / Mercer, Robert L. / Nahamoo, David:
"A fast algorithm for deleted interpolation",
1209-1212.
Franzini, Michael A. / Waibel, Alex H. / Lee, Kai-Fu:
"Recent work in continuous speech recognition using the connectionist viterbi training procedure",
1213-1216.
Steinbiss, Volker:
"A search organization for large-vocabulary recognition based on n-best decoding",
1217-1220.
Gong, Yifan / Haton, Jean-Paul:
"VINICS: a continuous speech recognizer based on a new robust formulation",
1221-1224.
Sagayama, Shigeki:
"A matrix representation of HMM-based speech recognition algorithms",
1225-1228.
Segmentation
Dalsgaard, Paul / Andersen, Ove / Barry, William:
"Multi-lingual acoustic-phonetic features for a number of european languages",
685-688.
Kabre, H. / Pérénnou, Guy / Vigouroux, Nadine:
"A non-linear filtering method applied to automatic segmentation of multilingual speech corpora",
689-692.
Cosi, Piero / Falavigna, Daniele / Omologo, Maurizio:
"A preliminary statistical evaluation of manual and automatic segmentation discrepancies",
693-696.
McQueen, J. M. / Briscoe, E. J.:
"A computational tool for examining lexical segmentation in continuous speech",
697-700.
Schmidt, M. S. / Watson, G. S.:
"The evaluation and optimization of automatic speech segmentation",
701-704.
Feng, G. / Achab, N. / Combescure, R.:
"On-line speech segmentation using adaptive models: application to variable rate speech coding",
705-708.
Taylor, P. A. / Isard, Stephen D.:
"Automatic diphone segmentation",
709-711.
Ottesen, Georg E.:
"An automatic diphone segmentation system",
713-716.
Brierton, R. A. / Cheetham, B. M. G.:
"An evaluation oof spectral transitivity functions for speech segmentation in variable frame-rate speech vocoding",
717-720.
Automatic Speech Recognition: Applications
Compernolle, Dirk Van / Smolders, J. / Jaspers, P. / Hellemans, T.:
"Speaker clustering for dialectic robustness in speaker independent recognition",
723-726.
Yashchin, Dina / Ortel, William C. G.:
"Experience with speech recognition in automating telephone operator functions",
727-730.
Canavesio, F. / Fissore, Lorenzo / Oreglia, M. / Ruscitti, P.:
"HMM modeling in the public telephone network environment: experiments and results",
731-734.
Morin, Dominique:
"Influence of field data in HMM training for a vocal server",
735-738.
Ciaramella, A. / Fissore, Lorenzo / Pacchiotti, A. / Pacifici, R.:
"An isolated word speech recognizer prototype for mobile-radio applications",
739-742.
Natural Language Processing
Monaghan, James / Cheepen, Christine:
"Linguistic modelling for a speech interface in the office context",
745-748.
Carlo, Andrea Di / Falcone, Rino:
"Ill-formedness problem in the spoken language processing",
749-752.
Maltese, Giulio / Mancini, Federico:
"A technique to automatically assign parts-of-speech to words taking into account word-ending information through a probabilistic model",
753-756.
Pelillo, Marcello / Refice, Mario:
"Syntactic category disambiguation through relaxation processes",
757-760.
Wrigley, E. N. / Wright, J. H.:
"Computational requirements of probabilistic LR parsing for speech recognition using a natural language grammar",
761-764.
Symbolic Processing in Speech Synthesis
Tiboni, J. / Perennon, G.:
"Phonotypical transcription through the GEPH expert system",
767-770.
Williams, Briony / Maier, Franziska:
"A spelling corrector for use in text-to-speech synthesis for English",
771-774.
Russi, Thomas:
"Robust and efficient parsing for applications such as text-to-speech conversion",
775-778.
Luk, Robert W. P. / Damper, Robert I.:
"Stochastic transduction for English text-to-phoneme conversion",
779-782.
Sub-Lexical Unit Modelling
Hwang, M. Y. / Huang, X. D.:
"Acoustic distribution clustering in phonetic hidden Markov models",
785-788.
Blomberg, M.:
"Modelling articulatory inter-timing variation in a speech recognition system based on synthetic references",
789-792.
Torkkola, Kari / Kokkonen, Mikko / Kurimo, Mikko / Utela, Pekka:
"Improving short-time speech frame recognition results by using context",
793-796.
Rentzepopoulos, P. A. / Kokkinakis, George K.:
"Phoneme to grapheme conversion using HMM",
797-800.
Parfitt, S. H. / Sharman, R. A.:
"A bi-directional model of English pronunciation",
801-804.
Speech Understanding and Dialogue
Goodine, David / Seneff, Stephanie / Hirschman, Lynette / Phillips, Michael:
"Full integration of speech and language understanding in the MIT spoken language system",
845-848.
Yamaoka, Takayuki / Iida, Hitoshi:
"Dialogue interpretation model and its application to next utterance prediction for spoken language processing",
849-852.
Boogers, W.:
"Dialogue construction by compilation",
853-856.
Nogaito, Izuru / Takahashi, Masahiko / Kuroiwa, Shingo / Yato, Fumihiro:
"Dialogue management in an extension number guidance system",
857-860.
Segarra, Encarna / Garcia, Pedro:
"Automatic learning of acoustic and syntactic-semantic levels in continuous speech understanding",
861-864.
Baggia, Paolo / Ciaramella, A. / Clementino, D. / Fissore, Lorenzo / Gerbino, E. / Giachin, Egidio P. / Micca, G. / Nebbia, L. / Pacifici, R. / Pirani, G. / Rullent, C.:
"A man-machine dialogue system for speech access to e-mail information using the telephone: implementation and first results",
865-868.
Assessment
Bezooijen, Renee van / Pols, Louis C. W.:
"Performance of text-to-speech conversion for dutch: a comparative evaluation of allophone and diphone based synthesis at the level of the segment, the word, and the paragraph",
871-874.
Benoît, Christian / Emerard, Francoise / Schnabel, Betina / Tseva, A.:
"Quality comparisons of prosodic and of acoustic components of various synthesisers",
875-878.
Griee, Martine / Vagges, Kiki / Hirst, Daniel:
"Assessment of intonation in text-to-speech synthesis systems - a pilot test in English and Italian",
879-882.
Monaghan, Alex I. C.:
"Evaluation of the naturalness of prosody generated by the CSTR TTS system",
883-886.
Halka, Ulrich:
"Speech-model processes for objective quality measurements of speech-coding systems",
887-890.
Speech Recognition: Stochastic Modelling
Euler, S.:
"Adaptation techniques in tied density hidden Markov models",
919-922.
Jouvet, D. / Bartkova, K. / Monné, Jean:
"On the modelization of allophones in an HMM based speech recognition system",
923-926.
Jouvet, D. / Mauuary, L. / Monné, Jean:
"Automatic adjustments of the structure of Markov models for speech recognition applications",
927-930.
Leung, Hong C. / Hetherington, I. Lee / Zue, Victor W.:
"Speech recognition using stochastic explicit-segment modeling",
931-934.
Dubois, D.:
"Comparison of time-dependent acoustic features for a speaker-independent speech recognition system",
935-938.
Gauvain, Jean-Luc / Lee, Chin-Hui:
"Bayesian learning for hidden Markov model with Gaussian mixture state observation densities",
939-942.
Speech Interfaces: Systems and Applications
Ruehl, Hans-Wilhelm:
"Voice controlled mail ordering via telephone using SPREIN",
945-948.
Dobler, Stefan / Armbruester, Werner / Meyer, Peter / Ruehl, Hans-Wilhelm:
"A voice dialling device for mobile radio",
949-952.
Smaili, K. / Charpillet, F. / Pierrel, Jean-Marie / Haton, Jean-Paul:
"A continuous speech recognition approach for the design of a dictation machine",
953-956.
Thomson, David L. / Wilpon, Jay G. / Sukkar, Rafid A. / Prezas, Dimitrios P.:
"Automatic speech recognition in the Spanish telephone network",
957-960.
Billi, Roberto / Buttafava, P. / Stefani, P. De / Gamba, M. / Voltolini, D.:
"Computer-aided, voice-based, medical report preparation: an application to radiology",
961-964.
Carlos, Filipe N. / Carmona, Jose P. / Chagas, Pedro M. / Oliveira, Luis C. / Serralheiro, Antonio J. / Trancoso, Isabel M.:
"A recognition / synthesis system applied to database access through the telephone network",
965-968.
Helle, Seppo:
"An experiment in using a hypertext system in phonetics and speech processing education",
969-972.
Antoniol, G. / Brugnara, F. / Palma, F. Dalla / Lazzari, G. / Moser, E.:
"A. RE. s. : an interface for automatic reporting by speech",
973-976.
Schultheiß, U. / Lochschmidt, B.:
"COGNITO - an experimental voice-controlled telecommunication system",
977-980.
Bernstein, Jared / Rtischev, Dimitry:
"A voice interactive language instruction system",
981-984.
Rooney, Edmund / Hiller, Steven / Laver, John / Benedetto, Maria-Gabriella Di:
"Macro and micro features for automated pronunciation improvement in the spell system",
985-988.
Neural Nets: Comparative Studies, Lexical Recognition
Devillers, Laurence / Dugast, Christian:
"Comparison of continuous mixture densities and TDNN in a viterbi-framework: experiments on speaker dependent DARPA RM1+",
991-994.
Thurston, Peter / Norris, Dennis:
"A comparison of two compression functions used for noisy vowel detection with back-propagation networks",
995-998.
Ferreiros, J. / Castro, A. / Pardo, J. M.:
"Comparison between two different approaches in speaker - independent isolated digit recognition",
999-1002.
Poirier, Franck:
"DVQ: dynamic vector quantization application to speech processing",
1003-1006.
Bengio, Yoshua / Mori, Renato De / Flammia, Giovanni / Kompe, Ralf:
"A comparative study on hybrid acoustic phonetic decoders based on artificial neural networks",
1007-1010.
Sawai, Hidefumi / Nakamura, Satoru:
"Time-delay neural network architectures for high-performance speaker-independent recognition",
1011-1014.
Wittenburg, P. / Couwenberg, R.:
"Recurrent neural nets as building blocks for human word recognition",
1015-1018.
Mekuria, Fisseha / FjÖllbrant, Tore:
"A neural net model for vector quantization",
1019-1022.
Russell, N. H. / Fallside, Frank / Robinson, A. J. / Prager, R. W.:
"Lexical access using a recurrent error propagation network",
1023-1026.
Brauer, Peter / Hedelin, Per / Huber, Dieter / Knagenhjelm, Petter / Molno, Johan:
"Model or non-model based classifiers",
1027-1030.
Aliosaar, Toomas / Karjalainen, Matti:
"Event-based recognition and analysis of speech by neural networks",
1031-1034.
Jelinek, Frederick:
"Up from trigrams! - the struggle for improved language models",
1037-1040.
Carlson, Rolf:
"Synthesis: modelling variability and constraints",
1043-1048.
Dialogue and Translation
Guyomard, M. / Siroux, J. / Cozannet, A.:
"The role of dialogue in speech recognition the case of the yellow",
1051-1054.
Gerbino, E. / Baggia, Paolo:
"Interpretation of context-dependent utterances in man-machine dialogue",
1055-1058.
Eggins, S. / Vonwiller, Julie P. / Matthiessen, C. M. I. / Sefton, P.:
"The description of minor clauses in information-seeking telephone dialogues",
1059-1062.
Roe, David B. / Pereira, Fernando / Sproat, Richard W. / Riley, Michael D. / Moreno, Pedro J. / Macarron, Alejandro:
"Toward a spoken language translator for restricted-domain context-free languages",
1063-1066.
Subramaniam, N. Venkata / Alwar, N. / Mallikarjuna, G. / Rao, P. Prabhakar / Raman, S.:
"Bidirectional machine translation in indian languages",
1067-1070.
Speech Analysis and Signal Representation
Papaodysseus, C. / Koukoutsis, E. / Triantafyllou, C. / Vasilatos, C.:
"Exact monitoring of the numerical error in various speech algorithms",
1073-1076.
Koreman, Jacques / Cranen, Bert / Boves, Louis:
"Automatic computation and comparison of dynamically varying voice source parameters",
1077-1080.
Alku, Paavo:
"Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering",
1081-1084.
Galas, Thieny / Rodet, Xavier:
"Generalized functional approximation for source-filter system modeling",
1085-1088.
Bimbot, Frederic / Atal, Bishnu S.:
"An evaluation of temporal decomposition",
1089-1092.
Discriminant Training and Speaker Adaptation
Zünkler, Klaus:
"A discriminative recognizer for isolated and continuous speech using statistical separability measures",
1095-1098.
Schmidbauer, O. / Höge, H.:
"Speaker adaptation based on articulatory features",
1099-1102.
Brugnara, F. / Mori, Renato De / Giuliani, D. / Omologo, M.:
"A parallel HMM approach to speech recognition",
1103-1106.
Nitta, Tsuneo / Iwasaki, Jun'ichi / Matsu'ura, Hiroshi:
"Speaker independent word recognition using HMMs with an orthogonalized phonetic segment codebook",
1107-1110.
Fung, Pascale / Kawahara, Tatsuya / Doshita, Shuji:
"Unsupervised speaker normalization by speaker Markov model converter for speaker-independent speech recognition",
1111-1114.
Perception I
Son, Rob J. J. H. van / Pols, Louis C. W.:
"The influence of formant track shape on the perception of synthetic vowels",
1117-1120.
Howard-Jones, P. A.:
"Fluctuation of noise background: measurement and significance in relation to speech masking",
1121-1124.
Ma, C. / Willems, L. F.:
"The audibility of narrow band noise in fiat spectral complex sounds",
1125-1128.
Laan, Gitta P. M. / Bergem, Dick R. van / Beinum, Fiorien J. Koopmans-van:
"The importance of spectral quality of vowels for the intelligibility of sentences",
1129-1132.
Steeneken, Herman J. M. / Houtgast, Tammo:
"On the mutual dependency of octave-band-specific contributions to speech intelligibility",
1133-1136.
Ooyen, Brit van / Cutler, Anne / Norris, Dennis:
"Detection times for vowels versus consonants",
1451-1454.
Bergem, Dick R. van:
"The influence of sentence accent, word stress, and word class on the quality of vowels",
1455-1458.
Beinum, Florien J. Koopmans-van:
"A peak-and-level model for focus words in read and spontaneous natural speech and in synthetic speech",
1459-1462.
Ingram, J. / Pittam, J.:
"Connected speech processes in second language learning",
1463-1466.
Speech Synthesis and Prosody
Potapova, Rodmonga K.:
"Modification of acoustic features in Russian connected speech",
1141-1144.
Strik, Helmer / Boves, Louis:
"On the relation between voice source characteristics and prosody",
1145-1148.
Stensby, Sverre:
"Prosody in a rule-based norwegian text-to-speech system",
1149-1152.
Madhukumar, A. S. / Rajendran, S. / Sekhar, C. Chandra / Yegnanarayana, B.:
"Synthesizing intonation for speech in hindi",
1153-1156.
Hieronymus, James L. / Williams, Briony J.:
"An investigation of the relation between perceived pitch accent and automatically-located accent in british English",
1157-1160.
Quazza, S.:
"Modelling Italian intonation in a text-to-speech system",
1161-1164.
O'Malley, Michael H. / Resnick, Howard / Caisse, Michelle:
"An analysis of strategies for finding prosodic clues in text",
1165-1168.
Balestri, Marcello:
"A coded dictionary for stress assignment rules in Italian",
1169-1172.
Text-to-Speech Synthesis Systems
Lindert, Enrico te / Leeuwen, Hugo C. van:
"Speech maker: text-to-speech conversion based on a multi-level, synchronized data structure",
1231-1234.
Lewis, E. / Tatham, Mark A. A.:
"A new text-to-speech synthesis system",
1235-1238.
Oliveira, Luis C. / Viana, M. Ceu / Trancoso, Isabel M.:
"DIXI - portuguese text-to-speech system",
1239-1242.
Hansen, P. Molbaek / Petersen, N. Reinholt / Rischel, J. / Henriksen, C.:
"Higher-level linguistic information in a text-to-speech system for danish",
1243-1246.
Olaszy, Gabor:
"Adaptation of the multivox text-to-speech system to Italian",
1247-1250.
Phonetic Modelling
Niyogi, Partha / Zue, Victor W.:
"Correlation analysis of vowels and their application to speech recognition",
1253-1256.
Holmes, John N.:
"Use of phonetic knowledge when designing and training stochastic models for speech recognition",
1257-1260.
Kaspar, B. / Schuhmacher, K.:
"Modelling phones by microsegments in a phonetically oriented recognition system",
1261-1264.
Kim, Il K. / Lee, H. S.:
"An extended LVQ2 algorithm and its application to phoneme classification",
1265-1268.
Dix, P. J. / Vernooij, G. J. / Bloothooft, G.:
"A hierarchical broad phonetic classification scheme",
1269-1272.
Generation of Prosody
Hirschberg, Julia:
"Using text analysis to predict intonational boundaries",
1275-1278.
Horne, Merle:
"Why do speakers accent 'given' information ?",
1279-1282.
Vonwiller, Julie P. / King, R. W. / Lloyd, R. W. T.:
"Automatic prosody assignment for interactive synthesized dialogue systems",
1283-1286.
NickYoud, NickYoud / House, Jill:
"Generating intonation in a voice dialogue system",
1287-1290.
Delmonte, Rodolfo / Dolci, Roberto:
"Computing linguistic knowledge for text-to-speech systems with PROSO",
1291-1294.
Speech Processing and Analysis
Acker, C. / Vary, Peter / Ostendarp, H.:
"Acoustic echo cancellation using prediction residual signals",
1297-1300.
Dabis, H. S. / Wrench, Alan A.:
"An evaluation of adaptive noise cancelling for speech recognition",
1301-1304.
Mumolo, Enzo / Riccio, Antonello / Abbattista, Giuseppe:
"An efficient algorithm for real-time voiced/unvoiced decision",
1305-1308.
Aarset, Tim / Gold, Ben:
"Models of pitch perception",
1309-1312.
Corney, P. / Mason, J. S.:
"A new perspective on LPC excitation using singular value decomposition",
1315-1318.
Verhelst, Werner / Borger, Marcel:
"Intra-speaker transplantation of speech characteristics an application of waveform vocoding techniques and DTW",
1319-1322.
Leung, S. H. / Wong, O. Y. / Lai, K. L.:
"Decomposition of the LPC excitation using wavelet functions",
1327-1330.
Ambikairajah, Eliathamby / Kilmartin, Liam:
"An adaptive cochlear model for speech recognition",
1331-1334.
Jacovitti, Gianni / Pierucci, Piero / Falaschi, Alessandro:
"Speech segmentation and classification using higher order moments",
1335-1338.
Automatic Speech Recognition: Hardware and Noise Reduction
Ciaramella, A. / Clementino, D. / Pacifici, R.:
"A PC-housed speaker independent large vocabulary continuous telephonic speech recognizer",
1341-1344.
Aktas, Abdulmesih / Zünkler, Klaus:
"Speaker independent continuous HMM-based recognition of isolated words on a real-time multi-DSP system",
1345-1348.
Tsopanoglou, A. / Kyriakis-Bitzaros, E. D. / Mourjopoulos, J. / Kokkinakis, George K.:
"A real time speech decoder using instantaneous frequency and energy",
1349-1352.
Schultheiß, M. / Lacroix, A.:
"Fast hardware for efficient parallel processing of speech signals",
1353-1356.
Sedivy, Jan / Filcev, Jiff / Uhlir, Jan / Vanek, Tomas / , Vaclav Hanzl, Zdenek Oliva, Petr Kotek / Hanzl, Vaclav / Oliva, Zdenek / Kotek, Petr:
"The one chip speech recognition system",
1357-1631.
Villarrubia, L. / Poza, M. J. / Crespo, C.:
"Influence of the telephone line on automatic speech recognition",
1363-1366.
Hermansky, Hynek / Morgan, Nelson / Bayya, Aruna / Kohn, Phil:
"Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)",
1367-1370.
Junqua, Jean-Claude / Reaves, Ben / Mak, Brian:
"A study of endpoint detection algorithms in adverse conditions: incidence on a DTW and HMM recognizer",
1371-1374.
Dvorak, Susanne / Hormann, Thomas:
"High-performance speech recognition in noise by continuously updated reference templates",
1375-1378.
Vicsi, Klara:
"Speech enhancement in the case of speech recognizers",
1379-1381.
Gomez-Mena, Juan / Santos-Suarez, J. / Garcia-Gomez, Ramón:
"A robust feature extraction method for automatic speech recognition in noisy environments",
1383-1386.
Sub-Word Units for Automatic Speech Recognition
Fissore, Lorenzo / Giachin, Egidio P. / Laface, P. / Micca, G.:
"Selection of speech units for a speaker-independent CSR task",
1389-1392.
Giachin, Egidio P. / Lee, Chin-Hui / Rabiner, Lawrence R. / Rosenberg, Aaron E. / Pieraccini, Roberto:
"Word juncture modeling using inter-word context-dependent phone-like units",
1393-1396.
Nagai, Akito / Sagayama, Shigeki / Kita, Kenji:
"Phoneme-context-dependent LR parsing algorithms for HMM-based continuous speech recognition",
1397-1400.
Drexler, H. / Roddeman, R. / Boves, Louis / Strik, Helmer:
"Optimizing lexical fast search in a large vocabulary isolated word speech recognition system",
1401-1404.
Auditory Modelling
Fjällbrant, Tore / Mekuria, Fisseha:
"Signal processing using an auditory filter bank with side-lobes and phase-jumps",
1429-1431.
Dijk, J. S. C. van:
"Notes on auditive coding of sophisticated signals",
1433-1436.
Beham, Manfred:
"An auditorily based spectral transformation of speech signals",
1437-1440.
Morris, Andrew C. / Escudier, Pierre / Schwartz, Jean-Luc:
"On and off units detect information bottle-necks for speech recognition",
1441-1444.
Pozas-Alvarez, Jose A.:
"A new logic operator-based auditory system model",
1445-1448.
Speech Interfaces: Dialogue and Human Factors
Peckham, Jeremy:
"Speech understanding and dialogue over the telephone: an overview of progress in the sundial project",
1469-1472.
Tubach, Jean-Pierre / Doignon, P.:
"A system for natural spoken language queries design, implementation and assessment",
1473-1476.
Deville, G. / Mousel, P.:
"Operational validation of syntactic-semantic models in a spoken man-machine dialogue system",
1477-1480.
Gaiffe, B. / Romary, L. / Pierrel, Jean-Marie:
"References in a multimodal dialogue: towards a unified processing",
1481-1485.
Lefebvre, P. / Duncan, G. / Poirier, F.:
"The user-unix dialogue: a novel integrated approach to enhancing the operating system interface",
1487-1490.
Arndt, Bodo:
"Adoption op verbal and visual dialogue behaviour in document handling systems",
1491-1494.
Smeele, P. M. T. / Sittig, A. C.:
"The contribution of vision to speech perception",
1495-1497.
Lickley, R. J. / Shillcock, R. C. / Bard, E. G.:
"Processing disfluent speech: how and when are disfluencies found?",
1499-1502.
Chointere, A. / Robert, J.-M. / Descout, Raymond:
"Building a user interface for a speech recognition-based telephone application system",
1503-1506.
Murray, A. C. / Frankish, C. R. / Jones, D. M.:
"System design and human factors in auditory interfaces",
1507-1510.