Keynotes
Rossi, Mario:
"Is syntactic structure prosodically recoverable?",
KN01-KN08.
Zue, Victor W.:
"Conversational interfaces: advances and challenges",
KN09-KN18.
Santen, Jan P. H. van:
"Prosodic modelling in text-to-speech synthesis",
KN19-KN28.
Junqua, Jean-Claude:
"Impact of the unknown communication channel on automatic speech recognition: a review KN-29",
KN29-KN32.
Bellegarda, Jerome:
"Statistical techniques for robust ASR: review and perspectives",
KN33-KN36.
Lippmann, Richard / Carlson, Beth A.:
"Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise KN-37",
KN37-KN40.
Acoustic Modelling
Dupont, Stéphane / Bourlard, Hervé:
"Using multiple time scales in a multi-stream speech recognition system",
3-6.
Wakita, Yumi / Singer, Harald / Sagisaka, Yoshinori:
"Speech recognition using HMM-state confusion characteristics",
7-10.
Chesta, Cristina / Laface, Pietro / Ravera, Franco:
"Bottom-up and top-down state clustering for robust acoustic modeling",
11-14.
Schlüter, Ralf / Macherey, W. / Kanthak, S. / Ney, Hermann / Welling, Lutz:
"Comparison of optimization methods for discriminative training criteria",
15-18.
Lee, Clark Z. / O'Shaughnessy, Douglas:
"Clustering beyond phoneme contexts for speech recognition",
19-22.
Chengalvarayan, Rathinavelu:
"Influence of outliers in training the parametric trajectory models for speech recognition",
23-26.
Holter, Trym / Svendsen, Torbjorn:
"Incorporating linguistic knowledge and automatic baseform generation in acoustic subword unit based speech recognition",
1159-1162.
Beyerlein, Peter / Ullrich, Meinhard / Wilcox, Patricia:
"Modelling and decoding of crossword context dependent phones in the Philips large vocabulary continuous speech recognition system",
1163-1166.
Hanna, Philip / Ming, Ji / O'Boyle, Peter / Smith, F. Jack:
"Modelling inter-frame dependence with preceeding and succeeding frames",
1167-1170.
Jones, Rhys James / Downey, Simon / Mason, John S.:
"Continuous speech recognition using syllables",
1171-1174.
Willett, Daniel / Rigoll, Gerhard:
"A new approach to generalized mixture tying for continuous HMM-based speech recognition",
1175-1178.
Beulen, Klaus / Bransch, Elmar / Ney, Hermann:
"State tying for context dependent phoneme models",
1179-1182.
Duchateau, Jacques / Demuynck, Kris / Compernolle, Dirk Van:
"A novel node splitting criterion in decision tree construction for semi-continuous HMMs",
1183-1186.
Blomberg, Mats:
"Creating unseen triphones by phone concatenation in the spectral, cepstral and formant domains",
1187-1190.
Pfau, Thilo / Beham, Manfred / Reichl, W. / Ruske, Günther:
"Creating large subword units for speech recognition",
1191-1194.
Goldberger, Jacob / Burshtein, David / Franco, Horacio:
"Segmental modeling using a continuous mixture of non-parametric models",
1195-1198.
Chang, Jane W. / Glass, James R.:
"Segmentation and modeling in segment-based recognition",
1199-1202.
Hauenstein, Alfred:
"Using syllables in a hybrid HMM-ANN recognition system",
1203-1206.
Hariharan, Ramalingam / Hakkinen, Juha / Laurila, Kari / Suontausta, Janne:
"Noise robust segment-based word recognition using vector quantisation",
1207-1210.
Rodriguez, Luis Javier / Torres, Ines M.:
"Viterbi based splitting of phoneme HMM's",
1211-1214.
Marino, José B. / Nogueiras, Albin / Bonafonte, Antonio:
"The demiphone: an efficient subword unit for continuous speech recognition",
1215-1218.
Kojima, Hiroaki / Tanaka, Kazuyo:
"Organizing phone models based on piecewise linear segment lattices of speech samples",
1219-1222.
Rogina, Ivica:
"Automatic architecture design by likelihood-based context clustering with crossvalidation",
1223-1226.
Roweis, Sam / Alwan, Abeer:
"Towards articulatory speech recognition: learning smooth maps to recover articulator information",
1227-1230.
Tsopanoglou, Anastasios / Fakotakis, Nikos:
"Selection of the most effective set of subword units for an HMM-based speech recognition system",
1231-1234.
Cerisara, Christophe / Haton, Jean-Paul / Mari, Jean-Francois / Fohr, Dominique:
"Multi-band continuous speech recognition",
1235-1238.
Bitar, Nabil N. / Espy-Wilson, Carol Y.:
"The design of acoustic parameters for speaker-independent speech recognition",
1239-1242.
Dynamic Articulatory Measurements
Candille, Laurence / Méloni, Henri:
"Adaptation of natural articulatory movements to the control of the command parameters of a production model",
27-30.
Stone, Maureen / Lundberg, Andrew / Davis, Edward / Gullapalli, Rao / NessAiver, Moriel:
"Three-dimensional coarticulatory strategies of tongue movement",
31-34.
Parlangeau, Nathalie / Andre-Obrecht, Regine:
"From laryngographic and acoustic signals to voicing gestures",
35-38.
Vilkman, Erkki / Takalo, Raija / Maatta, Taisto / Laukkanen, Anne-Maria / Nummenranta, Jaana / Lipponen, Tero:
"Ultrasonographic measurement of cricothyroid space in speech",
39-42.
Demolin, Didier / George, M. / Lecuit, V. / Metens, T. / Soquet, A. / Raeymaekers, H.:
"Coarticulation and articulatory compensations studied by dynamic MRI",
43-46.
Badin, Pierre / Baricchi, Enrico / Vilain, Anne:
"Determining tongue articulation: from discrete fleshpoints to continuous shadow",
47-50.
Language Identification
Zissman, Marc A.:
"Predicting, diagnosing and improving automatic language identification performance",
51-54.
Corredor-Ardoy, Cristobal / Gauvain, Jean Luc / Adda-Decker, Martine / Lamel, Lori:
"Language identification with language-independent acoustic models",
55-58.
Parris, Eluned S. / Lloyd-Thomas, Harvey / Carey, Michael J. / Wright, Jerry H.:
"Bayesian methods for language verification",
59-62.
Kwan, HingKeung / Hirose, Keikichi:
"Use of recurrent network for unknown language rejection in language identification system",
63-66.
Andersen, Ove / Dalsgaard, Paul:
"Language-identification based on cross-language acoustic models and optimised information combination",
67-70.
Navratil, Jiri / Zühlke, Werner:
"Phonetic-context mapping in language identification",
71-74.
Neural Networks for Speech and Language Processing
Rahim, Mazin / Bengio, Yoshua / LeCun, Yann:
"Discriminative feature and model design for automatic speech recognition",
75-78.
Rottland, Jörg / Neukirchen, Christoph / Willett, Daniel / Rigoll, Gerhard:
"Large vocabulary speech recognition with context dependent MMI-connectionist / HMM systems using the WSJ database",
79-82.
Moudenc, Thierry / Mercier, Guy:
"Automatic selection of segmental acoustic parameters by means of neural-fuzzy networks for reordering the n-best HMM hypotheses",
83-86.
Kurimo, Mikko:
"Comparison results for segmental training algorithms for mixture density HMMs",
87-90.
Castano, Asuncion / Casacuberta, Francisco:
"A connectionist approach to machine translation",
91-94.
Pican, Nicolas / Mari, Jean-Francois / Fohr, Dominique:
"Continuous speech recognition using a context sensitive ANN and HMM2s",
95-98.
Training Techniques; Efficient Decoding in ASR
Shinoda, Koichi / Watanabe, Takao:
"Acoustic modeling based on the MDL principle for speech recognition",
99-102.
Modi, Piyush / Rahim, Mazin:
"Discriminative utterance verification using multiple confidence measures",
103-106.
Bocchieri, Enrico / Mak, Brian:
"Subspace distribution clustering for continuous observation density hidden Markov models",
107-110.
Nock, H. J. / Gales, M. J. F. / Young, Steve J.:
"A comparative study of methods for phonetic decision-tree state clustering",
111-114.
Kaltenmeier, Alfred / Franke, Jürgen:
"Comparing Gaussian and polynomial classification in SCHMM-based recognition systems",
115-118.
Girardi, Alexandre / Singer, Harald / Shikano, Kiyohiro / Nakamura, Satoshi:
"Maximum likelihood successive state splitting algorithm for tied-mixture HMNET",
119-122.
McDermott, Erik / Katagiri, Shigeru:
"String-level MCE for continuous phoneme recognition",
123-126.
Rivlin, Ze'ev / Sankar, Ananth / Bratt, Harry:
"HMM state clustering across allophone class boundaries",
127-130.
Mohri, Mehryar / Riley, Michael:
"Weighted determinization and minimization for large vocabulary speech recognition",
131-134.
Phillips, Steven / Rogers, Anne:
"Parallel speech recognition",
135-138.
Ortmanns, Stefan / Firzlaff, Thorsten / Ney, Hermann:
"Fast likelihood computation methods for continuous mixture densities in large vocabulary speech recognition",
139-142.
Demuynck, Kris / Duchateau, Jacques / Compernolle, Dirk van:
"A static lexicon network representation for cross-word context dependent phones",
143-146.
Padmanabhan, Mukund / Bahl, L. R. / Nahamoo, D. / Souza, Pieter de:
"Decision-tree based quantization of the feature space of a speech recognizer",
147-150.
Ravishankar, Mosur / Bisiani, R. / Thayer, E.:
"Sub-vector clustering to improve memory and speed performance of acoustic likelihood computation",
151-154.
Hovell, Simon:
"The incorporation of path merging in a dynamic network recogniser",
155-158.
Novak, Miroslav:
"Improvement on connected digits recognition using duration constraints in the asynchronous decoding scheme",
159-162.
Stolcke, Andreas / Konig, Yochai / Weintraub, Mitchel:
"Explicit word error minimization in n-best list rescoring",
163-166.
Nguyen, Long / Schwartz, Richard:
"Efficient 2-pass n-best decoder",
167-170.
Iwasaki, Tomohiro / Abe, Yoshiharu:
"A memory management method for a large word network",
171-174.
Prosody
Romano, Antonio:
"Persistence of prosodic features between dialectal and standard Italian utterances in six sub-varieties of a region of southern Italy (salento): first assessments of the results of a recognition test and an instrumental analysis",
175-178.
Vereecken, Halewijn / Vorstermans, Annemie / Martens, Jean-Pierre / Coile, Bert van:
"Improving the phonetic annotation by means of prosodic phrasing",
179-182.
Ode, Cecilia:
"A descriptive study of prosodic phenomena in Mpur (west Papuan Phylum)",
183-186.
Mixdorff, Hansjörg / Fujisaki, Hiroya:
"Automated quantitative analysis of F0 contours of utterances from a German ToBI-labeled speech database",
187-190.
Tournemire, Stéphanie de:
"Identification and automatic generation of prosodic contours for a text-to-speech synthesis system in French",
191-194.
Ni, Jin-Fu / Wang, Ren-Hua / Hirose, Keikichi:
"Quantitative analysis and formulation of tone concatenation in Chinese F0 contours",
195-198.
Brindöpke, Christel / Pahde, Arno / Kummert, Franz / Sagerer, Gerhard:
"An environment for the labelling and testing of melodic aspects of speech",
199-202.
Casacuberta, David / Aguilar, Lourdes / Marin, Rafael:
"PROPAUSE: a syntactico-prosodic system designed to assign pauses",
203-206.
Warnke, Volker / Kompe, Ralf / Niemann, Heinrich / Nöth, Elmar:
"Integrated dialog act segmentation and classification using prosodic features and language models",
207-210.
Donzel, Monique E. van / Koopmans-van Beinum, Florien J.:
"Evaluation of prosodic characteristics in retold stories in Dutch by means of semantic scales",
211-214.
Bruce, Gosta / Filipsson, Marcus / Frid, Johan / Granström, Björn / Gustafson, Kjell / Horne, Merle / House, David:
"Text-to-intonation in spontaneous Swedish",
215-218.
Morlec, Yann / Bailly, Gérard / Auberge, Véronique:
"Synthesising attitudes with global rhythmic and intonation contours",
219-222.
Gibbon, Dafydd / Sassen, Claudia:
"Prosody-particle pairs as discourse control signs",
223-226.
Elsner, Anja:
"Focus detection with additional information of phrase boundaries and sentence mode",
227-230.
Bosch, Laura / Sebastian-Galles, Nuria:
"The role of prosody in infants' native-language discrimination abilities: the case of two phonologically close languages",
231-234.
Buder, Eugene H. / Eriksson, Anders:
"Prosodic cycles and interpersonal synchrony in American English and Swedish",
235-238.
Strangert, Eva:
"Relating prosody to syntax: boundary signalling in Swedish",
239-242.
Nakai, Mitsuru / Shimodaira, Hiroshi:
"On representation of fundamental frequency of speech for prosody analysis using reliability function",
243-246.
Kim, Seong-Hwan / Kim, Jin-Young:
"Efficient method of establishing words tone dictionary for Korean TTS system",
247-250.
D'Imperio, Mariapaola / House, David:
"Perception of questions and statements in Neapolitan Italian",
251-254.
Keyword and Topic Spotting
Lin, Qiguang / Lubensky, Dave / Picheny, Michael / Rao, P. Srinivasa:
"Key-phrase spotting using an integrated language model of n-grams and finite-state grammar",
255-258.
Junkawitsch, Jochen / Ruske, Gunther / Höge, Harald:
"Efficient methods for detecting keywords in continuous speech",
259-262.
Lau, Raymond / Seneff, Stephanie:
"Providing sublexical constraints for word spotting within the ANGIE framework",
263-266.
Bartkova, Katarina / Jouvet, Denis:
"Usefulness of phonetic parameters in a rejection procedure of an HMM-based speech recognition system",
267-270.
Yamashita, Yoichi / Mizoguchi, Riichiro:
"Keyword spotting using F0 contour matching",
271-274.
Nöth, Elmar / Harbeck, Stefan / Niemann, Heinrich / Warnke, Volker:
"A frame and segment based approach for topic spotting",
275-278.
Robustness in Recognition and Signal Processing
Paliwal, Kuldip K. / Sagisaka, Yoshinori:
"Cyclic autocorrelation-based linear prediction analysis of speech",
279-282.
Zeljkovic, Ilija / Narayanan, Shrikanth:
"Novel filler acoustic models for connected digit recognition",
283-286.
Shozakai, Makoto / Nakamura, Satoshi / Shikano, Kiyohiro:
"A non-iterative model-adaptive e-CMN/PMC approach for speech recognition in car environments",
287-290.
Torre, Angel de la / Peinado, Antonio M. / Rubio, Antonio J. / Garcia, Pedro:
"Discriminative feature extraction for speech recognition in noise",
291-294.
Brendborg, Michael K. / Lindberg, Borge:
"Noise robust recognition using feature selective modeling",
295-298.
Abrash, Victor:
"Mixture input transformations for adaptation of hybrid connectionist speech recognizers",
299-302.
Hwang, Tai-Hwei / Lee, Lee-Min / Wang, Hsiao-Chuan:
"Adaptation of time differentiated cepstrum for noisy speech recognition",
1075-1078.
Kanedera, Noboru / Arai, Takayuki / Hermansky, Hynek / Pavel, Misha:
"On the importance of various modulation frequencies for speech recognition",
1079-1082.
Hong, Wei-Tyng / Chen, Sin-Horng:
"A robust RNN-based pre-classification for noisy Mandarin speech recognition",
1083-1086.
Rahim, Mazin:
"A parallel environment model (PEM) for speech recognition and adaptation",
1087-1090.
Schless, Volker / Class, Fritz:
"Adaptive model combination for robust speech recognition in car environments",
1091-1094.
Gerven, Stefaan Van / Xie, Fei:
"A comparative study of speech detection methods",
1095-1098.
Doukas, Nikos / Naylor, Patrick / Stathaki, Tania:
"Voice activity detection using source separation techniques",
1099-1102.
Taniguchi, Tomohiko / Kajita, Shoji / Takeda, Kazuya / Itakura, Fumitada:
"Voice activity detection using source separation techniques",
1103-1106.
Avendano, Carlos / Tibrewala, Sangita / Hermansky, Hynek:
"Multiresolution channel normalization for ASR in reverberant environments",
1107-1110.
Martinez, Rafael / Alvarez, Agustin / Gomez, Vilda Pedro / Perez, Mercedes / Nieto, Victor / Rodellar, Victoria:
"A speech pre-processing technique for end-point detection in highly non-stationary environments",
1111-1114.
Docio-Fernandez, Laura / Garcia-Mateo, Carmen:
"Application of several channel and noise compensation techiques for robust speaker recognition",
1115-1118.
Agaiby, Hany / Moir, Thomas J.:
"Knowing the wheat from the weeds in noisy speech",
1119-1122.
Kim, Do Yeong / Kim, Nam Soo / Un, Chong Kwan:
"Model-based approach for robust speech recognition in noisy environements with multiple noise sources",
1123-1126.
Chu, Y.C. / Jie, Charlie / Tung, Vincent / Lin, Ben / Lee, Richard:
"Normalization of speaker variability by spectrum warping for robust speech recognition",
1127-1130.
Maes, Stephane H.:
"LPC poles tracker for music/speech/noise segmentation and music cancellation",
1131-1134.
Kim, Doh-Suk / , Jae-Hoon Jeong(1) / Le, Soo-Young / Kil, Rhee M.:
"Comparative evaluations of several front-ends for robust speech recognition",
1135-1138.
Gouvea, Evandro B. / Stern, Richard M.:
"Speaker normalization through formant-based warping of the frequency scale",
1139-1142.
Westphal, Martin:
"The use of cepstral means in conversational speech recognition",
1143-1146.
Huerta, Juan M. / Stern, Richard M.:
"Compensation for environmental and speaker variability by normalization of pole locations",
1147-1150.
Puel, Jean-Baptiste / André-Obrecht, Régine:
"Cellular phone speech recognition: noise compensation vs. robust architectures",
1151-1154.
Chiang, Tung-Hui:
"Speech recognition in noise using on-line HMM adaptation",
1155-1158.
Modelling of Prosody
Malliopoulos, Christos / Mikros, George:
"Metrical representations of demarcation and constituency in noun phrases",
303-306.
Pirker, Hannes / Alter, Kai / Rank, Erhard / Matiasek, John / Trost, Harald / Kubin, Gernot:
"A system of stylized intonation contours in German",
307-310.
Hirose, Keikichi / Iwano, Kouji:
"A method of representing fundamental frequency contours of Japanese using statistical models of moraic transition",
311-314.
Fotinea, Evita F. / Vlahakis, Michael A. / Carayannis, George V.:
"Modeling arbitrarily long sentence-Spanning F0 contours by parametric concatenation of word-Spanning patterns",
315-318.
Son, Rob J. J. H. van / Santen, Jan P. H. van:
"Strong interaction between factors influencing consonant duration",
319-322.
Gros, Jerneja / Pavesic, Nikola / Mihelic, France:
"Speech timing in Slovenian TTS",
323-326.
Microphone Arrays for Speech Enhancement
Dorbecker, Matthias:
"Small microphone arrays with optimized directivity for speech enhancement",
327-330.
Inoue, Masaaki / Nakamura, Satoshi / Yamada, Takeshi / Shikano, Kiyohiro:
"Microphone array design measures for hands-free speech recognition",
331-334.
Akagi, Masato / Mizumachi, Mitsunori:
"Noise reduction by paired microphones",
335-338.
Mahmoudi, Djamila:
"A microphone array for speech enhancement using multiresolution wavelet transform",
339-342.
Nagata, Yoshifumi / Tsuboi, Hiroyuki:
"A two-channel adaptive microphone array with target tracking",
343-346.
Giuliani, Diego / Matassoni, Marco / Omologo, Maurizio / Svaizer, Piergiorgio:
"Use of different microphone array configurations for hands-free speech recognition in noisy and reverberant environment",
347-350.
Multilingual Recognition
Wang, Chao / Glass, James / Meng, Helen / Polifroni, Joe / Seneff, Stephanie / Zue, Victor W.:
"YINHE: a Mandarin Chinese version of the GALAXY system",
351-354.
Bonaventura, Patrizia / Gallocchio, Filippo / Micca, Giorgio:
"Multilingual speech recognition for flexible vocabularies",
355-358.
Weng, Fuliang / Bratt, Harry / Neumeyer, Leonardo / Stolcke, Andreas:
"A study of multilingual speech recognition",
359-362.
Billa, Jayadev / Ma, Kristine / McDonough, John W. / Zavaliagkos, George / Miller, David R. / Ross, Kenneth N. / El-Jaroudi, Amro:
"Multilingual speech recognition: the 1996 byblos callhome system",
363-366.
Schultz, Tanja / Koll, Detlef / Waibel, Alex:
"Japanese LVCSR on the spontaneous scheduling task with JANUS-3",
367-370.
Schultz, Tanja / Waibel, Alex:
"Fast bootstrapping of LVCSR systems with multilingual phoneme sets",
371-374.
Language Specific Speech Analysis
Pompino-Marschall, Bernd / Mooshammer, Christine:
"Factors of variation in the production of the German dorsal fricative",
375-378.
Thomas, Kimberly:
"EPG and aerodynamic evidence for the coproduction and coarticulation of clicks in Isizulu",
379-382.
Geumann, Anja:
"Formant trajectory dynamics in Swabian diphthongs",
383-386.
Wood, Sidney A. J.:
"The gestural organization of vowels and consonants: a cinefluorographic study of articulator gestures in Greenlandic",
387-388.
Anderson, Victoria B.:
"The perception of coronals in Western Arrernte",
389-392.
Espy-Wilson, Carol Y. / Narayanan, Shrikanth / Boyce, Suzanne E. / Alwan, Abeer:
"Acoustic modelling of American English /r/",
393-396.
Feature Estimation, Pitch, and Prosody
Hansen, Anya Varnich:
"Acoustic parameters optimised for recognition of phonetic features",
397-400.
Halberstadt, Andrew K. / Glass, James R.:
"Heterogeneous acoustic measurements for phonetic classification 1",
401-404.
Milner, Ben:
"Cepstral-time matrices and LDA for improved connected digit and sub-word recognition accuracy",
405-408.
Vuuren, Sarel van / Hermansky, Hynek:
"Data-driven design of RASTA-like filters",
409-412.
Nicholson, Simon / Milner, Ben / Cox, Stephen:
"Evaluating feature set performance using the f-ratio and j-measures",
413-416.
Hernando, Javier / Nadeu, Climent:
"Robust speech parameters located in the frequency domain",
417-420.
Gaillard, Francois / Berthommier, Frederic / Feng, Gang / Schwartz, Jean-Luc:
"A modified zero-crossing method for pitch detection in presence of interfering sources",
445-448.
Simonin, Jacques / Mokbel, Chafic:
"Using simulated annealing expectation maximization algorithm for hidden Markov model parameters estimation",
449-452.
Fant, Gunnar / Hertegard, Stellan / Kruckenberg, Anita / Liljencrants, Johan:
"Covariation of subglottal pressure, F0 and glottal parameters",
453-456.
Delopoulos, Anastasios / Rangoussi, Maria:
"The fractal behaviour of unvoiced plosives: a means for classification",
457-460.
Ohno, Sumio / Fujisaki, Hiroya / Taguchi, Hideyuki:
"A method for analysis of the local speech rate using an inventory of reference units",
461-464.
Fujisaki, Hiroya / Ohno, Sumio / Yagi, Takashi:
"Analysis and modeling of fundamental frequency contours of Greek utterances",
465-468.
Martinez, Fernando / Tapias, Daniel / Alvarez, Jorge / Leon, Paloma:
"Characteristics of slow, average and fast speech and their effects in large vocabulary continuous speech recognition",
469-472.
Lee, Sungbok / Potamianos, Alexandros / Narayanan, Shrikanth:
"Analysis of children's speech: duration, pitch and formants",
473-476.
Traunmüller, Hartmut / Eriksson, Anders:
"A method of measuring formant frequencies at high fundamental frequencies",
477-480.
Brondsted, Tom / Madsen, Jens Printz:
"Analysis of speaking rate variations in stress-timed languages",
481-484.
Micallef, Paul / Chilton, Ted:
"Automatic identification of phoneme boundaries using a mixed parameter model",
485-488.
Koval, Serguei / Bekasova, Veronika / Khitrov, Michael / Raev, Andrey:
"Pitch detection reliability assessment for forensic applications",
489-492.
Hu, Zhihong / Barnard, Etienne:
"Efficient estimation of perceptual features for speech recognition",
493-496.
Malayath, Narendranath / Hermansky, Hynek / Kain, Alexander:
"Towards decomposing the sources of variability in speech",
497-500.
Chengalvarayan, Rathinavelu:
"Use of vector-valued dynamic weighting coefficients for speech recognition: maximum likelihood approach",
501-504.
Beet, S. W. / Baghai-Ravary, L.:
"Automatic segmentation: data-driven units of speech",
505-508.
Bajic, Dejan:
"On robust time-varying AR speech analysis based on t-distribution",
509-512.
Tambakas, Dimitris / Tzima, Iliana / Fakotakis, Nikos / Kokkinakis, George:
"A simple phoneme energy model for the Greek language and its application to speech recognition",
513-516.
Noad, James E. H. / Whiteside, Sandra P. / Green, Phil:
"A macroscopic analysis of an emotional speech corpus",
517-520.
Shimodaira, Hiroshi / Nakai, Mitsuru / Kumata, Akihiro:
"Restoration of pitch pattern of speech based on a pitch generation model",
521-524.
Agranovski, A. V. / Berg, O. Y. / Lednov, D. A.:
"The research of correlation between pitch and skin galvanic reaction at change of human emotional state",
525-528.
Montacié, Claude / Caraty, Marie-José / Lefèvre, Fabrice:
"K-NN versus Gaussian in HMM-based recognition system",
529-532.
Doval, Boris / d'Alessandro, Christophe / Diard, Benoit:
"Spectral methods for voice source parameters estimation",
533-536.
Speech Coding
Vrecken, Olivier van der / Pierret, Nicolas / Dutoit, Thierry / Pagel, Vincent / Malfrere, Fabrice:
"A simple and efficient algorithm for the compression of MBROLA segment databases",
421-424.
Zolfaghari, Parham / Robinson, Tony:
"A segmental formant vocoder based on linearly varying mixture of Gaussians",
425-428.
Chennoukh, Samir / Sinder, Daniel / Richard, Gael / Flanagan, James L.:
"Voice mimic system using an articulatory codebook for estimation of vocal tract shape",
429-432.
Mudugamuwa, Damith J. / Bradley, Alan B.:
"Adaptive transform coding for linear predictive residual",
433-436.
Takahashi, Akira / Kitawaki, Nobuhiko / Usai, Paolino / Atkinson, David:
"Performance evaluation of objective quality measures for coded speech",
437-440.
Ismail, Mohamed / Ponting, Keith:
"Between recognition and synthesis - 300 bits/second speech coding",
441-444.
Villette, Stephane / Stefanovic, Milos / Atkinson, Ian / Kondoz, Ahmet:
"High quality split-band LPC vocoder and its fixed point real time implementation",
1243-1246.
Chang, Wen-Whei / Chang, Hwai-Tsu / Meng, Wan-Yu:
"Missing packet recovery techniques for DM coded speech",
1247-1250.
Vu, Hai Le / Lois, Laszlo:
"Spectral sensitivity of LSP parameters and their transformed coefficients",
1251-1254.
Ramasubramanian, V. / Paliwal, Kuldip K.:
"Reducing the complexity of the LPC vector quantizer using the k-d tree search algorithm",
1255-1258.
Lemma, Aweke N. / Kleijn, W. Bastiaan / Deprettere, Ed F.:
"Quantization using wavelet based temporal decomposition of the LSF",
1259-1262.
Xydeas, Costas S. / Ilk, Gokhan H.:
"A novel 1.7/2.4 kb/s DCT based prototype interpolation speech coding system",
1263-1266.
Choi, Yong-Soo / Kang, Hong-Goo / Park, Sang-Wook / Yoo, Jae-Ha / Youn, Dae-Hee:
"Improved regular pulse VSELP coding of speech at low bit-rates",
1267-1270.
Cho, Yong Duk / Kim, Hong Kook / Kim, Moo Young / Kim, Sang Ryong:
"Joint estimation of pitch, band magnitudes, and v\UV decisions for MBE vocoder",
1271-1274.
Kovesi, Balazs / Saoudi, Samir / Boucher, Jean Marc / Horvath, Gábor:
"A new distance measure in LPC coding: application for real time situations",
1275-1278.
Vepyek, Peter / Bradley, Alan B.:
"Consideration of processing strategies for very-low-rate compression of wideband speech signals with known text transcription",
1279-1282.
Görtz, Norbert:
"Zero-redundancy error protection for CELP speech codecs",
1283-1286.
Matmti, Ridha / Jelinek, Milan / Adoul, Jean-Pierre:
"Low bit rate speech coding using an improved HSX model",
1287-1290.
Ribeiro, Carlos M. / Trancoso, Isabel:
"Phonetic vocoding with speaker adaptation",
1291-1294.
Baudoin, Geneviève / Cernocky, Jan / Chollet, Gérard:
"Quantization of spectral sequences using variable length spectral segments for speech coding at very low bit rate",
1295-1298.
Ghaemmaghami, Shahrokh / Deriche, Mohamed / Boashash, Boualem:
"On modeling event functions in temporal decomposition based speech coding",
1299-1302.
Torres, Soledad / Casajús-Quirós, F. Javier:
"Phase quantization by pitch-cycle waveform coding in low bit rate sinusoidal coders",
1303-1306.
Botinis, Antonis / Fourakis, Marios / Hawks, John W.:
"A perceptual study of the greek vowel space using synthetic stimuli",
1307-1310.
Han, Woo-Jin / Kim, Sung-Joo / Oh, Yung-Hwan:
"Mixed multi-band excitation coder using frequency domain mixture function (FDMF) for a low-bit rate speech coding",
1311-1314.
Fingscheidt, Tim / Scheufen, Olaf:
"Robust GSM speech decoding using the channel decoder's soft output",
1315-1318.
Seymour, Carl W. / Robinson, Tony A.:
"A low-bit-rate speech coder using adaptive line spectral frequency prediction 1319",
1319-1322.
Speech Synthesis Techniques
Ding, Wen / Campbell, Nick:
"Optimising unit selection with voice source and formants in the CHATR speech synthesis system",
537-540.
Abe, Masanobu / Mizuno, Hideyuki / Takahashi, Satoshi / Nakajima, Shin'ya:
"A new framework to provide high-controllability speech signal and the development of a workbench for it",
541-544.
Banga, Eduardo R. / Garcia-Mateo, Carmen / Fernandez-Salgado, Xavier:
"Shape-invariant prosodic modification algorithm for concatenative text-to-speech synthesis",
545-548.
Hwang, Shaw-Hwa / Chen, Sin-Horng / Chang, Saga:
"An RNN-based spectral information generation for Mandarin text-to-speech",
549-552.
Santen, Jan P. H. van / Buchsbaum, Adam L.:
"Methods for optimal text selection",
553-556.
Gimenez de los Galanes, Francisco M. / Talkin, David:
"High resolution prosody modification for speech synthesis",
557-560.
Karaali, Orhan / Corrigan, Gerald / Gerson, Ira / Massey, Noel:
"Text-to-speech conversion with neural networks: a recurrent TDNN approach",
561-564.
Högberg, Jesper:
"Data driven formant synthesis",
565-568.
King, Simon / Portele, Thomas / Höfer, Florian:
"Speech synthesis using non-uniform units in the Verbmobil project",
569-572.
Trancoso, Isabel / Vianna, M. Ceu:
"On the pronunciation mode of acronyms in several European languages",
573-576.
Rietveld, Toni / Kerkhoff, Joop / Emons, M. J. W. M. / Meijer, E.J. / Sanderman, Angelien A. / Sluijter, Agaath M. C.:
"Evaluation of speech synthesis systems for Dutch in tele-communication applications in GSM and PSTN networks",
577-580.
Angelini, Bianca / Barolo, Claudia / Falavigna, Daniele / Omologo, Maurizio / Sandri, Stefano:
"Automatic diphone extraction for an Italian text-to-speech synthesis system",
581-584.
Keller, Eric:
"Simplification of TTS architecture vs. operational quality",
585-588.
Fries, Georg / Wirth, Antje:
"Felix - a TTS system with improved pre-processing and source signal generation",
589-592.
Edgington, Mike:
"Investigating the limitations of concatenative synthesis",
593-596.
Teixeira de Jesus, Luis Miguel / Cawley, Gavin C.:
"Speech coding and synthesis using parametric curves",
597-600.
Black, Alan W. / Taylor, Paul:
"Automatically clustering similar units for unit selection in speech synthesis",
601-604.
Jiang, Li / Hon, Hsiao-Wuen / Huang, Xuedong:
"Improvements on a trainable letter-to-sound converter",
605-608.
Bae, Myungjin / Kim, Kyuhong / Lee, Woncheol:
"On a cepstral pitch alteration technique for prosody control in the speech synthesis system with high quality",
609-612.
Stylianou, Yannis / Dutoit, Thierry / Schroeter, Juergen:
"Diphone concatenation using a harmonic plus noise model of speech",
613-616.
Technology for S&L Acquisition, Speech Processing Tools
Sabah, Gérard:
"The "sketchboard": a dynamic interpretative memory and its use for spoken language understanding",
617-620.
Zhou, Qiru / Lee, Chin-Hui / Chou, Wu / Pargellis, Andrew:
"Speech technology integration and research platform: a system study",
621-624.
Geller, Dieter / Lieb, Markus / Budde, Wolfgang / Muelhens, Oliver / Zinke, Manfred:
"Speech recognition on SPHERIC - an IC for command and control applications",
625-628.
McCandless, Michael K. / Glass, James R.:
"MUSE: a scripting language for the development of interactive speech analysis and recognition tools",
629-632.
Witt, Silke / Young, Steve J.:
"Language learning based on non-native speech recognition",
633-636.
Kilian, Ute / Bader, Klaus:
"Task modelling by sentence templates",
637-640.
Kitaazawa, Shigeyoshi / Ichikawa, Hideya / Kobayashi, Satoshi / Nishinuma, Yukihiro:
"Extraction and representation rhythmic components of spontaneous speech",
641-644.
Kim, Yoon / Franco, Horacio / Neumeyer, Leonardo:
"Automatic pronunciation scoring of specific phone segments for language instruction",
645-648.
Ronen, Orith / Neumeyer, Leonardo / Franco, Horacio:
"Automatic detection of mispronunciation for language instruction",
649-652.
Alvarez, Agustin / Martinez, Rafael / Nieto, Victor / Rodellar, Victoria / Gomez, Pedro:
"Continuous formant-tracking applied to visual representations of the speech and speech recognition",
653-656.
Kawai, Goh / Hirose, Keikichi:
"A CALL system using speech recognition to train the pronunciation of Japanese long vowels, the mora nasal and mora obstruents",
657-660.
Nouza, Jan / Holada, Miroslav / Hajek, Daniel:
"An educational and experimental workbench for visual processing of speech data",
661-664.
Choi, Yong-Soo / Kang, Hong-Goo / Kim, Sung-Youn / Park, Young-Cheol / Youn, Dae-Hee:
"A 3 channel digital CVSD bit-rate conversion system using a general purpose DSP",
665-668.
Delmonte, Rodolfo / Petrea, Mirela / Bacalu, Ciprian:
"SLIM prosodic module for learning activities in a foreign language",
669-672.
Kaspar, Bernhard / Schuhmacher, Karlheinz / Feldes, Stefan:
"Barge-in revised",
673-676.
Akbar, Mohammad:
"Waveedit, an interactive speech processing environment for microsoft windows platform",
677-680.
Ehsani, Farzad / Bernstein, Jared / Najmi, Amir / Todic, Ognjen:
"Subarashii: Japanese interactive spoken language education",
681-684.
Goddeau, David / Goldenthal, William / Weikart, Chris:
"Deploying speech applications over the web",
685-688.
Schalkwyk, Johan / Villiers, Jacques de / Vuuren, Sarel van / Vermeulen, Pieter:
"CSLUsh: an extendible research environment",
689-692.
Ferenczi, Tibor / Nemeth, Geza / Olaszy, Gabor / Gaspar, Zoltan:
"A flexible client-server model for multilingual CTS/TTS development",
693-696.
Laine, Unto K.:
"Critically sampled PR filterbanks of nonuniform resolution based on block recursive FAMlet transform",
697-700.
Minematsu, Nobuaki / Ohashi, Nariaki / Nakagawa, Seiichi:
"Automatic detection of accent in English words spoken by Japanese students",
701-704.
Taniguchi, Yasuhiro / Reyes, Allan A. / Suzuki, Hideyuki / Nakagawa, Seiichi:
"An English conversation and pronunciation CAI system using speech recognition technology",
705-708.
Sutton, Stephen / Kaiser, Ed / Cronk, A. / Cole, Ron:
"Bringing spoken language systems to the classroom",
709-712.
Cucchiarini, Catia / Boves, Lou:
"Automatic assessment of foreign speakers' pronunciation of dutch",
713-716.
Holzrichter, John F. / Burnett, Greg C.:
"Use of low power EM radar sensors for speech articulator measurements",
717-720.
Epps, Julien / Dowd, Annette / Smith, John / Wolfe, Joe:
"Real time measurements of the vocal tract resonances during speech",
721-724.
Phonetics and Phonology
Cavalcante Albano, Eleonora / Aparecida Aquino, Patricia:
"Linguistic criteria for building and recording units for concatenative speech synthesis in brazilian portuguese",
725-728.
Kvale, Knut / Foldvik, Arne Kjell:
""four-and-twenty, twenty-four". what's in a number?",
729-732.
Moraes, Joao Antonio de:
"Vowel nasalization in Brazilian Portuguese: an articulatory investigation",
733-736.
Steriopolo, Elena:
"Rhythmic organization pecularities of the spoken text",
737-738.
Rueber, Bernhard:
"Obtaining confidence measures from sentence probabilities",
739-742.
Zu, Yiqing:
"Sentence design for speech synthesis and speech recognition database by phonetic rules",
743-746.
Draxler, Christoph / Burger, Susanne:
"Identification of regional variants of high German from digit sequences in German telephone speech",
747-750.
Kavitskaya, Darya:
"Aerodynamic constraints on the production of palatalized trills: the case of the Slavic trilled [r]",
751-754.
Seong, Cheol-jae / Kim, Sanghun:
"An experimental phonetic study of the interrelationship between prosodic phrase and syntactic structure",
755-758.
Heid, Sebastian J. G. G.:
"Individual differences between vowel systems of German speakers",
759-762.
Batliner, Anton / Kießling, Andreas / Kompe, Ralf / Niemann, Heinrich / Nöth, Elmar:
"Tempo and its change in spontaneous speech",
763-766.
Petek, Bojan / Sustarsic, Rastislav:
"A corpus-based approach to diphthong analysis of standard Slovenian",
767-770.
Aguilar, Lourdes / Gimenez, Julia A. / Machuca, Maria / Marin, Rafael / Riera, Montse:
"Catalan vowel duration",
771-774.
Caputo, Maria Rosaria:
"The intonation of vocatives in spoken Neapolitan Italian",
775-778.
Magno Caldognetto, Emanuela / Zmarich, Claudio / Ferrero, Franco:
"A comparative acoustic study of spontaneous and read Italian speech",
779-782.
Refice, Mario / Savino, Michelina / Grice, Martine:
"A contribution to the estimation of naturalness in the intonation of Italian spontaneous speech",
783-786.
Moosmüller, Sylvia:
"Diphthongs and the process of monophthongization in Austrian German: a first approach",
787-790.
Hoskins, Steve:
"The prosody of broad and narrow focus in English: two experiments",
791-794.
Turk, Alice / White, Laurence:
"The domain of accentual lengthening in Scottish English",
795-798.
Bessac, Mariette / Caelen-Haumont, Geneviève:
"Spontaneous dialogue: some results about the F0 predictions of a pragmatic model of information processing",
799-802.
Demolin, Didier / Teston, Bernard:
"Phonetic characteristics of double articulations in some Mangbutu-efe languages",
803-806.
Hernaez, Inmaculada / Gaminde, Inaki / Etxebarria, Borja / Etxebarria, Pilartxo:
"Intonation modeling for the southern dialects of the Basque language 807",
807-809.
O'Boyle, Peter / Ming, Ji / Owens, Marie / Smith, F. Jack:
"From phone identification to phone clustering using mutual information",
2391-2394.
Berrah, Ahmed-Reda / Laboissiere, Rafael:
"Phonetic code emergence in a society of speech robots: explaining vowel systems and the MUAF principle",
2395-2398.
Moen, Inger / Simonsen, Hanne Gram:
"Effects of voicing on /t,d/ tongue/palate contact in English and norwegian",
2399-2402.
Ladefoged, Peter / Fant, Gunnar:
"Fieldwork techniques for relating formant frequency, amplitude and bandwidth",
2403-2406.
Wang, Xue / Pols, Louis C.W.:
"Word juncture modelling based on the TIMIT database",
2407-2410.
Ueyama, Motoko:
"The phonology and phonetics of second language intonation: the case of "Japanese English"",
2411-2414.
Confidence Measures in ASR
Fetter, Pablo / Haiber, Udo / Regel-Brietzmann, Peter:
"A low-cost phonetic transcription method",
811-814.
Chase, Lin:
"Word and acoustic confidence annotation for large vocabulary speech recognition",
815-818.
Bergen, Zachary / Ward, Wayne:
"A senone based confidence measure for speech recognition",
819-822.
Bernstein, Erica / Evans, Ward R.:
"OOV utterance detection based on the recognizer response function",
823-826.
Kemp, Thomas / Schaaf, Thomas:
"Estimating confidence using word lattices",
827-830.
Siu, Man-hung / Gish, Herbert / Richardson, Fred:
"Improved estimation, evaluation and applications of confidence measures for speech recognition",
831-834.
Speaker and Language Identification
Hussain, Salleh / McInnes, Fergus R. / Jack, Mervyn A.:
"Improved speaker verification system with limited training data on telephone quality speech",
835-838.
Li, Qi / Juang, Biing-Hwang / Zhou, Qiru / Lee, Chin-Hui:
"Verbal information verification",
839-842.
Sarma, Sridevi V. / Zue, Victor W.:
"A segment-based speaker verification system using SUMMIT",
843-846.
Sokolov, Michael:
"Speaker verification on the world wide web",
847-850.
Lindberg, Johan / Melin, Håkan:
"Text-prompted versus sound-prompted passwords in speaker verification systems",
851-854.
Schmidt, Michael / Golden, John / Gish, Herbert:
"GMM sample statistic log-likelihoods for text-independent speaker recognition",
855-858.
Perception of Prosody
Rietveld, Toni / Gussenhoven, Carlos:
"The influence of phrase boundaries on perceived prominence in two-peak intonation contours",
859-862.
Caspers, Johanneke:
"Testing the meaning of four dutch pitch accent types",
863-866.
Mersdorf, Joachim J. / Domhover, Thomas:
"A perceptual study for modelling speaker-dependent intonation in TTS and dialog systems",
867-870.
Aubergé, Veronique / Grepillat, Tuulikki / Rilliard, A.:
"Can we perceive attitudes before the end of sentences? the gating paradigm for prosodic contours",
871-874.
Heldner, Mattias / Strangert, Eva:
"To what extent is perceived focus determined by F0-cues?",
875-878.
House, David / Hermes, Dik / Beaugendre, Frédéric:
"Temporal-alignment categories of accent-lending rises and falls",
879-882.
Applications of Speech Technology
Lau, Raymond / Flammia, Giovanni / Pao, Christine / Zue, Victor W.:
"Webgalaxy - integrating spoken language and hypertext navigation",
883-886.
Carey, Michael J. / Parris, Eluned S. / Tattersall, Graham D.:
"Pitch estimation of singing for re-synthesis and musical transcription",
887-890.
Jones, Christian Martyn / Dlay, Satnam Singh:
"Automated lip synchronisation for human-computer interaction and special effect animation",
891-894.
Hemphill, Charles T. / Muthusamy, Yeshwant K.:
"Developing web-based speech applications",
895-898.
Verhelst, Werner:
"Automatic post-synchronization of speech utterances",
899-902.
Robert-Ribes, Jordi / Mukhtar, Rami G.:
"Automatic generation of hyperlinks between audio and transcript",
903-906.
Möller, Sebastian / Schonweiler, Rainer:
"Analysis of infant cries for the early detection of hearing impairment",
1759-1762.
Hatzis, A. / Green, P.D. / Howard, S.J.:
"Optical logo-therapy (OLT): a computer-based real time visual feedback application for speech training",
1763-1766.
Lin, Sung-Chien / Chien, Lee-Feng / Chen, Ming-Chiuan / Lee, Lin-Shan / Chen, Ker-Jiann:
"Intelligent retrieval of very large Chinese dictionaries with speech queries",
1767-1770.
Leonardi, Fulvio / Micca, Giorgio / Militello, Sheyla / Nigra, Mario:
"Preliminary results of a multilingual interactive voice activated telephone service for people-on-the-move",
1771-1774.
Dubois, Jean-Christophe / Anglade, Yolande / Fohr, Dominique:
"Assessment of an operational dialogue system used by a blind telephone switchboard operator",
1775-1778.
Rubio, Antonio J. / Garcia, Pedro / Torre, Angel de la / Segura, Jose C. / Diaz-Verdejo, Jesus / Benitez, Maria C. / Sanchez, Victoria / Peinado, Antonio M. / Lopez-Soler, Juan M. / Perez-Cordoba, Jose L.:
"STACC: an automatic service for information access using continuous speech recognition through telephone line",
1779-1782.
Lopez-Cozar, Ramon / Garcia, Pedro / Diaz-Verdejo, Jesus / Rubio, Antonio J.:
"A voice activated dialogue system for fast-food restaurant applications",
1783-1786.
Shields, Paul W. / Campbell, Douglas R.:
"Multi-microphone sub-band adaptive signal processing for improvement of hearing aid performance",
1787-1790.
Piroth, Hans Georg / Arnhold, Thomas:
"Tactile transmission of intonation and stress",
1791-1794.
Huttunen, Kerttu / Korkko, Pentti / Sorri, Martti:
"Hearing impairment simulation: an interactive multimedia programme on the internet for students of speech therapy",
1795-1798.
Ciocea, Sorin / Schoentgen, Jean / Crevier-Buchman, Lisa:
"Analysis of dysarthric speech by means of formant-to-area mapping",
1799-1802.
Lobanov, Boris M. / Brickle, Simon V. / Kubashin, Andrey V. / Levkovskaja, Tatiana V.:
"An intelligent telephone answering system using speech recognition",
1803-1806.
Ackermann, Ulla / Angelini, Bianca / Brugnara, Fabio / Federico, Marcello / Giuliani, Diego / Gretter, Roberto / Niemann, Heinrich:
"Speedata: a prototype for multilingual spoken data-entry",
1807-1810.
Karjalainen, Matti / Boda, Peter / Somervuo, Panu / Altosaar, Toomas:
"Applications for the hearing-impaired: evaluation of finnish phoneme recognition methods",
1811-1814.
Alarotu, Nina / Lennes, Mietta / Altosaar, Toomas / Malm, Anja / Karjalainen, Matti:
"Applications for the hearing-impaired: comprehension of finnish text with phoneme errors",
1815-1818.
Ehrlich, Ute / Hanrieder, Gerhard / Hitzenberger, Ludwig / Heisterkamp, Paul / Mecklenburg, Klaus / Regel-Brietzmann, Peter:
"Access - automated call center through speech understanding system",
1819-1822.
Anthony, E. Richard / Bowen, Charles / Peet, Margot T. / Tammaro, Susan:
"Integrating a radio model with a spoken language interface for military simulations",
1823-1826.
Falavigna, Daniele / Gretter, Roberto:
"On field experiments of continuous digit recognition over the telephone network",
1827-1830.
Menendez-Pidal, Xavier / Polikoff, James B. / Bunnell, H.Timothy:
"An HMM-based phoneme recognizer applied to assessment of dysarthric speech",
1831-1834.
Torre, Celinda de la / Alonso, Gonzalo:
"Multiapplication platform based on technology for mobile telephone network services",
1835-1838.
Os, Els den / Boves, Lou / James, David / Winski, Richard / Fridh, Kurt:
"Field test of a calling card service based on speaker verification and automatic speech recognition",
1839-1842.
Julia, Luc E. / Cheyer, Adam J.:
"Speech: a privileged modality",
1843-1846.
Spontaneous Speech Recognition
Gauvain, Jean-Luc / Lamel, Lori / Adda, Gilles / Adda-Decker, Martine:
"Transcription of broadcast news",
907-910.
Alleva, Fil / Huang, Xuedong / Hwang, Mei-Yuh / Jiang, Li:
"Can continuous speech recognizers handle isolated speech?",
911-914.
Matsuok, Tatsuo / Taguchi, Yuichi / Ohtsuki, Katsutoshi / Furui, Sadaoki / Shirai, Katsuhiko:
"Toward automatic transcription of Japanese broadcast news",
915-918.
Cettolo, Mauro / Corazza, Anna:
"Automatic detection of semantic boundaries",
919-922.
Bauche, Etienne / Gajic, Bojana / Minami, Yasuhiro / Matsuoka, Tatsuo / Furui, Sadaoki:
"Connected digit recognition in spontaneous speech",
923-926.
Kubala, Francis / Jin, Hubert / Matsoukas, Spyros / Nguyen, Long / Schwartz, Richard / Makhoul, John:
"Advances in transcription of broadcast news",
927-930.
Language Specific Segmental Features
Cambier-Langeveld, Tina / Nespor, Marina / Heuven, Vincent J. van:
"The domain of final lengthening in production and perception in Dutch",
931-934.
Meunier, Christine:
"Voicing assimilation as a cue for cluster identification",
935-938.
Riele, Saskia M.M. te / Loef, Manon / Herwijnen, O. van:
"On the perceptual relevance of degemination in Dutch",
939-942.
Fougeron, Cecile / Steriade, Donca:
"Does deletion of French SCHWA lead to neutralization of lexical distinctions?",
943-946.
Bruyninckx, Marielle / Harmegnies, Bernard:
"An approach of the catalan palatals discrimination based on durational patterns of spectral evolution",
947-950.
Gros, Jerneja / Pavesic, Nikola / Mihelic, France:
"Syllable and segment duration at different speaking rates in the Slovenian language",
951-954.
Speaker Recognition
Li, Wei-Ying / O'Shaughnessy, Douglas:
"Hybrid networks based on RBFN and GMM for speaker recognition",
955-958.
He, Jialong / Liu, Li / Palm, Günther:
"A discriminative training algorithm for Gaussian mixture speaker models",
959-962.
Reynolds, Douglas A.:
"Comparison of background normalization methods for text-independent speaker verification",
963-966.
Kimball, Owen / Schmidt, Michael / Gish, Herbert / Waterman, Jason:
"Speaker verification with limited enrollment data",
967-970.
Bimbot, Frédéric / Hutter, Hans-Peter / Jaboulet, Cedric / Koolwaaij, Johan W. / Lindberg, Johan / Pierrot, Jean-Benoit:
"Speaker verification in the telephone network: research activities in the cave project",
971-974.
Kuitert, Mark / Boves, Lou:
"Speaker verification with GSM coded telephone speech",
975-978.
Rosenberg, Aaron E. / Parthasarathy, S.:
"Speaker identification with user-selected password phrases",
1371-1374.
Olsen, Jesper O.:
"Speaker verification based on phonetic decision making",
1375-1378.
Ariyaeeinia, A. M. / Sivakumaran, P.:
"Analysis and comparison of score normalisation methods for text-dependent speaker verification",
1379-1382.
Jauquet, Frederic / Verlinde, Patrick / Vloeberghs, Claude:
"Automatic speaker recognition on a vocoder link",
1383-1386.
Bimbot, Frederic / Genoud, Dominiqne:
"Likelihood ratio adjustment for the compensation of model mismatch in speaker verification",
1387-1390.
Sönmez, M. Kemal / Heck, Larry / Weintraub, Mitchel / Shriberg, Elizabeth:
"A lognormal tied mixture model of pitch for prosody based speaker recognition",
1391-1394.
Speech Synthesis: Linguistic Analysis
Campbell, Nick / Hebert, Tony / Black, Ezra:
"Parsers, prominence, and pauses",
979-982.
Béchet, Frédéric / El-Bèze, Marc:
"Automatic assignment of part-of-speech to out-of-vocabulary words for text-to-speech processing",
983-986.
Gili Fivela, Barbara / Quazza, Silvia:
"Text-to-prosody parsing in an Italian speech synthesizer. recent improvements",
987-990.
Krenn, Brigitte:
"Tagging syllables",
991-994.
Black, Alan W. / Taylor, Paul:
"Assigning phrase breaks from part-of-speech sequences",
995-998.
Widera, Christina / Portele, Thomas / Wolters, Maria:
"Prediction of word prominence",
999-1002.
Speech Analysis and Modelling
Kuwabara, Hisao:
"Acoustic and perceptual properties of phonemes in continuous speech as a function of speaking rate",
1003-1006.
Narayanan, Shrikanth / Alwan, Abeer / Song, Yong:
"New results in vowel production: MRI, EPG, and acoustic data",
1007-1010.
Arai, Takayuki / Greenberg, Steven:
"The temporal properties of spoken Japanese are similar to those of English",
1011-1014.
Esposito, Anna:
"The amplitudes of the peaks in the spectrum: data from /a/ context",
1015-1018.
Bolfan-Stosic, Natalija / Hedjever, Mladen:
"Acoustical characteristics of speech and voice in speech pathology",
1019-1022.
Kipp, Andreas / Wesenick, Maria-Barbara / Schiel, Florian:
"Pronuncation modeling applied to automatic segmentation of spontaneous speech",
1023-1026.
Downey, Simon / Wiseman, Richard:
"Dynamic and static improvements to lexical baseforms",
1027-1030.
Hauenstein, Andreas:
"Signal driven generation of word baseforms from few examples",
1031-1034.
Botha, Elizabeth C. / Pols, Louis C. W.:
"Modeling the acoustic differences between L1 and L2 speech: the short vowels of africaans and south-african English",
1035-1038.
Vaxelaire, Béatrice / Sock, Rudolph:
"Laryngeal movements and speech rate: an x-ray investigation",
1039-1042.
Eriksson, Anders / Wretling, Pär:
"How flexible is the human voice? - a case study of mimicry",
1043-1046.
Strik, Helmer:
"The effect of low-pass filtering on estimated voice source parameters",
1047-1050.
Fosnot, Susan M.:
"Vowel development of /i/ and /u/ in 15-36 month old children at risk and not at risk to stutter",
1051-1054.
Wrench, Alan / McIntosh, Alan / Hardcastle, William:
"Optopalatograph: development of a device for measuring tongue movement in 3D",
1055-1058.
Gutierrez-Arriola, Juana M. / Gimenez de los Galanes, Francisco M. / Savoji, Mohammed H. / Pardo, José M.:
"Speech synthesis and prosody modification using segmentation and modelling of the excitation signal",
1059-1062.
Savariaux, Christophe / Boë, Louis-Jean / Perrier, Pascal:
"How can the control of the vocal tract limit the speaker's capability to produce the ultimate perceptive objectives of speech? 1063",
1063-1066.
Jovanovic, Goran S.:
"A step toward general model for symbolic description of the speech signal 1067",
1067-1070.
Furukawa, Kiyoshi / Nakazawa, Masayuki / Endo, Takashi / Oka, Ryuichi:
"Referring in long term speech by using orientation patterns obtained from vector field of spectrum pattern",
1071-1074.
Dialogue Systems: Design and Applications
Barnett, J. / Anderson, S. / Broglio, J. / Singh, M. / Hudson, R. / Kuo, S. W.:
"Experiments in spoken queries for document retrieval",
1323-1326.
Seide, Frank / Kellner, Andreas:
"Towards an automated directory information system",
1327-1330.
Larsen, Lars Bo:
"A strategy for mixed-initiative dialogue control",
1331-1334.
Hugunin, Jim / Zue, Victor W.:
"On the design of effective speech-based interfaces for desktop applications",
1335-1338.
Denecke, Matthias / Waibel, Alex:
"Dialogue strategies guiding users to their communicative goals",
1339-1342.
Issar, Sunil:
"A speech interface for forms on WWW",
1343-1346.
Flammia, Giovanni / Zue, Victor W.:
"Learning the structure of mixed initiative dialogues using a corpus of annotated conversations 1",
1871-1874.
Pieraccini, Roberto / Levin, Esther / Eckert, Wieland:
"AMICA: the AT&t mixed initiative conversational architecture",
1875-1878.
Abella, Alicia / Gorin, Allen L.:
"Generating semantically consistent inputs to a dialog manager",
1879-1882.
Levin, Esther / Pieraccini, Roberto:
"A stochastic model of computer-human interaction for learning dialogue strategies",
1883-1886.
Boros, Manuela / Aretoulaki, Maria / Gallwitz, Florian / Noth, Elmar / Niemann, Heinrich:
"Semantic processing of out-of-vocabulary words in a spoken dialogue system",
1887-1890.
Maier, Elisabeth:
"Clarification dialogues in VERBMOBIL",
1891-1894.
Speech Production Modelling
Arslan, Levent M. / Talkin, David:
"Voice conversion by codebook mapping of line spectral frequencies and excitation spectrum",
1347-1350.
Mokbel, C. / Gravier, G. / Chollet, Gérard:
"Optimal state dependent spectral representation for HMM modeling : a new theoretical framework",
1351-1354.
Potamianos, Alexandros / Maragos, Petros:
"Speech analysis and synthesis using an AM-FM modulation model",
1355-1358.
Mawass, Khaled / Badin, Pierre / Bailly, Gérard:
"Synthesis of fricative consonants by audiovisual-to-articulatory inversion",
1359-1362.
Claes, Tom / Dologlou, Ioannis / Bosch, Louis ten / Compernolle, Dirk Van:
"New transformations of cepstral parameters for automatic vocal tract length normalization in speech recognition",
1363-1366.
Dobrisek, S. / Mihelic, F. / Pavesic, N.:
"A multiresolutionally oriented approach for determination of cepstral features in speech recognition",
1367-1370.
Speech Enhancement and Noise Mitigation
Haulick, Tim / Linhard, Klaus / Schrogmeier, Peter:
"Residual noise suppression using psychoacoustic criteria",
1395-1398.
Yegnanarayana, B. / Avendano, Carlos / Hermansky, Hynek / Murthy, P. Satyanarayana:
"Processing linear prediction residual for speech enhancement",
1399-1402.
Gustafsson, Stefan / Martin, Rainer:
"Combined acoustic echo control and noise reduction for mobile communications",
1403-1406.
Lee, Ki Yong / Rheem, Jae Yeol:
"A nonstationary autoregressive HMM and its application to speech enhancement",
1407-1410.
Yoma, Nestor Becerra / McInnes, Fergus R. / Jack, Mervyn A.:
"Spectral subtraction and mean normalization in the context of weighted matching algorithms",
1411-1414.
Tsoukalas, D. E. / Mourjopoulos, J. / Kokkinakis, George:
"Improving the intelligibility of noisy speech using an audible noise suppression technique",
1415-1418.
Girin, Laurent / Feng, Gang / Schwartz, Jean-Luc:
"Noisy speech enhancement by fusion of auditory and visual information: a study of vowel transitions",
2555-2558.
Engelsberg, Andreas / Gulzow, Thomas:
"Spectral subtraction using a non-critically decimated discrete wavelet transform",
2559-2562.
Chien, Jen-Tzung / Wang, Hsiao-Chuan / Lee, Chin-Hui:
"Bayesian affine transformation of HMM parameters for instantaneous and supervised adaptation in telephone speech recognition",
2563-2566.
Lawrence, Craig / Rahim, Mazin:
"Integrated bias removal techniques for robust speech recognition \lambda",
2567-2570.
Langmann, Detlev / Fischer, Alexander / Wuppermann, Friedhelm / Haeb-Umbach, Reinhold / Eisele, Thomas:
"Acoustic front ends for speaker-independent digit recognition in car environments",
2571-2574.
Delphin-Poulat, Lionel / Mokbel, Chafic:
"Signal bias removal using the multi-path stochastic equalization technique",
2575-2578.
Miksic, Andrej / Horvat, Bogomir:
"Subband echo cancellation in automatic speech dialog systems",
2579-2582.
Tolba, Hesham / O'Shaughnessy, Douglas:
"Speech enhancement via energy separation",
2583-2586.
Unoki, Masashi / Akagi, Masato:
"A method of signal extraction from noisy signal",
2587-2590.
Sika, Jiri / Davidek, Vratislav:
"Multi-channel noise reduction using wavelet filter bank",
2591-2594.
Abdallah, Imad / Montresor, Silvio / Baudry, Marc:
"Speech signal detection in noisy environement using a local entropic criterion",
2595-2598.
Moreno, Pedro J. / Eberman, Brian:
"A new algorithm for robust speech recognition: the delta vector taylor series approach",
2599-2602.
Cole, David / Moody, Miles / Sridharan, Sridha:
"Robust enhancement of reverberant speech using iterative noise removal",
2603-2606.
Jones, D. J. / Watson, Scott D. / Evans, K. G. / Cheetham, B. M. G. / Reeve, R. A.:
"A network speech echo canceller with comfort noise",
2607-2610.
Hussain, Amir / Campbell, Douglas R. / Moir, Thomas J.:
"A new metric for selecting sub-band processing in adaptive speech enhancement systems",
2611-2614.
Kobatake, Hidefumi / Suzuki, Hideta:
"Estimation of LPC cepstrum vector of speech contaminated by additive noise and its application to speech enhancement",
2615-2618.
Tibrewala, Sangita / Hermansky, Hynek:
"Multi-band and adaptation approaches to robust speech recognition",
2619-2622.
Masgrau, Enrique / Lleida, Eduardo / Vicente, Luis:
"Non-quadratic criterion algorithms for speech enhancement",
2623-2626.
Spoken Language Understanding
Wright, J. H. / Gorin, Allen L. / Riccardi, Giuseppe:
"Automatic acquisition of salient grammar fragments for call-type classification",
1419-1422.
Minker, Wolfgang:
"Stochastically-based natural language understanding across tasks and languages",
1423-1426.
Riley, Michael / Pereira, Fernando / Mohri, Mehryar:
"Transducer composition for context-dependent network expansion",
1427-1430.
Lieske, Christian / Bos, Johan / Emele, Martin / Gambac, Bjorn / Rupp, C.J. :
"Giving prosody a meaning",
1431-1434.
Papineni, Kishore A. / Roukos, Salim / Ward, Todd R.:
"Feature-based language understanding",
1435-1438.
Amengual, Juan Carlos / Benedi, Jose Miguel / Beulen, Klaus / Casacuberta, Francisco / Castano, Asuncion / Castellanos, Antonio / Jimenez, Victor M. / Llorens, David / Marzal, Andres / Ney, Hermann / Prat, Federico / Vida, Enrique / Vila, Juan Miguel:
"Speech translation based on automatically trainable finite-state models",
1439-1442.
Language Model Adaptation
Gotoh, Yoshihiko / Renals, Steve:
"Document space models using latent semantic analysis",
1443-1446.
Martin, Sven C. / Liermann, Jörg / Ney, Hermann:
"Adaptive topic - dependent language modelling using word - based varigrams",
1447-1450.
Bellegarda, Jerome R.:
"A latent semantic analysis framework for large-Span language modeling",
1451-1454.
Schwartz, Richard / Imai, Toru / Kubala, Francis / Nguyen, Long / Makhoul, John:
"A maximum likelihood model for topic classification of broadcast news",
1455-1458.
Popovici, Cosmin / Baggia, Paolo:
"Language modelling for task-oriented domains",
1459-1462.
Lin, Sung-Chien / Tsai, Chi-Lung / Chien, Lee-Feng / Chen, Ker-Jiann / Lee, Lin-Shan:
"Chinese language model adaptation based on document classification and multiple domain-specific language models",
1463-1466.
Prosody and Speech Recognition/Understanding
Langlais, Philippe:
"Estimating prosodic weights in a syntactic-rhythmical prediction system",
1467-1470.
Ozeki, Kazuhiko / Kousaka, Kazuyuki / Zhang, Yujie:
"Syntactic information contained in prosodic features of Japanese utterances",
1471-1474.
Chung, Grace / Seneff, Stephanie:
"Hierarchical duration modelling for speech recognition using the ANGIE framework",
1475-1478.
Strom, Volker / Elsner, Anja / Hess, Wolfgang / Kasper, Walter / Klein, Alexandra / Krieger, Hans Ulrich / Spilker, Jörg / Weber, Hans / Gorz, Gunther:
"On the use of prosody in a speech-to-speech translator",
1479-1482.
Heuven, Vincent J. van / Haan, Judith / Pacilly, Jos J.A.:
"Automatic recognition of sentence type from prosody in dutch",
1483-1486.
Munteanu, Paul / Caillaud, Bertrand / Serignat, Jean-Francois / Caelen-Haumont, Genevicve:
"Automatic word demarcation based on prosody",
1487-1490.
Wideband Speech Coding
Kataoka, A. / Kurihara, S. / Sasaki, S. / Hayashi, S.:
"A 16-kbit/s wideband speech codec scalable with g.729",
1491-1494.
Lynch, M. / Ambikairajah, E. / Davis, A.:
"Comparison of auditory masking models for speech coding",
1495-1498.
Amodio, A. / Feng, G.:
"Wideband speech coding based on the MBE structure",
1499-1502.
Guimaraes, Marcos Perreau / Moreau, Nicolas / Bonnet, Madeleine:
"Perceptual filter comparisons for wideband and FM bandwidth audio coders",
1503-1506.
Chan, Cheung-Fat / Chu, Man-Tak:
"Wideband coding of speech using neural network gain adaptation",
1507-1510.
Salavedra, Josep M.:
"Wideband-speech APVQ coding from 16 to 32 kbps",
1511-1514.
Speech Recognition in Adverse Environments CSR and Error Analysis
Hung, Wei-Wen / Wang, Hsiao-Chuan:
"A comparative analysis of blind channel equalization methods for telephone speech recognition",
1515-1518.
Hung, Wei-Wen / Wang, Hsiao-Chuan:
"HMM retraining based on state duration alignment for noisy speech recognition",
1519-1522.
Komori, Yasuhiro / Kosaka, Tetsuo / Yamamoto, Hiroki / Yamada, Masayuki:
"Fast parallel model combination noise adaptation processing",
1523-1526.
Endo, Takashi / Nagaya, Shigeki / Nakazawa, Masayuki / Furukawa, Kiyoshi / Oka, Ryuichi:
"Speech recognition module for CSCW using a microphone array",
1527-1530.
Han, Jiqing / Han, Munsung / Park, Gyu-Bong / Park, Jeongue / Gao, Wen:
"Relative mel-frequency cepstral coefficients compensation for robust telephone speech recognition",
1531-1534.
Yamamoto, Seiichi / Naito, Masaki / Kuroiwa, Shingo:
"Robust speech detection method for speech recognition system for telecommunication networks and its field trial",
1535-1538.
Mauuary, Laurent / Karray, Lamia:
"The tuning of speech detection in the context of a global evaluation of a voice response system",
1539-1542.
Chen, C. Julian / Gopinath, Ramesh A. / Monkowski, Michael D. / Picheny, Michael A. / Shen, Katherine:
"New methods in continuous Mandarin speech recognition",
1543-1546.
Spina, Michelle S. / Zue, Victor W.:
"Automatic transcription of general audio data: effect of environment segmentation on phonetic recognition 1",
1547-1550.
Ng, Alfred Ying Pang / Chan, L. W. / Ching, P. C.:
"Automatic recognition of continuous Cantonese speech with very large vocabulary",
1551-1554.
Gong, Yifan:
"Source normalization training for HMM applied to noisy telephone speech recognition",
1555-1558.
Neto, Joao P. / Martins, Ciro A. / Almeida, Luis B.:
"The development of a speaker independent continuous speech recognizer for portuguese",
1559-1562.
Chase, Lin:
"Blame assignment for errors made by large vocabulary speech recognizers",
1563-1566.
Nakamura, Atsushi:
"Predicting speech recognition performance",
1567-1570.
Watson, Scott D. / Cheetham, Barry M.G. / Barrett, P.A. / Wong, W.T.K. / Lewi, A.V.:
"A voice activity detector for the ITU-t 8kbit/s speech coding standard g.729",
1571-1574.
Muthusamy, Yeshwant K. / Godfrey, John J.:
"Vocabulary-independent recognition of american Spanish phrases and digit strings",
1575-1578.
Hemphill, Charles T. / Muthusamy, Yeshwant K.:
"Developing web-based speech applications",
1575-1578.
Meyer, Michael / Hild, Hermann:
"Recognition of spoken and spelled proper names",
1579-1582.
Kobayashi, Takao / Masuko, Takashi / Tokuda, Keiichi:
"HMM compensation for noisy speech recognition based on cepstral parameter generation",
1583-1586.
Nokas, George / Dermatas, Evangelos / Kokkinakis, George:
"On the robustness of the critical-band adaptive filtering method for multi-source noisy speech recognition",
1587-1590.
Guan, Cun-tai / Leung, Shu-hung / Lau, Wing-hong:
"A space transformation approach for robust speech recognition in noisy environments",
1591-1594.
Vaich, Tzur / Cohen, Arnon:
"Robust isolated word recognition using WSP-PMC combination",
1595-1598.
Multimodal Speech Processing, Emerging Techniques and Applications
Raptis, Spyros / Carayannis, George V.:
"Fuzzy logic for rule-based formant speech synthesis",
1599-1602.
Jourlin, Pierre / Luettin, Juergen / Genoud, Dominique / Wassner, Hubert:
"Integrating acoustic and labial information for speaker identification and verification",
1603-1606.
Ng, Kenney / Zue, Victor W.:
"Subword unit representations for spoken document retrieval",
1607-1610.
Teissier, Pascal / Schwartz, Jean-Luc / Guerin-Dugue, Anne:
"Non-linear representations, sensor reliability estimation and context-dependent fusion in the audiovisual recognition of speech in noise",
1611-1614.
Renevey, Philippe / Drygajlo, Andrzej:
"Securized flexible vocabulary voice messaging system on unix workstation with ISDN connection",
1615-1618.
Mokbel, Houda / Jouvet, Denis:
"Automatic derivation of multiple variants of phonetic transcriptions from acoustic signals",
1619-1622.
Nakamura, Satoshi / Nagai, Ron / Shikano, Kiyohiro:
"Improved bimodal speech recognition using tied-mixture HMMs and 5000 word audio-visual synchronous database",
1623-1626.
Depambour, Philippe / Andre-Obrecht, Regine / Delyon, Bernard:
"On the use of phone duration and segmental processing to label speech signal",
1627-1630.
Paping, Martin / Fahnle, Thomas:
"Automatic detection of disturbing robot voice- and ping pong-effects in GSM transmitted speech",
1631-1634.
Martino, Joseph Di:
"Speech synthesis using phase vocoder techniques",
1635-1638.
Sarukkai, Ramesh R. / Hunter, Craig:
"Integration of eye fixation information with speech recognition systems",
1639-1643.
Nakatoh, Yoshihisa / Tsushima, M. / Norimatsu, T.:
"Generation of broadband speech from narrowband speech using piecewise linear mapping",
1643-1646.
Rogers, Ian E.C.:
"An assessment of the benefits active noise reduction systems provide to speech intelligibility in aircraft noise environments",
1647-1650.
Beskow, Jonas / Elenius, Kjell / McGlashan, Scott:
"OLGA - a dialogue system with an animated talking agent",
1651-1654.
Robbe, Sandrine / Carbonell, Noelle / Valot, Claude:
"Towards usable multimodal command languages: definition and ergonomic assessment of constraints on users' spontaneous speech and gestures",
1655-1658.
Suhm, Bernhard / Waibel, Alex:
"Exploiting repair context in interactive error recovery",
1659-1662.
Reveret, Lionel / Garcia, Frederique / Benoit, Christian / Vatikiotis-Bateson, Eric:
"An hybrid image processing approach to liptracking independent of head orientation",
1663-1666.
Goff, Bertrand Le:
"Automatic modeling of coarticulation in text-to-visual speech synthesis",
1667-1670.
Adjoudani, Ali / Guiard-Marigny, Thierry / Goff, Bertrand Le / Reveret, Lionel / Benoit, Christian:
"A multimedia platform for audio-visual speech processing",
1671-1674.
Fujisaki, Hiroya / Kameda, Hiroyuki / Ohno, Sumio / Ito, Takuya / Tajima, Ken / Abe, Kenji:
"An intelligent system for information retrieval over the internet through spoken dialogue",
1675-1678.
Yardimci, Yasemin / Cetin, A. Enis / Ansari, Rashid:
"Data hiding in speech using phase coding",
1679-1682.
Burnham, Denis / Fowler, John / Nicol, Michelle:
"CAVE: an on-line procedure for creating and running auditory-visual speech perception experiments-hardware, software, and advantages",
1683-1686.
Databases, Tools and Evaluations
Schiel, Florian / Draxler, Christoph / Tillmann, Hans G.:
"The bavarian archive for speech signals: resources for the speech community",
1687-1690.
Draxler, Christoph:
"WWWTranscribe - a modular transcription system based on the world wide web",
1691-1694.
Engberg, Inger S. / Hansen, Anya Varnich / Andersen, Ove / Dalsgaard, Paul:
"Design, recording and verification of a danish emotional speech database",
1695-1698.
Eskenazi, Maxine / Hogan, C. / Allen, J. / Frederking, R.:
"Issues in database creation: recording new populations, faster and better labelling",
1699-1702.
Feldes, Stefan / Kaspar, Bernhard / Jouvet, Denis:
"Design and analysis of a German telephone speech database for phoneme based training",
1703-1706.
Neto, Joao P. / Martins, Ciro A. / Meinedo, Hugo / Almeida, Luis B.:
"The design of a large vocabulary speech corpus for portuguese",
1707-1710.
Nord, Lennart / Hammarberg, Britta / Lundstrom, Elisabet:
"Continued investigations of laryngectomee speech in noise - measurements and intelligibility tests",
1711-1714.
Rothkrantz, L.J.M. / Manintveld, W.A.Th. / Rats, M.M.M. / Vark, R.J. van / Vreught, J.P.M. de / Koppelaar, H.:
"An appreciation study of an ASR inquiry system",
1715-1718.
Bensaber, Kamel / Munteanu, Paul / Serignat, Jean-Francois / Perrier, Pascal:
"Object-oriented modeling of articulatory data for speech research information systems",
1719-1722.
Kim, Woosung / Koo, Myoung-Wan:
"A Korean speech corpus for train ticket reservation aid system based on speech recognition",
1723-1726.
Dutton, Dawn / Kamm, Candace / Boyce, Susan:
"Recall memory for earcons",
1727-1730.
Mella, O. / Fohr, D.:
"Semi-automatic phonetic labelling of large corpora",
1731-1734.
Grocholewski, Stefan:
"CORPORA - speech database for Polish diphones",
1735-1738.
Müller, Christel / Ziem, Thomas:
"Multilingual speech interfaces (MSI) and dialogue design environments for computer telephony services",
1739-1742.
Hansen, John H. L. / Bou-Ghazale, Sahar E.:
"Getting started with SUSAS: a speech under simulated and actual stress database",
1743-1746.
Taylor, Paul / Tanenblatt, Michael / Isard, Amy:
"A markup language for text-to-speech synthesis richard sproat",
1747-1750.
Itahashi, Shuichi / Ueda, Naoko / Yamamoto, Mikio:
"Several measures for selecting suitable speech CORPORA",
1751-1754.
Chatzi, Irene / Fakotakis, Nikos / Kokkinakis, George:
"Greek speech database for creation of voice driven teleservices",
1755-1758.
Speaker Adaptation I
Huo, Qiang / Lee, Chin-Hui:
"Combined on-line model adaptation and Bayesian predictive classification for robust speech recognition",
1847-1850.
Aubert, Xavier / Thelen, Eric:
"Speaker adaptive training applied to continuous mixture density modeling",
1851-1854.
Illina, Irina / Gong, Yifan:
"Speaker normalization training for mixture stochastic trajectory model",
1855-1858.
Digalakis, V.:
"On-line adaptation of hidden Markov models using incremental estimation algorithms",
1859-1862.
Kannan, Ashvin / Ostendorf, Mari:
"Modeling dependency in adaptation of acoustic models using multiscale tree processes",
1863-1866.
Heck, Larry / Sankar, Ananth:
"Acoustic clustering and adaptation for robust speech recognition",
1867-1870.
Assessment Methods
Martin, Alvin / Doddington, George / Kamm, Terri / Ordowski, Mark / Przybocki, Mark:
"The DET curve in assessment of detection task performance",
1895-1898.
Klaus, Harald / Diedrich, Ekkehard / Dehnel, Astrid / Berger, Jens:
"Speech quality evaluation of hands-free terminals",
1899-1902.
Pallett, David S. / Fiscus, Jonathan G. / Fisher, William M. / Garofolo, John S.:
"Use of broadcast news materials for speech recognition benchmark tests",
1903-1906.
Fraser, Norman M.:
"Spoken dialogue system evaluation: a first framework for reporting results",
1907-1910.
Bernsen, Niels Ole / Dybkjaer, Hans / Dybkjaer, Laila / Zinkevicius, Vytautas:
"Generality and transferability. two issues in putting a dialogue evaluation tool into practical use",
1911-1914.
Leeuwen, David A. van / Steeneken, Herman J. M.:
"Within-speaker variability of the word error rate for a continuous speech recognition system",
1915-1918.
Education for Language and Speech Communication
Huckvale, Mark / Benoit, Christian / Bowerman, C. / Eriksson, Anders / Rosner, M. / Tatham, M. / Williams, Briony:
"Opportunities for computer-aided instruction in phonetics and speech communication provided by the internet",
1919-1922.
Bloothooft, Gerrit:
"The landscape of future education in speech communication sciences",
1923-1926.
Sjölander, Kare / Gustafson, Joakim:
"An integrated system for teaching spoken dialogue systems technology",
1927-1930.
Beck, Janet / Camilleri, Bernard / Chantrain, Hilde / Klippi, Anu / Leterme, Marianne / Lehtihalmes, Matti / PeterSchneider, PeterSchneider / Vieregge, Wilhelm / Wigforss, Eva:
"Communication science within education for logopedics/speech and language therapy in europe: the state of the art",
1931-1934.
Green, Phil / Espain, Carlos / The Spoken Language Engineering Working Group of the Socrates Thematic Sciences Network in Speech Communication:
"Education in spoken language engineering in europe",
1935-1938.
Hazan, Valerie / Dommelen, Wim van:
"A survey of phonetics education in Europe",
1939-1942.
Hybrid Systems for ASR
Tu, Xin / Yan, Yonghong / Cole, Ron:
"Matching training and testing criteria in hybrid speech recognition systems",
1943-1946.
Dupont, Stephane / Ris, Christophe / Deroo, Olivier / Fontaine, Vincent / Boite, Jean-Marc / Zanoni, L.:
"Context independent and context dependent hybrid HMM/ANN systems for vocabulary independent tasks",
1947-1950.
Hennebert, J. / Ris, Christophe / Bourlard, Hervè / Renals, Steve / Morgan, Nelson:
"Estimation of global posteriors and forward-backward training of hybrid HMM/ANN systems",
1951-1954.
Williams, Gethin / Renals, Steve:
"Confidence measures for hybrid HMM/ANN speech recognition",
1955-1958.
Cook, Gary D. / Waterhouse, Steve R. / Robinson, A.J.:
"Ensemble methods for connectionist acoustic modelling",
1959-1962.
Fritsch, Jürgen / Finke, Michael:
"Improving performance on switchboard by combining hybrid HME/HMM and mixture of Gaussians acoustic models",
1963-1966.
Topic and Dialogue Dependent Language Modelling
Witschel, Petra / Höge, Harald:
"Experiments in adaptation of language models for commercial applications",
1967-1970.
Kneser, Reinhard / Peters, Jochen / Klakow, Dietrich:
"Language model adaptation using dynamic marginals",
1971-1974.
Iyer, Rukmini / Ostendorf, Mari:
"Transforming out-of-domain estimates to improve in-domain language models",
1975-1978.
Rao, P. Srinivasa / Dharanipragada, Satya / Roukos, Salim:
"MDI adaptation of language models across corpora",
1979-1982.
Ries, Klaus:
"A class based approach to domain adaptation and constraint integration for empirical m-gram models",
1983-1986.
Seymore, Kristie / Rosenfeld, Ronald:
"Using story topics for language model adaptation",
1987-1990.
Lipreading
Luettin, Juergen:
"Towards speaker independent continuous speechreading",
1991-1994.
Goldenthal, William / Waters, Keith / Thong, Jean-Manuel Van / Glickman, Oren:
"Driving synthetic mouth gestures: phonetic recognition for faceme!",
1995-1998.
Rogozan, Alexandrina / Deleglise, Paul:
"Continuous visual speech recognition using geometric lip-shape models and neural networks",
1999-2002.
Beskow, Jonas / Dahlquist, Martin / Granström, Björn / Lundeberg, Magnus / Spens, Karl-Erik / Öhman, Tobias:
"The teleface project multi-modal speech-communication for the hearing impaired",
2003-2006.
Stiefelhagen, Rainer / Meier, Uwe / Yang, Jie:
"Real-time lip-tracking for lipreading",
2007-2010.
Reveret, Lionel:
"From raw images of the lips to articulatory parameters: a viseme-based prediction",
2011-2014.
Articulatory Modelling
Mathieu, Bruno / Laprie, Yves:
"Adaptation of Maeda's model for acoustic to articulatory inversion",
2015-2018.
Payan, Yohan / Perrier, Pascal:
"Why should speech control studies based on kinematics be considered with caution? insights from a 2d biomechanical model of the tongue.",
2019-2022.
Sanguineti, Vittorio / Laboissiere, Rafael / Ostry, David J.:
"An integrated model of the biomechanics and neural control of the tongue, jaw, hyoid and larynx system",
2023-2026.
Mohammad, M. / Moore, E. / Carter, J.N. / Shadle, C.H. / Gunn, S.J.:
"Using MRI to image the moving vocal tract during speech",
2027-2030.
Vatikiotis-Bateson, Eric / Yehia, Hani:
"Unified physiological model of audible-visible speech production",
2031-2034.
Loevenbruck, Hélène / Perrier, Pascal:
"Motor control information recovering from the dynamics with the EP hypothesis",
2035-2038.
Front-Ends and Adaptation to Acoustics Speaker Adaptation
Komori, Yasuhiro / Kosaka, Tetsuo / Yamada, Masayuki / Yamamoto, Hiroki:
"Speaker adaptation for context-dependent HMM using spatial relation of both phoneme context hierarchy and speakers",
2039-2042.
Yamada, Masayuki / Komori, Yasuhiro / Kosaka, Tetsuo / Yamamoto, Hiroki:
"Fast algorithm for speech recognition using speaker cluster HMM",
2043-2046.
Hazen, Timothy J. / Glass, James R.:
"A comparison of novel techniques for instantaneous speaker adaptation",
2047-2050.
Yamaguchi, Yoshikazu / Takahashi, Satoshi / Sagayama, Shigeki:
"Fast adaptation of acoustic models to environmental noise using jacobian adaptation algorithm",
2051-2054.
Zeljkovic, Ilija / Narayanan, Shrikanth / Potamianos, Alexandros:
"Unsupervised HMM adaptation based on speech-silence discrimination",
2055-2058.
Afify, Mohamed / Gong, Yifan / Haton, Jean-Paul:
"Correlation based predictive adaptation of hidden Markov models",
2059-2062.
Diakoloukas, Vassilios / Digalakis, Vassilios:
"Adaptation of hidden Markov models using multiple stochastic transformations",
2063-2066.
Gales, M. J. F.:
"Transformation smoothing for speaker and environmental adaptation",
2067-2070.
Fontaine, Vincent / Ris, Christophe / Boite, Jean-Marc:
"Nonlinear discriminant analysis for improved speech recognition",
2071-2074.
Tchorz, Jurgen / Kasper, Klaus / Reininger, Herbert / Kollmeier, Bilger:
"On the interplay between auditory-based features and locally recurrent neural networks for robust speech recognition in noise",
2075-2078.
Morgan, Nelson / Fosler, Eric / Mirghafori, Nikki:
"Speech recognition using on-line estimation of speaking rate",
2079-2082.
Holmes, John N. / Holmes, Wendy J. / Garner, Philip N.:
"Using formant frequencies in speech recognition",
2083-2086.
Zhan, Puming / Westphal, Martin / Finke, Michael / Waibel, Alex:
"Speaker normalization and speaker adaptation - a combination for conversational speech recognition",
2087-2090.
Gao, Yuqing / Padmanabhan, Mukund / Picheny, Michael:
"Speaker adaptation based on pre-clustering training speakers",
2091-2094.
Lincoln, Mike / Cox, Stephen / Ringland, Simon:
"A fast method of speaker normalisation using formant estimation",
2095-2098.
Welling, Lutz / Haberland, N. / Ney, Hermann:
"Acoustic front-end optimization for large vocabulary speech recognition",
2099-2102.
Logan, B. T. / Robinson, A. J.:
"Improving autoregressive hidden Markov model recognition accuracy using a non-linear frequency scale with application to speech enhancement",
2103-2106.
Nitta, Tsuneo / Kawamura, Akinori:
"Designing a reduced feature-vector set for speech recognition by using KL/GPD competitive training",
2107-2110.
Chen, Scott Shaobing / DeSouza, Peter:
"Speaker adaptation by correlation (ABC)",
2111-2114.
Speech Perception
Ainsworth, William A. / Meyer, Georg F.:
"Preliminary experiments on the perception of double semivowels",
2115-2118.
Schiller, Niels O.:
"Does syllable frequency affect production time in a delayed naming task?",
2119-2122.
Morris, Andrew C. / Bloothooft, Gerrit / Barry, William J. / Andreeva, Bistra / Koreman, Jacques:
"Human and machine identification of consonantal place of articulation from vocalic transition segments",
2123-2126.
Barker, Jon / Cooke, Martin:
"Modelling the recognition of spectrally reduced speech",
2127-2130.
Pallier, Christophe / Cutler, Anne / Sebastian-Galles, Nuria:
"Prosodic structure and phonetic processing: a cross-linguistic study",
2131-2134.
Son, Rob J. J. H. van / Pols, Louis C. W.:
"The correlation between consonant identification and the amount of acoustic consonant reduction",
2135-2138.
Bonneau, Anne:
"Relevant spectral information for the identification of vowel features from bursts",
2139-2142.
Li, Aijun:
"Perceptual study of intersyllabic formant transitions in synthesized V1-V2 in standard Chinese",
2143-2146.
Skljarov, Oleg P.:
"Role of perception of rhythmically organized speech in consolidation process of long-term memory traces (LTM-traces) and in speech production controlling",
2147-2150.
Lugt, Arie H. van der:
"SEQUENTIAL PROBABILITIES AS a CUE FOR SEGMENTATION",
2151-2154.
Jansens, Susan / Bloothooft, Gerrit / Krom, Guus de:
"PERCEPTION AND ACOUSTICS OF EMOTIONS IN SINGING",
2155-2158.
Pallier, Christophe:
"Phonemes and syllables in speech perception: size of attentional focus in French",
2159-2162.
Tokuma, Shinichi:
"Quality of a vowel with formant undershoot: a preliminary perceptual study",
2163-2166.
Koster, Mariette / Cutler, Anne:
"Segmental and suprasegmental contributions to spoken-word recognition in dutch",
2167-2170.
Behne, Dawn M. / Czigler, Peter E. / Sullivan, Kirk P. H.:
"Perception of vowel duration and spectral characteristics in Swedish",
2171-2174.
Neagu, Adrien / Bailly, Gerard:
"Relative contributions of noise burst and vocalic transitions to the perceptual identification of stop consonants",
2175-2178.
Kitagawa, Satoshi / Hashimoto, Makoto / Higuchi, Norio:
"Effect of speaker familiarity and background noise on acoustic features used in speaker identification",
2179-2182.
Pitermann, Michel:
"Dynamic versus static specification for the perceptual identity of a coarticulated vowel",
2183-2186.
Plauche, Madelaine / Delogu, Cristina / Ohala, John J.:
"Asymmetries in consonant confusion",
2187-2190.
Dumay, Nicolas / Radeau, Monique:
"Rime and syllabic effects in phonological priming between French spoken words",
2191-2194.
Zhu, Weizhong / Kasuya, Hideki:
"Roles of static and dynamic features of formant trajectories in the perception of talk indedivduality",
2195-2198.
Dialogue Systems: Linguistic Structures, Modelling and Evaluation
Lin, Chih-mei / Narayanan, Shrikanth / Ritenour, Russell:
"Database management and analysis for spoken dialog systems: methodology and tools",
2199-2202.
Kamm, Candace / Narayanan, Shrikanth / Dutton, Dawn / Ritenour, Russell:
"Evaluating spoken dialog systems for telecommunication services",
2203-2206.
Pouteau, Xavier / Krahmer, Emiel / Landsbergen, Jan:
"Robust spoken dialogue management for driver information systems",
2207-2210.
Lee, Yue-Shi / Chen, Hsin-Hsi:
"Using acoustic and prosodic cues to correct Chinese speech repairs",
2211-2214.
Dahlbäck, Nils / Jönsson, Arne:
"Integrating domain specific focusing in dialogue models",
2215-2218.
Walker, Marilyn / Hindle, Donald / Fromer, Jeanne / Fabbrizio, Giuseppe Di / Mestel, Craig:
"Evaluating competing agent strategies for a voice email agent",
2219-2222.
Byron, Donna K. / Heeman, Peter A.:
"Discourse marker use in task-oriented spoken dialog \lambda",
2223-2226.
Zue, Victor W. / Seneff, Stephanie / Glass, James / Hetherington, Lee / Hurley, Edward / Meng, Helen / Pao, Christine / Polifroni, Joseph / Schloming, Rafael / Schmid, Philipp:
"From interface to content: translingual access and delivery of on-line information",
2227-2230.
Alexandersson, Jan / Reithinger, Norbert:
"Learning dialogue structures from a corpus",
2231-2234.
Reithinger, Norbert / Klesen, Martin:
"Dialogue act classification using language models",
2235-2238.
Pernel, Didier:
"User's multiple goals in spoken dialogue",
2239-2242.
Suzuki, Noriko / Inokuchi, Seiji / Ishii, K. / Okada, Michio:
"Chatting with interactive agent",
2243-2246.
Churcher, Gavin E. / Atwell, Eric S. / Souter, Clive:
"Generic template for the evaluation of dialogue management systems",
2247-2250.
Niimi, Yasuhisa / Nishimoto, Takuya / Kobayashi, Yutaka:
"Analysis of interactive strategy to recover from misrecognition of utterances including multiple information items",
2251-2254.
Mathieu, Francois-Arnould / Gaiffe, Bertrand / Pierrel, Jean-Marie:
"A referential approach to reduce perplexity in the vocal command system comppa",
2255-2258.
Thanopoulos, Aristomenis / Fakotakis, Nikos / Kokkinakis, George:
"Linguistic processor for a spoken dialogue system based on island parsing techniques",
2259-2262.
Mellor, Brian / Baber, Chris:
"Modelling of speech-based user interfaces",
2263-2266.
Hockey, Beth Ann / Rossen-Knill, Deborah / Spejewski, Beverly / Stone, Matthew / Isard, Stephen:
"Can you predict responses to yes/no questions? yes, no, and stuff",
2267-2270.
Möller, Jens-Uwe:
"Dia-moLE: an unsupervised learning approach to adaptive dialogue models for spoken dialogue systems",
2271-2274.
Gustafson, Joakim / Larsson, Anette / Carlson, Rolf / Hellman, K.:
"How do system questions influence lexical choices in user answers?",
2275-2278.
Speaker Recognition and Language Identification
Yuo, Kuo-Hwei / Wang, Hsiao-Chuan:
"Gaussian mixture models with common principal axes and their application in text-independent speaker identification",
2279-2282.
Dersch, Dominik R. / King, Robin W.:
"Speaker models designed from complete data sets: a new approach to text-independent speaker verification",
2283-2286.
Vergin, Rivarol / O'Shaughnessy, Douglas:
"A double Gaussian mixture modeling approach to speaker recognition",
2287-2290.
Afify, Mohamed / Gong, Yifan / Haton, Jean-Paul:
"An acoustic subword unit approach to non-linguistic speech feature identification",
2291-2294.
Tadj, Chakib / Dumouchel, Pierre / Fang, Yu:
"N-best GMM's for speaker identification",
2295-2298.
Gravier, Guillaume / Mokbel, Chafic / Chollet, Gerard:
"Model dependent spectral representations for speaker recognition",
2299-2302.
Auckenthaler, Roland / Mason, John S.:
"Equalizing sub-band error rates in speaker recognition",
2303-2306.
Slomka, Stefan / Sridharan, Sridha:
"Automatic gender identification under adverse conditions",
2307-2310.
Lavner, Yizhar / Gath, Isak / Rosenhouse, Judith:
"Acoustic features and perceptive processes in the identification of familiar voices",
2311-2314.
Rodriguez-Linares, Leandro / Garcia-Mateo, Carmen:
"On the use of acoustic segmentation in speaker identification",
2315-2318.
Steeneken, Herman J. M. / Leeuwen, David A. van:
"Speaker recognition by humans and machines",
2319-2322.
Kumpf, Karsten / King, Robin W.:
"Foreign speaker accent classification using phoneme-dependent accent discrimination models and comparisons with human perception benchmarks",
2323-2326.
Liu, Li / He, Jialong / Palm, Günther:
"A comparison of human and machine in speaker recognition",
2327-2330.
Goddijn, Simo M. A. / Krom, Guus de:
"Evaluation of second language learners' pronunciation using hidden Markov models",
2331-2334.
Eberman, Brian / Moreno, Pedro J.:
"Delta vector taylor series environment compensation for speaker recognition",
2335-2338.
Hume, Jonathan:
"Wavelet-like regression features in the cepstral domain for speaker recognition",
2339-2342.
Chengalvarayan, Rathinavelu:
"Minimum classification error linear regression (MCELR) for speaker adaptation using HMM with trend functions",
2343-2346.
Fakotakis, Nikos / Georgila, Kallirroi / Tsopanoglou, Anastasios:
"A continuous HMM text-independent speaker recognition system based on vowel spotting",
2347-2350.
Koolwaaij, Johan W. / Boves, Lou:
"On the independence of digits in connected digit strings",
2351-2354.
Koolwaaij, Johan W. / Boves, Lou:
"A new procedure for classifying speakers in speaker verification systems",
2355-2358.
Montacié, Claude / Caraty, Marie-José:
"SOUND CHANNEL VIDEO INDEXING",
2359-2362.
Hernando, Javier / Nadeu, Climent:
"CDHMM speaker recognition by means of frequency filtering of filter-bank energies",
2363-2366.
Style and Accent Recognition
Humphries, J. J. / Woodland, P. C.:
"Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition",
2367-2370.
Potamianos, Alexandros / Narayanan, Shrikanth / Lee, Sungbok:
"Automatic speech recognition for children",
2371-2374.
Teixeira, Carlos / Trancoso, Isabel / Serralheiro, Antonio:
"Recognition of non-native accents",
2375-2378.
Finke, Michael / Waibel, Alex:
"Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition",
2379-2382.
Shriberg, Elizabeth / Bates, Rebecca / Stolcke, Andreas:
"A prosody only decision-tree model for disfluency detection",
2383-2386.
Bou-Ghazale, Sahar E. / Hansen, John H. L.:
"A novel training approach for improving speech recognition under adverse stressful conditions",
2387-2390.
Towards Robust ASR for Car and Telephone Applications
Fissore, L. / Micca, Giorgio / Vair, C.:
"Methods for microphone equalization in speech recognition",
2415-2418.
Nakamura, Satoshi / Shikano, Kiyohiro:
"Room acoustics and reverberation: impact on hands-free recognition",
2419-2422.
Faucon, Gerard / Bouquin-Jeannes, Regine Le:
"Echo and noise reduction for hands-free terminals - state of the art -",
2423-2426.
Haeb-Umbach, Reinhold:
"Robust speech recognition for wireless networks and mobile telephony",
2427-2430.
Compernolle, Dirk Van:
"Speech recognition in the car from phone dialing to car navigation",
2431-2434.
Language-Specific Systems
Williams, Briony / Isard, Stephen:
"A keyvowel approach to the synthesis of regional accents of English",
2435-2438.
Ferencz, Attila / Arsinte, Radu / Nagy, Istvan / Ratiu, Teodora / Ferencz, Maria / Toderean, Gavril / Zaiu, Diana / Kovacs, Tunde-Csilla / Simon, Lajos:
"Experimental implementation of pitch-synchronous synthesis methods for the ROMVOX text-to-speech system",
2439-2442.
Möbius, Bernd / Sproat, Richard / Santen, Jan P. H. van / Olive, Joseph P.:
"The bell labs German text-to-speech system: an overview",
2443-2446.
Fitt, Susan:
"The generation of regional pronunciations of English for speech synthesis",
2447-2450.
Pavlova, Elena / Pavlov, Yuri / Sproat, Richard / Shih, Chilin / Santen, Jan P. H. van:
"Bell laboratories Russian text-to-speech system",
2451-2454.
Bonafonte, Antonio / Esquerra, Ignasi / Febrer, Albert / Vallverdu, Francesc:
"A bilingual text-to-speech system in Spanish and catalan",
2455-2458.
Pronunciation Models
Cremelie, Nick / Martens, Jean-Pierre:
"Automatic rule-based generation of word pronunciation networks",
2459-2462.
Elvira, Jose Maria / Torrecilla, Juan Carlos / Caminero, Javier:
"Creating user defined new vocabularies for voice dialing",
2463-2466.
Ravishankar, Mosur / Eskenazi, Maxine:
"Automatic generation of context-dependent pronunciations",
2467-2470.
Fukada, Toshiaki / Sagisaka, Yoshinori:
"Automatic generation of a pronunciation dictionary based on a pronunciation network",
2471-2474.
Jost, Uwe / Heine, Henrik / Evermann, Gunnar:
"What is wrong with the lexicon - an attempt to model pronunciations probabilistically",
2475-2478.
Markey, Kevin L. / Ward, Wayne:
"Lexical tuning based on triphone confidence estimation",
2479-2482.
Auditory Modelling and Psychoacoustics
Berthommier, Frédéric / Meyer, Georg:
"Improving of amplitude modulation maps for F0-dependent segregation of harmonic sounds",
2483-2486.
Kortekaas, Reinier / Kohlrausch, Armin:
"Psychophysical evaluation of PSOLA: natural versus synthetic speech",
2487-2490.
Lublinskaja, Valentina V. / Koroleva, Inna V. / Kornev, A.N. / Iagounova, Elena V.:
"Perception of noised words by normal children and children with speech and language impairments",
2491-2494.
Meyer, Georg F. / Ainsworth, William A.:
"Modelling the perception of simultaneous semi-vowels",
2495-2498.
Perdigao, Fernando S. / Sa, Luis V.:
"PROPERTIES OF AUDITORY MODEL REPRESENTATIONS",
2499-2502.
Marta, Eduardo Sa / Sa, Luis Vieira de:
"Impact of "ascending sequence" AI (auditory primary cortex) cells on stop consonant perception",
2503-2506.
Voice Conversion and Data Driven F0-Models
Santen, Jan P. H. van:
"Combinatorial issues in text-to-speech synthesis",
2507-2510.
Boeffard, Olivier / Emerard, F.:
"Application-dependent prosodic models for text-to-speech synthesis and automatic design of learning database corpus using genetic algorithm",
2511-2514.
Lopez-Gonzalo, Eduardo / Rodriguez-Garcia, Jose M. / Hernandez-Gomez, Luis / Villar, Juan M.:
"Automatic corpus-based training of rules for prosodic generation in text-to-speech",
2515-2518.
Kim, Eun-Kyoung / Lee, Sangho / Oh, Yung-Hwan:
"Hidden Markov model based voice conversion using dynamic characteristics of speaker",
2519-2522.
Yoshimura, Takayoshi / Masuko, Takashi / Tokuda, Keiichi / Kobayashi, Takao / Kitamura, Tadashi:
"Speaker interpolation in HMM-based speech synthesis system",
2523-2526.
Darsinos, Vassilios / Galanis, Dimitrios / Kokkinakis, George:
"Designing a speaker adaptable formant-based text-to-speech system",
2527-2530.
Vocal Tract Analysis
Maragos, Petros / Potamianos, Alexandros:
"On using fractal features of speech sounds in automatic speech recognition",
2531-2534.
Richards, Hywel B. / Bridle, John S. / Hunt, Melvyn J. / Mason, John S.:
"Dynamic constraint weighting in the context of articulatory parameter estimation",
2535-2538.
Lee, Minkyu / Childers, Donald G.:
"Estimation of vocal tract front cavity resonance in unvoiced fricative speech",
2539-2542.
Teixeira, Antonio / Vaz, Francisco / Principe, Jose Carlos:
"A software tool to study portuguese vowels",
2543-2546.
Schoentgen, Jean / Ciocea, Sorin:
"Post-synchronization via formant-to-area mapping of asynchronously recorded speech signals and area functions",
2547-2550.
Yu, Zhenli L. / Ching, P.C.:
"Geometrically and acoustically optimized codebook for unique mapping from formants to vocal-tract shape",
2551-2554.
F0 and Duration Modelling, Spoken language processing
Riedi, Marcel:
"Modeling segmental duration with multivariate adaptive regression splines",
2627-2630.
Malfrere, Fabrice / Dutoit, Thierry:
"High-quality speech synthesis for phonetic speech segmentation",
2631-2634.
Campbell, Nick / Itoh, Yoshiharu / Ding, Wen / Higuchi, Norio:
"Factors affecting perceived quality and intelligibility in the CHATR concatenative speech synthesiser",
2635-2638.
Neukirchen, Christoph / Willett, Daniel / Rigoll, Gerhard:
"Reduced lexicon trees for decoding in a MMIi-connectionist/HMM speech recognition system",
2639-2642.
Veronis, Jean / Cristo, Philippe Di / Courtois, Fabienne / Lagrue, Benoit:
"A stochastic model of intonation for French text-to-speech synthesis",
2643-2646.
Sanderman, Angelien A. / Collier, Renè:
"Phonetic rules for a phonetic-to-speech system",
2647-2650.
Santen, Jan van / Shih, Chilin / Möbius, Bernd / Tzoukermann, Evelyne / Tanenblatt, Michael:
"Multi-lingual duration modeling",
2651-2654.
Barbosa, Plinio A.:
"A model of segment (and pause) duration generation for Brazilian Portuguese text-to-speech synthesis",
2655-2658.
Halber, Ariane / Roussel, David:
"Parsing strategy for spoken language interfaces with a lexicalized tree grammar",
2659-2662.
Amtrup, Jan W. / Heine, Henrik / Jost, Uwe:
"What's in a word graph evaluation and enhancement of word lattices?",
2663-2666.
Tillmann, C. / Vogel, S. / Ney, Hermann / Zubiaga, A. / Sawaf, H.:
"Accelerated DP based search for statistical translation",
2667-2670.
Fujisawa, Ken / Hirai, Toshio / Higuchi, Norio:
"Use of pitch pattern improvement in the CHATR speech synthesis system",
2671-2674.
Corrigan, G. / Massey, N. / Karaali, O.:
"Generating segment durations in a text-zo-speech system: a hybrid rule-based/neural network approach",
2675-2678.
Ishikawa, Yasushi / Ebihara, Takashi:
"On the global FO shape model using a transition network for Japanese text-to-speech systems",
2679-2682.
Colás, José / Montero, Juan M. / Ferreiros, Javier / Pardo, José M.:
"An alternative and flexible approach in robust information retrieval systems",
2683-2686.
Horiguchi, Keiko / Franz, Alexander:
"A probabilistic approach to analogical speech translation",
2687-2690.
Caraty, Marie-José / Montacié, Claude / Lefèvre, Fabrice:
"Dynamic lexicon for a very large vocabulary vocal dictation",
2691-2694.
Language Modelling
Segarra, E. / Hurtado, L.:
"Construction of language models using the morphic generator grammatical inference (MGGI) methodology",
2695-2698.
Zhang, Shuwu / Huang, Taiyi:
"An integrated language modeling with n-gram model and WA model for speech recognition",
2699-2702.
Wang, Ye-Yi / Waibel, Alex:
"Statistical analysis of dialogue structure",
2703-2706.
Clarkson, Philip / Rosenfeld, Ronald:
"Statistical language modeling using the CMU-cambridge toolkit",
2707-2710.
Adda, Gilles / Adda-Decker, Martine / Gauvain, Jean-Luc / Lamel, Lori:
"Text normalization and speech recognition in French",
2711-2714.
Damnati, G. / Simonin, J.:
"A novel tree-based clustering algorithm for statistical language modeling",
2715-2718.
Matsunaga, Shoichi / Sagayama, Shigeki:
"Variable-length language modeling integrating global constraints",
2719-2722.
Smaili, K. / Zitouni, I. / Charpillet, F. / Haton, Jean-Paul:
"An hybrid language model for a continuous dictation prototype",
2723-2726.
Pérennou, Guy / Pousse, L.:
"Dealing with pronunciation variants at the language model level for the continuous automatic speech recognition of French",
2727-2730.
Schukat-Talamazzini, Ernst Günter / Gallwit, Florian / Harbeck, Stefan / Warnke, Volker:
"Rational interpolation of maximum likelihood predictors in stochastic language modeling",
2731-2734.
Ito, Akinori / Saitoh, Hideyuki / Katoh, Masaharu / Kohda, Masaki:
"N-gram language model adaptation using small corpus for spoken dialog recognition",
2735-2738.
Siu, Manhung / Ostendorf, Mari:
"Variable n-gram language modeling and extensions for conversational speech",
2739-2742.
Geutner, Petra:
"Fuzzy class rescoring: a part-of-speech language model",
2743-2746.
Nagai, Akito / Ishikawa, Yasushi:
"Speech understanding based on integrating concepts by conceptual dependency",
2747-2750.
Brugnara, Fabio / Federico, Marcello:
"Dynamic language models for interactive speech applications",
2751-2754.
Demetriou, George / Atwell, Eric / Souter, Clive:
"Large-scale lexical semantics for speech recognition support",
2755-2758.
Tsukada, Hajime / Yamamoto, Hirofumi / Sagisaka, Yoshinori:
"Integration of grammar and statistical language constraints for partial word-sequence recognition",
2759-2762.
Taylor, Paul / King, Simon / Isard, Stephen / Wright, Helen / Kowtko, Jacqueline:
"Using intonation to constrain language models in speech recognition",
2763-2766.
Heeman, Peter A. / Allen, James F.:
"Incorporating POS tagging into language modeling",
2767-2770.
Uhrik, C. / Ward, W.:
"Confidence metrics based on n-gram language model backoff behaviors",
2771-2774.
Chelba, Ciprian / Engle, David / Jelinek, Frederick / Jimenez, Victor / Khudanpur, Sanjeev / Mangu, Lidia / Printz, Harry / Ristad, Eric / Rosenfeld, Ronald / Stolcke, Andreas / Wu, Dekai:
"Structure and performance of a dependency language model",
2775-2778.
Stolcke, Andreas:
"Modeling linguistic segment and turn boundaries for n-best rescoring of spontaneous speech",
2779-2782.
Kenne, P. E. / O'Kane, Mary:
"Hybrid language models: is simpler better?",
2783-2786.
Brants, Thorsten:
"Internal and external tagsets in part-of-speech tagging",
2787-2790.
Auditory Modelling and Psychoacoustics, Neural Networks for Speech Processing and Recognition
Varin, Laurent / Berthommier, Frédéric:
"A probabilistic model of double-vowel segregation",
2791-2794.
Houshang, Habibzadeh V. / Shigeyoshi, Kitazawa:
"Stimulus signal estimation from auditory-neural transduction inverse processing",
2795-2798.
Tadj, Chakib / Dumouchel, Pierre / Poirier, Franck:
"FDVQ based keyword spotter which incorporates a semi-supervised learning for primary processing",
2799-2802.
Lublinskaja, V. V. / Sappok, Christian:
"The initial time Span of auditory processing used for speaker attribution of the speech signal",
2803-2806.
Ström, Nikko:
"Sparse connection and pruning in large dynamic artificial neural networks",
2807-2810.
Teodorescu, Roxana / Compernolle, Dirk Van / Dologlou, Ioannis:
"A modular initialization scheme for better speech recognition performance using hybrid systems of MLPs/HMMs",
2811-2814.
Chernigovskaya, Tatiana V.:
"Lateralization for auditory perception of foreign words",
2815-2818.
Kosarev, Yuri / Jarov, Pavel / Osipov, Alexander:
"The structural weighted sets method for continuous speech and text recognition",
2819-2822.
Sumner, C. J. / Gillies, D. F.:
"Lateral inhibitory networks for auditory processing",
2823-2826.
Reetz, Henning:
"Missing fundamentals: a problem of auditory or mental processing?",
2827-2830.
Freitag, F. / Monte, E. / Salavedra, J.:
"Predictive neural networks applied to phoneme recognition",
2831-2834.
Suhardi / Fellbaum, Klaus:
"Empirical comparison of two multilayer perceptron-based keyword speech recognition algorithms",
2835-2838.
Fukada, Toshiaki / Aveline, Sophie / Schuster, Mike / Sagisaka, Yoshinori:
"Segment boundary estimation using recurrent neural networks",
2839-2842.
Schuster, Mike:
"Incorporation of HMM output constraints in hybrid NN/HMM systems during training",
2843-2846.
Babkina, Ludmila / Koval, Sergey / Molchanov, Alexander:
"Principles of the hearing periphery functioning in new methods of pitch detection and speech enhancement",
2847-2850.
Meunier, Christine / Content, Alain / Frauenfelder, Uli H. / Kearns, Ruth:
"The locus of the syllable effect: prelexical or lexical?",
2851-2854.
Lickley, Robin J. / Bard, Ellen G.:
"On not remembering disfluencies",
2855-2858.
Andringa, T.:
"Using an auditory model and leaky autocorrelators to tune in to speech",
2859-2862.