Plenary Talks
Church, Kenneth Ward:
"Speech and language processing: where have we been and where are we going?",
1-4.
Kollmeier, Birger:
"Auditory principles in speech processing - do computers need silicon ears ?",
5-8.
Aurora Noise Robustness on SMALL Vocabulary Databases
Yao, Kaisheng / Visser, Erik / Kwon, Oh-Wook / Lee, Te-Won:
"A speech processing front-end with eigenspace normalization for robust speech recognition in noisy automobile environments",
9-12.
Lai, Yiu-Pong / Siu, Man-Hung:
"Maximum likelihood normalization for robust speech recognition",
13-16.
Stouten, Veronique / Hamme, Hugo van / Demuynck, Kris / Wambacq, Patrick:
"Robust speech recognition using model-based feature enhancement",
17-20.
Wu, Jian / Huo, Qiang:
"Several HKU approaches for robust speech recognition and their evaluation on Aurora connected digit recognition tasks",
21-24.
Wang, Yadong / Hansen, Jesse / Allu, Gopi Krishna / Kumaresan, Ramdas:
"Average instantaneous frequency (AIF) and average log-envelopes (ALE) for ASR with the Aurora 2 database",
25-28.
Sasou, Akira / Asano, Futoshi / Tanaka, Kazuyo / Nakamura, Satoshi:
"Adaptation of acoustic model using the gain-adapted HMM decomposition method",
29-32.
ISCA Special Interest Group Session: "Hot Topics" in Speech Science and Technology
Bonastre, Jean-Francois / Bimbot, Frédéric / Boe, Louis-Jean / Campbell, Joseph P. / Reynolds, Douglas A. / Magrin-Chagnolleau, Ivan:
"Person authentication by voice: a need for caution",
33-36.
Bailly, Gérard / Campbell, Nick / Möbius, Bernd:
"ISCA special session: hot topics in speech synthesis",
37-40.
Gelder, Beatrice de:
"Perceiving emotions by ear and by eye",
41-44.
Greenberg, Steven:
"Strategies for automatic multi-tier annotation of spoken language corpora",
45-48.
Lee, Lin-shan / Ho, Yuan / Chen, Jia-fu / Chen, Shun-Chuan:
"Why is the special structure of the language important for Chinese spoken language processing? - examples on spoken document retrieval, segmentation and summarization",
49-52.
Speech Signal Processing 1-4
Weruaga, Luis / Kepesi, Marian:
"Speech analysis with the short-time chirp transform",
53-56.
Arroabarren, Ixone / Carlosena, Alfonso:
"Glottal spectrum based inverse filtering",
57-60.
Kiran, G.V. / Sreenivas, T.V.:
"A novel method of analysing and comparing responses of hearing aid algorithms using auditory time-frequency representation",
61-64.
Paliwal, Kuldip K. / Atal, Bishnu S.:
"Frequency-related representation of speech",
65-68.
Raykar, Vikas C. / Duraiswami, Ramani / Yegnanarayana, B. / Prasanna, S.R. Mahadeva:
"Tracking a moving speaker using excitation source information",
69-72.
Deng, Li / Bazzi, Issam / Acero, Alex:
"Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint",
73-76.
Lashkari, Khosrow / Miki, Toshio:
"Optimization of the CELP model in the LSP domain",
1709-1712.
Gillett, Ben / King, Simon:
"Transforming voice quality",
1713-1716.
Hioka, Yusuke / Hamada, Nozomu:
"DOA estimation of speech signal using equilateral-triangular microphone array",
1717-1720.
Potamitis, Ilyas / Tremoulis, George / Fakotakis, Nikos / Kokkinakis, George:
"Multi-array fusion for beamforming and localization of moving speakers",
1721-1724.
Shao, Xu / Milner, Ben P. / Cox, Stephen J.:
"Integrated pitch and MFCC extraction for speech reconstruction and speech recognition applications",
1725-1728.
Laaksonen, Lasse / Himanen, Sakari / Heikkinen, Ari / Nurminen, Jani:
"Exploiting time warping in AMR-NB and AMR-WB speech coders",
1729-1732.
Grashey, Stephan:
"A new approach to voice activity detection based on self-organizing maps",
1733-1736.
Shiga, Yoshinori / King, Simon:
"Estimating the spectral envelope of voiced speech using multi-frame analysis",
1737-1740.
Jafer, Essa / Mahdi, Abdulhussain E.:
"Adaptive noise estimation using second generation and perceptual wavelet transforms",
1741-1744.
Bourgeois, Julien:
"A clustering approach to on-line audio source separation",
1745-1748.
Shiga, Yoshinori / King, Simon:
"Estimation of voice source and vocal tract characteristics based on multi-frame analysis",
1749-1752.
En-Najjary, Taoufik / Rosec, Olivier / Chonavel, Thierry:
"A new method for pitch prediction from spectral envelope and its application in voice conversion",
1753-1756.
Orlandi, Marco / Santarelli, Alfiero / Falavigna, Daniele:
"Maximum likelihood endpoint detection with time-domain features",
1757-1760.
Arroabarren, Ixone / Carlosena, Alfonso:
"Unified analysis of glottal source spectrum",
1761-1764.
Bouzid, Aicha / Ellouze, Noureddine:
"Local regularity analysis at glottal opening and closure instants in electroglottogram signal using wavelet transform modulus maxima",
2837-2840.
Schaffoner, M. / Katz, M. / Kruger, S.E. / Wendemuth, A.:
"Improved robustness of automatic speech recognition using a new class definition in linear discriminant analysis",
2841-2844.
Turk, Oytun / Arslan, Levent M.:
"Voice conversion methods for vocal tract and pitch contour modification",
2845-2848.
Schreiner, Olaf:
"Modulation spectrum for pitch and speech pause detection",
2849-2852.
Dimitriadis, Dimitrios / Maragos, Petros:
"Robust energy demodulation based on continuous models with application to speech recognition",
2853-2856.
Kim, Jong Uk / Kim, SangGyun / Yoo, Chang D.:
"A robust and sensitive word boundary decision algorithm",
2857-2860.
Seo, Seongho / Jang, Dalwon / Lee, Sunil / Yoo, Chang D.:
"A novel transcoding algorithm for SMV and g.723.1 speech coders via direct parameter transformation",
2861-2864.
Jang, Dalwon / Seo, Seongho / Lee, Sunil / Yoo, Chang D.:
"A novel rate selection algorithm for transcoding CELP-type codec and SMV",
2865-2868.
Choy, G. / Hermann, D. / Brennan, R.L. / Schneider, T. / Sheikhzadeh, H. / Cornu, E.:
"Subband-based acoustic shock limiting algorithm on a low-resource DSP system",
2869-2872.
Pelle, Patricia A. / Capeletto, Matias L.:
"Pitch estimation using phase locked loops",
2873-2876.
Arifianto, Dhany / Kobayashi, Takao:
"Performance evaluation of IFAS-based fundamental frequency estimator in noisy environment",
2877-2880.
Kruschke, Hans / Lenz, Michael:
"Estimation of the parameters of the quantitative intonation model with continuous wavelet analysis",
2881-2884.
Rodriguez, Francisco Romero / Liu, Wei M. / Evans, Nicholas W.D. / Mason, John S.D.:
"Morphological filtering of speech spectrograms in the context of additive noise",
2885-2888.
Lathoud, Guillaume / McCowan, Iain A. / Moore, Darren C.:
"Segmenting multiple concurrent speakers using microphone arrays",
2889-2892.
Nagarajan, T. / Murthy, Hema A. / Hegde, Rajesh M.:
"Segmentation of speech into syllable-like units",
2893-2896.
Petrillo, Massimo / Cutugno, Francesco:
"A syllable segmentation algorithm for English and italian",
2913-2916.
Verma, Ashish / Kumar, Arun:
"Modeling speaking rate for voice fonts",
2917-2920.
Pohjalainen, Jouni:
"A new HMM-based approach to broad phonetic classification of speech",
2921-2924.
Zhong, Xin / Clements, Mark A. / Lim, Sung:
"Acoustic change detection and segment clustering of two-way telephone conversations",
2925-2928.
Levin, David N.:
"Blind normalization of speech from different channels",
2929-2932.
Gurijala, A.R. / Deller Jr., J.R.:
"Speech watermarking by parametric embedding with an l_(infinity) fidelity criterion",
2933-2936.
Phonology and Phonetics I
Tseng, Shu-Chuan:
"Features of contracted syllables of spontaneous Mandarin",
77-80.
Samudravijaya, K.:
"Durational characteristics of hindi stop consonants",
81-84.
Isei-Jaakkola, Toshiko:
"Quantity comparison of Japanese and finnish in various word structures",
85-88.
Baltazani, Mary:
"Broad focus across sentence types in greek",
89-92.
Hansakunbuntheung, Chatchawarn / Tesprasit, Virongrong / Siricharoenchai, Rungkarn / Sagisaka, Yoshinori:
"Analysis and modeling of syllable duration for Thai speech synthesis",
93-96.
Chen, Aoju:
"Reaction time as an indicator of discrete intonational contrasts in English",
97-100.
Gibbon, Dafydd:
"Corpus-based syntax-prosody tree matching",
761-764.
Ying, D.W. / Gao, W. / Wang, W.Q.:
"A new approach to segment and detect syllables from high-speed speech",
765-768.
Son, R.J.J.H. van / Pols, Louis C.W.:
"Information structure and efficiency in speech production",
769-772.
Corazza, Anna / Bosch, Louis ten:
"Learning rule ranking by dynamic construction of context-free grammars using AND/OR graphs",
773-776.
Zvonik, Elena / Cummins, Fred:
"The effect of surrounding phrase lengths on pause duration",
777-780.
Okawa, Shigeki / Shirai, Katsuhiko:
"Statistical estimation of phoneme's most stable point based on universal constraint",
781-784.
Beringer, N.:
"Independent automatic segmentation by self-learning categorial pronunciation rules",
785-788.
Braun, Bettina / Ladd, D. Robert:
"Prosodic correlates of contrastive and non-contrastive themes in German",
789-792.
Chen, Yiya:
"Accentual lengthening in standard Chinese: evidence from four-syllable constituents",
793-796.
Kanokphara, Supphanat:
"Syllable structure based phonetic units for context-dependent continuous Thai speech recognition",
797-800.
Hu, Fang:
"An acoustic phonetic analysis of diphthongs in ningbo Chinese",
801-804.
Otake, Takashi / Sakamoto, Yoko:
"Latent ability to manipulate phonemes by Japanese preliterates in roman alphabet",
805-808.
Pfitzinger, Hartmut R.:
"The /i/-/a/-/u/-ness of spoken vowels",
809-812.
Topics in Prosody and Emotional Speech
Gillett, Ben / King, Simon:
"Transforming F0 contours",
101-104.
Cook, Norman D. / Fujisawa, Takeshi / Takami, Kazuaki:
"Evaluation of the affect of speech intonation using a model of the perception of interval dissonance and harmonic tension",
105-108.
Lai, Wen-Hsing / Wang, Yih-Ru / Chen, Sin-Horng:
"A new pitch modeling approach for Mandarin speech",
109-112.
Zervas, P. / Maragoudakis, M. / Fakotakis, Nikos / Kokkinakis, George:
"Bayesian induction of intonational phrase breaks",
113-116.
Ehrette, T. / Chateau, N. / d'Alessandro, Christophe / Maffiolo, V.:
"Predicting the perceptive judgment of voices in a telecom context: selection of acoustic parameters",
117-120.
Mattys, Sven L.:
"Stress-based speech segmentation revisited",
121-124.
Kwon, Oh-Wook / Chan, Kwokleung / Hao, Jiucang / Lee, Te-Won:
"Emotion recognition by speech signals",
125-128.
Tamburini, Fabio:
"Automatic prosodic prominence detection in speech using acoustic features: an unsupervised system",
129-132.
Hozjan, Vladimir / Kacic, Zdravko:
"Improved emotion recognition with large set of statistical features",
133-136.
Charnvivit, Patavee / Thubthong, Nuttakorn / Maneenoi, Ekkarit / Luksaneeyanawin, Sudaporn / Jitapunkul, Somchai:
"Recognition of intonation patterns in Thai utterance",
137-140.
Hirose, Keikichi / Furuyama, Yusuke / Narusawa, Shuichi / Minematsu, Nobuaki / Fujisaki, Hiroya:
"Use of linguistic information for automatic extraction of f_0 contour generation process model parameters",
141-144.
Dohen, Marion / Loevenbruck, Hélčne / Cathiard, Marie-Agnes / Schwartz, Jean-Luc:
"Potential audiovisual correlates of contrastive focus in French",
145-148.
Hatano, Toshie / Horiuchi, Yasuo / Ichikawa, Akira:
"How does human segment the speech by prosody ?",
149-152.
Walker, B.D. / Lackey, B.C. / Muller, J.S. / Schone, P.J.:
"Language-reconfigurable universal phone recognition",
153-156.
Lee, Chul Min / Narayanan, Shrikanth:
"Emotion recognition using a data-driven fuzzy inference system",
157-160.
Suzuki, Noriko / Yabuta, Yohei / Takeuchi, Yugo / Katagiri, Yasuhiro:
"Effects of voice prosody by computers on human behaviors",
161-164.
Jokisch, Oliver / Kuhne, Marco:
"An investigation of intensity patterns for German",
165-168.
Teixeira, Joao Paulo / Freitas, Diamantino:
"Segmental durations predicted with a neural network",
169-172.
Yamashita, Takumi / Sagisaka, Yoshinori:
"Generation and perception of f_0 markedness in conversational speech with adverbs expressing degrees",
173-176.
Mixdorff, Hansjorg / Bach, Nguyen Hung / Fujisaki, Hiroya / Luong, Mai Chi:
"Quantitative analysis and synthesis of syllabic tones in vietnamese",
177-180.
Kiriyama, Shinya / Mitsuta, Yoshifumi / Hosokawa, Yuta / Hashimoto, Yoshikazu / Ito, Toshihiko / Kitazawa, Shigeyoshi:
"Japanese prosodic labeling support system utilizing linguistic information",
181-184.
Auberge, Veronique / Audibert, Nicolas / Rilliard, Albert:
"Why and how to control the authentic emotional speech corpora",
185-188.
Devillers, Laurence / Vasilescu, Ioana:
"Prosodic cues for emotion characterization in real-life spoken dialogs",
189-192.
Language Modeling, Discourse and Dialog
Polifroni, Joseph / Chung, Grace / Seneff, Stephanie:
"Towards the automatic generation of mixed-initiative dialogue systems from web content",
193-196.
Filisko, Edward / Seneff, Stephanie:
"A context resolution server for the galaxy conversational systems",
197-200.
Hardy, Hilda / Baker, Kirk / Bonneau-Maynard, Hélčne / Devillers, Laurence / Rosset, Sophie / Strzalkowski, Tomek:
"Semantic and dialogic annotation for automated multilingual customer service",
201-204.
Nicholson, H.B.M. / Bard, E.G. / Anderson, A.H. / Flecha-Garcia, M.L. / Kenicer, D. / Smallwood, L. / Mullin, J. / Lickley, R.J. / Chen, Y.:
"Disfluency under feedback and time-pressure",
205-208.
Heeman, Peter A. / Yang, Fan / Strayer, Susan E.:
"Control in task-oriented dialogues",
209-212.
McTait, Kevin / Adda-Decker, Martine:
"The 300k LIMSI German broadcast news transcription system",
213-216.
Tian, Jilei / Suontausta, Janne / Hakkinen, Juha:
"Weighted entropy training for the decision tree based text-to-phoneme mapping",
217-220.
Ogawa, Yoshihiko / Yamamoto, Hirofumi / Sagisaka, Yoshinori / Kikui, Genichiro:
"Word class modeling for speech recognition with out-of-task words using a hierarchical language model",
221-224.
Ordelman, Roeland / Hessen, Arjan van / Jong, Franciska de:
"Compound decomposition in dutch large vocabulary speech recognition",
225-228.
Savova, Guergana / Bachenko, Joan:
"Designing for errors: similarities and differences of disfluency rates and prosodic characteristics across domains",
229-232.
Wester, Mirjam:
"Syllable classification using articulatory-acoustic features",
233-236.
Zitouni, Imed / Siohan, Olivier / Lee, Chin-Hui:
"Hierarchical class n-gram language models: towards better estimation of unseen events in speech recognition",
237-240.
Barrachina, Sergio / Vilar, Juan Miguel:
"Incremental and iterative monolingual clustering algorithms",
241-244.
Venkataraman, Anand / Wang, Wen:
"Techniques for effective vocabulary selection",
245-248.
Galescu, Lucian:
"Recognition of out-of-vocabulary words with sub-lexical language models",
249-252.
Bonneau-Maynard, Hélčne / Rosset, Sophie:
"A semantic representation for spoken dialogs",
253-256.
Adda-Decker, Martine:
"A corpus-based decompounding algorithm for German lexical modeling in LVCSR",
257-260.
Lee, Kyong-Nim / Chung, Minhwa:
"Modeling cross-morpheme pronunciation variations for korean large vocabulary continuous speech recognition",
261-264.
Speech Synthesis: Unit Selection 1, 2
Zhou, Yi / Zu, Yiqing:
"Unit selection based on voice recognition",
265-268.
Xu, Jun / Choy, Thomas / Dong, Minghui / Guan, Cuntai / Li, Haizhou:
"On unit analysis for Cantonese corpus-based TTS",
269-272.
Lambert, T. / Breen, Andrew P. / Eggleton, Barry / Cox, Stephen J. / Milner, Ben P.:
"Unit selection in concatenative TTS synthesis systems based on mel filter bank amplitudes and phonetic context",
273-276.
Bozkurt, Baris / Ozturk, Ozlem / Dutoit, Thierry:
"Text design for TTS speech corpus building using a modified greedy selection",
277-280.
Park, Seung Seop / Kim, Chong Kyu / Kim, Nam Soo:
"Discriminative weight training for unit-selection based speech synthesis",
281-284.
Rutten, Peter / Fackrell, Justin:
"The application of interactive speech unit selection in TTS systems",
285-288.
Diaz, Francisco Campillo / Banga, Eduardo R.:
"On the design of cost functions for unit-selection speech synthesis",
289-292.
Vepa, Jithendra / King, Simon:
"Kalman-filter based join cost for unit-selection speech synthesis",
293-296.
Toda, Tomoki / Kawai, Hisashi / Tsuzaki, Minoru:
"Optimizing integrated cost function for segment selection in concatenative speech synthesis based on perceptual evaluations",
297-300.
Matousek, Jindrich / Tihelka, Daniel / Psutka, Josef:
"Automatic segmentation for czech concatenative speech synthesis using statistical approach with boundary-specific correction",
301-304.
Kuo, Chih-Chung / Kuo, Chi-Shiang / Chen, Jau-Hung / Chang, Sen-Chia:
"Automatic speech segmentation and verification for concatenative synthesis",
305-308.
Paulo, Sergio / Oliveira, Luis C.:
"DTW-based phonetic alignment using multiple acoustic features",
309-312.
Kominek, John / Bennett, Christina L. / Black, Alan W.:
"Evaluating and correcting phoneme segmentation for unit selection synthesis",
313-316.
Klabbers, Esther / Santen, Jan P.H. van:
"Control and prediction of the impact of pitch modification on synthetic speech quality",
317-320.
Aylett, Matthew / Fackrell, Justin / Rutten, Peter:
"My voice, your prosody: sharing a speaker specific prosody model across speakers in unit selection TTS",
321-324.
Tesprasit, Virongrong / Charoenpornsawat, Paisarn / Sornlertlamvanich, Virach:
"Learning phrase break detection in Thai text-to-speech",
325-328.
Kain, Alexander B. / Santen, Jan P.H. van:
"A speech model of acoustic inventories based on asynchronous interpolation",
329-332.
Hirose, Keikichi / Ono, Takayuki / Minematsu, Nobuaki:
"Corpus-based synthesis of fundamental frequency contours of Japanese using automatically-generated prosodic corpus and generation process model",
333-336.
Kishore, S.P. / Black, Alan W.:
"Unit size in unit selection speech synthesis",
1317-1320.
Schweitzer, Antje / Braunschweiler, Norbert / Klankert, Tanja / Möbius, Bernd / Sauberlich, Bettina:
"Restricted unlimited domain synthesis",
1321-1324.
Francois, Hélčne / Boeffard, Olivier:
"Evaluation of units selection criteria in corpus-based speech synthesis",
1325-1328.
Pucher, Michael / Neubarth, Friedrich / Rank, Erhard / Niklfeld, Georg / Guan, Qi:
"Combining non-uniform unit selection with diphone based synthesis",
1329-1332.
Alias, Francesc / Llora, Xavier:
"Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis",
1333-1336.
Andersen, Ove / Hoequist, Charles:
"Keeping rare events rare",
1337-1340.
Aurora Noise Robustness on LARGE Vocabulary Databases
Parihar, N. / Picone, Joseph:
"Analysis of the Aurora large vocabulary evaluations",
337-340.
Hilger, Florian / Ney, Hermann:
"Evaluation of quantile based histogram equalization with filter combination on the Aurora 3 and 4 databases",
341-344.
Rigazio, Luca / Nguyen, Patrick / Kryze, David / Junqua, Jean-Claude:
"Large vocabulary noise robustness on Aurora4",
345-348.
Stouten, Veronique / Hamme, Hugo van / Duchateau, Jacques / Wambacq, Patrick:
"Evaluation of model-based feature enhancement on the AURORA-4 task",
349-352.
Segura, Jose C. / Ramirez, Javier / Benitez, Carmen / Torre, Angel de la / Rubio, Antonio J.:
"Improved feature extraction based on spectral noise reduction and nonlinear feature normalization",
353-356.
Kim, Young Joon / Kim, Hyun Woo / Lim, Woohyung / Kim, Nam Soo:
"Feature compensation technique for robust speech recognition in noisy environments",
357-360.
Multilingual Speech-to-Speech Translation
Ney, Hermann:
"The statistical approach to machine translation and a roadmap for speech translation",
361-364.
Gao, Yuqing:
"Coupling vs. unifying: modeling techniques for speech-to-speech translation",
365-368.
Waibel, Alex / Badran, Ahmed / Black, Alan W. / Frederking, Robert / Gates, Donna / Lavie, Alon / Levin, Lori / Lenzo, Kevin A. / Tomokiyo, Laura Mayfield / Reichert, Jurgen / Schultz, Tanja / Wallace, Dorcas / Woszczyna, Monika / Zhang, Jing:
"Speechalator: two-way speech-to-speech translation on a consumer PDA",
369-372.
Franco, Horacio / Zheng, Jing / Precoda, Kristin / Cesari, Federico / Abrash, Victor / Vergyri, Dimitra / Venkataraman, Anand / Bratt, Harry / Richey, Colleen / Sarich, Ace:
"Development of phrase translation systems for handheld computers: from concept to field",
373-376.
Federico, Marcello:
"Evaluation frameworks for speech translation technologies",
377-380.
Kikui, Genichiro / Sumita, Eiichiro / Takezawa, Toshiyuki / Yamamoto, Seiichi:
"Creating corpora for speech-to-speech translation",
381-384.
Prosody
Minematsu, Nobuaki / Matsuoka, Bungo / Hirose, Keikichi:
"Prosodic analysis and modeling of the NAGAUTA singing to synthesize its prosodic patterns from the standard notation",
385-388.
Gharavian, D. / Ahadi, S.M.:
"Statistical evaluation of the influence of stress on pitch frequency and phoneme durations in farsi language",
389-392.
Chen, K. / Borys, S. / Hasegawa-Johnson, Mark / Cole, J.:
"Prosody dependent speech recognition with explicit duration modelling at intonational phrase boundaries",
393-396.
Teixeira, Joao Paulo / Freitas, Diamantino / Fujisaki, Hiroya:
"Prediction of fujisaki model's phrase commands",
397-400.
Muto, Makiko / Sagisaka, Yoshinori / Naito, Takuro / Maeki, Daiju / Kondo, Aki / Shirai, Katsuhiko:
"Corpus-based modeling of naturalness estimation in timing control for non-native speech",
401-404.
Ishi, Carlos Toshinori / Mokhtari, Parham / Campbell, Nick:
"Perceptually-related acoustic-prosodic features of phrase finals in spontaneous speech",
405-408.
Language Modeling
Langlois, David / Smaili, Kamel / Haton, Jean-Paul:
"Efficient linear combination for distant n-gram models",
409-412.
Emami, Ahmad:
"Improving a connectionist based syntactical language model",
413-416.
Nakano, Mikio / Hazen, Timothy J.:
"Using untranscribed user utterances for improving language models based on confidence scoring",
417-420.
Chang, Pi-Chuan / Liao, Shuo-Peng / Lee, Lin-shan:
"Improved Chinese broadcast news transcription by language modeling with temporally consistent training corpora and iterative phrase extraction",
421-424.
Mori, Shinsuke / Nishimura, Masafumi / Itoh, Nobuyasu:
"Language model adaptation using word clustering",
425-428.
Lane, Ian R. / Kawahara, Tatsuya / Matsui, Tomoko / Nakamura, Satoshi:
"Hierarchical topic classification for dialog speech recognition based on language model switching",
429-432.
Speech Modeling and Features 1-4
Alku, Paavo / Backstrom, Tom:
"Linear predictive method with low-frequency emphasis",
433-436.
Jain, Pratibha / Hermansky, Hynek:
"Beyond a single critical-band in TRAP based ASR",
437-440.
Valente, Fabio / Wellekens, Christian:
"Variational Bayesian GMM for speech recognition",
441-444.
Wada, Yamato / Sugiyama, Masahide:
"Time alignment for scenario and sounds with voice, music and BGM",
445-448.
Nguyen, Phu Chien / Akagi, Masato:
"Efficient quantization of speech excitation parameters using temporal decomposition",
449-452.
Kommer, Robert van / Hirsbrunner, Beat:
"Distributed genetic algorithm to discover a wavelet packet best basis for speech recognition",
453-456.
Huang, Chao-Shih / Lee, Chin-Hui / Wang, Hsiao-Chuan:
"New model-based HMM distances with applications to run-time ASR error estimation and model tuning",
457-460.
Kaburagi, Tokihiko / Kawai, Koji:
"Analysis of voice source characteristics using a constrained polynomial model",
461-464.
Ni, Jinfu / Kawai, Hisashi:
"Tone pattern discrimination combining parametric modeling and maximum likelihood estimation",
465-468.
Wrigley, Stuart N. / Brown, Guy J. / Wan, Vincent / Renals, Steve:
"Feature selection for the classification of crosstalk in multi-channel audio",
469-472.
Liu, Jingwei:
"A DTW-based DAG technique for speech and speaker feature analysis",
473-476.
Somervuo, Panu / Chen, Barry / Zhu, Qifeng:
"Feature transformations and combinations for improving ASR performance",
477-480.
Tseng, Chiu-yu:
"On the role of intonation in the organization of Mandarin Chinese speech prosody",
481-484.
Ohkawa, Yuichi / Yoshida, Akihiro / Suzuki, Motoyuki / Ito, Akinori / Makino, Shozo:
"An optimized multi-duration HMM for spontaneous speech recognition",
485-488.
Kim, Hyoung-Gook / Berdahl, Edgar / Moreau, Nicolas / Sikora, Thomas:
"Speaker recognition using MPEG-7 descriptors",
489-492.
Macherey, Wolfgang / Ney, Hermann:
"A comparative study on maximum entropy and discriminative training for acoustic modeling in automatic speech recognition",
493-496.
Zolnay, Andras / Schluter, Ralf / Ney, Hermann:
"Extraction methods of voicing feature for robust speech recognition",
497-500.
Armani, Luca / Matassoni, Marco / Omologo, Maurizio / Svaizer, Piergiorgio:
"Use of a CSP-based voice activity detector for distant-talking ASR",
501-504.
Omar, Mohamed Kamal / Hasegawa-Johnson, Mark:
"Maximum conditional mutual information projection for speech recognition",
505-508.
Gibbon, Dafydd / Gut, Ulrike / Hell, Benjamin / Looks, Karin / Thies, Alexandra / Trippel, Thorsten:
"A computational model of arm gestures in conversation",
813-816.
Pitsikalis, Vassilis / Kokkinos, Iasonas / Maragos, Petros:
"Nonlinear analysis of speech signals: generalized dimensions and lyapunov exponents",
817-820.
Motlicek, Petr / Cernocký, Jan:
"Time-domain based temporal processing with application of orthogonal transformations",
821-824.
Schwarz, Petr / Matejka, Pavel / Cernocký, Jan:
"Recognition of phoneme strings using TRAP technique",
825-828.
Fegyo, Tibor / Mihajlik, Peter / Tatai, Peter:
"Comparative study on hungarian acoustic model sets and training methods",
829-832.
Cheveigne, Alain de / Baskind, Alexis:
"F_0 estimation of one or several voices",
833-836.
Sivadas, Sunil / Hermansky, Hynek:
"In search of target class definition in tandem feature extraction",
837-840.
Adami, Andre G. / Hermansky, Hynek:
"Segmentation of speech for speaker and language recognition",
841-844.
Li, Xiang / Stern, Richard M.:
"Feature generation based on maximum classification probability for improved speech recognition",
845-848.
Yao, Kaisheng / Paliwal, Kuldip K. / Lee, Te-Won:
"Speech recognition with a generative factor analyzed hidden Markov model",
849-852.
Chen, Barry / Chang, Shuangyu / Sivadas, Sunil:
"Learning discriminative temporal patterns in speech: development of novel TRAPS-like classifiers",
853-856.
Scanlon, Patricia / Ellis, Daniel P.W. / Reilly, Richard:
"Using mutual information to design class-specific phone recognizers",
857-860.
Duxans, Helenca / Bonafonte, Antonio:
"Estimation of GMM in voice conversion including unaligned data",
861-864.
Tokuda, Keiichi / Zen, Heiga / Kitamura, Tadashi:
"Trajectory modeling based on HMMs with the explicit relationship between static and dynamic features",
865-868.
Bauerecker, Hermann / Nadeu, Climent / Padrell, Jaume:
"On the advantage of frequency-filtering features for speech recognition with variable sampling frequencies. experiments with speechdatcar databases",
869-872.
Mixdorff, Hansjorg / Fujisaki, Hiroya / Chen, Gao Peng / Hu, Yu:
"Towards the automatic extraction of fujisaki model parameters for Mandarin",
873-876.
Airey, S.S. / Gales, M.J.F.:
"Product of Gaussians as a distributed representation for speech recognition",
877-880.
Petrinovic, Davor:
"Harmonic weighting for all-pole modeling of the voiced speech",
881-884.
Nishizawa, Nobuyuki / Hirose, Keikichi / Minematsu, Nobuaki:
"Estimation of resonant characteristics based on AR-HMM modeling and spectral envelope conversion of vowel sounds",
885-888.
Hermansky, Hynek / Jain, Pratibha:
"Band-independent speech-event categories for TRAP based ASR",
1013-1016.
Grezl, Frantisek / Hermansky, Hynek:
"Local averaging and differentiating of spectral plane for TRAP-based ASR",
1017-1020.
Wolfel, Matthias / McDonough, John / Waibel, Alex:
"Minimum variance distortionless response on a warped frequency scale",
1021-1024.
Wang, Xuechuan / O'Shaughnessy, Douglas:
"Improving the efficiency of automatic speech recognition by feature transformation and dimensionality reduction",
1025-1028.
Stadermann, Jan / Rigoll, Gerhard:
"Distributed speech recognition on the WSJ task",
1029-1032.
Stuker, Sebastian / Metze, Florian / Schultz, Tanja / Waibel, Alex:
"Integrating multilingual articulatory features into speech recognition",
1033-1036.
Petek, Bojan:
"Locus equations determination using the speechdat(II)",
2301-2304.
Emonts, Michael / Lonsdale, Deryle:
"A memory-based approach to Cantonese tone recognition",
2305-2308.
Escudero, David / Cardenoso, Valentin / Bonafonte, Antonio:
"Experimental evaluation of the relevance of prosodic features in Spanish using machine learning techniques",
2309-2312.
Nakatani, Tomohiro / Irino, Toshio / Zolfaghari, Parham:
"Dominance spectrum based v/UV classification and f_0 estimation",
2313-2316.
Fujisaki, Hiroya / Narusawa, Shuichi / Ohno, Sumio / Freitas, Diamantino:
"Analysis and modeling of f_0 contours of portuguese utterances based on the command-response model",
2317-2320.
Jackson, Philip J.B. / Moreno, David M. / Russell, Martin J. / Hernando, Javier:
"Covariation and weighting of harmonically decomposed streams for ASR",
2321-2324.
Speech Enhancement 1, 2
Heracleous, Panikos / Nakamura, Satoshi / Shikano, Kiyohiro:
"A semi-blind source separation method for hands-free speech recognition of multiple talkers",
509-512.
Krasny, Leonid / Khayrallah, Ali:
"Influence of the waveguide propagation on the antenna performance in a car cabin",
513-516.
Potamitis, Ilyas / Tremoulis, George / Fakotakis, Nikos:
"Multi-speaker DOA tracking using interactive multiple models and probabilistic data association",
517-520.
Lu, Ching-Ta / Wang, Hsiao-Chuan:
"Speech enhancement using weighting function based on the variance of wavelet coefficients",
521-524.
Potamitis, Ilyas / Fishler, Eran:
"Microphone array voice activity detection and noise suppression using wideband generalized likelihood ratio",
525-528.
Saric, Zoran / Jovicic, Slobodan:
"Adaptive beamforming in room with reverberation",
529-532.
Ju, Gwo-hwa / Lee, Lin-shan:
"Perceptually-constrained generalized singular value decomposition-based approach for enhancing speech corrupted by colored noise",
533-536.
Yamajo, Hiroaki / Saruwatari, Hiroshi / Takatani, Tomoya / Nishikawa, Tsuyoki / Shikano, Kiyohiro:
"Blind separation and deconvolution for convolutive mixture of speech using SIMO-model-based ICA and multichannel inverse filtering",
537-540.
Raza, D.G. / Chan, C.F.:
"Quality enhancement of CELP coded speech by using an MFCC based Gaussian mixture model",
541-544.
Kim, Hyoung-Gook / Schwab, Markus / Moreau, Nicolas / Sikora, Thomas:
"Enhancement of noisy speech for noise robust front-end and speech reconstruction at back-end of DSR system",
545-548.
Wei, Jianqiang / Du, Limin / Yan, Zhaoli / Zeng, Hui:
"Improved kalman filter-based speech enhancement",
549-552.
Irino, Toshio / Patterson, Roy D. / Kawahara, Hideki:
"Speech segregation based on fundamental event information using an auditory vocoder",
553-556.
Yan, Zhaoli / Du, Limin / Wei, Jianqiang / Zeng, Hui:
"Time delay estimation based on hearing characteristic",
557-560.
Stolbov, M. / Koval, S. / Khitrov, M.:
"Parametric multi-band automatic gain control for noisy speech enhancement",
561-564.
Iser, Bernd / Schmidt, Gerhard:
"Neural networks versus codebooks in an application for bandwidth extension of speech signals",
565-568.
Jafer, Essa / Mahdi, Abdulhussain E.:
"Wavelet-based perceptual speech enhancement using adaptive threshold estimation",
569-572.
Potamitis, Ilyas / Fakotakis, Nikos / Kokkinakis, George:
"A trainable speech enhancement technique based on mixture models for speech and noise",
573-576.
Fu, Qiang / Wan, Eric A.:
"Perceptual wavelet adaptive denoising of speech",
577-580.
Yegnanarayana, B. / Prasanna, S.R. Mahadeva / Doss, Mathew Magimai:
"Enhancement of speech in multispeaker environment",
581-584.
Mizumachi, Mitsunori / Nakamura, Satoshi:
"Noise reduction using paired-microphones on non-equally-spaced microphone arrangement",
585-588.
Hodoshima, Nao / Arai, Takayuki / Inoue, Tsuyoshi / Kinoshita, Keisuke / Kusumoto, Akiko:
"Improving speech intelligibility by steady-state suppression as pre-processing in small to medium sized halls",
1365-1368.
Lee, Chen-Long / Yang, Ya-Ru / Chang, Wen-Whei / Chiang, Yuan-Chuan:
"Enhancement of hearing-impaired Mandarin speech",
1369-1372.
Alvarez, A. / Nieto, V. / Gomez, P. / Martinez, R.:
"Speech enhancement for a car environment using LP residual signal and spectral subtraction",
1373-1376.
Ju, Gwo-hwa / Lee, Lin-shan:
"Speech enhancement and improved recognition accuracy by integrating wavelet transform and spectral subtraction algorithm",
1377-1380.
Mahe, Gael / Gilloire, Andre:
"Multi-referenced correction of the voice timbre distortions in telephone networks",
1381-1384.
Lee, J.J. / Lee, J.H. / Lee, K.Y.:
"Efficient speech enhancement based on left-right HMM with state sequence detection using LRT",
1385-1388.
Gnaba, H. / Alouane, M. Turki-Hadj / Jaidane-Saidane, M. / Scalart, P.:
"Introduction of the CELP structure of the GSM coder in the acoustic echo canceller for the GSM network",
1389-1392.
Sodoyer, David / Girin, Laurent / Jutten, Christian / Schwartz, Jean-Luc:
"Extracting an AV speech source from a mixture of signals",
1393-1396.
Puder, Henning:
"Speech enhancement for hands-free car phones by adaptive compensation of harmonic engine noise components",
1397-1400.
Hou, Zhaorong / Jia, Ying:
"Enhance low-frequency suppression of GSC beamforming",
1401-1404.
Srinivasan, Sriram / Samuelsson, Jonas / Kleijn, W. Bastiaan:
"Speech enhancement using a-priori information",
1405-1408.
Hogden, John / Valdez, Patrick / Katagiri, Shigeru / McDermott, Erik:
"Blind inversion of multidimensional functions for speech enhancement",
1409-1412.
Abutalebi, H.R. / Sheikhzadeh, H. / Brennan, R.L. / Freeman, G.H.:
"Convergence improvement for oversampled subband adaptive noise and echo cancellation",
1413-1416.
Unoki, Masashi / Sakata, Keigo / Akagi, Masato:
"A speech dereverberation method based on the MTF concept",
1417-1420.
Kim, SangGyun / Kim, Jong Uk / Yoo, Chang D.:
"Accuracy improved double-talk detector based on state transition diagram",
1421-1424.
Natarajan, Ajay / Hansen, John H.L. / Arehart, Kathryn / Rossi-Katz, Jessica A.:
"Perceptual based speech enhancement for normal-hearing and hearing-impaired individuals",
1425-1428.
Ortega, Alfonso / Lleida, Eduardo / Masgrau, Enrique:
"Residual echo power estimation for speech reinforcement systems in vehicles",
1429-1432.
Qian, Yasheng / Kabal, Peter:
"Dual-mode wideband speech recovery from narrowband speech",
1433-1436.
Al-Naimi, Khaldoon / Sturt, Christian / Kondoz, Ahmet:
"A robust noise and echo canceller",
1437-1440.
Nix, Johannes / Kleinschmidt, Michael / Hohmann, Volker:
"Computational auditory scene analysis by using statistics of high-dimensional speech dynamics and sound source direction",
1441-1444.
Spoken Dialog Systems 1, 2
Witt, Silke M. / Williams, Jason D.:
"Two studies of open vs. directed dialog strategies in spoken dialog systems",
589-592.
O'Neill, Ian / Hanna, Philip / Liu, Xingkun / McTear, Michael:
"The queen's communicator: an object-oriented dialogue manager",
593-596.
Bohus, Dan / Rudnicky, Alexander I.:
"Ravenclaw: dialog management using hierarchical task decomposition and an expectation agenda",
597-600.
Macherey, Klaus / Ney, Hermann:
"Features for tree based dialogue course management",
601-604.
Torres, Francisco / Sanchis, Emilio / Segarra, Encarna:
"Development of a stochastic dialog manager driven by semantics",
605-608.
Takeuchi, Masashi / Kitaoka, Norihide / Nakagawa, Seiichi:
"Generation of natural response timing using decision tree based on prosodic and linguistic information",
609-612.
Bell, Linda / Gustafson, Joakim:
"Child and adult speaker adaptation during error resolution in a publicly available spoken dialogue system",
613-616.
Esteve, Yannick / Raymond, Christian / Bechet, Frédéric / Mori, Renato De:
"Conceptual decoding for spoken dialog systems",
617-620.
Wang, Huei-Ming / Lin, Yi-Chung:
"Sentence verification in spoken dialogue system",
621-624.
Kitaoka, Norihide / Kakutani, Naoko / Nakagawa, Seiichi:
"Detection and recognition of correction utterance in spontaneously spoken dialog",
625-628.
Ekanadham, Chaitanya J.K. / Huerta, Juan M.:
"Topic-specific parser design in an air travel natural language understanding application",
629-632.
Cox, Stephen J. / Cawley, Gavin:
"The use of confidence measures in vector based call-routing",
633-636.
Bechet, Frédéric / Riccardi, Giuseppe / Hakkani-Tur, Dilek Z.:
"Multi-channel sentence classification for spoken dialogue language modeling",
637-640.
Seneff, Stephanie / Wang, Chao / Hazen, Timothy J.:
"Automatic induction of n-gram language models from a natural language grammar",
641-644.
Vilar, David / Castro, Maria Jose / Sanchis, Emilio:
"Connectionist classification and specific stochastic models in the understanding process of a dialogue system",
645-648.
Boye, Johan / Wiren, Mats:
"Robust parsing of utterances in negotiative dialogue",
649-652.
Wu, Chung-Hsien / Yan, Gwo-Lang:
"Flexible speech act identification of spontaneous speech with disfluency",
653-656.
Dohsaka, Kohji / Yasuda, Norihito / Aikawa, Kiyoaki:
"Efficient spoken dialogue control depending on the speech recognition rate and system's database",
657-660.
Takahashi, Shin-ya / Morimoto, Tsuyoshi / Maeda, Sakashi / Tsuruta, Naoyuki:
"Robust speech understanding based on expected discourse plan",
661-664.
Isobe, T. / Hayakawa, S. / Murao, H. / Mizutani, T. / Takeda, Kazuya / Itakura, Fumitada:
"A study on domain recognition of spoken dialogue systems",
1889-1892.
He, Wei / Li, Honglian / Yuan, Baozong:
"Domain adaptation augmented by state-dependence in spoken dialog systems",
1893-1896.
Portele, Thomas / Goronzy, Silke / Emele, Martin / Kellner, Andreas / Torge, Sunna / Vrugt, Jurgen te:
"Smartkom-home - an advanced multi-modal interface to home entertainment",
1897-1900.
Xu, Yunbiao / Di, Fengying / Araki, Masahiro / Niimi, Yasuhisa:
"Methods to improve its portability of a spoken dialog system both on task domains and languages",
1901-1904.
Fegyo, Tibor / Mihajlik, Peter / Szarvas, Mate / Tatai, Peter / Tatai, Gabor:
"Voxenter^TM - intelligent voice enabled call center for hungarian",
1905-1908.
Huang, Qiang / Cox, Stephen J.:
"Automatic call-routing without transcriptions",
1909-1912.
Turunen, Markku / Hakulinen, Jaakko:
"Jaspis^2 - an architecture for supporting distributed spoken dialogues",
1913-1916.
Zibert, Janez / Martincic-Ipsic, Sanda / Hajdinjak, Melita / Ipsic, Ivo / Mihelic, France:
"Development of a bilingual spoken dialog system for weather information retrieval",
1917-1920.
Allen, James / Attwater, David / Durston, Peter / Farrell, Mark:
"Improving "how may i help you?" systems using the output of recognition lattices",
1921-1924.
Andorno, M. / Fissore, L. / Laface, P. / Nigra, M. / Popovici, C. / Ravera, F. / Vair, C.:
"Incremental learning of new user formulations in automatic directory assistance",
1925-1928.
Baca, Julie A. / Zheng, Feng / Gao, Hualin / Picone, Joseph:
"Dialog systems for automotive environments",
1929-1932.
Neto, Joao P. / Mamede, Nuno J. / Cassaca, Renato / Oliveira, Luis C.:
"The development of a multi-purpose spoken dialogue system",
1933-1936.
Goronzy, Silke / Valsan, Zica / Emele, Martin / Schimanowski, Juergen:
"The dynamic, multi-lingual lexicon in smartkom",
1937-1940.
Higashinaka, Ryuichiro / Miyazaki, Noboru / Nakano, Mikio / Aikawa, Kiyoaki:
"Evaluating discourse understanding in spoken dialogue systems",
1941-1944.
Larsen, Lars Bo:
"Assessment of spoken dialogue system usability - what are we really measuring?",
1945-1948.
Smeele, Paula M.T. / Waals, Juliette A.J.S.:
"Evaluation of a speech-driven telephone information service using the PARADISE framework: a closer look at subjective measures",
1949-1952.
Moller, Sebastian / Skowronek, Janto:
"Quantifying the impact of system characteristics on perceived quality dimensions of a spoken dialogue service",
1953-1956.
Ramaswamy, Ganesh N. / Zilca, Ran D. / Alecksandrovich, Oleg:
"A programmable policy manager for conversational biometrics",
1957-1960.
Hazen, Timothy J. / Jones, Douglas A. / Park, Alex / Kukolich, Linda C. / Reynolds, Douglas A.:
"Integration of speaker recognition into conversational spoken dialogue systems",
1961-1964.
Robust Speech Recognition - Noise Compensation
Obuchi, Yasunari / Stern, Richard M.:
"Normalization of time-derivative parameters using histogram equalization",
665-668.
Zhang, Zhipeng / Otsuji, Kiyotaka / Furui, Sadaoki:
"Tree-structured noise-adapted HMM modeling for piecewise linear-transformation-based adaptation",
669-672.
Zhu, Donglai / Nakamura, Satoshi / Paliwal, Kuldip K. / Wang, Renhua:
"Maximum likelihood sub-band weighting for robust speech recognition",
673-676.
Kim, Wooil / Ahn, Sungjoo / Ko, Hanseok:
"Feature compensation scheme based on parallel combined mixture model",
677-680.
Droppo, Jasha / Deng, Li / Acero, Alex:
"A comparison of three non-linear observation models for noisy speech features",
681-684.
Daoudi, Khalid / Deviren, Murat:
"A new supervised-predictive compensation scheme for noisy speech recognition",
685-688.
Forensic Speaker Recognition
Drygajlo, Andrzej / Meuwly, Didier / Alexander, Anil:
"Statistical methods and Bayesian interpretation of evidence in forensic automatic speaker recognition",
689-692.
Gonzalez-Rodriguez, J. / Garcia-Romero, D. / Garcia-Gomar, M. / Ramos-Castro, D. / Ortega-Garcia, J.:
"Robust likelihood ratio estimation in Bayesian forensic speaker recognition",
693-696.
Nakasone, Hirotaka:
"Automated speaker recognition in real world conditions: controlling the uncontrollable",
697-700.
Pfister, Beat / Beutler, Rene:
"Estimating the weight of evidence in forensic speaker verification",
701-704.
Gfroerer, Stefan:
"Auditory-instrumental forensic speaker recognition",
705-708.
Kerstholt, J.H. / Jansen, E.J.M. / Amelsvoort, A.G. van / Broeders, A.P.A.:
"Earwitness line-ups: effects of speech duration, retention interval and acoustic environment on identification accuracy",
709-712.
Emotion in Speech
Amir, Noam / Ziv, Shirley / Cohen, Rachel:
"Characteristics of authentic anger in hebrew speech",
713-716.
Seppanen, Tapio / Vayrynen, Eero / Toivanen, Juhani:
"Prosody-based classification of emotions in spoken finnish",
717-720.
Rahurkar, Mandar A. / Hansen, John H.L.:
"Frequency distribution based weighted sub-band approach for classification of emotional/stressful content in speech",
721-724.
Liscombe, Jackson / Venditti, Jennifer / Hirschberg, Julia:
"Classifying subject ratings of emotional speech using acoustic features",
725-728.
Yacoub, Sherif / Simske, Steve / Lin, Xiaofan / Burns, John:
"Recognition of emotions in interactive voice response systems",
729-732.
Batliner, Anton / Zeissler, Viktor / Frank, Carmen / Adelhardt, Johann / Shi, Rui P. / Nöth, Elmar:
"We are not amused - but how do you know? user states in a multi-modal dialogue system",
733-736.
Dialog System User and Domain Modeling
Bernsen, Niels Ole:
"On-line user modelling in a mobile spoken dialogue system",
737-740.
Pakucs, Botond:
"Towards dynamic multi-domain dialogue processing",
741-744.
Komatani, Kazunori / Ueno, Shinichi / Kawahara, Tatsuya / Okuno, Hiroshi G.:
"User modeling in spoken dialogue systems for flexible guidance generation",
745-748.
Seneff, Stephanie / Chung, Grace / Wang, Chao:
"Empowering end users to personalize dialogue systems through spoken interaction",
749-752.
Raux, Antoine / Langner, Brian / Black, Alan W. / Eskenazi, Maxine:
"LET's GO: improving spoken dialog systems for the elderly and non-natives",
753-756.
Hakulinen, Jaakko / Turunen, Markku / Salonen, Esa-Pekka:
"Agents for integrated tutoring in spoken dialogue systems",
757-760.
Topics in Speech Recognition and Segmentation
Kim, Taeyoon / Ko, Hanseok:
"Utterance verification under distributed detection and fusion framework",
889-892.
Ho, Simon / Mak, Brian:
"Joint estimation of thresholds in a bi-threshold verification problem",
893-896.
Nefti, Samir / Boeffard, Olivier / Moudenc, Thierry:
"Confidence measures for phonetic segmentation of continuous speech",
897-900.
Wiggers, Pascal / Rothkrantz, Leon J.M.:
"Using confidence measures and domain knowledge to improve speech recognition",
901-904.
Thambiratnam, K. / Sridharan, Sridha:
"Isolated word verification using cohort word-level verification",
905-908.
Au, Wing-Hei / Siu, Man-Hung:
"A new approach to minimize utterance verification error rate for a specific operating point",
909-912.
Yan, Binfeng / Guo, Rui / Zhu, Xiaoyan:
"Continuous speech recognition and verification based on a combination score",
913-916.
Fabian, Tibor / Lieb, Robert / Ruske, Gunther / Thomae, Matthias:
"Impact of word graph density on the quality of posterior probability based confidence measures",
917-920.
Heracleous, Panikos / Shimizu, Tohru:
"An efficient keyword spotting technique using a complementary language for filler models training",
921-924.
Levit, Michael / Alshawi, Hiyan / Gorin, Allen / Nöth, Elmar:
"Context-sensitive evaluation and correction of phone recognition output",
925-928.
Deng, Yonggang / Mahajan, Milind / Acero, Alex:
"Estimating speech recognition error rate without acoustic test data",
929-932.
Bisani, M. / Ney, Hermann:
"Multigram-based grapheme-to-phoneme conversion for LVCSR",
933-936.
Beutler, Rene / Pfister, Beat:
"Integrating statistical and rule-based knowledge for continuous German speech recognition",
937-940.
Vandecatseye, An / Martens, Jean-Pierre:
"A fast, accurate and stream-based speaker segmentation and clustering algorithm",
941-944.
Cheng, Shi-sian / Wang, Hsin-Min:
"A sequential metric-based audio segmentation method via the Bayesian information criterion",
945-948.
Srivastava, Amit / Kubala, Francis:
"Sentence boundary detection in arabic speech",
949-952.
Franz, Martin / Ramabhadran, Bhuvana / Ward, Todd / Picheny, Michael:
"Automated transcription and topic segmentation of large spoken archives",
953-956.
Liu, Yang / Shriberg, Elizabeth / Stolcke, Andreas:
"Automatic disfluency identification in conversational speech using multiple knowledge sources",
957-960.
Yamamoto, Natsuo / Ogata, Jun / Ariki, Yasuo:
"Topic segmentation and retrieval system for lecture videos based on spontaneous speech recognition",
961-964.
Robust Speech Recognition - Acoustic Modeling
Markov, Konstantin / Dang, Jianwu / Iizuka, Yosuke / Nakamura, Satoshi:
"Hybrid HMM/BN ASR system integrating spectrum and articulatory features",
965-968.
Stemmer, Georg / Zeissler, Viktor / Hacker, Christian / Nöth, Elmar / Niemann, Heinrich:
"Context-dependent output densities for hidden Markov models in speech recognition",
969-972.
Shinozaki, Takahiro / Furui, Sadaoki:
"Time adjustable mixture weights for speaking rate fluctuation",
973-976.
Wu, Jian / Huo, Qiang:
"A switching linear Gaussian hidden Markov model and its application to nonstationary noise compensation for robust speech recognition",
977-980.
Tyagi, Vivek / McCowan, Iain A. / Bourlard, Hervé / Misra, Hemant:
"On factorizing spectral dynamics for robust speech recognition",
981-984.
Jia, Chuan / Ding, Peng / Xu, Bo:
"Joint model and feature based compensation for robust speech recognition under non-stationary noise environments",
985-988.
Advanced Machine Learning Algorithms for Speech and Language Processing
Cortes, Corinna / Haffner, Patrick / Mohri, Mehryar:
"Weighted automata kernels - general framework and algorithms",
989-992.
Altun, Yasemin / Hofmann, Thomas:
"Large margin methods for label sequence learning",
993-996.
Ratsch, Gunnar:
"Robust multi-class boosting",
997-1000.
Saul, Lawrence K. / Sha, Fei / Lee, Daniel D.:
"Statistical signal processing with nonnegativity constraints",
1001-1004.
Garg, Ashutosh / Warmuth, Manfred K.:
"Inline updates for HMMs",
1005-1008.
Roweis, Sam T.:
"Factorial models and refiltering for speech separation and denoising",
1009-1012.
Multi-Modal Spoken Language Processing
Klein, Alexandra / Trost, Harald:
"Using corpus-based methods for spoken access to news texts on the web",
1037-1040.
Brungart, Douglas S. / Simpson, Brian D. / Kordik, Alex:
"Cross-modal informational masking due to mismatched audio cues in a speechreading task",
1041-1044.
Berthommier, Frédéric:
"Audiovisual speech enhancement based on the association between speech envelope and video features",
1045-1048.
Wasinger, Rainer / Stahl, Christoph / Krueger, Antonio:
"Robust speech interaction in a mobile environment through the use of multiple and different media input types",
1049-1052.
Woltjer, Rogier / Tan, Wah Jin / Chen, Fang:
"Speech-based, manual-visual, and multi-modal interaction with an in-car computer - evaluation of a pilot study",
1053-1056.
Prodanov, Plamen / Drygajlo, Andrzej:
"Bayesian networks for spoken dialogue management in multimodal systems of tour-guide robots",
1057-1060.
Speech Coding and Transmission
Chu, Wai C. / Miki, Toshio:
"Optimization of window and LSF interpolation factor for the ITU-t g.729 speech coding standard",
1061-1064.
Chang, Joon-Hyuk / Shin, Jong-Won / Kim, Nam Soo:
"Likelihood ratio test with complex laplacian model for voice activity detection",
1065-1068.
Nurminen, Jani:
"Multi-mode quantization of adjacent speech parameters using a low-complexity prediction scheme",
1069-1072.
Sinervo, Ulpu / Nurminen, Jani / Heikkinen, Ari / Saarinen, Jukka:
"Multi-mode matrix quantizer for low bit rate LSF quantization",
1073-1076.
Mertz, Frank / Taddei, Herve / Varga, Imre / Vary, Peter:
"Voicing controlled frame loss concealment for adaptive multi-rate (AMR) speech frames in voice-over-IP",
1077-1080.
Lahdekorpi, Marja / Nurminen, Jani / Heikkinen, Ari / Saarinen, Jukka:
"Perceptual irrelevancy removal in narrowband speech coding",
1081-1084.
Jeu, Charles du / Charbit, Maurice / Chollet, Gérard:
"Very-low-rate speech compression by indexation of polyphones",
1085-1088.
Sanchez, Victoria / Peinado, Antonio M. / Gomez, Angel M. / Perez-Cordoba, Jose L.:
"Entropy-optimized channel error mitigation with application to speech recognition over wireless",
1089-1092.
Krishnan, Venkatesh / Anderson, David V.:
"Robust jointly optimized multistage vector quantization for speech coding",
1093-1096.
Pobloth, Harald / Vafin, Renat / Kleijn, W. Bastiaan:
"Polar quantization of sinusoids from speech signal blocks",
1097-1100.
Yoon, Sung-Wan / Choi, Jin-Kyu / Kang, Hong-Goo / Youn, Dae-Hee:
"Transcoding algorithm for g.723.1 and AMR speech coders: for interoperability between voIP and mobile networks",
1101-1104.
Petrinovic, Davorka / Petrinovic, Davor:
"Quality-complexity trade-off in predictive LSF quantization",
1105-1108.
Kikuiri, Kei / Naka, Nobuhiko / Ohya, Tomoyuki:
"Variable bit rate control with trellis diagram approximation",
1109-1112.
Srinivasamurthy, Naveen / Ortega, Antonio / Narayanan, Shrikanth:
"Towards optimal encoding for classification with applications to distributed speech recognition",
1113-1116.
Raad, Mohammed / Burnett, Ian / Mertins, Alfred:
"Multi-rate extension of the scalable to lossless PSPIHT audio coder",
1117-1120.
Shabestary, Turaj Zakizadeh / Hedelin, Per / Norden, Fredrik:
"Entropy constrained quantization of LSP parameters",
1121-1124.
Speech Recognition - Search and Lexicon Modeling
Kobayashi, Akio / Och, Franz J. / Ney, Hermann:
"Named entity extraction from Japanese broadcast news",
1125-1128.
Park, Young-Hee / Ahn, Dong-Hoon / Chung, Minhwa:
"Morpheme-based lexical modeling for korean broadcast news transcription",
1129-1132.
Wachter, Mathias De / Demuynck, Kris / Compernolle, Dirk van / Wambacq, Patrick:
"Data driven example based continuous speech recognition",
1133-1136.
Astrov, Sergey / Andrassy, Bernt:
"Large vocabulary speaker independent isolated word recognition for embedded systems",
1137-1140.
Seward, Alexander:
"Low-latency incremental speech transcription in the synface project",
1141-1144.
Kanthak, S. / Ney, Hermann:
"Multilingual acoustic modeling using graphemes",
1145-1148.
Fujii, Atsushi / Itou, Katunobu / Akiba, Tomoyosi / Ishikawa, Tetsuya:
"A cross-media retrieval system for lecture videos",
1149-1152.
Fujii, Atsushi / Itou, Katunobu:
"Building a test collection for speech-driven web retrieval",
1153-1156.
Novak, Miroslav / Ruiz, Diego:
"Confidence measure driven scalable two-pass recognition strategy for large list grammars",
1157-1160.
Abdou, Sherif / Scordilis, Michael S.:
"An efficient, fast matching approach using posterior probability estimates in speech recognition",
1161-1164.
Hacioglu, Kadri / Pellom, Bryan / Ciloglu, Tolga / Ozturk, Ozlem / Kurimo, Mikko / Creutz, Mathias:
"On lexicon creation for turkish LVCSR",
1165-1168.
Chen, Stanley F.:
"Compiling large-context phonetic decision trees into finite-state transducers",
1169-1172.
Maskey, Sameer Raj / Hirschberg, Julia:
"Automatic summarization of broadcast news using structural features",
1173-1176.
Yan, Yonghong / Zheng, Chengyi / Zhang, Jianping / Pan, Jielin / Han, Jiang / Liu, Jian:
"A dynamic cross-reference pruning strategy for multiple feature fusion at decoder run time",
1177-1180.
Lamere, Paul / Kwok, Philip / Walker, William / Gouvea, Evandro / Singh, Rita / Raj, Bhiksha / Wolf, Peter:
"Design of the CMU sphinx-4 decoder",
1181-1184.
Cilingir, Onur / Demirekler, Mubeccel:
"A new decoder design for large vocabulary turkish speech recognition",
1185-1188.
Speech Technology Applications
Green, Phil / Carmichael, James / Hatzis, Athanassios / Enderby, Pam / Hawley, Mark / Parker, Mark:
"Automatic speech recognition with sparse training data for dysarthric speakers",
1189-1192.
Inoue, Akira / Mikami, Takayoshi / Yamashita, Yoichi:
"Prediction of sentence importance for speech summarization using prosodic parameters",
1193-1196.
Wang, Chong-kai / Lyu, Ren-Yuan / Chiang, Yuang-Chin:
"An automatic singing transcription system with multilingual singing lyric recognizer and robust melody tracker",
1197-1200.
Goto, Masataka / Omoto, Yukihiro / Itou, Katunobu / Kobayashi, Tetsunori:
"Speech shift: direct speech-input-mode switching through intentional control of voice pitch",
1201-1204.
Matsushita, Masahiko / Nishizaki, Hiromitsu / Utsuro, Takehito / Kodama, Yasuhiro / Nakagawa, Seiichi:
"Evaluating multiple LVCSR model combination in NTCIR-3 speech-driven web retrieval task",
1205-1208.
Wang, Kuansan:
"Semantic object synchronous understanding in SALT for highly interactive user interface",
1209-1212.
Kneissler, Jan / Kienappel, Anne K. / Klakow, Dietrich:
"Information retrieval based call classification",
1213-1216.
Larson, Martha / Eickeler, Stefan:
"Using syllable-based indexing features and language models to improve German spoken document retrieval",
1217-1220.
Sundaram, Shiva / Narayanan, Shrikanth:
"An empirical text transformation method for spontaneous speech synthesizers",
1221-1224.
Gul, Yilmaz / Ariyaeeinia, Aladdin M. / Dewhirst, Oliver:
"A new approach to reducing alarm noise in speech",
1225-1228.
Yu, Dong / Wang, Kuansan / Mahajan, Milind / Mau, Peter / Acero, Alex:
"Improved name recognition with user modeling",
1229-1232.
Bawab, Ziad Al / Locher, Ivo / Xue, Jianxia / Alwan, Abeer:
"Speech recognition over bluetooth wireless channels",
1233-1236.
Kitayama, Koji / Goto, Masataka / Itou, Katunobu / Kobayashi, Tetsunori:
"Speech starter: noise-robust endpoint detection by using filled pauses",
1237-1240.
Boulianne, Gilles / Beaumont, Jean-Francois / Cardinal, Patrick / Comeau, Michel / Ouellet, Pierre / Dumouchel, Pierre:
"Automatic segmentation of film dialogues into phonemes and graphemes",
1241-1244.
Brousseau, Julie / Beaumont, Jean-Francois / Boulianne, Gilles / Cardinal, Patrick / Chapdelaine, Claude / Comeau, Michel / Osterrath, Frédéric / Ouellet, Pierre:
"Automated closed-captioning of live TV broadcast news in French",
1245-1248.
Jan, E.E. / Maison, Benoit / Mangu, Lidia / Zweig, Geoffrey:
"Automatic construction of unique signatures and confusable sets for natural language directory assistance applications",
1249-1252.
Meng, Helen M. / Li, Yuk-Chi / Fung, Tien-Ying / Ho, Man-Cheuk / Keung, Chi-Kin / Lo, Tin-Hang / Lo, Wai-Kit / Ching, P.C.:
"Recent enhancements in CU VOCAL for Chinese TTS-enabled applications",
1253-1256.
Trancoso, Isabel / Neto, Joao P. / Meinedo, Hugo / Amaral, Rui:
"Evaluation of an alert system for selective dissemination of broadcast news",
1257-1260.
Mittal, U. / Ashley, J.P. / Cruz-Zeno, E.M.:
"Low complexity joint optimization of excitation parameters in analysis-by-synthesis speech coding",
1261-1264.
Horlock, James / King, Simon:
"Named entity extraction from word lattices",
1265-1268.
Belfield, William / Gish, Herbert:
"A topic classification system based on parametric trajectory mixture models",
1269-1272.
Robust Speech Recognition - Front-end Processing
Yao, Kaisheng / Paliwal, Kuldip K. / Nakamura, Satoshi:
"Model based noisy speech recognition with environment parameters estimated by noise adaptive speech recognition with prior",
1273-1276.
Seltzer, Michael L. / Droppo, Jasha / Acero, Alex:
"A harmonic-model-based front end for robust speech recognition",
1277-1280.
Yapanel, Umit H. / Hansen, John H.L.:
"A new perspective on feature extraction for robust in-vehicle speech recognition",
1281-1284.
Sekiya, Toshiyuki / Ogawa, Tetsuji / Kobayashi, Tetsunori:
"Speech recognition of double talk using SAFIA-based audio segregation",
1285-1288.
Zhang, Xianxian / Hansen, John H.L.:
"CFA-BF: a novel combined fixed/adaptive beamforming for robust speech recognition in real car environments",
1289-1292.
Potamianos, Gerasimos / Neti, Chalapathy:
"Audio-visual speech recognition in challenging environments",
1293-1296.
Spoken Language Processing for e-Inclusion
Karlsson, Inger / Faulkner, Andrew / Salvi, Giampiero:
"SYNFACE - a talking face telephone",
1297-1300.
Vesnicer, Bostjan / Zibert, Janez / Dobrisek, Simon / Pavesic, Nikola / Mihelic, France:
"A voice-driven web browser for blind people",
1301-1304.
Muller, Christian / Wittig, Frank / Baus, Jorg:
"Exploiting speech for recognizing elderly users to respond to their special needs",
1305-1308.
Newell, Alan F.:
"Spoken language and e-inclusion",
1309-1312.
Stemmer, Georg / Hacker, Christian / Steidl, Stefan / Nöth, Elmar:
"Acoustic normalization of children's speech",
1313-1316.
Language and Accent Identification
Martin, Alvin F. / Przybocki, Mark A.:
"NIST 2003 language recognition evaluation",
1341-1344.
Singer, E. / Torres-Carrasquillo, P.A. / Gleason, T.P. / Campbell, W.M. / Reynolds, Douglas A.:
"Acoustic, phonetic, and discriminative approaches to automatic language identification",
1345-1348.
Chen, Stanley F. / Maison, Benoit:
"Using place name data to train language identification models",
1349-1352.
Angkititrakul, Pongtep / Hansen, John H.L.:
"Use of trajectory models for automatic accent classification",
1353-1356.
Ramasubramanian, V. / Jayram, A.K.V. Sai / Sreenivas, T.V.:
"Language identification using parallel sub-word recognition - an ergodic HMM equivalence",
1357-1360.
BenZeghiba, Mohamed Faouzi / Bourlard, Hervé:
"On the combination of speech and speaker recognition",
1361-1364.
Speech Recognition - Adaptation 1, 2
Pitz, Michael / Ney, Hermann:
"Vocal tract normalization as linear transformation of MFCC",
1445-1448.
Wang, Zhirong / Schultz, Tanja:
"Non-native spontaneous speech recognition through polyphone decision tree specialization",
1449-1452.
Ariki, Yasuo / Shigemori, Takeru / Kaneko, Tsuyoshi / Ogata, Jun / Fujimoto, Masakiyo:
"Live speech recognition in sports games by adaptation of acoustic model and language model",
1453-1456.
Oh, Se-Jin / Kim, Kwang-Dong / Roh, Duk-Gyoo / Sung, Woo-Chang / Chung, Hyun-Yeol:
"Speaker adaptation using regression classes generated by phonetic decision tree-based successive state splitting",
1457-1460.
Kim, Jiun / Chung, Jaeho:
"Reduction of dimension of HMM parameters using ICA and PCA in MLLR framework for speaker adaptation",
1461-1464.
Zhang, Huayun / Xu, Bo:
"Geometric constrained maximum likelihood linear regression on Mandarin dialect adaptation",
1465-1468.
Akiba, Tomoyosi / Itou, Katunobu / Fujii, Atsushi:
"Adapting language models for frequent fixed phrases by emphasizing n-gram subsets",
1469-1472.
Kienappel, Anne K.:
"Learning intra-speaker model parameter correlations from many short speaker segments",
1473-1476.
Kam, Patgi / Lee, Tan / Soong, Frank K.:
"Modeling Cantonese pronunciation variation by acoustic model refinement",
1477-1480.
Park, Jong Se / Song, Hwa Jeon / Kim, Hyung Soon:
"Performance improvement of rapid speaker adaptation based on eigenvoice and bias compensation",
1481-1484.
Fang, Xiaoshan / Gao, Jianfeng / Li, Jianfeng / Sheng, Huanye:
"Training data optimization for language model adaptation",
1485-1488.
Aalburg, Stefanie / Hoege, Harald:
"Approaches to foreign-accented speaker-independent speech recognition",
1489-1492.
Yamade, Shingo / Lee, Akinobu / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Unsupervised speaker adaptation based on HMM sufficient statistics in various noisy environments",
1493-1496.
Lauri, Fabrice / Illina, Irina / Fohr, Dominique / Korkmazsky, Filipp:
"Using genetic algorithms for rapid speaker adaptation",
1497-1500.
Barreaud, Vincent / Illina, Irina / Fohr, Dominique / Korkmazsky, Filipp:
"Structural state-based frame synchronous compensation",
1501-1504.
Lawson, Aaron D. / Harris, David M. / Grieco, John J.:
"Effect of foreign accent on speech recognition in the NATO n-4 corpus",
1505-1508.
Nedel, Jon P. / Stern, Richard M.:
"Duration normalization and hypothesis combination for improved spontaneous speech recognition",
1509-1512.
Chou, Wu / He, Xiaodong:
"Maximum a posteriori linear regression (MAPLR) variance adaptation for continuous density HMMS",
1513-1516.
Myrvoll, Tor Andre / Soong, Frank K.:
"On divergence based clustering of normal distributions and its application to HMM adaptation",
1517-1520.
Balakrishnan, Sreeram V.:
"Fast incremental adaptation using maximum likelihood regression and stochastic gradient descent",
1521-1524.
Axelrod, Scott / Goel, Vaibhava / Kingsbury, Brian / Visweswariah, Karthik / Gopinath, Ramesh:
"Large vocabulary conversational speech recognition with a subspace constraint on inverse covariance matrices",
1613-1616.
Jang, Gyucheol / Jin, Minho / Yoo, Chang D.:
"Speaker adaptation based on confidence-weighted training",
1617-1620.
Abad, Alberto / Nadeu, Climent / Hernando, Javier / Padrell, Jaume:
"Jacobian adaptation based on the frequency-filtered spectral energies",
1621-1624.
Matrouf, Driss / Bellot, Olivier / Nocera, Pascal / Linares, Georges / Bonastre, Jean-Francois:
"Structural linear model-space transformations for speaker adaptation",
1625-1628.
He, Xiaodong / Chou, Wu:
"Minimum classification error (MCE) model adaptation of continuous density HMMS",
1629-1632.
Gunawardana, Asela / Acero, Alex:
"Adapting acoustic models to new domains and conditions using untranscribed data",
1633-1636.
Speech Resources and Standards
Bijankhan, Mahmood / Sheykhzadegan, Javad / Roohani, Mahmood R. / Zarrintare, Rahman / Ghasemi, Seyyed Z. / Ghasedi, Mohammad E.:
"Tfarsdat - the telephone farsi speech database",
1525-1528.
Hartikainen, Elviira / Maltese, Giulio / Moreno, Asunción / Shammass, Shaunie / Ziegenhain, Ute:
"Large lexica for speech-to-speech translation: from specification to creation",
1529-1532.
Oflazer, Kemal / Inkelas, Sharon:
"A pronunciation lexicon for turkish based on two-level morphology",
1533-1536.
Zheng, Hong / Lu, Yiqing:
"Using both global and local hidden Markov models for automatic speech unit segmentation",
1537-1540.
Heuvel, Henk van den / Choukri, Khalid / Hoge, Harald / Maegaard, Bente / Odijk, Jan / Mapelli, Valerie:
"Quality control of language resources at ELRA",
1541-1544.
Bael, Christophe van / Binnenpoorte, Diana / Strik, Helmer / Heuvel, Henk van den:
"Validation of phonetic transcriptions based on recognition performance",
1545-1548.
Hernaez, I. / Luengo, I. / Navas, E. / Zubizarreta, M. / Gaminde, I. / Sanchez, J.:
"The basque speech_dat (II) database: a description and first test recognition results",
1549-1552.
Maase, Jens / Hirschfeld, Diane / Koloska, Uwe / Westfeld, Timo / Helbig, Jorg:
"Towards an evaluation standard for speech control concepts in real-world scenarios",
1553-1556.
Draxler, Chr.:
"Orientel: recording telephone speech of turkish speakers in Germany",
1557-1560.
Backfried, Gerhard / Caldes, Roser Jaquemot:
"Spanish broadcast news transcription",
1561-1564.
Digalakis, Vassilios / Oikonomidis, Dimitrios / Pratsolis, D. / Tsourakis, N. / Vosnidis, C. / Chatzichrisafis, N. / Diakoloukas, V.:
"Large vocabulary continuous speech recognition in greek: corpus and an automatic dictation system",
1565-1568.
Daubias, Philippe / Deleglise, Paul:
"The LIUM-AVS database : a corpus to test lip segmentation and speechreading systems in natural conditions",
1569-1572.
Salor, Ozgul / Pellom, Bryan / Demirekler, Mubeccel:
"Implementation and evaluation of a text-to-speech synthesis system for turkish",
1573-1576.
Kolar, Jachym / Romportl, Jan / Psutka, Josef:
"The czech speech and prosody database both for ASR and TTS purposes",
1577-1580.
Kishida, Itsuki / Irie, Yuki / Yamaguchi, Yukiko / Matsubara, Shigeki / Kawaguchi, Nobuo / Inagaki, Yasuyoshi:
"Construction of an advanced in-car spoken dialogue corpus and its characteristic analysis",
1581-1584.
Jones, Douglas A. / Wolf, Florian / Gibson, Edward / Williams, Elliott / Fedorenko, Evelina / Reynolds, Douglas A. / Zissman, Marc:
"Measuring the readability of automatic speech-to-text transcripts",
1585-1588.
Mana, Nadia / Burger, Susanne / Cattoni, Roldano / Besacier, Laurent / MacLaren, Victoria / McDonough, John / Metze, Florian:
"The NESPOLE! voIP multilingual corpora in tourism and medical domains",
1589-1592.
Conejero, David / Gimenez, Jesus / Arranz, Victoria / Bonafonte, Antonio / Pascual, Neus / Castell, Nuria / Moreno, Asunción:
"Lexica and corpora for speech-to-speech translation: a trilingual approach",
1593-1596.
Cieri, Christopher / Miller, David / Walker, Kevin:
"From switchboard to fisher: telephone collection protocols, their uses and yields",
1597-1600.
Meister, Einar / Lasn, Jurgen / Meister, Lya:
"Development of the estonian speechdat-like database",
1601-1604.
Serralheiro, Antonio / Trancoso, Isabel / Caseiro, Diamantino / Chambel, Teresa / Carrico, Luis / Guimaraes, Nuno:
"Towards a repository of digital talking books",
1605-1608.
Strassel, Stephanie / Miller, David / Walker, Kevin / Cieri, Christopher:
"Shared resources for robust speech-to-text technology",
1609-1612.
Towards Synthesizing Expressive Speech
Campbell, Nick:
"Towards synthesising expressive speech; designing and collecting expressive speech data",
1637-1640.
Banziger, Tanja / Morel, Michel / Scherer, Klaus R.:
"Is there an emotion signature in intonational patterns? and can it be used in synthesis?",
1641-1644.
Eide, E. / Bakis, R. / Hamza, W. / Pitrelli, J.:
"Multilayered extensions to the speech synthesis markup language for describing expressiveness",
1645-1648.
Black, Alan W.:
"Unit selection and emotional speech",
1649-1652.
d'Alessandro, Christophe / Doval, Boris:
"Voice quality modification for emotional speech synthesis",
1653-1656.
Santen, Jan P.H. van / Black, Lois / Cohen, Gilead / Kain, Alexander B. / Klabbers, Esther / Mishra, Taniya / Villiers, Jacques de / Niu, Xiaochuan:
"Applications of computer generated expressive speech for communication disorders",
1657-1660.
Speaker Verification
Leeuwen, David A. van:
"Speaker verification systems and security considerations",
1661-1664.
Hebert, Matthieu / Heck, Larry P.:
"Phonetic class-based speaker verification",
1665-1668.
Suhadi, Suhadi / Stan, Sorel / Fingscheidt, Tim / Beaugeant, Christophe:
"An evaluation of VTS and IMM for speaker verification in noise",
1669-1672.
Ganchev, Todor / Tasoulis, Dimitris K. / Vrahatis, Michael N. / Fakotakis, Nikos:
"Locally recurrent probabilistic neural network for text-independent speaker verification",
1673-1676.
Li, Stan Z. / Zhang, Dong / Ma, Chengyuan / Shum, Heung-Yeung / Chang, Eric:
"Learning to boost GMM based speaker verification",
1677-1680.
Yu, Eric W.M. / Mak, Man-Wai / Sit, Chin-Hung / Kung, Sun-Yuan:
"Speaker verification based on g.729 and g.723.1 coder parameters and handset mismatch compensation",
1681-1684.
Dialog System Generation
Whittaker, Stephen / Walker, Marilyn / Maloor, Preetam:
"Should i tell all?: an experiment on conciseness in spoken dialogue",
1685-1688.
Meng, Helen M. / Yip, Wing Lin / Mok, Oi Yan / Chan, Shuk Fong:
"Natural language response generation in mixed-initiative dialogs using task goals and dialog acts",
1689-1692.
Hirose, Keikichi / Tago, Junji / Minematsu, Nobuaki:
"Speech generation from concept for realizing conversation with an agent in a virtual room",
1693-1696.
Walker, Marilyn / Prasad, Rashmi / Stent, Amanda:
"A trainable generator for recommendations in multimodal dialog",
1697-1700.
Kawahara, Tatsuya / Ito, Ryosuke / Komatani, Kazunori:
"Spoken dialogue system for queries on appliance manuals using hierarchical confirmation strategy",
1701-1704.
Kallulli, Dalina:
"SAG: a procedural tactical generator for dialog systems",
1705-1708.
Robust Speech Recognition 1-4
Luo, Yu / Du, Limin:
"A hidden Markov model-based missing data imputation approach",
1765-1768.
Yamada, Takeshi / Okada, Jiro / Takeda, Kazuya / Kitaoka, Norihide / Fujimoto, Masakiyo / Kuroiwa, Shingo / Yamamoto, Kazumasa / Nishiura, Takanobu / Mizumachi, Mitsunori / Nakamura, Satoshi:
"Integration of noise reduction algorithms for Aurora2 task",
1769-1772.
Singh, Rita / Warmuth, Manfred K. / Raj, Bhiksha / Lamere, Paul:
"Classification with free energy at raised temperatures",
1773-1776.
Ding, Pei / Shi, Bertram E. / Fung, Pascale / Cao, Zhigang:
"Flooring the observation probability for robust ASR in impulsive noise",
1777-1780.
Fujimoto, Masakiyo / Ariki, Yasuo:
"Combination of temporal domain SVD based speech enhancement and GMM based speech estimation for ASR in noise - evaluation on the AURORA2 task -",
1781-1784.
Fousek, Petr / Pollak, Petr:
"Additive noise and channel distortion-robust parametrization tool - performance evaluation on Aurora 2 & 3",
1785-1788.
Dupont, Stephane / Ris, Christophe:
"Robust feature extraction and acoustic modeling at multitel: experiments on the Aurora databases",
1789-1792.
Kotnik, Bojan / Kacic, Zdravko / Horvat, Bogomir:
"Noise robust speech parameterization based on joint wavelet packet decomposition and autoregressive modeling",
1793-1796.
Couvreur, Christophe / Gedge, Oren / Linhard, Klaus / Shammass, Shaunie / Vantieghem, Johan:
"Database adaptation for ASR in cross-environmental conditions in the SPEECON project",
1797-1800.
Motlicek, Petr / Cernocký, Jan:
"Autoregressive modeling based feature extraction for Aurora3 DSR task",
1801-1804.
Trentin, Edmondo / Matassoni, Marco / Gori, Marco:
"Evaluation on the Aurora 2 database of acoustic models that are less noise-sensitive",
1805-1808.
Macias-Guarasa, J. / Ordonez, J. / Montero, J.M. / Ferreiros, J. / Cordoba, R. / D'Haro, L.F.:
"Revisiting scenarios and methods for variable frame rate analysis in automatic speech recognition",
1809-1812.
Parveen, Shahla / Green, Phil:
"Multitask learning in connectionist robust ASR using recurrent neural networks",
1813-1816.
Misra, Hemant / Morris, Andrew:
"Confusion matrix based entropy correction in multi-stream combination",
1817-1820.
Zhang, Huayun / Han, Zhaobing / Xu, Bo:
"Dynamic channel compensation based on maximum a posteriori estimation",
2137-2140.
Docio-Fernandez, Laura / Gelbart, David / Morgan, Nelson:
"Far-field ASR on inexpensive microphones",
2141-2144.
Tsuge, Satoru / Kuroiwa, Shingo / Kita, Kenji:
"Evaluation of ETSI advanced DSR front-end and bias removal method on the Japanese newspaper article sentences speech corpus",
2145-2148.
Soon, Chng Chin / Andrassy, Bernt / Bauer, Josef / Ruske, Gunther:
"Environment adaptive control of noise reduction parameters for improved robustness of ASR",
2149-2152.
Denda, Yuki / Nishiura, Takanobu / Kawahara, Hideki:
"Speech enhancement with microphone array and fourier / wavelet spectral subtraction in real noisy environments",
2153-2156.
Nishiura, Takanobu / Nakamura, Satoshi / Miki, Kazuhiro / Shikano, Kiyohiro:
"Environmental sound source identification based on hidden Markov model for robust speech recognition",
2157-2160.
Jancovic, Peter / Kokuer, Munevver / Murtagh, Fionn:
"High-likelihood model based on reliability statistics for robust combination of features: application to noisy speech recognition",
2161-2164.
Demiroglu, Cenk / Anderson, David V.:
"Noise robust digit recognition with missing frames",
2165-2168.
Cui, Xiaodong / Bernard, Alexis / Alwan, Abeer:
"A noise-robust ASR back-end technique based on weighted viterbi recognition",
2169-2172.
Ghulam, Muhammad / Fukuda, Takashi / Nitta, Tsuneo:
"Voice quality normalization in an utterance for robust ASR",
2173-2176.
Akbacak, Murat / Hansen, John H.L.:
"Environmental sniffing: robust digit recognition for an in-vehicle environment",
2177-2180.
Hwang, Tai-Hwei:
"Energy contour extraction for in-car speech recognition",
2181-2184.
Fukuda, Takashi / Nitta, Tsuneo:
"Noise-robust ASR by using distinctive phonetic features approximated with logarithmic normal distribution of HMM",
2185-2188.
Fukuda, Takashi / Nitta, Tsuneo:
"Noise-robust automatic speech recognition using orthogonalized distinctive phonetic feature vectors",
2189-2192.
Yoma, Nestor Becerra / Brito, Ivan / Silva, Jorge:
"Language model accuracy and uncertainty in noise cancelling in the stochastic weighted viterbi algorithm",
2193-2196.
Eneman, Koen / Duchateau, Jacques / Moonen, Marc / Compernolle, Dirk van / Hamme, Hugo van:
"Assessment of dereverberation algorithms for large vocabulary speech recognition systems",
2689-2692.
Milner, Ben P. / James, A.B.:
"Analysis and compensation of packet loss in distributed speech recognition using interleaving",
2693-2696.
Milner, Ben P.:
"Non-linear compression of feature vectors using transform coding and non-uniform bit allocation",
2697-2700.
Chien, Jen-Tzung / Furui, Sadaoki:
"Predictive hidden Markov model selection for decision tree state tying",
2701-2704.
Nakadai, Kazuhiro / Matsuura, Daisuke / Okuno, Hiroshi G. / Tsujino, Hiroshi:
"Three simultaneous speech recognition by integration of active audition and face recognition for humanoid",
2705-2708.
Fujinaga, Katsuhisa / Kokubo, Hiroaki / Yamamoto, Hirofumi / Kikui, Genichiro / Shimodaira, Hiroshi:
"Mis-recognized utterance detection using multiple language models generated by clustered sentences",
2709-2712.
Sun, Hui / Zhang, Guoliang / Zheng, Fang / Xu, Mingxing:
"Using word confidence measure for OOV words detection in a spontaneous spoken dialog system",
2713-2716.
Manabe, Hiroyuki / Hiraiwa, Akira / Sugimura, Toshiaki:
"Speech recognition using EMG; mime speech recognition",
2717-2720.
Jitsuhiro, Takatoshi / Matsui, Tomoko / Nakamura, Satoshi:
"Automatic generation of non-uniform context-dependent HMM topologies based on the MDL criterion",
2721-2724.
Kitaoka, Norihide / Shingu, Masahisa / Nakagawa, Seiichi:
"Comparison of effects of acoustic and language knowledge on spontaneous speech perception/recognition between human and automatic speech recognizer",
2725-2728.
Gorrell, Genevieve:
"Using statistical language modelling to identify new vocabulary in a grammar-based speech recognition system",
2729-2732.
Gomez, Angel M. / Peinado, Antonio M. / Sanchez, Victoria / Rubio, Antonio J.:
"A source model mitigation technique for distributed speech recognition over lossy packet channels",
2733-2736.
Russell, Martin J. / Jackson, Philip J.B.:
"The effect of an intermediate articulatory layer on the performance of a segmental HMM",
2737-2740.
Liu, Yi / Fung, Pascale:
"Automatic phone set extension with confidence measure for spontaneous speech",
2741-2744.
Paredes, R. / Sanchis, A. / Vidal, E. / Juan, A.:
"Utterance verification using an optimized k-nearest neighbour classifier",
2745-2748.
Fu, Guokang / Li, Ta-Hsin:
"A segment-based algorithm of speech enhancement for robust speech recognition",
3029-3032.
Gemello, Roberto / Mana, Franco / Albesano, Dario / Mori, Renato De:
"Robust multiple resolution analysis for automatic speech recognition",
3033-3036.
Afify, Mohamed:
"An accurate noise compensation algorithm in the log-spectral domain for robust speech recognition",
3037-3040.
Ramirez, Javier / Segura, Jose C. / Benitez, Carmen / Torre, Angel de la / Rubio, Antonio J.:
"A new adaptive long-term spectral estimation voice activity detector",
3041-3044.
Carey, Michael J.:
"Robust speech recognition using non-linear spectral smoothing",
3045-3048.
Miao, Cailian / Wang, Yangsheng:
"A novel use of residual noise model for modified PMC",
3049-3052.
Cerisara, Christophe / Illina, Irina:
"Robust speech recognition to non-stationary noise based on model-driven approaches",
3053-3056.
Cerisara, Christophe:
"Towards missing data recognition with cepstral features",
3057-3060.
Haverinen, Hemmo / Kiss, Imre:
"On-line parametric histogram equalization techniques for noise robust embedded speech recognition",
3061-3064.
Yu, An-Tze / Wang, Hsiao-Chuan:
"Compensation of channel distortion in line spectrum frequency domain",
3065-3068.
Martin, Arnaud / Mauuary, Laurent:
"Voicing parameter and energy based speech/non-speech detection for speech recognition in adverse conditions",
3069-3072.
Hamme, Hugo van:
"Two correction models for likelihoods in robust speech recognition using missing feature theory",
3073-3076.
Sujatha, J. / Kumar, K.R. Prasanna / Ramakrishnan, K.R. / Balakrishnan, N.:
"Spectral maxima representation for robust automatic speech recognition",
3077-3080.
Endo, Toshiki / Kuroiwa, Shingo / Nakamura, Satoshi:
"Missing feature theory applied to robust speech recognition over IP network",
3081-3084.
Tolba, Hesham / Selouani, Sid-Ahmed / O'Shaughnessy, Douglas:
"Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for robust automatic speech recognition in low-SNR car environments",
3085-3088.
Hamme, Hugo van:
"Robust speech recognition using missing feature theory in the cepstral or LDA domain",
3089-3092.
Liao, Yuan-Fu / Lin, Jeng-Shien / Tsai, Wei-Ho:
"Bandwidth mismatch compensation for robust speech recognition",
3093-3096.
Morris, Robert W. / Arrowood, Jon A. / Clements, Mark A.:
"Markov chain monte carlo methods for noise robust feature extraction using the autoregressive model",
3097-3100.
Hilario, Joan Mari / Class, Fritz:
"A comparative study of some discriminative feature reduction algorithms on the AURORA 2000 and the daimlerchrysler in-car ASR tasks",
3101-3104.
Speech Recognition - Large Vocabulary 1, 2
Psutka, Josef / Ircing, Pavel / Psutka, J.V. / Radova, Vlasta / Byrne, William J. / Hajic, Jan / Mirovsky, Jiri / Gustman, Samuel:
"Large vocabulary ASR for spontaneous czech in the MALACH project",
1821-1824.
Riccardi, Giuseppe / Hakkani-Tur, Dilek Z.:
"Active and unsupervised learning for automatic speech recognition",
1825-1828.
Yapanel, Umit H. / Dharanipragada, Satya / Hansen, John H.L.:
"Perceptual MVDR-based cepstral coefficients (PMCCs) for high accuracy speech recognition",
1829-1832.
Gao, Sheng / Lee, Chin-Hui:
"A discriminative decision tree learning approach to acoustic modeling",
1833-1836.
Nguyen, Patrick / Rigazio, Luca / Junqua, Jean-Claude:
"Large corpus experiments for broadcast news recognition",
1837-1840.
Jitapunkul, Somchai / Maneenoi, Ekkarit / Ahkuputra, Visarut / Luksaneeyanawin, Sudaporn:
"Performance evaluation of phonotactic and contextual onset-rhyme models for speech recognition of Thai language",
1841-1844.
Qian, Yao / Lee, Tan / Li, Yujia:
"Overlapped di-tone modeling for tone recognition in continuous Cantonese speech",
1845-1848.
Nishida, Masafumi / Kawahara, Tatsuya:
"Speaker model selection using Bayesian information criterion for speaker indexing and speaker adaptation",
1849-1852.
Sturm, Janienke / Kessens, Judith M. / Wester, Mirjam / Wet, Febe de / Sanders, Eric / Strik, Helmer:
"Automatic transcription of football commentaries in the MUMIS project",
1853-1856.
Peters, S. Douglas:
"On the limits of cluster-based acoustic modeling",
1857-1860.
Lyu, Dau-Cheng / Liang, Min-Siong / Chiang, Yuang-Chin / Hsu, Chun-Nan / Lyu, Ren-Yuan:
"Large vocabulary taiwanese (min-nan) speech recognition using tone features and statistical pronunciation modeling",
1861-1864.
Dognin, Pierre L. / El-Jaroudi, Amro:
"A new spectral transformation for speaker normalization",
1865-1868.
Yu, Hua / Schultz, Tanja:
"Enhanced tree clustering with single pronunciation dictionary for conversational speech recognition",
1869-1872.
Ircing, Pavel / Psutka, Josef:
"Fitting class-based language models into weighted finite-state transducer framework",
1873-1876.
Lefevre, Fabrice / Gauvain, Jean-Luc / Lamel, Lori:
"Multi-source training and adaptation for generic speech recognition",
1877-1880.
Kingsbury, Brian / Mangu, Lidia / Saon, George / Zweig, Geoffrey / Axelrod, Scott / Goel, Vaibhava / Visweswariah, Karthik / Picheny, Michael:
"Toward domain-independent conversational speech recognition",
1881-1884.
Zhang, Rong / Rudnicky, Alexander I.:
"Comparative study of boosting and non-boosting training for constructing ensembles of acoustic models",
1885-1888.
Ding, Peng / Chen, Zhenbiao / Hu, Sheng / Zhang, Shuwu / Xu, Bo:
"Discriminative optimization of large vocabulary Mandarin conversational speech recognition system",
1965-1968.
Schalkwyk, Johan / Hetherington, Lee / Story, Ezra:
"Speech recognition with dynamic grammars using finite-state transducers",
1969-1972.
Demuynck, Kris / Laureys, Tom / Compernolle, Dirk van / Hamme, Hugo van:
"FLavor: a flexible architecture for LVCSR",
1973-1976.
Saon, George / Zweig, Geoffrey / Kingsbury, Brian / Mangu, Lidia / Chaudhari, Upendra:
"An architecture for rapid decoding of large vocabulary conversational speech",
1977-1980.
Povey, D. / Gales, M.J.F. / Kim, D.Y. / Woodland, P.C.:
"MMI-MAP and MPE-MAP for acoustic model adaptation",
1981-1984.
Doumpiotis, Vlasios / Tsakalidis, Stavros / Byrne, William J.:
"Lattice segmentation and minimum Bayes risk discriminative training",
1985-1988.
Robust Methods in Processing of Natural Language Dialogues
Zechner, Klaus:
"Spoken language condensation in the 21st century",
1989-1992.
Furui, Sadaoki:
"Robust methods in automatic speech recognition and understanding",
1993-1998.
Delmonte, Rodolfo:
"Parsing spontaneous speech",
1999-2004.
Speaker Identification
Reynolds, Douglas A.:
"Model compression for GMM based speaker recognition systems",
2005-2008.
Navratil, Jiri / Ramaswamy, Ganesh N.:
"The awe and mystery of t-norm",
2009-2012.
Bonastre, Jean-Francois / Morin, Philippe / Junqua, Jean-Claude:
"Gaussian dynamic warping (GDW) method applied to text-dependent speaker detection and verification",
2013-2016.
Ferrer, Luciana / Bratt, Harry / Gadde, Venkata R.R. / Kajarekar, Sachin S. / Shriberg, Elizabeth / Sonmez, Kemal / Stolcke, Andreas / Venkataraman, Anand:
"Modeling duration patterns for speaker recognition",
2017-2020.
Lucey, Simon / Chen, Tsuhan:
"Improved speaker verification through probabilistic subspace adaptation",
2021-2024.
Yu, Peng / Seide, Frank / Ma, Chengyuan / Chang, Eric:
"An improved model-based speaker segmentation system",
2025-2028.
Speech Synthesis: Miscellaneous 1, 2
Bellegarda, Jerome R.:
"A latent analogy framework for grapheme-to-phoneme conversion",
2029-2032.
Chen, Stanley F.:
"Conditional and joint models for grapheme-to-phoneme conversion",
2033-2036.
Pfister, Beat / Romsdorfer, Harald:
"Mixed-lingual text analysis for polyglot TTS synthesis",
2037-2040.
Zhang, Jason Y. / Black, Alan W. / Sproat, Richard:
"Identifying speakers in children's stories for speech synthesis",
2041-2044.
Stevens, Catherine / Lees, Nicole / Vonwiller, Julie:
"Experimental tools to evaluate intelligibility of text-to-speech (TTS) synthesis: effects of voice gender and signal quality",
2045-2048.
Tomokiyo, Laura Mayfield / Black, Alan W. / Lenzo, Kevin A.:
"Arabic in my hand: small-footprint synthesis of egyptian arabic",
2049-2052.
Bennett, Christina L. / Black, Alan W.:
"Using acoustic models to choose pronunciation variations for synthetic voices",
2937-2940.
Yan, Qin / Vaseghi, Saeed / Ho, Ching-Hsiang / Rentzos, Dimitrios / Turajlic, Emir:
"Comparative analysis and synthesis of formant trajectories of british and broad australian accents",
2941-2944.
Ramirez, Miguel Arjona:
"Cycle extraction for perfect reconstruction and rate scalability",
2945-2948.
Teixeira, Antonio / Jesus, Luis M.T. / Martinez, Roberto:
"Adding fricatives to the portuguese articulatory synthesiser",
2949-2952.
Iriondo, Ignasi / Alias, Francesc / Sanchis, Javier / Melenchon, Javier:
"A hybrid method oriented to concatenative text-to-speech synthesis",
2953-2956.
Zhao, Yong / Chu, Min / Peng, Hu / Chang, Eric:
"Custom-tailoring TTS voice font - keeping the naturalness when reducing database size",
2957-2960.
Speech Perception
Srinivasan, Soundararajan / Wang, DeLiang:
"Schema-based modeling of phonemic restoration",
2053-2056.
Kuwabara, Hisao:
"Perception of voice-individuality for distortions of resonance/source characteristics and waveforms",
2057-2060.
Sato, Tsutomu:
"The perceptual cues of a high level pitch-accent pattern in Japanese: pitch-accent patterns and duration",
2061-2064.
Iwaki, Mamoru / Nakamura, Norio:
"Illusory continuity of intermittent pure tone in binaural listening and its dependency on interaural time difference",
2065-2068.
Minematsu, Nobuaki / Guo, Changchen / Hirose, Keikichi:
"CART-based factor analysis of intelligibility reduction in Japanese English",
2069-2072.
Toth, Laszlo / Kocsor, Andras:
"Harmonic alternatives to sine-wave speech",
2073-2076.
Picovici, Dorel / Mahdi, Abdulhussain E.:
"Non-intrusive assessment of perceptual speech quality using a self-organising map",
2077-2080.
Dufour, Sophie / Peereman, Ronald:
"Inhibitory priming effect in auditory word recognition: the role of the phonological mismatch length between primes and targets",
2081-2084.
Scharenborg, Odette / Bosch, Louis ten / Boves, Lou:
"Recognising `real-life' speech with spem: a speech-based computational model of human speech recognition",
2085-2088.
Rosenhouse, Judith / Kishon-Rabin, Liat:
"The effect of speech rate and noise on bilinguals' speech perception: the case of native speakers of arabic in israel",
2089-2092.
Turk, Oytun / Arslan, Levent M.:
"Subjective evaluations for perception of speaker identity through acoustic feature transplantations",
2093-2096.
Scharenborg, Odette / McQueen, James M. / Bosch, Louis ten / Norris, Dennis:
"Modelling human speech recognition using automatic speech recognition paradigms in speM",
2097-2100.
Saito, Mutsumi / Shiraishi, Kimio / Fukudome, Kimitoshi:
"The effect of amplitude compression on wide band telephone speech for hearing-impaired elderly people",
2101-2104.
Otake, Takashi / Komatsu, Miki:
"Word activation model by Japanese school children without knowledge of roman alphabet",
2105-2108.
Harding, Sue / Meyer, Georg:
"Multi-resolution auditory scene analysis: robust speech recognition using pattern-matching from a noisy signal",
2109-2112.
Matsui, Hisami / Kawahara, Hideki:
"Investigation of emotionally morphed speech perception and its structure using a high quality speech manipulation system",
2113-2116.
Paliwal, Kuldip K. / Alsteris, Leigh:
"Usefulness of phase spectrum in human speech perception",
2117-2120.
Tokuma, Shinichi:
"Perception of English lexical stress by English and Japanese speakers: effect of duration and "realistic" intensity change",
2121-2124.
Welby, Pauline:
"French intonational rises and their role in speech seg mentation [sic]",
2125-2128.
Tokuma, Won:
"Physical and perceptual configurations of Japanese fricatives from multidimensional scaling analyses",
2129-2132.
Au, Ching-Pong:
"An acquisition model of speech perception with considerations of temporal information",
2133-2136.
Multi-Modal Processing and Speech Interface Design
Potamitis, Ilyas / Georgila, K. / Fakotakis, Nikos / Kokkinakis, George:
"An integrated system for smart-home control of appliances based on remote speech interaction",
2197-2200.
Jin, Jianhong / Russell, Martin J. / Carey, Michael J. / Chapman, James / Lloyd-Thomas, Harvey / Tattersall, Graham:
"A spoken language interface to an electronic programme guide",
2201-2204.
Lopes, L. Seabra / Teixeira, Antonio / Rodrigues, M. / Gomes, D. / Teixeira, C. / Ferreira, L. / Soares, P. / Girao, J. / Senica, N.:
"Towards a personal robot with language interface",
2205-2208.
Williams, Jason D. / Shaw, Andrew T. / Piano, Lawrence / Abt, Michael:
"Preference, perception, and task completion of open, menu-based, and directed prompts for call routing: a case study",
2209-2212.
Hatzis, Athanassios / Green, Phil / Carmichael, James / Cunningham, Stuart / Palmer, Rebecca / Parker, Mark / O'Neill, Peter:
"An integrated toolkit deploying speech technology for computer based speech training with application to dysarthric speakers",
2213-2216.
Suhm, Bernhard:
"Towards best practices for speech user interface design",
2217-2220.
Stallard, David / Makhoul, John / Choi, Frederick / Macrostie, Ehry / Natarajan, Premkumar / Schwartz, Richard / Zawaydeh, Bushra:
"Design and evaluation of a limited two-way speech translator",
2221-2224.
Dusan, Sorin / Gadbois, Gregory J. / Flanagan, James:
"Multimodal interaction on PDA's integrating speech and pen inputs",
2225-2228.
Gieselmann, Petra / Denecke, Matthias:
"Towards multimodal interaction with an intelligent room",
2229-2232.
Pieraccini, Roberto / Dayanidhi, Krishna / Bloom, Jonathan / Dahan, Jean-Gui / Phillips, Michael / Goodman, Bryan R. / Prasad, K. Venkatesh:
"A multimodal conversational interface for a concept vehicle",
2233-2236.
Ma, L. / Smith, D.J. / Milner, Ben P.:
"Context awareness using environmental noise classification",
2237-2240.
Shiraishi, Tatsuya / Toda, Tomoki / Kawanami, Hiromichi / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Simple designing methods of corpus-based visual speech synthesis",
2241-2244.
Sturm, Janienke / Bakx, Ilse / Cranen, Bert / Terken, Jacques:
"Comparing the usability of a user driven and a mixed initiative multimodal dialogue system for train timetable information",
2245-2248.
Massaro, Dominic W. / Light, Joanna:
"Read my tongue movements: bimodal learning to perceive and produce non-native speech /r/ and /l/",
2249-2252.
Perez, Jesus F. Guitarte / Lukas, Klaus / Frangi, Alejandro F.:
"Low resource lip finding and tracking algorithm for embedded devices",
2253-2256.
Asano, Futoshi / Motomura, Yoichi / Asoh, Hideki / Yoshimura, Takashi / Ichimura, Naoyuki / Yamamoto, Kiyoshi / Kitawaki, Nobuhiko / Nakamura, Satoshi:
"Detection and separation of speech segment using audio and video information fusion",
2257-2260.
Engwall, Olov / Beskow, Jonas:
"Resynthesis of 3d tongue movements from facial data",
2261-2264.
Trippel, Thorsten / Sasaki, Felix / Hell, Benjamin / Gibbon, Dafydd:
"Acquiring lexical information from multilevel temporal annotations",
2265-2268.
Cosi, Piero / Fusaro, Andrea / Tisato, Graziano:
"LUCIA a new italian talking-head based on a modified cohen-massaro's labial coarticulation model",
2269-2272.
Mukherjee, Niloy / Roy, Deb:
"A visual context-aware multimodal system for spoken language processing",
2273-2276.
Speech Recognition - Language Modeling
Piantanida, Juan P. / Estienne, Claudio F.:
"Maximum entropy good-turing estimator for language modeling",
2277-2280.
Li, Xiaolong / Zhao, Yunxin:
"Exploiting order-preserving perfect hashing to speedup n-gram language model lookahead",
2281-2284.
Oikonomidis, Dimitrios / Digalakis, Vassilios:
"Stem-based maximum entropy language models for inflectional languages",
2285-2288.
Krbec, Pavel / Podvesky, Petr / Hajic, Jan:
"Combination of a hidden tag model and a traditional n-gram model: a case study in czech speech recognition",
2289-2292.
Siivola, Vesa / Hirsimaki, Teemu / Creutz, Mathias / Kurimo, Mikko:
"Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner",
2293-2296.
Szarvas, Mate / Furui, Sadaoki:
"Evaluation of the stochastic morphosyntactic language model on a one million word hungarian dictation task",
2297-2300.
Feature Analysis and Cross-Language Processing of Chinese Spoken Language
Lee, Lin-shan / Chen, Shun-Chuan:
"Automatic title generation for Chinese spoken documents considering the special structure of the language",
2325-2328.
Xu, Bo / Zhang, Shuwu / Zong, Chengqing:
"Statistical speech-to-speech translation with multilingual speech recognition and bilingual-chunk parsing",
2329-2332.
Du, Limin / Chen, Boxing:
"Automatic extraction of bilingual chunk lexicon for spoken language translation",
2333-2336.
Lo, Wai-Kit / Li, Yuk-Chi / Levow, Gina / Wang, Hsin-Min / Meng, Helen M.:
"Multi-scale document expansion in English-Mandarin cross-language spoken document retrieval",
2337-2340.
Tseng, Chiu-yu:
"Mandarin speech prosody: issues, pitfalls and directions",
2341-2344.
Li, Aijun / Wang, Xia:
"A contrastive investigation of standard Mandarin and accented Mandarin",
2345-2348.
Tao, Jianhua:
"Emotion control of Chinese speech synthesis in natural environment",
2349-2352.
Speech Production and Physiology
Leonov, A.S. / Sorokin, V.N.:
"Optimality criteria in inverse problems for tongue-jaw interaction",
2353-2356.
Sasaki, Koji / Miki, Nobuhiro / Miyanaga, Yoshikazu:
"FEM analysis based on 3-d time-varying vocal tract shape",
2357-2360.
Dang, Jianwu / Honda, Kiyoshi:
"Consideration of muscle co-contraction in a physiological articulatory model",
2361-2364.
Manfredi, Claudia / Peretti, Giorgio:
"Robust techniques for pre- and post-surgical voice analysis",
2365-2368.
Schnell, K. / Lacroix, A.:
"Analysis of lossy vocal tract models for speech production",
2369-2372.
Khioe, Beatrice Fung-Wah:
"Temporal properties of the nasals and nasalization in Cantonese",
2373-2376.
Bettens, F. / Grenez, F. / Schoentgen, J.:
"Estimation of vocal noise in running speech by means of bi-directional double linear prediction",
2377-2380.
Mahdi, Abdulhussain E.:
"Visualisation of the vocal tract based on estimation of vocal area functions and formant frequencies",
2381-2384.
Sciamarella, Denisse / d'Alessandro, Christophe:
"Reproducing laryngeal mechanisms with a two-mass model",
2385-2388.
Bostik, Milan / Sigmund, Milan:
"Methods for estimation of glottal pulses waveforms exciting voiced speech",
2389-2392.
Zhang, Zhaoyan / Espy-Wilson, Carol / Tiede, Mark:
"Acoustic modeling of american English lateral approximants",
2393-2396.
Takano, Sayoko / Honda, Kiyoshi / Masaki, Shinobu / Shimada, Yasuhiro / Fujimoto, Ichiro:
"Translation and rotation of the cricothyroid joint revealed by phonation-synchronized high-resolution MRI",
2397-2400.
Speech Synthesis: Voice Conversion and Miscellaneous Topics
Kawanami, Hiromichi / Iwami, Yohei / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"GMM-based voice conversion applied to emotional speech synthesis",
2401-2404.
Rentzos, Dimitrios / Vaseghi, Saeed / Yan, Qin / Ho, Ching-Hsiang / Turajlic, Emir:
"Probability models of formant parameters for voice conversion",
2405-2408.
Ye, Hui / Young, Steve:
"Perceptually weighted linear transformations for voice conversion",
2409-2412.
Chen, Yining / Chu, Min / Chang, Eric / Liu, Jia / Liu, Runsheng:
"Voice conversion with smoothed GMM and MAP adaptation",
2413-2416.
Salor, Ozgul / Demirekler, Mubeccel / Pellom, Bryan:
"A system for voice conversion based on adaptive filtering and line spectral frequency distance optimization for text-to-speech synthesis",
2417-2420.
Mori, Hiroki / Kasuya, Hideki:
"Speaker conversion in ARX-based source-formant type speech synthesis",
2421-2424.
Breen, Andrew P. / Minnis, Steve / Eggleton, Barry:
"Implementing an SSML compliant concatenative TTS system",
2425-2428.
Gu, Zhenglai / Mori, Hiroki / Kasuya, Hideki:
"Acoustic variations of focused disyllabic words in Mandarin Chinese: analysis, synthesis and perception",
2429-2432.
Quintana-Morales, Pedro / Navarro-Mesa, Juan L.:
"An approach to common acoustical pole and zero modeling of consecutive periods of voiced speech",
2433-2436.
Deng, Huiqun / Beddoes, Michael / Ward, Rabab / Hodgson, Murray:
"Estimating the vocal-tract area function and the derivative of the glottal wave from a speech signal",
2437-2440.
Zolfaghari, Parham / Nakatani, Tomohiro / Irino, Toshio / Kawahara, Hideki / Itakura, Fumitada:
"Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis",
2441-2444.
Karjalainen, Matti:
"Mixed physical modeling techniques applied to speech production",
2445-2448.
Fagel, Sascha / Sendlmeier, Walter F.:
"An expandable web-based audiovisual text-to-speech synthesis system",
2449-2452.
Nikleczy, P. / Olaszy, G.:
"A reconstruction of farkas kempelen's speaking machine",
2453-2456.
Gu, Wentao / Hirose, Keikichi:
"Acoustic model selection and voice quality assessment for HMM-based Mandarin speech synthesis",
2457-2460.
Yamagishi, Junichi / Onishi, Koji / Masuko, Takashi / Kobayashi, Takao:
"Modeling of various speaking styles and emotions for HMM-based speech synthesis",
2461-2464.
Maia, R. da S. / Zen, Heiga / Tokuda, Keiichi / Kitamura, Tadashi / Resende Jr., F.G.V.:
"Towards the development of a brazilian portuguese text-to-speech system based on HMM",
2465-2468.
Vozila, Paul / Adams, Jeff / Lobacheva, Yuliya / Thomas, Ryan:
"Grapheme to phoneme conversion and dictionary verification using graphonemes",
2469-2472.
Fackrell, Justin / Skut, Wojciech / Hammervold, Kathrine:
"Improving the accuracy of pronunciation prediction for unit selection TTS",
2473-2476.
Mishra, Taniya / Klabbers, Esther / Santen, Jan P.H. van:
"Detection of list-type sentences",
2477-2480.
Acoustic Modelling 1, 2
Prieto, Ramon / Jiang, Jing / Choi, Chi-Ho:
"A new pitch synchronous time domain phoneme recognizer using component analysis and pitch clustering",
2481-2484.
Kojima, Hiroaki / Tanaka, Kazuyo:
"Mixed-lingual spoken word recognition by using VQ codebook sequences of variable length segments",
2485-2488.
Lahti, Tommi / Viikki, Olli / Vasilache, Marcel:
"Low memory acoustic models for HMM based speech recognition",
2489-2492.
Fonollosa, Jose A.R.:
"Nearest-neighbor search algorithms based on subcodebook selection and its application to speech recognition",
2493-2496.
Omar, Mohamed Kamal / Hasegawa-Johnson, Mark:
"Non-linear maximum likelihood feature transformation for speech recognition",
2497-2500.
Suk, Soo-Young / Jung, Ho-Youl / Chung, Hyun-Yeol:
"Automatic generation of context-independent variable parameter models using successive state and mixture splitting",
2501-2504.
Zgank, Andrej / Kacic, Zdravko / Horvat, Bogomir:
"Data driven generation of broad classes for decision tree construction in acoustic modeling",
2505-2508.
Olsen, Peder A. / Dharanipragada, Satya:
"An efficient integrated gender detection scheme and time mediated averaging of gender dependent acoustic models",
2509-2512.
Ogata, Jun / Ariki, Yasuo:
"Syllable-based acoustic modeling for Japanese spontaneous speech recognition",
2513-2516.
Cetin, Ozgur / Ostendorf, Mari:
"Cross-stream observation dependencies for multi-stream speech recognition",
2517-2520.
Mak, Brian / Chan, Kin-Wah:
"Pruning transitions in a hidden Markov model with optimal brain surgeon",
2521-2524.
Magimai-Doss, Mathew / Stephenson, Todd A. / Bourlard, Hervé:
"Using pitch frequency information in speech recognition",
2525-2528.
Livescu, Karen / Glass, James / Bilmes, Jeff:
"Hidden feature models for speech recognition using dynamic Bayesian networks",
2529-2532.
Hu, Wei / Zhang, Yimin / Diao, Qian / Huang, Shan:
"An efficient viterbi algorithm on DBNs",
2533-2536.
Zhang, Li / Edmondson, William:
"Speech recognition based on syllable recovery",
2537-2540.
Abu-Amer, Tarek / Carson-Berndsen, Julie:
"HARTFEX: a multi-dimensional system of HMM based recognisers for articulatory features extraction",
2541-2544.
Maison, Benoit:
"Automatic baseform generation from acoustic data",
2545-2548.
Spiess, Thurid / Wrede, Britta / Fink, Gernot A. / Kummert, Franz:
"Data-driven pronunciation modeling for ASR using acoustic subword units",
2549-2552.
Vanhoucke, Vincent / Sankar, Ananth:
"Variable length mixtures of inverse covariances",
2605-2608.
Neukirchen, Christoph:
"Semi-tied full deviation matrices for laplacian density models",
2609-2612.
Visweswariah, Karthik / Axelrod, Scott / Gopinath, Ramesh:
"Acoustic modeling with mixtures of subspace constrained exponential models",
2613-2616.
Goel, Vaibhava / Axelrod, Scott / Gopinath, Ramesh / Olsen, Peder A. / Visweswariah, Karthik:
"Discriminative estimation of subspace precision and mean (SPAM) models",
2617-2620.
Yoshizawa, Shinichi / Shikano, Kiyohiro:
"Model-integration rapid training based on maximum likelihood for speech recognition",
2621-2624.
Lima, Amaro / Zen, Heiga / Nankaku, Yoshihiko / Miyajima, Chiyomi / Tokuda, Keiichi / Kitamura, Tadashi:
"On the use of kernel PCA for feature extraction in speech recognition",
2625-2628.
Time is of the Essence - Dynamic Approaches to Spoken Language
Greenberg, Steven:
"Time is of the essence - dynamic approaches to spoken language",
2553-2556.
Grant, Ken W. / Greenberg, Steven:
"Spectro-temporal interactions in auditory and auditory-visual speech processing",
2557-2560.
Poeppel, David:
"Brain imaging correlates of temporal quantization in spoken language",
2561-2564.
Saltzman, Elliot:
"Temporal aspects of articulatory control",
2565-2568.
Keller, Brigitte Zellner:
"The temporal organisation of speech as gauged by speech synthesis",
2569-2572.
Kleinschmidt, Michael:
"Localized spectro-temporal features for automatic speech recognition",
2573-2576.
Atlas, Les:
"Modulation spectral filtering of speech",
2577-2580.
Topics in Speech Recognition
Moore, Roger K.:
"A comparison of the data requirements of automatic speech recognition systems and human listeners",
2581-2584.
Tang, Min / Seneff, Stephanie / Zue, Victor W.:
"Modeling linguistic features in speech recognition",
2585-2588.
Ramabhadran, Bhuvana / Huang, Jing / Chaudhari, Upendra / Iyengar, Giridharan / Nock, Harriet J.:
"Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives",
2589-2592.
Beaufays, Francoise / Sankar, Ananth / Williams, Shaun / Weintraub, Mitch:
"Learning linguistically valid pronunciations from acoustic data",
2593-2596.
Minematsu, Nobuaki / Osaki, Koichi / Hirose, Keikichi:
"Improvement of non-native speech recognition by effectively modeling frequently observed pronunciation habits",
2597-2600.
Nakajima, Yoshitaka / Kashioka, Hideki / Shikano, Kiyohiro / Campbell, Nick:
"Non-audible murmur recognition",
2601-2604.
Speaker and Language Recognition
Mami, Yassine / Charlet, Delphine:
"Speaker modeling from selected neighbors applied to speaker recognition",
2629-2632.
Zetterholm, Elisabeth / Sullivan, Kirk P.H. / Green, James / Eriksson, Erik / Doorn, Jan van / Czigler, Peter E.:
"Who knows carl bildt? - and what if you don't?",
2633-2636.
Vivaracho-Pascual, C. / Ortega-Garcia, J. / Alonso-Romero, L. / Moro-Sancho, Q.:
"Improving the competitiveness of discriminant neural networks in speaker verification",
2637-2640.
Kinnunen, Tomi / Hautamaki, Ville / Franti, Pasi:
"On the fusion of dissimilarity-based classifiers for speaker identification",
2641-2644.
Ming, Ji / Stewart, Darryl / Hanna, Philip / Corr, Pat / Smith, Jack / Vaseghi, Saeed:
"Robust speaker identification using posterior union models",
2645-2648.
Zilca, Ran D. / Navratil, Jiri / Ramaswamy, Ganesh N.:
""syncpitch": a pseudo pitch synchronous algorithm for speaker recognition",
2649-2652.
Kwon, Soonil / Narayanan, Shrikanth:
"A method for on-line speaker indexing using generic reference models",
2653-2656.
Mihoubi, M. / Boulianne, Gilles / Dumouchel, Pierre:
"Discriminative training and maximum likelihood detector for speaker identification",
2657-2660.
Kajarekar, Sachin S. / Adami, Andre G. / Hermansky, Hynek:
"Novel approaches for one- and two-speaker detection",
2661-2664.
Campbell, Joseph P. / Reynolds, Douglas A. / Dunn, Robert B.:
"Fusing high- and low-level features for speaker recognition",
2665-2668.
Sivakumaran, P. / Fortuna, J. / Ariyaeeinia, Aladdin M.:
"Score normalisation applied to open-set, text-independent speaker identification",
2669-2672.
Arcienega, Mijail / Drygajlo, Andrzej:
"On the number of Gaussian components in a mixture: an application to speaker verification tasks",
2673-2676.
Salvi, Giampiero:
"Using accent information in ASR models for Swedish",
2677-2680.
Nakajima, Hideharu / Nagata, Masaaki / Asano, Hisako / Abe, Masanobu:
"Estimating Japanese word accent from syllable sequence using support vector machine",
2681-2684.
Cordoba, R. / Prime, G. / Macias-Guarasa, J. / Montero, J.M. / Ferreiros, J. / Pardo, J.M.:
"PPRLM optimization for language identification in air traffic control tasks",
2685-2688.
Spoken Language Understanding and Translation
Chen, Hsin-Hsi:
"Spoken cross-language access to image collection via captions",
2749-2752.
Jamoussi, Salma / Smaili, Kamel / Haton, Jean-Paul:
"Understanding process for speech recognition",
2753-2756.
Takezawa, Toshiyuki / Kikui, Genichiro:
"Collecting machine-translation-aided bilingual dialogues for corpus-based speech translation",
2757-2760.
Wutiwiwatchai, Chai / Furui, Sadaoki:
"Combination of finite state automata and neural network for spoken language understanding",
2761-2764.
Horlock, James / King, Simon:
"Discriminative methods for improving named entity extraction on speech data",
2765-2768.
Gu, Liang / Gao, Yuqing / Picheny, Michael:
"Improving statistical natural concept generation in interlingua-based speech-to-speech translation",
2769-2772.
Goulian, Jerome / Antoine, Jean-Yves / Poirier, Franck:
"How NLP techniques can improve speech understanding: ROMUS - a robust chunk based message understanding system using link grammars",
2773-2776.
Chelba, Ciprian / Acero, Alex:
"Discriminative training of n-gram classifiers for speech and text routing",
2777-2780.
Honal, Matthias / Schultz, Tanja:
"Correction of disfluencies in spontaneous speech using a noisy-channel approach",
2781-2784.
Koumpis, Konstantinos / Renals, Steve:
"Multi-class extractive voicemail summarization",
2785-2788.
Tur, Gokhan / Rahim, Mazin / Hakkani-Tur, Dilek Z.:
"Active labeling for spoken language understanding",
2789-2792.
Tur, Gokhan / Hakkani-Tur, Dilek Z.:
"Exploiting unlabeled utterances for spoken language understanding",
2793-2796.
Liu, Fu-Hua / Gao, Yuqing / Gu, Liang / Picheny, Michael:
"Noise robustness in speech to speech translation",
2797-2800.
Siu, K.C. / Meng, Helen M. / Wong, C.C.:
"Example-based bi-directional Chinese-English machine translation with semi-automatically induced grammars",
2801-2804.
Wrede, Britta / Shriberg, Elizabeth:
"Spotting "hot spots" in meetings: human judgments and prosodic cues",
2805-2808.
Wang, Ye-Yi / Acero, Alex:
"Combination of CFG and n-gram modeling in semantic grammar learning",
2809-2812.
Chen, Shun-Chuan / Lee, Lin-shan:
"Automatic title generation for Chinese spoken documents using an adaptive k nearest-neighbor approach",
2813-2816.
Hori, Takaaki / Hori, Chiori / Minami, Yasuhiro:
"Speech summarization using weighted finite-state transducers",
2817-2820.
Lee, Yun-Tien / Chen, Shun-Chuan / Lee, Lin-shan:
"Cross domain Chinese speech understanding and answering based on named-entity extraction",
2821-2824.
Hori, Chiori / Hori, Takaaki / Furui, Sadaoki:
"Evaluation method for automatic speech summarization",
2825-2828.
Li, Li / Liu, Feng / Chou, Wu:
"An information theoretic approach for using word cluster information in natural language call routing",
2829-2832.
Sista, Sreenivasa / Srivastava, Amit / Kubala, Francis / Schwartz, Richard:
"Unsupervised topic discovery applied to segmentation of news transcriptions",
2833-2836.
Towards a Roadmap for Speech Technology
Heisterkamp, Paul:
""do not attempt to light with match!": some thoughts on progress and research goals in spoken dialog systems",
2897-2900.
Granstrom, Bjorn / House, David:
"Multimodality and speech technology: verbal and non-verbal communication in talking agents",
2901-2904.
Cole, Ronald A.:
"Roadmaps, journeys and destinations speculations on the future of speech technology research",
2905-2908.
Moore, Roger K.:
"Spoken language output: realising the vision",
2909-2912.
Speaker Recognition and Verification
Kenny, P. / Mihoubi, M. / Dumouchel, Pierre:
"New MAP estimators for speaker recognition",
2961-2964.
Moreno, Pedro J. / Ho, Purdy P.:
"A new SVM approach to speaker identification and verification using probabilistic distance kernels",
2965-2968.
Cheung, Ming-Cheung / Mak, Man-Wai / Kung, Sun-Yuan:
"Adaptive decision fusion for multi-sample speaker verification over GSM networks",
2969-2972.
Yiu, Kwok-Kwong / Mak, Man-Wai / Kung, Sun-Yuan:
"Environment adaptation for robust speaker verification",
2973-2976.
Zigel, Yaniv / Cohen, Arnon:
"On cohort selection for speaker verification",
2977-2980.
Tadj, C. / Benlahouar, A.:
"Speaker characterization using principal component analysis and wavelet transform for speaker verification",
2981-2984.
Akita, Yuya / Kawahara, Tatsuya:
"Unsupervised speaker indexing using anchor models and automatic transcription of discussions",
2985-2988.
Scherer, Klaus R. / Grandjean, D. / Johnstone, T. / Klasmeyer, G. / Banziger, Tanja:
"A statistical approach to assessing speech and voice variability in speaker verification",
2989-2992.
Tsai, Wei-Ho / Wang, Hsin-Min / Rodgers, Dwight:
"Automatic singer identification of popular music recordings via estimation and modeling of solo vocal signal",
2993-2996.
Vescovi, Michele / Cettolo, Mauro / Rizzi, Romeo:
"A DP algorithm for speaker change detection",
2997-3000.
Lapidot, Itshak:
"SOM as likelihood estimator for speaker clustering",
3001-3004.
Minematsu, Nobuaki / Yamauchi, Keita / Hirose, Keikichi:
"Automatic estimation of perceptual age using speaker modeling techniques",
3005-3008.
Rifkin, Ryan:
"Speaker recognition using local models",
3009-3012.
Vogt, Robbie / Pelecanos, Jason / Sridharan, Sridha:
"Dependence of GMM adaptation on feature post-processing for speaker recognition",
3013-3016.
Nakagawa, Seiichi / Zhang, Wei:
"Text-independent speaker recognition by speaker-specific GMM and speaker adapted syllable-based HMM",
3017-3020.
Padrta, Ales / Radova, Vlasta:
"On the amount of speech data necessary for successful speaker identification",
3021-3024.
Turk, Ulrich / Schiel, Florian:
"Speaker verification based on the German veridat database",
3025-3028.
Multi-Lingual Spoken Language Processing
Fischer, V. / Janke, E. / Kunzmann, S.:
"Recent progress in the decoding of non-native speech with multilingual acoustic models",
3105-3108.
Kuo, Wei-Chih / Lin, Li-Feng / Wang, Yih-Ru / Chen, Sin-Horng:
"An NN-based approach to prosodic information generation for synthesizing English words embedded in Chinese text",
3109-3112.
Matsunaga, S. / Ogawa, A. / Yamaguchi, Yoshikazu / Imamura, A.:
"Speaker adaptation for non-native speakers using bilingual English lexicon and acoustic models",
3113-3116.
Le, Viet Bac / Bigi, Brigitte / Besacier, Laurent / Castelli, Eric:
"Using the web for fast language model construction in minority languages",
3117-3120.
Cheng, Yan Ming / Liu, Chen / Wei, Yuan-Jun / Melnar, Lynette / Ma, Changxue:
"An approach to multilingual acoustic modeling for portable devices",
3121-3124.
Martin, Terrence / Svendsen, Torbjorn / Sridharan, Sridha:
"Cross-lingual pronunciation modelling for indonesian speech recognition",
3125-3128.
Kim, Woosung / Khudanpur, Sanjeev:
"Language model adaptation using cross-lingual information",
3129-3132.
Wong, Eddie / Martin, Terrence / Svendsen, Torbjorn / Sridharan, Sridha:
"Multilingual phone clustering for recognition of spontaneous indonesian speech utilising pronunciation modelling techniques",
3133-3136.
Srinivasamurthy, Naveen / Narayanan, Shrikanth:
"Language-adaptive persian speech recognition",
3137-3140.
Killer, Mirjam / Stuker, Sebastian / Schultz, Tanja:
"Grapheme based speech recognition",
3141-3144.
Interdisciplinary
Petrushin, Valery A.:
"Learning Chinese tones",
3145-3148.
Hirose, Keikichi / Gendrin, Frédéric / Minematsu, Nobuaki:
"A pronunciation training system for Japanese lexical accents with corrective feedback in learner's voice",
3149-3152.
Mouri, Taro / Hirose, Keikichi / Minematsu, Nobuaki:
"Considerations on vowel durations for Japanese CALL system",
3153-3156.
Kato, Hiroaki / Nukinay, Masumi / Kawaharay, Hideki / Akahane-Yamada, Reiko:
"Influence of recording equipment on the identification of second language phoneme contrasts",
3157-3160.
Tam, Yik-Cheung / Mostow, Jack / Beck, Joseph E. / Banerjee, Satanjeev:
"Training a confidence measure for a reading tutor that listens",
3161-3164.
Banerjee, Satanjeev / Beck, Joseph E. / Mostow, Jack:
"Evaluating the effect of predicting oral reading miscues",
3165-3168.
Holada, Miroslav / Nouza, Jan:
"VISPER II - enhanced version of the educational software for speech processing courses",
3169-3172.
Lu, Meirong / Takagi, Kazuyuki / Ozeki, Kazuhiko:
"The use of multiple pause information in dependency structure analysis of spoken Japanese sentences",
3173-3176.
Takagi, Kazuyuki / Okimoto, Mamiko / Ogawa, Yoshio / Ozeki, Kazuhiko:
"A neural network approach to dependency analysis of Japanese sentences using prosodic information",
3177-3180.
Asano, Hisako / Nagata, Masaaki / Abe, Masanobu:
"Say-as classification for alphabetic words in Japanese texts",
3181-3184.
Ishihara, Kazushi / Tsubota, Yasushi / Okuno, Hiroshi G.:
"Automatic transformation of environmental sounds into sound-imitation words based on Japanese syllable structure",
3185-3188.
Zen, Heiga / Tokuda, Keiichi / Kitamura, Tadashi:
"Decision tree-based simultaneous clustering of phonetic contexts, dimensions, and state positions for acoustic modeling",
3189-3192.
Nakagawa, Seiichi / Mori, Kazumasa / Nakamura, Naoki:
"A statistical method of evaluating pronunciation proficiency for English words spoken by Japanese",
3193-3196.