Keynotes
Fitch, W. Tecumseh:
"The evolution of spoken language: a comparative approach",
1-8.
Young, Steve:
"Talking to machines (statistically speaking)",
9-16.
Speech Recognition in Noise - I
Macho, Duncan / Mauuary, Laurent / Noé, Bernhard / Cheng, Yan Ming / Ealey, Doug / Jouvet, Denis / Kelleher, Holly / Pearce, David / Saadoun, Fabien:
"Evaluation of a noise-robust DSR front-end on Aurora databases",
17-20.
Adami, Andre / Burget, Lukás / Dupont, Stephane / Garudadri, Hari / Grezl, Frantisek / Hermansky, Hynek / Jain, Pratibha / Kajarekar, Sachin / Morgan, Nelson / Sivadas, Sunil:
"Qualcomm-ICSI-OGI features for ASR",
21-24.
Kleinschmidt, Michael / Gelbart, David:
"Improving word accuracy with Gabor feature extraction",
25-28.
Droppo, Jasha / Deng, Li / Acero, Alex:
"Evaluation of SPLICE on the Aurora 2 and 3 tasks",
29-32.
Mak, Brian / Tam, Yik-Cheung:
"Performance of discriminatively trained auditory features on Aurora2 and Aurora3",
33-36.
Segura, José C. / Benítez, M.C. / Torre, Ángel de la / Rubio, Antonio J.:
"Feature extraction combining spectral noise reduction and cepstral histogram equalization for robust ASR",
225-228.
Chen, Jingdong / Dimitriadis, Dimitris / Jiang, Hui / Li, Qi / Myrvoll, Tor André / Siohan, Olivier / Soong, Frank K.:
"Bell labs approach to Aurora evaluation on connected digit recognition",
229-232.
Kim, Hong Kook / Rose, Richard C.:
"Algorithms for distributed speech recognition in a noisy automobile environment",
233-236.
Hilger, Florian / Molau, Sirko / Ney, Hermann:
"Quantile based histogram equalization for online applications",
237-240.
Chen, Chia-Ping / Filali, Karim / Bilmes, Jeff A.:
"Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases",
241-244.
Ida, Masaki / Nakamura, Satoshi:
"HMM COmposition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Aurora2 corpus",
437-440.
Hung, Jeih-weih / Lee, Lin-shan:
"Data-driven temporal filters obtained via different optimization criteria evaluated on Aurora2 database",
441-444.
Kotnik, Bojan / Vlaj, Damjan / Kacic, Zdravko / Horvat, Bogomir:
"Efficient additive and convolutional noise reduction procedures",
445-448.
Lieb, Markus / Fischer, Alexander:
"Progress with the philips continuous ASR system on the Aurora 2 noisy digits database",
449-452.
Wu, Jian / Huo, Qiang:
"An environment compensated minimum classification error training approach and its evaluation on Aurora2 database",
453-456.
Yao, Kaisheng / Zhu, Dong-Lai / Nakamura, Satoshi:
"Evaluation of a noise adaptive speech recognition system on the Aurora 3 database",
457-460.
Docío-Ferández, Laura / García-Mateo, Carmen:
"Distributed speech recognition over IP networks on the Aurora 3 database",
461-464.
Fujimoto, M. / Ariki, Yasuo:
"Evaluation of noisy speech recognition based on noise reduction and acoustic model adaptation on the Aurora2 tasks",
465-468.
Saon, George / Huerta, Juan M.:
"Improvements to the IBM Aurora 2 multi-condition system",
469-472.
Jain, Pratibha / Hermansky, Hynek / Kingsbury, Brian:
"Distributed speech recognition using noise-robust MFCC and traps-estimated manner features",
473-476.
Kitaoka, Norihide / Nakagawa, Seiichi:
"Evaluation of spectral subtraction with smoothing of time direction on the Aurora 2 task",
477-480.
Cui, Xiaodong / Iseli, Markus / Zhu, Qifeng / Alwan, Abeer:
"Evaluation of noise robust features on the Aurora databases",
481-484.
Evans, Nicholas W. D. / Mason, John S.:
"Computationally efficient noise compensation for robust automatic speech recognition assessed under the Aurora 2/3 framework",
485-488.
Farooq, O. / Datta, S.:
"Mel-scaled wavelet filter based features for noisy unvoiced phoneme recognition",
1017-1020.
Onoe, Kazuo / Segi, Hiroyuki / Kobayakawa, Takeshi / Sato, Shoei / Imai, Toru / Ando, Akio:
"Filter bank subtraction for robust speech recognition",
1021-1024.
Morris, Andrew C. / Payne, Simon / Bourlard, Hervé:
"Low cost duration modelling for noise robust speech recognition",
1025-1028.
Gong, Yifan:
"A comparative study of approximations for parallel model combination of static and dynamic parameters",
1029-1032.
Motícek, Petr / Burget, Lukás:
"Noise estimation for efficient speech enhancement and robust speech recognition",
1033-1036.
Çetin, Özgür / Nock, Harriet J. / Kirchhoff, Katrin / Bilmes, Jeff A. / Ostendorf, Mari:
"The 2001 GMTK-based SPINE ASR system",
1037-1040.
Hung, Wei-Wen:
"Using adaptive signal limiter together with weighting techniques for noisy speech recognition",
1041-1044.
Yamade, Shingo / Matsunami, Kanako / Baba, Akira / Lee, Akinobu / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics",
1045-1048.
Siu, Manhung / Chan, Yu-Chung:
"Robust speech recognition against short-time noise",
1049-1052.
Toma, M. / Lodi, A. / Guerrieri, R.:
"Word endpoints detection in the presence of non-stationary noise",
1053-1056.
Pujol Marsal, Pere / Pol Font, Susagna / Hagen, Astrid / Bourlard, Hervé / Nadeu, Climent:
"Comparison and combination of RASTA-PLP and FF features in a hybrid HMM/MLP speech recognition system",
1057-1060.
Xu, Tao / Cao, Zhigang:
"Robust MMSE-FW-LAASR scheme at low SNRs",
1061-1064.
Zolnay, András / Schlüter, Ralf / Ney, Hermann:
"Robust speech recognition using a voiced-unvoiced feature",
1065-1068.
Wet, Febe de / Veth, Johan de / Cranen, Bert / Boves, Lou:
"Accumulated kullback divergence for analysis of ASR performance in the presence of noise",
1069-1072.
Kingsbury, Brian / Jain, Pratibha / Adami, Andre:
"A hybrid HMM/traps model for robust voice activity detection",
1073-1076.
Zheng, Chengyi / Yan, Yonghong:
"Run time information fusion in speech recognition",
1077-1080.
Arrowood, Jon A. / Clements, Mark A.:
"Using observation uncertainty in HMM decoding",
1561-1564.
Stuttle, M. N. / Gales, M. J. F.:
"Combining a Gaussian mixture model front end with MFCC parameters",
1565-1568.
Droppo, Jasha / Acero, Alex / Deng, Li:
"Noise from corrupted speech log mel-spectral energies",
1569-1572.
Lima, Carlos / Almeida, Luís B. / Monteiro, João L.:
"Improving the role of unvoiced speech segments by spectral normalisation in robust speech recognition",
1573-1576.
Gadde, Venkata Ramana Rao / Stolcke, Andreas / Vergyri, Dimitra / Zheng, Jing / Sönmez, Kemal / Venkataraman, Anand:
"Building an ASR system for noisy environments: SRI’s 2001 SPINE evaluation system",
1577-1580.
Experimental Phonetics
Son, Rob J. J. H. van / Pols, Louis C. W.:
"Evidence for efficiency in vowel production",
37-40.
Aylett, Matthew P.:
"Stochastic suprasegmentals: relationship between the spectral characteristics of vowels, redundancy and prosodic structure",
41-44.
Serkhane, J. / Schwartz, Jean-Luc / Boë, Louis Jean / Davis, B. / Matyear, C.:
"Motor specifications of a baby robot via the analysis of infants² vocalizations",
45-48.
Koenig, Laura L. / Lucero, Jorge C.:
"Oral-laryngeal control patterns for fricatives in 5-year-olds and adults",
49-52.
Delvaux, Véronique / Metens, Thierry / Soquet, Alain:
"French nasal vowels: acoustic and articulatory properties",
53-56.
Speech Recognition: Adaptation
Kenny, P. / Boulianne, G. / Dumouchel, Pierre:
"Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks",
57-60.
Pusateri, Ernest J. / Hazen, Timothy J.:
"Rapid speaker adaptation using speaker clustering",
61-64.
Huang, Chao / Chen, Tao / Chang, Eric:
"Adaptive model combination for dynamic speaker selection training",
65-68.
Kwan, Ka-Yan / Lee, Tan / Yang, Chen:
"Unsupervised n-best based model adaptation using model-level confidence measures",
69-72.
Nguyen, Patrick / Rigazio, Luca / Wellekens, Christian / Junqua, Jean-Claude:
"LU factorization for feature transformation",
73-76.
Ding, Guo-Hong / Zhu, Yi-Fei / Li, Chengrong / Xu, Bo:
"Implementing vocal tract length normalization in the MLLR framework",
1389-1392.
Kim, Dong Kook / Kim, Nam Soo:
"Markov models based on speaker space model evolution",
1393-1396.
Li, Baojie / Hirose, Keikichi / Minematsu, Nobuaki:
"Robust speech recognition using inter-speaker and intra-speaker adaptation",
1397-1400.
Lima, Carlos / Almeida, Luís B. / Monteiro, João L.:
"Continuous environmental adaptation of a speech recogniser in telephone line conditions",
1401-1404.
Illina, Irina:
"Tree-structured maximum a posteriori adaptation for a segment-based speech recognition system",
1405-1408.
Plötz, Thomas / Fink, Gernot A.:
"Robust time-synchronous environmental adaptation for continuous speech recognition systems",
1409-1412.
Niesler, Thomas / Willett, Daniel:
"Unsupervised language model adaptation for lecture speech transcription",
1413-1416.
Li, Yongxin / Erdogan, Hakan / Gao, Yuqing / Marcheret, Etienne:
"Incremental on-line feature space MLLR adaptation for telephony speech recognition",
1417-1420.
Molau, Sirko / Hilger, Florian / Keysers, Daniel / Ney, Hermann:
"Enhanced histogram normalization in the acoustic feature space",
1421-1424.
Levin, David N.:
"Blind normalization of speech from different channels and speakers",
1425-1428.
Ogata, Jun / Ariki, Yasuo:
"Unsupervised acoustic model adaptation based on phoneme error minimization",
1429-1432.
Zhou, Bowen / Hansen, John H. L.:
"Improved structural maximum likelihood eigenspace mapping for rapid speaker adaptation",
1433-1436.
Torre, Ángel de la / Fohr, Dominique / Haton, Jean-Paul:
"Statistical adaptation of acoustic models to noise conditions for robust speech recognition",
1437-1440.
Brugnara, F. / Cettolo, M. / Federico, M. / Giuliani, D.:
"Issues in automatic transcription of historical audio data",
1441-1444.
Language Identification
Stockmal, Verna / Bond, Zinny S.:
"Same talker, different language: a replication",
77-80.
Jayram, A. K. V. Sai / Ramasubramanian, V. / Sreenivas, T. V.:
"Automatic language identification using acoustic sub-word units",
81-84.
Maddieson, Ian / Vasilescu, Ioana:
"Factors in human language identification",
85-88.
Torres-Carrasquillo, Pedro A. / Singer, Elliot / Kohler, Mary A. / Greene, Richard J. / Reynolds, Douglas A. / Deller Jr., J. R.:
"Approaches to language identification using Gaussian mixture models and shifted delta cepstral features",
89-92.
Wong, Eddie / Sridharan, Sridha:
"Methods to improve Gaussian mixture model based language identification system",
93-96.
Speech Synthesis
Jing, Hongyan / Tzoukermann, Evelyne:
"Part-of-speech tagging in French text-to-speech synthesis: experiments in tagset selection",
97-100.
Uebler, Ulla:
"Grapheme-to-phoneme conversion using pseudo-morphological units",
101-104.
Bisani, M. / Ney, Hermann:
"Investigations on joint-multigram models for grapheme-to-phoneme conversion",
105-108.
Galescu, Lucian / Allen, James F.:
"Pronunciation of proper names with a joint n-gram model for bi-directional grapheme-to-phoneme conversion",
109-112.
Jilka, Matthias / Syrdal, Ann K.:
"The AT&t German text-to-speech system: realistic linguistic description",
113-116.
Li, Haiping / Chen, Fangxin / Shen, Liqin:
"Generating script using statistical information of the context variation unit vector",
117-120.
Kuo, Chih-Chung / Huang, Jing-Yi:
"Efficient and scalable methods for text script generation in corpus-based TTS design",
121-124.
Rutten, Peter / Aylett, Matthew P. / Fackrell, Justin / Taylor, Paul:
"A statistically motivated database pruning technique for unit selection synthesis",
125-128.
Wu, Yi-Jian / Hu, Yu / Wu, Xiaoru / Wang, Ren-Hua:
"A new method of building decision tree based on target information",
129-132.
Yamagishi, Junichi / Tamura, Masatsune / Masuko, Takashi / Tokuda, Keiichi / Kobayashi, Takao:
"A context clustering technique for average voice model in HMM-based speech synthesis",
133-136.
Tsuzaki, Minoru / Kawai, Hisashi:
"Feature extraction for unit selection in concatenative speech synthesis: comparison between AIM, LPC, and MFCC",
137-140.
Campillo-Díaz, Francisco / Banga, Eduardo R.:
"Combined prosody and candidate unit selections for corpus-based text-to-speech systems",
141-144.
Kim, Yeon-Jun / Conkie, Alistair:
"Automatic segmentation combining an HMM-based approach and spectral boundary correction",
145-148.
Sethy, Abhinav / Narayanan, Shrikanth S.:
"Refined speech segmentation for concatenative speech synthesis",
149-152.
Breen, Andrew / Eggleton, Barry / Dion, Peter / Minnis, Steve:
"Refocussing on the text normalisation process in text-to-speech systems",
153-156.
Vepa, Jithendra / Ayachitam, Jahnavi / Reddy, K. V. K. Kalpana:
"A text-to-speech synthesis system for telugu",
157-160.
Freitas, Diamantino / Braga, Daniela:
"Towards an intonation module for a portuguese TTS system",
161-164.
Saito, Takashi / Sakamoto, Masaharu:
"Applying a hybrid intonation model to a seamless speech synthesizer",
165-168.
Hirai, Toshio / Tenpaku, Seiichi / Shikano, Kiyohiro:
"Using start/end timings of spectral transitions between phonemes in concatenative speech synthesis",
2357-2360.
Ni, Jinfu / Kawai, Hisashi:
"Design of a Mandarin sentence set for corpus-based speech synthesis by use of a multi-tier algorithm taking account of the varied prosodic and spectral characteristics",
2361-2364.
Mori, Hiroki / Ohtsuka, Takahiro / Kasuya, Hideki:
"A data-driven approach to source-formant type text-to-speech system",
2365-2368.
Shi, Yu / Chang, Eric / Peng, Hu / Chu, Min:
"Power spectral density based channel equalization of large speech database for concatenative TTS system",
2369-2372.
Meng, Helen M. / Keung, Chi Kin / Siu, Kai Chung / Fung, Tien Ying / Ching, P. C.:
"CU VOCAL: corpus-based syllable concatenation for Chinese speech synthesis across domains and dialects",
2373-2376.
Lu, Jinlin / Kawai, Hisashi:
"Perceptual evaluation of naturalness due to substitution of Chinese syllable for concatenative speech synthesis",
2377-2380.
Chazan, Dan / Hoory, Ron / Kons, Zvi / Silberstein, Dorel / Sorin, Alexander:
"Reducing the footprint of the IBM trainable speech synthesis system",
2381-2384.
Lee, Sung-Joo / Kim, Hyung Soon:
"Computationally efficient time-scale modification of speech using 3 level clipping",
2385-2388.
Shuang, Zhi-Wei / Hu, Yu / Ling, Zhen-Hua / Wang, Ren-Hua:
"A miniature Chinese TTS system based on tailored corpus",
2389-2392.
Song, Hoeun / Kim, Jaein / Lee, Kyongrok / Kim, Jinyoung:
"Phonetic normalization using z-score in segmental prosody estimation for corpus-based TTS system",
2393-2396.
Kawahara, Hideki / Zolfaghari, Parham / Cheveigné, Alain de:
"On F0 trajectory optimization for very high-quality speech manipulation",
2397-2400.
Lee, Tan / Kochanski, Greg / Shih, Chilin / Li, Yujia:
"Modeling tones in continuous Cantonese speech",
2401-2404.
Dong, Minghui / Lua, Kim-Teng:
"Pitch contour model for Chinese text-to-speech using CART and statistical model",
2405-2408.
Navas, Eva / Hernáez, Inmaculada / Sánchez, Juan María:
"Basque intonation modelling for text to speech conversion",
2409-2412.
Low, Phuay Hui / Vaseghi, Saeed:
"Application of microprosody models in text to speech synthesis",
2413-2416.
Zhao, Sheng / Tao, Jianhua / Cai, Lianhong:
"Prosodic phrasing with inductive learning",
2417-2420.
Milner, Ben / Shao, Xu:
"Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model",
2421-2424.
Kawanami, Hiromichi / Masuda, Tsuyoshi / Toda, Tomoki / Shikano, Kiyohiro:
"Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer",
2425-2428.
Multimodal Spoken Language Processing
Bühler, Dirk / Minker, Wolfgang / Häußler, Jochen / Krüger, Sven:
"Flexible multimodal human-machine interaction in mobile environments",
169-172.
Kaiser, Edward C. / Cohen, Philip R.:
"Implementation testing of a hybrid symbolic/statistical multimodal architecture",
173-176.
Yamakata, Yoko / Kawahara, Tatsuya / Okuno, Hiroshi G.:
"Belief network based disambiguation of object reference in spoken dialogue system for robot",
177-180.
Beskow, Jonas / Edlund, Jens / Nordstrand, Magnus:
"Specification and realisation of multimodal output in dialogue systems",
181-184.
Quek, Francis / Xiong, Yingen / McNeill, David:
"Gestural trajectory symmetries and discourse segmentation",
185-188.
Quek, Francis / McNeill, David / Bryll, Robert / Harper, Mary:
"Gestural spatialization in natural discourse segmentation",
189-192.
Nakadai, Kazuhiro / Okuno, Hiroshi G. / Kitano, Hiroaki:
"Real-time sound source localization and separation for robot audition",
193-196.
Ma, Jiyong / Yan, Jie / Cole, Ronald:
"CU animate tools for enabling conversations with animated characters",
197-200.
Cohen, Philip R. / Coulston, Rachel / Krout, Kelly:
"Multiparty multimodal interaction: a preliminary analysis",
201-204.
Poller, Peter / Müller, Jochen:
"Distributed audio-visual speech synchronization",
205-208.
Daubias, Philippe / Deléglise, Paul:
"Lip-reading based on a fully automatic statistical model",
209-212.
Liu, Xiaoxing / Zhao, Yibao / Pi, Xiaobo / Liang, Luhong / Nefian, Ara V.:
"Audio-visual continuous speech recognition using a coupled hidden Markov model",
213-216.
Dybkjær, Laila / Bernsen, Niels Ole:
"Data, annotation schemes and coding tools for natural interactivity",
217-220.
Quek, Francis / Shi, Yang / Kirbas, Cemil / Wu, Shunguang:
"VisSTA: a tool for analyzing multimodal discourse data",
221-224.
Perception: Non-Native
Lambacher, Stephen / Martens, William / Kakehi, Kazuhiko:
"The influence of identification training on identification and production of the american English mid and low vowels by native speakers of Japanese",
245-248.
Tajima, Keiichi / Akahane-Yamada, Reiko / Yamada, Tsuneo:
"Perceptual learning of second-language syllable rhythm by elderly listeners",
249-252.
Clarke, Constance M.:
"Perceptual adjustment to foreign-accented English with short term exposure",
253-256.
Burnham, Denis K. / Brooker, Ron:
"Absolute pitch and lexical tones: tone perception by non-musician, musician, and absolute pitch non-tonal language speakers",
257-260.
Broersma, Mirjam:
"Comprehension of non-native speech: inaccurate phoneme processing and activation of lexical competitors",
261-264.
Dialog Systems I: Evaluation
Minker, Wolfgang:
"Overview on recent activities in speech understanding and dialogue systems evaluation",
265-268.
Walker, Marilyn A. / Rudnicky, Alexander I. / Prasad, Rashmi / Aberdeen, John / Bratt, Elizabeth Owen / Garofolo, John S. / Hastie, Helen / Le, Audrey N. / Pellom, Bryan / Potamianos, Alex / Passonneau, Rebecca / Roukos, Salim / Sanders, Gregory A. / Seneff, Stephanie / Stallard, David:
"DARPA communicator: cross-system results for the 2001 evaluation",
269-272.
Walker, Marilyn A. / Rudnicky, Alexander I. / Aberdeen, John / Bratt, Elizabeth Owen / Garofolo, John S. / Hastie, Helen / Le, Audrey N. / Pellom, Bryan / Potamianos, Alex / Passonneau, Rebecca / Prasad, Rashmi / Roukos, Salim / Sanders, Gregory A. / Seneff, Stephanie / Stallard, David:
"DARPA communicator evaluation: progress from 2000 to 2001",
273-276.
Sanders, Gregory A. / Le, Audrey N. / Garofolo, John S.:
"Effects of word error rate in the DARPA communicator data during 2000 and 2001",
277-280.
Sidner, Candace L. / Forlines, Clifton:
"Subset languages for conversing with collaborative interface agents",
281-284.
Voice Conversion
Watanabe, Tomomi / Murakami, Takahiro / Namba, Munehiro / Hoya, Tetsuya / Ishida, Yoshihisa:
"Transformation of spectral envelope for voice conversion based on radial basis function networks",
285-288.
Turk, Oytun / Arslan, Levent M.:
"Subband based voice conversion",
289-292.
Mashimo, Mikiko / Toda, Tomoki / Kawanami, Hiromichi / Kashioka, Hideki / Shikano, Kiyohiro / Campbell, Nick:
"Evaluation of cross-language voice conversion using bilingual and non-bilingual databases",
293-296.
Gustafson, Joakim / Sjölander, Kåre:
"Voice transformations for improving children²s speech recognition in a publicly available dialogue system",
297-300.
Spoken Language Resources
Burger, Susanne / MacLaren, Victoria / Yu, Hua:
"The ISL meeting corpus: the impact of meeting type on speech style",
301-304.
López-Cózar, R. / Torre, Ángel de la / Segura, José C. / Rubio, Antonio J. / López-Soler, J. M.:
"A new method for testing dialogue systems based on simulations of real-world conditions",
305-308.
Ludwig, Thorsten:
"Comfort noise detection and GSM-FR-codec detection for speech-quality evaluations in telephone networks",
309-312.
Cucchiarini, Catia / Binnenpoorte, Diana:
"Validation and improvement of automatic phonetic transcriptions",
313-316.
Aman, Shigeaki / Kato, Kazumi / Kondo, Tadahisa:
"Development of Japanese infant speech database and speaking rate analysis",
317-320.
Dong, Minghui / Lua, Kim-Teng:
"Automatic prosodic break labeling for Mandarin Chinese speech data",
321-324.
Zitouni, Imed / Olive, Joseph / Iskra, Dorota / Choukri, Khalid / Emam4, Ossama / Gedge, Oren / Maragoudakis, Emmanuel / Tropf, Herbert / Moreno, Asunción / Rodriguez, Albino Nogueiras / Heuft, Barbara / Siemund, Rainer:
"Orientel: speech-based interactive communication applications for the mediterranean and the middle east",
325-328.
Alvarez, Yolanda Vazquez / Huckvale, Mark:
"The reliability of the ITU-t p.85 standard for the evaluation of text-to-speech systems",
329-332.
Demuynck, Kris / Laureys, Tom / Gillis, Steven:
"Automatic generation of phonetic transcriptions for large speech corpora",
333-336.
Minker, Wolfgang:
"Overview on recent activities in speech understanding and dialogue systems evaluation",
337-340.
Bennett, Christina / Rudnicky, Alexander I.:
"The carnegie mellon communicator corpus",
341-344.
Schultz, Tanja:
"Globalphone: a multilingual speech and text database developed at karlsruhe university",
345-348.
Salor, Özgül / Pellom, Bryan / Çiloglu, Tolga / Hacioglu, Kadri / Demirekler, Mübeccel:
"On developing new text and audio corpora and speech recognition tools for the turkish language",
349-352.
Martell, Craig:
"FORM: an extensible, kinematically-based gesture annotation scheme",
353-356.
Hosom, John-Paul:
"Automatic phoneme alignment based on acoustic-phonetic modeling",
357-360.
Gupta, Narendra K. / Bangalore, Srinivas / Rahim, Mazin:
"Extracting clauses for spoken language understanding in conversational systems",
361-364.
Lefèvre, F. / Bonneau-Maynard, H.:
"Issues in the development of a stochastic speech understanding system",
365-368.
Pfitzinger, Hartmut R.:
"10 years of phondat-II: a reassessment",
369-372.
Speech Recognition: Search
Kumar, Shankar / Byrne, William:
"Risk based lattice cutting for segmental minimum Bayes-risk decoding",
373-376.
Wendt, Sascha / Fink, Gernot A. / Kummert, Franz:
"Dynamic search-space pruning for time-constrained speech recognition",
377-380.
Lee, Raymond H. / Choi, Eric H. C.:
"A Gaussian selection method for multi-mixture HMM based continuous speech recognition",
381-384.
Dong, Rong / Zhu, Jie:
"On use of duration modeling for continuous digits speech recognition",
385-388.
Zweig, Geoffrey / Saon, George / Yvon, F.:
"Arc minimization in finite state decoding graphs with cross-word acoustic context",
389-392.
Zheng, Jing / Franco, Horacio:
"Fast hierarchical grammar optimization algorithm toward time and space efficiency",
393-396.
Abdou, Sherif / Scordilis, Michael:
"Dynamic tuning of language model score in speech recognition using a confidence measure",
397-400.
Zhang, Xiao / Zhao, Yunxin:
"Minimum perfect hashing for fast n-gram language model lookup",
401-404.
Li, Xiang / Singh, Rita / Stern, Richard M.:
"Combining search spaces of heterogeneous recognizers for improved speech recogniton",
405-408.
Auditory Models and Hearing Aids
Pellant, Karel / Mejzlík, Jan / Prikryl, Karel / Skvor, Zdenek:
"Transmission characteristics of outer ear canal",
409-412.
Kates, James M.:
"Hearing-aid benefits and limitations: predictions from a cochlear model",
413-416.
Nelson, Peggy B. / DiGiovanni, Jeffrey J. / Schlauch, Robert S.:
"A psychoacoustic basis for spectral sharpening",
417-420.
Huettel, Lisa G. / Collins, Leslie M.:
"Model-based predictions of intensity discrimination for normal- and impaired-hearing listeners",
421-424.
Assmann, Peter F. / Nearey, Terrance M. / Scott, Jack M.:
"Modeling the perception of frequency-shifted vowels",
425-428.
Mackersie, Carol L.:
"The relationship between pure-tone sequential stream segregation and perceptual separation of male and female talkers by listeners with hearing loss",
429-432.
Johansson, Mathias / Blomberg, Mats / Elenius, Kjell / Hoffsten, Lars-Erik / Torberger, Anders:
"A phoneme recognizer for the hearing impaired",
433-436.
Multi-Lingual and Non-Native Spoken Language Processing
Fischer, V. / Janke, E. / Kunzmann, S.:
"Likelihood combination and recognition output voting for the decoding of non-native speech with multilingual HMMs",
489-492.
Angkititrakul, Pongtep / Hansen, John H. L.:
"Stochastic trajectory model analysis for accent classification",
493-496.
Tian, Jilei / Häkkinen, Juha / Viikki, Olli:
"Multilingual pronunciation modeling for improving multilingual speech recognition",
497-500.
Tian, Jilei / Häkkinen, Juha / Riis, Søren / Jensen, Kåre Jean:
"On text-based language identification for multilingual speech recognition systems",
501-504.
Ma1, Bin / Guan, Cuntai / Li, Haizhou / Lee, Chin-Hui:
"Multilingual speech recognition with language identification",
505-508.
Chengalvarayan, Rathi:
"Robust HMM training for unified dutch and German speech recognition",
509-512.
Khudanpur, Sanjeev / Kim, Woosung:
"Using cross-language cues for story-specific language modeling",
513-516.
Zhao, Bing / Vogel, Stephan:
"Full-text story alignment models for Chinese-English bilingual news corpora",
517-520.
Sooful, Jayren J. / Botha, Elizabeth C.:
"Comparison of acoustic distance measures for automatic cross-language phoneme mapping",
521-524.
He, Xiaodong / Zhao, Yunxin:
"Maximum expected likelihood based model selection and adaptation for nonnative English speakers",
525-528.
Minematsu, Nobuaki / Kurata, Gakuto / Hirose, Keikichi:
"Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition",
529-532.
Nguyen, Thu / Ingram, John:
"Native and vietnamese production of compound and phrasal stress patterns",
533-536.
Prosody in Spoken Dialogue Systems
Caspers, Johanneke:
"On the function of the late rise and the early fall in dutch dialogue: a perception experiment",
537-540.
Esposito, Anna / Duncan, Susan / Quek, Francis:
"Holds as gestural correlates to empty and filled speech pauses",
541-544.
Itoh, Toshihiko / Kai, Atsuhiko / Konishi, Tatsuhiro / Itoh, Yukihiro:
"Linguistic and acoustic changes of user²s utterances caused by different dialogue situations",
545-548.
Ward, Nigel / Nakagawa, Satoshi:
"Automatic user-adaptive speaking rate selection for information delivery",
549-552.
Skantze, Gabriel:
"Coordination of referring expressions in multimodal human-computer dialogue",
553-556.
Cerrato, Loredana:
"A comparison between feedback strategies in human-to-human and human-machine communication",
557-560.
Darves, Courtney / Oviatt, Sharon:
"Adaptation of users² spoken dialogue patterns in a conversational interface",
561-564.
Speaker Segmentation and Adaptation
Rosenberg, Aaron E. / Gorin, Allen / Liu, Zhu / Parthasarathy, S.:
"Unsupervised speaker segmentation of telephone conversations",
565-568.
Sivakumaran, P. / Ariyaeeinia, A.M. / Fortuna, J.:
"An effective unsupervised scheme for multiple-speaker-change detection",
569-572.
Ajmera, J. / Bourlard, Hervé / Lapidot, I. / McCowan, Iain A.:
"Unknown-multiple speaker clustering using HMM",
573-576.
Meignier, Sylvain / Bonastre, Jean-François / Magrin-Chagnolleau, Ivan:
"Speaker utterances tying among speaker segmented audio documents using hierarchical classification: towards speaker indexing of audio databases",
577-580.
Mariéthoz, Johnny / Bengio, Samy:
"A comparative study of adaptation methods for speaker verification",
581-584.
Farrell, Kevin R.:
"Speaker verification with data fusion and model adaptation",
585-588.
Mirghafori, Nikki / Heck, Larry P.:
"An adaptive speaker verification system with speaker dependent a priori decision thresholds",
589-592.
Spoken Language Understanding
Roy, Deb / Gorniak, Peter / Mukherjee, Niloy / Juster, Josh:
"A trainable spoken language understanding system for visual object selection",
593-596.
Béchet, F. / Gorin, Allen / Wright, Jerry / Tur, D. Hakkani:
"Named entity extraction from spontaneous speech in how may i help you?",
597-600.
Bousquet-Vernhettes, Caroline / Vigouroux, Nadine:
"Recognition error processing for speech understanding",
601-604.
Pargellis, Andrew / Fosler-Lussier, Eric / Tsai, Augustine:
"Using part-of-speech tags, context thresholding, and trigram contexts to improve the auto-induction of semantic classes",
605-608.
Wang1, Ye-Yi / Acero, Alex / Chelba, Ciprian / Frey, Brendan / Wong, Leon:
"Combination of statistical and rule-based approaches for spoken language understanding",
609-612.
Xie, Guodong / Zong, Chengqing / Xu, Bo:
"Chinese spoken language analyzing based on combination of statistical and rule methods",
613-616.
Pfannerer, Norbert:
"A maximum entropy semantic parser using word classes",
617-620.
INTERSPEECH
Gurijala, A. / R. Deller Jr., J. / Seadle, M. S. / Hansen, John H. L.:
"Speech watermarking through parametric modeling",
621-624.
Hong, Kai Sze / Salleh, Sh-Hussain:
"An education software in teaching automatic speech recognition (ASR)",
625-628.
Xiao, Benfang / Girand, Cynthia / Oviatt, Sharon:
"Multimodal integration patterns in children",
629-632.
Scharenborg, Odette / Boves, Lou / Veth, Johan de:
"ASR in a human word recognition model: generating phonemic input for shortlist",
633-636.
Wu, Chung-Hsien / Chiu, Yu-Hsien / Cheng, Kung-Wei:
"Sign language translation using an error tolerant retrieval algorithm",
637-640.
Turk, Oytun / Sayli, Omer / Dutagaci, Helin / Arslan, Levent M.:
"A sound source classification system based on subband processing",
641-644.
Zhang, Ying / Zhao, Bing / Yang, Jie / Waibel, Alex:
"Automatic sign translation",
645-648.
Wenndt, Stanley J. / Cupples, Edward J. / Floyd, Richard M.:
"A study on the classification of whispered and normally phonated speech",
649-652.
Tatara, Kiyoshi / Ito, Taisuke / Zolfaghari, Parham / Takeda, Kazuya / Itakura, Fumitada:
"Experiments on recognition of lavalier microphone speech and whispered speech in real world environments",
653-656.
Iwaki, Mamoru / Seki, Hiromi:
"An effect of amplitude modulation on perceptual segregation of tone sequences",
657-660.
Sanders, Eric / Ruiter, Marina / Beijer, Lilian / Strik, Helmer:
"Automatic recognition of dutch dysarthric speech: a pilot study",
661-664.
Engwall, Olov:
"Evaluation of a system for concatenative articulatory visual speech synthesis",
665-668.
Sato, Marc / Schwartz, Jean-Luc / Cathiard, Marie-Agnès / Abry, Christian / Loevenbruck, Hélène:
"Intrasyllabic articulatory control constraints in verbal working memory",
669-672.
Campbell, Nick:
"Towards a grammar of spoken language: incorporating paralinguistic information",
673-676.
Li, Qun / Russell, Martin J.:
"An analysis of the causes of increased error rates in children²s speech recognition",
2337-2340.
Öster, Anne-Marie:
"A new computer-based analytical speech perception test for prelingually deaf children and children with speech disorders",
2341-2344.
Fell, Harriet J. / MacAuslan, Joel / Ferrier, Linda J. / Worst, Susan G. / Chenausky, Karen:
"Vocalization age as a clinical tool",
2345-2348.
Cosi, Piero / Cohen, Michael M. / Massaro, Dominic W.:
"Baldini: baldi speaks italian!",
2349-2352.
Cavé, Christian / Guaïtella, Isabelle / Santi;, Serge:
"Eyebrow movements and voice variations in dialogue situations: an experimental investigation",
2353-2356.
Large Vocabulary Speech Recognition
Córdoba, R. / Macías-Guarasa, J. / Ferreiros, J. / Montero, J. M. / Pardo, José M.:
"State clustering improvements for continuous HMMs in a Spanish large vocabulary recognition system",
677-680.
Rotovnik, Tomaz / Maucec, Mirjam Sepesy / Horvat, Bogomir / Kacic, Zdravko:
"A comparison of HTK, ISIP and julius in slovenian large vocabulary continuous speech recognition",
681-684.
Jia, Lei / Xu, Bo:
"Parametric trajectory segment model for LVCSR",
685-688.
Diéguez-Tirado, F. Javier / Cardenal-López, Antonio:
"Efficient precalculation of LM contexts for large vocabulary continuous speech recognition",
689-692.
Chengalvarayan, Rathi:
"Integrating multiple pronunciations during MCE-based acoustic model training for large vocabulary speech recognition",
693-696.
Laureys, Tom / Vandeghinste, Vincent / Duchateau, Jacques:
"A hybrid approach to compounds in LVCSR",
697-700.
Utsuro, Takehito / Harada, Tetsuji / Nishizaki, Hiromitsu / Nakagawa, Seiichi:
"A confidence measure based on agreement among multiple LVCSR models - correlation between pair of acoustic models and confidence",
701-704.
Nouza, Jan / Drabkova, Jindra:
"Combining lexical and morphological knowledge in language model for inflectional (czech) language",
705-708.
Nguyen, Long / Guo, Xuefeng / Makhoul, John:
"Modeling frequent allophones in Japanese speech recognition",
709-712.
Chen, Feili / Zhu, Jie / Song, Wentao:
"The structure and its implementation of hidden dynamic HMM for Mandarin speech recognition",
713-716.
Shinozaki, Takahiro / Furui, Sadaoki:
"A new lexicon optimization method for LVCSR based on linguistic and acoustic characteristics of words",
717-720.
Langlois, David / Smaïli, Kamel / Haton, Jean-Paul:
"Retrieving phrases by selecting the history: application to automatic speech recognition",
721-724.
Ahn, Dong-Hoon / Chung, Minhwa:
"Compact subnetwork-based large vocabulary continuous speech recognition",
725-728.
Dutagaci, Helin / Arslan, Levent M.:
"A comparison of four language models for large vocabulary turkish speech recognition",
729-732.
Integration of Speech Technology in Language Learning
Hincks, Rebecca:
"Speech recognition for language teaching and evaluating: a study of existing commercial products",
733-736.
Raux, Antoine / Kawahara, Tatsuya:
"Automatic intelligibility assessment and diagnosis of critical pronunciation errors for computer-assisted pronunciation learning",
737-740.
Hirata, Yukari:
"Effects of production training with visual feedback on the acquisition of Japanese pitch and durational contrasts",
741-744.
Minematsu, Nobuaki / Kobashikawa, Satoshi / Hirose, Keikichi / Erickson, Donna:
"Acoustic modeling of sentence stress using differential features between syllables for English rhythm learning system development",
745-748.
Imoto, Kazunori / Tsubota, Yasushi / Raux, Antoine / Kawahara, Tatsuya / Dantsuji, Masatake:
"Modeling and automatic detection of English sentence stress for computer-assisted English prosody learning system",
749-752.
Tsubota, Yasushi / Kawahara, Tatsuya / Dantsuji, Masatake:
"Recognition and verification of English by Japanese students for computer-assisted language learning system",
1205-1208.
Neri, Ambra / Cucchiarini, Catia / Strik, Helmer:
"Feedback in computer assisted pronunciation training: technology push or demand pull?",
1209-1212.
Minematsu, Nobuaki / Kurata, Gakuto / Hirose, Keikichi:
"Corpus-based analysis of English spoken by Japanese students in view of the entire phonemic system of English",
1213-1216.
Hardison, Debra M.:
"Computer-assisted second-language speech learning: generalization of prosody-focused training",
1217-1220.
Mostow, Jack / Beck, Joseph / Winter, S. Vanessa / Wang, Shaojun / Tobin, Brian:
"Predicting oral reading miscues",
1221-1224.
Kim, Chanwoo / Sung, Wonyong:
"Implementation of an intonational quality assessment system",
1225-1228.
Ariki, Yasuo / Ogata, Jun:
"English call system with functions of speech segmentation and pronunciation evaluation using speech recognition technology",
1229-1232.
Perception of Prosody
Mixdorff, Hansjörg / Luksaneeyanawin, Sudaporn / Fujisaki, Hiroya / Charnvivit, Patavee:
"Perception of tone and vowel quantity in Thai",
753-756.
Kinoshita, Keisuke / Behne, Dawn M. / Arai, Takayuki:
"Duration and F0 as perceptual cues to Japanese vowel quantity",
757-760.
Muto, Makiko / Kato, Hiroaki / Tsuzaki, Minoru / Sagisaka, Yoshinori:
"Effects of intra-phrase position on acceptability of changes in segmental duration in sentence speech",
761-764.
Barac-Cikoja, Dragana / Revoile, Sally:
"Perception of prosodic phrasing by hearing-impaired listeners",
765-768.
Aasland, Wendi A. / Baum, Shari R.:
"Processing of temporal cues marking phrasal boundaries in individuals with brain damage",
769-772.
Speech Enhancement I
Herbordt, W. / Ying, J. / Buchner, H. / Kellermann, W.:
"A real-time acoustic human-machine front-end for multimedia applications integrating robust adaptive beamforming and stereophonic acoustic echo cancellation",
773-776.
Lu, Ching-Ta / Wang, Hsiao-Chuan:
"Enhancement of single channel speech using perception-based wavelet transform",
777-780.
Lin, L. / Holmes, W. H. / Ambikairajah, E.:
"Speech enhancement based on a perceptual modification of wiener filtering",
781-784.
Attias, Hagai / Deng, Li:
"A new approach to speech enhancement by a microphone array using EM and mixture models",
785-788.
Kim, Sang G. / Yoo, Chang D.:
"Acoustic echo cancellation based on m-channel IIR cosine-modulated filter bank",
789-792.
Saruwatari, Hiroshi / Sawai, Katsuyuki / Lee, Akinobu / Shikano, Kiyohiro / Kaminuma, Atsunobu / Sakata, Masao:
"Speech enhancement in car environment using blind source separation",
1781-1784.
Potamitis, I. / Fakotakis, Nikos / Kokkinakis, George:
"Speech enhancement based on combining perceptual enhancement and short-time spectral attenuation",
1785-1788.
Nishiura, Takanobu / Nakamura, Satoshi / Okada, Yuka / Yamada, Takeshi / Shikano, Kiyohiro:
"Suitable design of adaptive beamformer based on average speech spectrum for noisy speech recognition",
1789-1792.
Tam, King / Sheikhzadeh, Hamid / Schneider, Todd:
"Highly oversampled subband adaptive filters for noise cancellation on a low-resource DSP system",
1793-1796.
Hu, Yi / Loizou, Philipos C.:
"A perceptually motivated subspace approach for speech enhancement",
1797-1800.
Ju, Gwo-hwa / Lee, Lin-shan:
"Speech enhancement based on generalized singular value decomposition approach",
1801-1804.
Kim, Jong Uk / Yoo, Chang D.:
"Subspace speech enhancement using subband whitening filter",
1805-1808.
Chang, Sungwook / Jung, Sungil / Kwon, Y. / Yang, Sung-il:
"Speech enhancement using wavelet packet transform",
1809-1812.
Deng, Li / Droppo, Jasha / Acero, Alex:
"Sequential MAP noise estimation and a phase-sensitive model of the acoustic environment",
1813-1816.
Nakadai, Kazuhiro / Okuno, Hiroshi G. / Kitano, Hiroaki:
"Auditory fovea based speech enhancement and its application to human-robot dialog system",
1817-1820.
Visser, Erik / Otsuka, Manabu / Lee, Te-Won:
"A spatio-temporal speech enhancement scheme for robust speech recognition",
1821-1824.
Berthommier, Frédéric / Choi, Seungjin:
"Comparative evaluation of CASA and BSS models for subband cocktail-party speech separation",
1825-1828.
Kim, Hyoung-Gook / Ruwisch, Dietmar:
"Speech enhancement in non-stationary noise environments",
1829-1832.
Mizumachi, Mitsunori / Nakamura, Satoshi:
"The 2ch hybrid subtractive beamformer applied to line sound sources",
1833-1836.
Speech Recognition: In-Vehicle
Yapanel, Umit / Zhang, Xianxian / Hansen, John H. L.:
"High performance digit recognition in real car environments",
793-796.
Shinde, Tetsuya / Takeda, Kazuya / Itakura, Fumitada:
"Multiple regression of log-spectra for in-car speech recognition",
797-800.
Gong, Yifan / Netsch, Lorin:
"Experiments on speaker-independent voice command recognition using in-vehicle hands free speech",
801-804.
Kadambe, Shubha:
"Application of over-complete blind source separation for robust automatic speech recognition",
805-808.
Beaufays, Françoise / Boies, Daniel / Weintraub, Mitch:
"Porting channel robustness across languages",
809-812.
Mechanisms for Dialogue Processing
Takahashi, Yasuhiro / Dohsaka, Kohji / Aikawa, Kiyoaki:
"An efficient dialogue control method using decision tree-based estimation of out-of-vocabulary word attributes",
813-816.
Bellegarda, Jerome R.:
"Semantic inference: a data-driven solution for NL interaction",
817-820.
Wright, Jerry / Abella, Alicia / Gorin, Allen:
"Unified task knowledge for spoken language understanding and dialog management",
821-824.
Lee, Yun-Tien / Wu, Cheng-Huang / Lee, Yumin / Lee, Lin-shan:
"Distributed Chinese keyword spotting and verification for spoken dialogues under wireless environment",
825-828.
Higashinaka, Ryuichiro / Miyazaki, Noboru / Nakano, Mikio / Aikawa, Kiyoaki:
"A method for evaluating incremental utterance understanding in spoken dialogue systems",
829-832.
Kakutani, Naoko / Kitaoka, Norihide / Nakagawa, Seiichi:
"Detection and recognition of repaired speech on misrecognized utterances for speech input of car navigation system",
833-836.
Eklund, Robert:
"Ingressive speech as an indication that humans are talking to humans (and not to machines)",
837-840.
Soltau, Hagen / Metze, Florian / Waibel, Alex:
"Compensating for hyperarticulation by modeling articulatory properties",
841-844.
Goubanova, Olga V.:
"Forms of introduction in map task dialogues: case of L2 Russian speakers",
845-848.
Veilleux, Nanette M.:
"Bridges: regions between discourse segments",
849-852.
Guillevic, Didier / Gandrabur, Simona / Normandin, Yves:
"Robust semantic confidence scoring",
853-856.
Müller, Ludek / Bartos, Tomás:
"Statistically based approach to rejection of incorrectly recognized words",
857-860.
Sato, Ryo / Higashinaka, Ryuichiro / Tamoto, Masafumi / Nakano, Mikio / Aikawa, Kiyoaki:
"Learning decision trees to determine turn-taking by spoken dialogue systems",
861-864.
Hamimed, H. / Damnati, G.:
"Integration of phonetic length properties in the acoustic models of false starts and out-of-vocabulary words",
865-868.
Zhao, Yibao / Zhou, Guojun:
"N-word-sequence frequency noise mitigation for SLM based on binomial distribution",
869-872.
Lee, Chul Min / Narayanan, Shrikanth S. / Pieraccini, Roberto:
"Combining acoustic and language information for emotion recognition",
873-876.
Hacioglu, Kadri / Ward, Wayne:
"A figure of merit for the analysis of spoken dialog systems",
877-880.
Language Modeling
Akiba, Tomoyosi / Itou, Katunobu / Fujii, Atsushi / Ishikawa, Tetsuya:
"Selective back-off smoothing for incorporating grammatical constraints into the n-gram language model",
881-884.
Zitouni, Imed / Siohan, Olivier / Kuo, Hong-Kwang Jeff / Lee, Chin-Hui:
"Backoff hierarchical class n-gram language modelling for automatic speech recognition systems",
885-888.
Picard, Francis / Boucher, Dominique / Lapalme, Guy:
"Constructing small language models from grammars",
889-892.
Zhang, Rong / Rudnicky, Alexander I.:
"Improve latent semantic analysis based language model by integrating multiple level knowledge",
893-896.
Sicilia-Garcia, Elvira I. / Ming, Ji / Smith, F. Jack:
"Individual word language models and the frequency approach",
897-900.
Stolcke, Andreas:
"SRILM - an extensible language modeling toolkit",
901-904.
Whittaker, E. W. D. / Klakow, D.:
"Efficient construction of long-range language models using log-linear interpolation",
905-908.
Corazza, Anna:
"Integration of two stochastic context-free grammars",
909-912.
Rayner, Manny / Hockey, Beth Ann / Dowding, John:
"Grammar specialisation meets language modelling",
913-916.
Huang, Jing / Zweig, Geoffrey:
"Maximum entropy model for punctuation annotation from speech",
917-920.
Mori, Shinsuke:
"An automatic sentence boundary detector based on a structured language model",
921-924.
Wu1, Genqing / Zheng, Fang / Wu1, Wenhu / Xu, Mingxing / Jin, Ling:
"Improved katz smoothing for language modeling in speech recogniton",
925-928.
Mori, Renato De / Estève, Yannick / Raymond, Christian:
"On the use of structures in language models for dialogue",
929-932.
Erdogan, Hakan / Sarikaya, Ruhi / Gao, Yuqing / Picheny, Michael:
"Semantic structured language models",
933-936.
Prosody and Speech Recognition - I
Hirose, Keikichi / Minematsu, Nobuaki / Terao, Makoto:
"Statistical language modeling with prosodic boundaries and its use for continuous speech recognition",
937-940.
Iwano, Koji / Seki, Takahiro / Furui, Sadaoki:
"Noise robust speech recognition using F0 contour extracted by hough transform",
941-944.
Almasganj, Farshad / Dehnavi, Farhad D. / Bijankhan, Mahmood:
"Sharing relative stress of cross-word syllables and lexical stress to spontaneous speech recognition",
945-948.
Baron, Don / Shriberg, Elizabeth / Stolcke, Andreas:
"Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues",
949-952.
Sun, Xuejing:
"Pitch accent prediction using ensemble machine learning",
953-956.
Escudero-Mancebo, D. / González-Ferreras, C. / Cardeñoso-Payo, V.:
"Quantitative evaluation of relevant prosodic factors for text-to-speech synthesis in Spanish",
1165-1168.
Thubthong, Nuttakorn / Kijsirikul, Boonserm / Luksaneeyanawin, Sudaporn:
"Tone recognition in Thai continuous speech based on coarticulaion, intonation and stress effects",
1169-1172.
Takagi, Kazuyuki / Kubota, Hajime / Ozeki, Kazuhiko:
"Combination of pause and F0 information in dependency analysis of Japanese sentences",
1173-1176.
Horiuchi, Yasuo / Ohsuga, Tomoko / Ichikawa, Akira:
"Estimating syntactic structure from F0 contour and pause duration in Japanese speech",
1177-1180.
Yamashita, Yoichi / Inoue, Akira:
"Extraction of important sentences using F0 information for speech summarization",
1181-1184.
Kitamura, Tatsuya / Itoh, Kayo / Itoh, Toshihiko / Kitazawa, Shigeyoshi:
"Influence of prosody, context, and word order in the identification of focus in Japanese dialogue",
1185-1188.
Kai, Atsuhiko / Nonomura, Yukari / Itoh, Toshihiko / Konishi, Tatsuhiro / Itoh, Yukihiro:
"Influence of different dialogue situations on user²s behavior in spoken corrections",
1189-1192.
Yang, Li-chiung:
"Interpreting meaning from context: modeling the prosody of discourse markers in speech",
1193-1196.
Bartkova, Katarina / Gac, David Le / Charlet, Delphine / Jouvet, Denis:
"Prosodic parameter for speaker identification",
1197-1200.
Shigeyoshi, Kitazawa / Toshihiko, Itoh / Tatsuya, Kitamura:
"Juncture segmentation of Japanese prosodic unit based on the spectrographic features",
1201-1204.
Pathology of Voice and Speech Production
Svec, Jan G. / Sram, Frantisek:
"Kymographic imaging of the vocal fold oscillations",
957-960.
Mády, K. / Sader, R. / Zimmermann, A. / Hoole, P. / Beer, A. / Zeilhofer, H.-F. / Hannig, Ch.:
"Assessment of consonant articulation in glossectomee speech by dynamic MRI",
961-964.
Wrench, Alan / Gibbon, Fiona / McNeill, Alison M. / Wood, Sara:
"An EPG therapy protocol for remediation and assessment of articulation disorders",
965-968.
Patel, Rupal:
"How speakers with and without speech impairment mark the question statement contrast",
969-972.
Zahorian, Stephen A. / Zimmer, A. Matthew / Meng, Fansheng:
"Vowel classification for computer-based visual feedback for speech training for the hearing impaired",
973-976.
Model Based Speech Processing I
Alku, Paavo / Bäckström, Tom:
"All-pole modeling of wide-band speech using weighted sum of the LSP polynomials",
977-980.
Schoentgen, Jean:
"Analysis and synthesis of the phonatory excitation signal by means of a pair of polynomial shaping functions",
981-984.
Vintsiuk, Taras K.:
"Optimal speech signal partition into one-quasiperiodical segments",
985-988.
Rufiner, Hugo L. / Rocha, Luis F. / Close, John Goddard:
"Sparse and independent representations of speech signals based on parametric models",
989-992.
Funaki, Keiichi:
"Improvement of the ELS-based time-varying complex speech analysis",
993-996.
Acoustic Modeling
Chin, K. K. / Woodland, P. C.:
"Maximum mutual information training of hidden Markov models with vector linear predictors",
997-1000.
Hamaker, J. E. / Picone, J. / Ganapathiraju, A.:
"A sparse modeling approach to speech recognition based on relevance vector machines",
1001-1004.
Chelba, Ciprian / Morton, Rachel:
"Mutual information phone clustering for decision tree induction",
1005-1008.
Horn, Kevin S. Van:
"Rethinking derived acoustic features in speech recognition",
1009-1012.
Markov, Konstantin / Nakamura, Satoshi:
"Modeling HMM state distributions with Bayesian networks",
1013-1016.
Tsakalidis, Stavros / Doumpiotis, Vlasios / Byrne, William:
"Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation",
2585-2588.
Okuda, Kozo / Kawahara, Tatsuya / Nakamura, Satoshi:
"Speaking rate compensation based on likelihood criterion in acoustic model training and decoding",
2589-2592.
Bacchiani, Michiel:
"Combining maximum likelihood and maximum a posteriori estimation for detailed acoustic modeling of context dependency",
2593-2596.
Huang, Jing / Goel, Vaibhava / Gopinath, Ramesh / Kingsbury, Brian / Olsen, Peder / Visweswariah, Karthik:
"Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model",
2597-2600.
Zhang, Jin-Song / Nakamura, Satoshi:
"Modeling varying pauses to develop robust acoustic models for recognizing noisy conversational speech",
2601-2604.
Song, Hwa Jeon / Kim, Hyung Soon:
"Improving phone-level discrimination in LDA with subphone-level classes",
2625-2628.
Ou, Zhijian / Wang, Zuoying:
"A combined model of statics-dynamics of speech optimized using maximum mutual information",
2629-2632.
Takahashi, Nobutoshi / Nakagawa, Seiichi:
"Syllable recognition using syllable-segment statistics and syllable-based HMM",
2633-2636.
Thirion, J. W. F. / Botha, Elizabeth C.:
"Recurrent neural network-enhanced HMM speech recognition systems",
2637-2640.
Yun, Young-Sun:
"Sharing trend information of trajectory in segmental-feature HMM",
2641-2644.
Salomon, Jesper / King, Simon / Salomon, Jesper:
"Framewise phone classification using support vector machines",
2645-2648.
Stewart, Darryl / Ji, Ming / Hanna, Philip / Smith, F. Jack:
"A state-tying approach to building syllable HMMs",
2649-2652.
Lee, Weifeng / Sekhar, C. Chandra / Takeda, Kazuya / Itakura, Fumitada:
"Recognition of continuous speech segments of monophone units using support vector machines",
2653-2656.
Park, Junho / Ko, Hanseok:
"Construction of decision tree from data driven clustering",
2657-2660.
Lee, Akinobu / Mera, Yuuichiro / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Selective multi-path acoustic model based on database likelihoods",
2661-2664.
Stephenson, Todd A. / Magimai-Doss, Mathew / Bourlard, Hervé:
"Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition",
2665-2668.
Watanabe, Shinji / Minami, Yasuhiro / Nakamura, Atsushi / Ueda, Naonori:
"Constructing shared-state hidden Markov models based on a Bayesian approach",
2669-2672.
Ogawa, Tetsuji / Kobayashi, Tetsunori:
"Generalization of state-observation-dependency in partly hidden Markov models",
2673-2676.
Phonetics
Esling, John H.:
"Laryngoscopic analysis of tibetan chanting modes and their relationship to register in sino-tibetan",
1081-1084.
Murray, Kathleen / Simonsen, Betina:
"A corpus-based study of danish laryngealization",
1085-1088.
Warner, Natasha / Jongman, Allard / Mücke, Doris:
"Variability in direction of dorsal movement during production of /l/",
1089-1092.
Xu, Yi / Liu, Fang:
"Segmentation of glides with tonal alignment as reference",
1093-1096.
Maddieson, Ian / Larson, Julie:
"Variability in the production of glottalized sonorants: data from yapese",
1097-1100.
Tuan, Vu Ngoc / d’Alessandro, Christophe / Rosset, Sophie:
"A phonetic study of vietnamese tones: acoustic and electroglottographic measurements",
1101-1104.
Chung, Hyunsong:
"Segment duration in spoken korean",
1105-1108.
Zvonik, Elena / Cummins, Fred:
"Pause duration and variability in read texts",
1109-1112.
Pfitzinger, Hartmut R.:
"Intrinsic phone durations are speaker-specific",
1113-1116.
Tronnier, Mechtild:
"Preaspirated stops in southern Swedish",
1117-1120.
Warner, Natasha / Weber, Andrea:
"Stop epenthesis at syllable boundaries",
1121-1124.
Raymond, William D. / Pitt, Mark / Johnson, Keith / Hume, Elizabeth / Makashay, Matthew / Dautricourt, Robin / Hilts, Craig:
"An analysis of transcription consistency in spontaneous speech from the buckeye corpus",
1125-1128.
Aoyagi, Makiko:
"Contextual effects on voicing judgment of stop consonants in Japanese",
1129-1132.
Joto, Akiyo / Imaishi, Motohisa / Nagase, Yoshiki / Funatsu, Seiya:
"Discrimination of English vowels in consonantal contexts by native speakers of Japanese and its relations to dynamic information of formants",
1133-1136.
Call Classification and Routing
Tur, Gokhan / Wright, Jerry / Gorin, Allen / Riccardi, Giuseppe / Hakkani-Tür, Dilek:
"Improving spoken language understanding using word confusion networks",
1137-1140.
Li, Li / Chou, Wu:
"Improving latent semantic indexing based classifier with information gain",
1141-1144.
Kuo, Hong-Kwang Jeff / Lee, Chin-Hui / Zitouni, Imed / Fosler-Lussier, Eric / Ammicht, Egbert:
"Discriminative training for call classification and routing",
1145-1148.
Cox, Stephen:
"Speech and language processing for a constrained speech translation system",
1149-1152.
Chotimongkol, Ananlada / Rudnicky, Alexander I.:
"Automatic concept identification in goal-oriented conversations",
1153-1156.
Levit, Michael / Nöth, Elmar / Gorin, Allen:
"Using EM-trained string-edit distances for approximate matching of acoustic morphemes",
1157-1160.
Natarajan, Premkumar / Prasad, Rohit / Suhm, Bernhard / McCarthy, Daniel:
"Speech-enabled natural language call routing: BBN call director",
1161-1164.
Acoustic Speech Modeling
Gao, Sheng / Zhang, Jin-Song / Nakamura, Satoshi / Lee, Chin-Hui / Chua, Tat-seng:
"Weighted graph based decision tree optimization for high accuracy acoustic modeling",
1233-1236.
Zhang, Li / Edmondson, William H.:
"Speech recognition using syllable patterns",
1237-1240.
Brink, Janus D. / Botha, Elizabeth C.:
"A comparison of L1 and african-mother-tongue acoustic models for south african English speech recognition",
1241-1244.
Somervuo, Panu:
"Speech modeling using variational Bayesian mixture of Gaussians",
1245-1248.
Chen, Tao / Huang, Chao / Chang, Eric / Wang, Jingchun:
"On the use of Gaussian mixture model for speaker variability analysis",
1249-1252.
Jackson, Philip J.B. / Russell, Martin J.:
"Models of speech dynamics in a segmental-HMM recognizer using intermediate linear representations",
1253-1256.
Zen, Heiga / Tokuda, Keiichi / Kitamura, Tadashi:
"Decision tree distribution tying based on a dimensional split technique",
1257-1260.
Speech Synthesis: Alternative Views
Huckvale, Mark:
"Speech synthesis, speech simulation and speech science",
1261-1264.
Bulut, Murtaza / Narayanan, Shrikanth S. / Syrdal, Ann K.:
"Expressive speech synthesis using a concatenative synthesizer",
1265-1268.
Shichiri, Kengo / Sawabe, Atsushi / Yoshimura, Takayoshi / Tokuda, Keiichi / Masuko, Takashi / Kobayashi, Takao / Kitamura, Tadashi:
"Eigenvoices for HMM-based speech synthesis",
1269-1272.
Marsi, Erwin / Busser, Bertjan / Daelemans, Walter / Hoste, Veronique / Reynaert, Martin / Bosch, Antal van den:
"Combining information sources for memory-based pitch accent placement",
1273-1276.
Swift, Mary D. / Campana, Ellen / Allen, James F. / Tanenhaus, Michael K.:
"Eye-fixation as a measure of real-time processing of synthesized words",
1277-1280.
Stent, Amanda / Walker, Marilyn A. / Whittaker, Steve / Maloor, Preetam:
"User-tailored generation for spoken dialogue: an experiment",
1281-1284.
Roy, Deb:
"A system that learns to describe objects in visual scenes",
1285-1288.
Finite State Transducers Applied to Spoken Language Processing
Mou, Xiaolong / Seneff, Stephanie / Zue, Victor:
"Integration of supra-lexical linguistic models with speech recognition using shallow parsing and finite state transducers",
1289-1292.
Shu, Han / Hetherington, I. Lee:
"EM training of finite-state transducers and its application to pronunciation modeling",
1293-1296.
Szarvas, Máté / Furui, Sadaoki:
"Finite-state transducer based hungarian LVCSR with explicit modeling of phonological changes",
1297-1300.
Caseiro, Diamantino / Trancoso, Isabel:
"Using dynamic WFST composition for recognizing broadcast news",
1301-1304.
Dolfing, Hans J. G. A.:
"Transducer search space modelings for large-vocabulary speech recognition",
1305-1308.
Kanthak, Stephan / Ney, Hermann / Riley, Michael / Mohri, Mehryar:
"A comparison of two LVR search optimization techniques",
1309-1312.
Mohri, Mehryar / Riley, Michael:
"An efficient algorithm for the n-best-strings problem",
1313-1316.
Speaker Modeling and Scoring
Xiang, Bing / Berger, Toby:
"Structural Gaussian mixture models for efficient text-independent speaker verification",
1317-1320.
Petry, A. / Barone, Dante A. C.:
"Text-dependent speaker verification using lyapunov exponents",
1321-1324.
BenZeghiba, Mohamed F. / Bourlard, Hervé:
"User-customized password speaker verification based on HMM/ANN and GMM models",
1325-1328.
Xin, Dong / Wu, Zhaohui / Yang, Yingchun:
"Exploiting support vector machines in hidden Markov models for speaker verification",
1329-1332.
Mami, Yassine / Charlet, Delphine:
"Speaker identification by location in an optimal space of anchor models",
1333-1336.
Park, Alex / Hazen, Timothy J.:
"ASR dependent techniques for speaker identification",
1337-1340.
Ding, Peng / Liu, Yang / Xu, Bo:
"Factor analyzed Gaussian mixture models for speaker identification",
1341-1344.
Jin, Qin / Schultz, Tanja / Waibel, Alex:
"Phonetic speaker identification",
1345-1348.
Navrátil, Jirí / Ramaswamy, Ganesh N.:
"DETAC: a discriminative criterion for speaker verification",
1349-1352.
Liu, Ming / Chang, Eric / Dai, Bei-qian:
"Hierarchical Gaussian mixture model for speaker verification",
1353-1356.
Kochanski, Greg / Lopresti, Daniel / Shih, Chilin:
"A reverse turing test using speech",
1357-1360.
Ahn, Sungjoo / Kang, Sunmee / Ko, Hanseok:
"On effective speaker verification based on subword model",
1361-1364.
Xiang, Bing:
"Speaker verification using Gaussian component strings in dynamic trajectory space",
1365-1368.
Heck, Larry P. / Genoud, Dominique:
"Combining speaker and speech recognition systems",
1369-1372.
Li, Qi / Jiang, Hui / Zhou, Qiru / Zheng, Jinsong:
"Automatic enrollment for speaker authentication",
1373-1376.
Andorno, M. / Laface, P. / Gemello, Roberto:
"Experiments in confidence scoring for word and sentence verification",
1377-1380.
Huggins, Mark C. / Grieco, John J.:
"Confidence metrics for speaker identification",
1381-1384.
Elenius, Daniel / Blomberg, Mats:
"Characteristics of a low reject mode speaker verification system",
1385-1388.
Issues in Audio-Visual Spoken Language Processing
Bernstein, Lynne E. / Burnham, Denis K. / Schwartz, Jean-Luc:
"Special session: issues in audiovisual spoken language processing (when, where, and how?)",
1445-1448.
Deligne, Sabine / Potamianos, Gerasimos / Neti, Chalapathy:
"Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization)",
1449-1452.
Bailly, Gérard:
"Audiovisual speech synthesis. from ground truth to models",
1453-1456.
Vatikiotis-Bateson, Eric / Hill, Harold / Kamachi, Miyuki / Lander, Karen / Munhall, Kevin G.:
"The stimulus as basis for audiovisual integration",
1457-1460.
Rosenblum, Lawrence D.:
"The perceptual basis for audiovisual speech integration",
1461-1464.
Hardison, Debra M.:
"Sources of variability in the perceptual training of /r/ and /l/: interaction of adjacent vowel, word position, talkers² visual and acoustic cues",
1465-1468.
Hazan, Valerie / Sennema, Anke / Faulkner, Andrew:
"Audiovisual perception in L2 learners",
1685-1688.
Kirk, Karen Iler / Pisoni, David B. / Lachs, Lorin:
"Audiovisual integration of speech by children and adults with cochlear implants",
1689-1692.
Sekiyama, Kaoru / Sugita, Yoichi:
"Auditory-visual speech perception examined by brain imaging and reaction time",
1693-1696.
Ponton, Curtis W. / Auer, Edward T. / Bernstein, Lynne E.:
"Neurocognitive basis for audiovisual speech perception: evidence from event-related potentials",
1697-1700.
Lewkowicz, David J.:
"Perception and integration of audiovisual speech in human infants",
1701-1704.
Bailly, Gérard / Badin, Pierre:
"Seeing tongue movements from outside",
1913-1916.
Wojdel, Jacek C. / Wiggers, Pascal / Rothkrantz, Leon J.M.:
"An audio-visual corpus for multimodal speech recognition in dutch language",
1917-1920.
Wiggers, Pascal / Wojdel, Jacek C. / Rothkrantz, Leon J.M.:
"Medium vocabulary continuous audio-visual speech recognition",
1921-1924.
Heckmann, Martin / Kroschel, Kristian / Savariaux, Christophe / Berthommier, Frédéric:
"DCT-based video features for audio-visual speech recognition",
1925-1928.
Erdener, V. Dogu / Burnham, Denis K.:
"The effect of auditory-visual information and orthographic background in L2 acquisition",
1929-1932.
Krahmer, Emiel / Ruttkay, Zsófia / Swerts, Marc / Wesselink, Wieger:
"Perceptual evaluation of audiovisual cues for prominence",
1933-1936.
Schwartz, Jean-Luc / Berthommier, Frédéric / Savariaux, Christophe:
"Audio-visual scene analysis: evidence for a "very-early" integration process in audio-visual speech perception",
1937-1940.
Zelezný, Milos / Císar, Petr / Krnoul, Zdenek / Novák, Jan:
"Design of an audio-visual speech corpus for the czech audio-visual speech synthesis",
1941-1944.
Attina, Virginie / Beautemps, Denis / Cathiard, Marie-Agnès:
"Coordination of hand and orofacial movements for CV sequences in French cued speech",
1945-1948.
Attina, Virginie / Cathiard, Marie-Agnès / Beautemps, Denis:
"Controling anticipatory behavior for rounding in French cued speech",
1949-1952.
Sodoyer, David / Girin, Laurent / Jutten, Christian / Schwartz, Jean-Luc:
"Audio-visual speech sources separation: a new approach exploiting the audio-visual coherence of speech stimuli",
1953-1956.
House, David:
"Intonational and visual cues in the perception of interrogative mode in Swedish",
1957-1960.
Lucey, Simon / Sridharan, Sridha / Chandran, Vinod:
"A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition",
1961-1964.
Speech Technology Applications
Endo, Taku / Ward, Nigel / Terada, Minoru:
"Can confidence scores help users post-editing speech recognizer output?",
1469-1472.
Watanabe, Masatoshi / Sugiyama, Masahide:
"Information retrieval based on speech recognition results",
1473-1476.
Lemmelä, Saija-Maaria / Boda, Péter Pál:
"Efficient combination of type-in and wizard-of-oz tests in speech interface development process",
1477-1480.
Macherey, Wolfgang / Viechtbauer, Jörg / Ney, Hermann:
"Probabilistic retrieval based on document representations",
1481-1484.
Nishimoto, Takuya / Araki, Masahiro / Niimi, Yasuhisa:
"Radiodoc: a voice-accessible document system",
1485-1488.
Goto, Masataka / Itou, Katunobu / Hayamizu, Satoru:
"Speech completion: on-demand completion assistance using filled pauses for speech input interfaces",
1489-1492.
Wilkie, Jenny / Jack, Mervyn A. / Littlewood, Peter:
"Design of system-initiated digressive proposals for automated banking dialogues",
1493-1496.
Toth, Arthur R. / Harris, Thomas K. / Sanders, James / Shriver, Stefanie / Rosenfeld, Roni:
"Towards every-citizen²s speech interface: an application generator for speech interfaces to databases",
1497-1500.
Iyer, Rukmini / Ma, Jeffrey / Gish, Herbert / Kimball, Owen:
"Training topic classifiers for conversational speech with limited data",
1501-1504.
Nishizaki, Hiromitsu / Nakagawa, Seiichi:
"Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval",
1505-1508.
Lai, Jennifer C. / Lee, Kwan Min:
"Choosing speech or touchtone modality for navigation within a telephony natural language system",
1509-1512.
Lo, Wai-Kit / Meng, Helen M. / Ching, P. C.:
"Multi-scale and multi-model integration for improved performance in Chinese spoken document retrieval",
1513-1516.
Ogata, Kohichi / Sonoda, Yorinobu:
"Development of a GUI-based articulatory speech synthesis system",
1517-1520.
Speech Production: Models and Physiology
Dang, Jianwu / Honda, Masaaki / Honda, Kiyoshi:
"Investigation of coarticulation based on electromagnetic articulographic data",
1521-1524.
Niikawa, Takuya / Ando, Takanori / Matsumura, Masafumi:
"Frequency dependence of vocal-tract length",
1525-1528.
Maeda, Shinji / Toda, Martine / Carlen, Andreas J. / Meftahi, Lyes:
"Functional modeling of face movements during speech",
1529-1532.
Mochida, Takemi / Honda, Masaaki / Hayashi, Kouki / Kuwae, Toshiharu / Tanahashi, Kunihiro / Nishikawa, Kazufumi / Takanishi, Atsuo:
"Control system for talking robot to replicate articulatory movement of natural speech",
1533-1536.
Finan, Donald S. / Smith, Anne / Ho, Michael:
"Feed the tiger: a method for evoking reliable jaw stretch reflexes in children",
1537-1540.
Kaburagi, Tokihiko / Wakamiya, Kohei / Honda, Masaaki:
"Three-dimensional electromagnetic articulograph based on a nonparametric representation of the magnetic field",
2297-2300.
Laprie, Yves / Ouni, Slim:
"Introduction of constraints in an acoustic-to-articulatory inversion method based on a hypercubic articulatory table",
2301-2304.
Hiroya, Sadao / Honda, Masaaki:
"Acoustic-to-articulatory inverse mapping using an HMM-based speech production model",
2305-2308.
Hashimoto, Kiyoshi:
"Modeling articulatory dynamics in autoregressive linear system",
2309-2312.
Sciamarella, Denisse / d’Alessandro, Christophe:
"A study of the two-mass model in terms of acoustic parameters",
2313-2316.
Tools for Spoken Language Resources
Kolossa, Dorothea / Huo, Qiang:
"Using time-stretched pulses for accurate splitting of speech utterances played back in noisy reverberant environments",
1541-1544.
Maekawa, Kikuo / Kikuchi, Hideaki / Igarashi, Yosuke / Venditti, Jennifer:
"X-JToBI: an extended j-toBI for spontaneous speech",
1545-1548.
Strik, Helmer / Daelemans, Walter / Binnenpoorte, Diana / Sturm, Janienke / Vriend, F. De / Cucchiarini, Catia:
"Dutch HLT resources: from BLARK to priority lists",
1549-1552.
Yang, Fan / Strayer, Susan E. / Heeman, Peter A.:
"ACT: a graphical dialogue annotation comparison tool",
1553-1556.
Yu, Ha-Jin / Kim, Jin Suk:
"A training prompts generation algorithm for connected spoken word recognition",
1557-1560.
Speech Recognition - Practical Issues
Cornu, Etienne / Sheikhzadeh, Hamid / Brennan, Robert:
"A low-resource, miniature implementation of the ETSI distributed speech recognition front-end",
1581-1584.
Astrov, Sergey:
"Memory space reduction for hidden Markov models in low-resource speech recognition systems",
1585-1588.
Wang, Xia / Iso-Sipilä, Juha:
"Low complexity Mandarin speaker-independent isolated word recognition",
1589-1592.
Kiss, Imre / Vasilache, Marcel:
"Low complexity techniques for embedded ASR systems",
1593-1596.
Reinhard, Klaus / Junkawitsch, Jochen / Kießling, Andreas / Dobler, Stefan:
"Optimization of hidden Markov models for embedded systems",
1597-1600.
Filali, Karim / Li, Xiao / Bilmes, Jeff A.:
"Data-driven vector clustering for low-memory footprint ASR",
1601-1604.
Jiang, Hui / Lee, Chin-Hui:
"Utterance verification based on neighborhood information and Bayes factors",
1605-1608.
Lahti, Tommi / Suontausta, Janne:
"Vocabulary independent OOV detection using support vector machines",
1609-1612.
Bazzi, Issam / Glass, James:
"A multi-class approach for modelling out-of-vocabulary words",
1613-1616.
Duchateau, Jacques / Wambacq, Patrick:
"Unconstrained versus constrained acoustic normalisation in confidence scoring",
1617-1620.
Falavigna, Daniele / Gretter, Roberto / Riccardi, Giuseppe:
"Acoustic and word lattice based algorithms for confidence scores",
1621-1624.
Wang, Huei-Ming / Lin, Yi-Chung:
"Error-tolerant spoken language understanding with confidence measuring",
1625-1628.
Perception
Weil, Shawn A.:
"Comparing intelligibility of several non-native accent classes in noise",
1629-1632.
Ishizuka, Kentaro / Aikawa, Kiyoaki:
"Effect of F0 fluctuation and amplitude modulation of natural vowels on vowel identification in noisy environments",
1633-1636.
Yoneyama, Kiyoko:
"Similarities of words in noise in Japanese",
1637-1640.
Brungart, Douglas S. / Kordik, Alexander J. / Das, Koel / Shaw, Arnab K.:
"The effects of F0 manipulation on the perceived distance of speech",
1641-1644.
Janse, Esther:
"Time-compressing natural and synthetic speech",
1645-1648.
Xue, Jianxia / Takayanagi, Sumiko / Bernstein, Lynne E.:
"Accounting for perceptual identification of consonants and vowels through acoustic dissimilarity",
1649-1652.
Wade, Travis / Eakin, Deborah K. / Webb, Russell / Agah, Arvin / Brown, Frank / Jongman, Allard / Gauch, John / Schreiber, Thomas A. / Sereno, Joan:
"Modeling recognition of speech sounds with minerva2",
1653-1656.
Kearns, Ruth / Norris, Dennis / Cutler, Anne:
"Syllable processing in English",
1657-1660.
Kuijpers, Cecile / Donselaar, Wilma van / Cutler, Anne:
"Perceptual effects of assimilation-induced violation of final devoicing in dutch",
1661-1664.
Yip, Michael C.W.:
"Access to homophonic meanings during spoken language comprehension: effects of context and neighborhood density",
1665-1668.
Magrin-Chagnolleau, Ivan / Barkat, Melissa / Meunier, Fanny:
"Intelligibility of reverse speech in French: a perceptual study",
1669-1672.
Serniclaes, Willy / Carré, René:
"Contextual effects in the perception of fricative place of articulation: a rotational hypothesis",
1673-1676.
Sock, Rudolph / Vaxelaire, Béatrice / Hecker, Véronique / Hirsch, Fabrice:
"What relationship between protrusion anticipation and auditory perception?",
1677-1680.
Carré, René / Liénard, Jean Sylvain / Marsico, Egidio / Serniclaes, Willy:
"On the role of the "schwa" in the perception of plosive consonants",
1681-1684.
Nguyen, Noël / Jankowski, Ludovic / Habib, Michel:
"The perception of stop consonant sequences in dyslexic and normal children",
2565-2568.
Otake, Takashi / Iijima, Akemi:
"Submoraic awareness by Japanese school children: evidence from a novel game",
2569-2572.
Markham, D. / Hazan, Valerie:
"Speaker intelligibility of adults and children",
2573-2576.
Yamashita, Yasuki / Matsumoto, Hiroshi:
"Acoustical correlates to SD ratings of speaker characteristics in two speaking styles",
2577-2580.
Ormanci, Eda / Nikbay, U. Hakan / Turk, Oytun / Arslan, Levent M.:
"Subjective assessment of frequency bands for perception of speaker identity",
2581-2584.
Speech to Speech Translation - I
Stallard, David / Natarajan, Premkumar / Noamany, Mohammed / Schwartz, Richard / Makhoul, John:
"Design for a speech-to-speech translator for field use",
1705-1708.
Black, Alan W. / Brown, Ralf D. / Frederking, Robert / Lenzo, Kevin / Moody, John / Rudnicky, Alexander I. / Singh, Rita / Steinbrecher, Eric:
"Rapid development of speech-to-speech translation systems",
1709-1712.
Imamura, Kenji / Sumita, Eiichiro:
"Bilingual corpus cleaning focusing on translation literality",
1713-1716.
Tanaka, Hideki / Nightingale, Stephen / Kashioka, Hideki / Matsumoto, Kenji / Nishiwaki, Masamchi / Kumano, Tadashi / Maruyama, Takehiko:
"Speech to speech translation system for monologues-data driven approach",
1717-1720.
Gispert, Adrià de / Mariño, José B.:
"Using x-grams for speech-to-speech translation",
1885-1888.
Watanabe, Taro / Sumita, Eiichiro:
"Statistical machine translation decoder based on phrase",
1889-1892.
Sumita, Eiichiro / Akiba, Yasuhiro / Imamura, Kenji:
"Reliability measures for translation quality",
1893-1896.
Zhou, Bowen / Gao, Yuqing / Sorensen, Jeffrey / Diao, Zijian / Picheny, Michael:
"Statistical natural language generation for speech-to-speech machine translation systems",
1897-1900.
Vogel, Stephan / Tribble, Alicia:
"Improving statistical machine translation for a speech-to-speech translation task",
1901-1904.
Rossato, Solange / Blanchon, Hervé / Besacier, Laurent:
"Speech-to-speech translation system evaluation: results for French for the NESPOLE! project first showcase",
1905-1908.
Kauers, Manuel / Vogel, Stephan / Fügen, Christian / Waibel, Alex:
"Interlingua based statistical machine translation",
1909-1912.
Speech Processing
Nishizawa, Nobuyuki / Hirose, Keikichi / Minematsu, Nobuaki:
"Separation of voiced source characteristics and vocal tract transfer function characteristics for speech sounds by iterative analysis based on AR-HMM model",
1721-1724.
Narusawa, Shuichi / Minematsu, Nobuaki / Hirose, Keikichi / Fujisaki, Hiroya:
"Automatic extraction of model parameters from fundamental frequency contours of English utterances",
1725-1728.
Murakami, Takahiro / Namba, Munehiro / Hoya, Tetsuya / Ishida, Yoshihisa:
"Pitch extraction of speech signals using an eigen-based subspace method",
1729-1732.
Nakatani, Tomohiro / Irino, Toshio:
"Robust fundamental frequency estimation against background noise and spectral distortion",
1733-1736.
Quatieri, Thomas F.:
"2-d processing of speech with application to pitch estimation",
1737-1740.
Speech Recognition: Broadcast and Courtroom Transcription
Saraclar, Murat / Riley, Michael / Bocchieri, Enrico / Goffin, Vincent:
"Towards automatic closed captioning : low latency real time broadcast news transcription",
1741-1744.
Prasad, Rohit / Nguyen, Long / Schwartz, Richard / Makhoul, John:
"Automatic transcription of courtroom speech",
1745-1748.
Nguyen, Long / Guo, Xuefeng / Schwartz, Richard / Makhoul, John:
"Japanese broadcast news transcription",
1749-1752.
Hecht, Robert / Riedler, Jürgen / Backfried, Gerhard:
"German broadcast news transcription",
1753-1756.
Imai, Toru / Matsui, Atsushi / Homma, Shinichi / Kobayakawa, Takeshi / Onoe, Kazuo / Sato, Shoei / Ando, Akio:
"Speech recognition with a re-speak method for subtitling live broadcasts",
1757-1760.
Duration, Tempo, and Intonation
Takamaru, Keiichi / Hiroshige, Makoto / Araki, Kenji / Tochinai, Koji:
"Evaluation of the method to detect Japanese local speech rate deceleration applying the variable threshold with a constant term",
1761-1764.
Kirkham, Sandra P.:
"Tempo modulations in English: selected pilot study results",
1765-1768.
Smith, Caroline L.:
"Modeling durational variability in reading aloud a connected text",
1769-1772.
Hifny, Yasser / Rashwan, Mohsen:
"Duration modeling for arabic text to speech synthesis",
1773-1776.
Jokisch, Oliver / Ding, Hongwei / Kruschke, Hans / Strecha, Guntram:
"Learning syllable duration and intonation of Mandarin Chinese",
1777-1780.
Speech Coding and Transmission
Patwardhan, Pushkar / Rao, Preeti:
"Controlling perceived degradation in spectrum envelope modeling via predistortion",
1837-1840.
Veprek, Peter / Bradley, Alan B.:
"Benefit and cost analysis of using the improved vector quantizer design algorithm for glottal source waveform compression",
1841-1844.
Zhong, Xin / Arrowood, Jon A. / Clements, Mark A.:
"Speech coding and transmission for improved automatic recognition",
1845-1848.
Nguyen, Phu Chien / Ochi, Takao / Akagi, Masato:
"Coding speech at very low rates using straight and temporal decomposition",
1849-1852.
Nieminen, Toni P.:
"Floating-point adaptive multi-rate wideband speech codec",
1853-1856.
Halmi, Omar / Tolba, Hesham / Guerchi, Driss / O’Shaughnessy, Douglas:
"On improving the performance of analysis-by-synthesis coding using a multi-magnitude algebraic code-book excitation signal",
1857-1860.
Humphreys, K. / Lawlor, R.:
"Improved performance speech codec for mobile communications",
1861-1864.
Yakhnich, Evgeni / Bistritz, Yuval:
"Fixed-length segment coding of LSF parameters",
1865-1868.
Parsa, Vijay / Jamieson, Donald G.:
"Interaction of voice over internet protocol speech coders and disordered speech samples",
1869-1872.
Kelleher, Holly / Pearce, David / Ealey, Doug / Mauuary, Laurent:
"Speech recognition performance comparison between DSR and AMR transcoded speech",
1873-1876.
Hirsch, Hans-Günter:
"The influence of speech coding on recognition performance in telecommunication networks",
1877-1880.
Moharir, Gautam / Patwardhan, Pushkar / Rao, Preeti:
"Spectral enhancement preprocessing for the HNM coding of noisy speech",
1881-1884.
Spoken Document Retrieval
Brun, Armelle / Smaïli, Kamel / Haton, Jean-Paul:
"Contribution to topic identification by using word similarity",
1965-1968.
Zhou, Bowen / Hansen, John H. L.:
"Speechfind: an experimental on-line spoken document retrieval system for historical audio archives",
1969-1972.
Suzuki, Yoshimi / Fukumoto, Fumiyo / Sekiguchi, Yoshihiro:
"Topic tracking using subject templates",
1973-1976.
Asami, Katsushi / Takezawa, Toshiyuki / Kikui, Genichiro:
"Topic detection of an utterance for speech dialogue processing",
1977-1980.
Liu, Daben / Ma, Jeffrey / Xu, Dongxin / Srivastava, Amit / Kubala, Francis:
"Real-time rich-content transcription of Chinese broadcast news",
1981-1984.
Wang, Chun-Jen / Chen, Berlin / Lee, Lin-shan:
"Improved Chinese spoken document retrieval with hybrid modeling and data-driven indexing features",
1985-1988.
Larson, Martha / Eickeler, Stefan / Paaß, Gerhard / Leopold, Edda / Kindermann, Jörg:
"Exploring sub-word features and linear support vector machines for German spoken document classification",
1989-1992.
Wester, Mirjam / Kessens, Judith M. / Strik, Helmer:
"Goal-directed ASR in a multimedia indexing and searching environment (MUMIS)",
1993-1996.
Logan, Beth / Thong, J. M. Van:
"Confusion-based query expansion for OOV words in spoken document retrieval",
1997-2000.
Wickramaratna, J. T. / Woodland, P. C.:
"Cluster identification for speaker-environment tracking",
2001-2004.
Pinquier, Julien / Rouas, Jean-Luc / André-Obrecht, Régine:
"Robust speech / music classification in audio documents",
2005-2008.
Karnebäck, Stefan:
"Expanded examinations of a low frequency modulation feature for speech/music discrimination",
2009-2012.
Ezzaidi, Hassan / Rouat, Jean:
"Speech, music and songs discrimination in the context of handsets variability",
2013-2016.
Acoustic Correlates and Recognition of Emotion
Scherer, Klaus R. / Grandjean, D. / Johnstone, Tom / Klasmeyer, Gudrun / Bänziger, Thomas:
"Acoustic correlates of task load and stress",
2017-2020.
Rahurkar, Mandar A. / Hansen, John H. L. / Meyerhoff, James / Saviolakis, George / Koenig, Michael:
"Frequency band analysis for stress detection using a teager energy operator based feature",
2021-2024.
Yuan, Jiahong / Shen, Liqin / Chen, Fangxin:
"The acoustic realization of anger, fear, joy and sadness in Chinese",
2025-2028.
Tato, Raquel / Santos, Rocío / Kompe, Ralf / Pardo, J. M.:
"Emotional space improves emotion recognition",
2029-2032.
Chuang, Ze-Jing / Wu, Chung-Hsien:
"Emotion recognition from textual input using an emotional semantic network",
2033-2036.
Ang, Jeremy / Dhillon, Rajdip / Krupski, Ashley / Shriberg, Elizabeth / Stolcke, Andreas:
"Prosody-based automatic detection of annoyance and frustration in human-computer dialog",
2037-2040.
Makarova, Veronika / Petrushin, Valery A.:
"RUSLANA: a database of Russian emotional utterances",
2041-2044.
Dialog Strategy Design
O’Neill, Ian M. / McTear, Michael F.:
"A pragmatic confirmation mechanism for an object-based spoken dialogue manager",
2045-2048.
Torge, Sunna / Rapp, Stefan / Kompe, Ralf:
"Serving complex user wishes with an enhanced spoken dialogue system",
2049-2052.
Chung, Grace / Seneff, Stephanie:
"Integrating speech with keypad input for automatic entry of spelling and pronunciation of new words",
2053-2056.
Campana, Ellen / Brown-Schmidt, Sarah / Tanenhaus, Michael K.:
"Reference resolution by human partners in a natural interactive problem-solving task",
2057-2060.
Ferrer, Luciana / Shriberg, Elizabeth / Stolcke, Andreas:
"Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody",
2061-2064.
Gorrell, Genevieve / Lewin, Ian / Rayner, Manny:
"Adding intelligent help to mixed-initiative spoken dialogue systems",
2065-2068.
Shin, Jongho / Narayanan, Shrikanth S. / Gerber, Laurie / Kazemzadeh, Abe / Byrd, Dani:
"Analysis of user behavior under error conditions in spoken dialogs",
2069-2072.
Speech Synthesis - Prosody
Jiang, Yinglong / Murphy, Peter:
"Production based pitch modification of voiced speech",
2073-2076.
Sun, Xuejing:
"F0 generation for speech synthesis using a multi-tier approach",
2077-2080.
Strom, Volker:
"From text to prosody without toBI",
2081-2084.
Hirose, Keikichi / Eto, Masaya / Minematsu, Nobuaki:
"Improved corpus-based synthesis of fundamental frequency contours using generation process model",
2085-2088.
Buhmann, Jeska / Martens, Jean-Pierre / Macken, Lieve / Coile, Bert Van:
"Intonation modelling for the synthesis of structured documents",
2089-2092.
Meron, Joram:
"Applying fallback to prosodic unit selection from a small imitation database",
2093-2096.
Tao, Jianhua / Cai, Lianhong:
"Clustering and feature learning based F0 prediction for Chinese speech synthesis",
2097-2100.
Speech Features
Weber, Katrin / Wet, Febe de / Cranen, Bert / Boves, Lou / Bengio, Samy / Bourlard, Hervé:
"Evaluation of formant-like features for ASR",
2101-2104.
Al-Dulaimy, Fadhil H. T. / Wang, Zuoying:
"Entropy of energy operator as feature for large vocabulary Mandarin speaker independent speech recognition",
2105-2108.
Zhang, Yiyan / Liu, Wenju / Xu, Bo / Zhang, Huayun:
"Improving parametric trajectory modeling by integration of pitch and tone information",
2109-2112.
Tolba, Hesham / Selouani, Sid-Ahmed / O’Shaughnessy, Douglas:
"Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for automatic speech recognition using a multi-stream paradigm",
2113-2116.
Leung, Ka-Yee / Siu, Manhung:
"Speech recognition using combined acoustic and articulatory information with retraining of acoustic model parameters",
2117-2120.
Wilkinson, N. J. / Russell, Martin J.:
"Improved phone recognition on TIMIT using formant frequency data and confidence measures",
2121-2124.
Kitaoka, Norihide / Yamada, Daisuke / Nakagawa, Seiichi:
"Speaker independent speech recognition using features based on glottal sound source",
2125-2128.
Omar, Mohamed Kamal / Chen, Ken / Hasegawa-Johnson, Mark / Brandman, Yigal:
"An evaluation of using mutual information for selection of acoustic-features representation of phonemes for speech recognition",
2129-2132.
Metze, Florian / Waibel, Alex:
"A flexible stream architecture for ASR using articulatory features",
2133-2136.
Ljolje, Andrej:
"Speech recognition using fundamental frequency and voicing in acoustic modeling",
2137-2140.
Karnjanadecha, Montri / Kimsawad, Patimakorn:
"A comparison of front-end analyses for Thai speech recognition",
2141-2144.
Turunen, Jari / Tanttu, Juha T. / Loula, Pekka:
"New model for speech residual signal shaping with static nonlinearity",
2145-2148.
Ho, Ching-Hsiang / Rentzos, Dimitrios / Vaseghi, Saeed:
"Formant model estimation and transformation for voice morphing",
2149-2152.
Megyesi, Beáta / Gustafson-Capková, Sofia:
"Production and perception of pauses and their linguistic context in read and spontaneous speech in Swedish",
2153-2156.
Manfredi, Claudia / Matassini, Lorenzo:
"Non-linear techniques for dysphonic voice analysis and correction",
2157-2160.
Sasou, Akira / Tanaka, Kazuyo:
"Adaptive estimation of time-varying features from high-pitched speech based on an excitation source HMM",
2161-2164.
Toda, Martine / Maeda, Shinji / Carlen, Andreas J. / Meftahi, Lyes:
"Lip gestures in English sibilants: articulatory - acoustic relationship",
2165-2168.
Malayath, Naren / Hermansky, Hynek:
"Bark resolution from speech data",
2169-2172.
Special Topics in Robust Speech Recognition
Selouani, Sid-Ahmed / O’Shaughnessy, Douglas:
"Noise-robust speech recognition in car environments using genetic algorithms and a mel-cepstral subspace approach",
2173-2176.
Axelrod, Scott / Gopinath, Ramesh / Olsen, Peder:
"Modeling with a subspace constraint on inverse covariance matrices",
2177-2180.
McCowan, Iain A. / Morris, Andrew C. / Bourlard, Hervé:
"Improving speech recognition performance of small microphone arrays using missing data techniques",
2181-2184.
Gelbart, David / Morgan, Nelson:
"Double the trouble: handling noise and reverberation in far-field automatic speech recognition",
2185-2188.
Couvreur, Laurent / Ris;, Christophe:
"Model-based independent component analysis for robust multi-microphone automatic speech recognition",
2189-2192.
Yu, An-Tze / Wang, Hsiao-Chuan:
"Compensation of channel effect on line spectrum frequencies",
2193-2196.
Zhang, Huayun / Han, Zhaobing / Xu, Bo:
"Codebook dependent dynamic channel estimation for Mandarin speech recognition over telephone",
2197-2200.
Gemello, Roberto / Mana1, Franco / Pegoraro, Paolo / Mori, Renato De:
"Robust multiple resolution analysis for automatic speech recognition",
2201-2204.
Peinado, Antonio M. / Sánchez, Victoria / Pérez-Córdoba, José L. / Segura, José C. / Rubio, Antonio J.:
"HMM-based methods for channel error mitigation in distributed speech recognition",
2205-2208.
Fingscheidt, Tim / Aalburg, Stefanie / Stan, Sorel / Beaugeant, Christophe:
"Network-based vs. distributed speech recognition in adaptive multi-rate wireless systems",
2209-2212.
Bernard, Alexis / Alwan, Abeer:
"Channel noise robustness for low-bitrate remote speech recognition",
2213-2216.
Peláez-Moreno, C. / Gallardo-Antolín, A. / Vicente-Peña, J. / Díaz-de-María, F.:
"Influence of transmission errors on ASR systems",
2217-2220.
Tsuge, Satoru / Kuroiwa, Shingo / Shishibori, Masami / Ren, Fuji / Kita, Kenji:
"Robust feature extraction in a variety of input devices on the basis of ETSI standard DSR front-end",
2221-2224.
Tan, Zheng-Hua / Dalsgaard, Paul:
"Channel error protection scheme for distributed speech recognition",
2225-2228.
Muthusamy, Yeshwant / Gong, Yifan / Gupta, Roshan:
"The effects of speech compression on speech recognition and text-to-speech synthesis",
2229-2232.
Milner, Ben / Shao, Xu:
"Transform-based feature vector compression for distributed speech recognition",
2233-2236.
Distributed Multimodal Dialog Management Using Internet Technologies - I
Johnston, Michael / Bangalore, Srinivas / Stent, Amanda / Vasireddy, Gunaranjan / Ehlen, Patrick:
"Multimodal language processing for mobile information access",
2237-2240.
Wang, Kuansan:
"SALT: a spoken language interface for web-based multimodal dialog systems",
2241-2244.
Bennett, Christina / Llitjós, Ariadna Font / Shriver, Stefanie / Rudnicky, Alexander I. / Black, Alan W.:
"Building voiceXML-based applications",
2245-2248.
Chai, Joyce:
"Operations for context-based multimodal interpretation in conversational systems",
2249-2252.
Liu, Feng / Saad, Antoine / Li, Li / Chou, Wu:
"A distributed multimodal dialogue system based on dialogue system and web convergence",
2253-2256.
Katsurada, Kouichi / Ootani, Yoshihiko / Nakamura, Yusaku / Kobayashi, Satoshi / Yamada, Hirobumi / Nitta, Tsuneo:
"A modality-independent MMI system architecture",
2549-2552.
Armaroli, Cristiana / Azzini, Ivano / Ferrario, Lorenza / Giorgino, Toni / Nardelli, Luca / Orlandi, Marco / Rognoni, Carla:
"An architecture for a multi-modal web browser",
2553-2556.
Ehlen, Patrick / Johnston, Michael / Vasireddy, Gunaranjan:
"Collecting mobile multimodal data for match",
2557-2560.
Meng, Helen M. / Ching, P. C. / Wong, Yee Fong / Chan, Cheong Chat:
"ISIS: a multi-modal, trilingual, distributed spoken dialog system developed with CORBA, java, XML and KQML",
2561-2564.
Phonology
Tsukada, Kimiko:
"An acoustic comparison between american English and australian English vowels",
2257-2260.
Jesus, Luis M.T. / Shadle, Christine H.:
"A case study of portuguese and English bilinguality",
2261-2264.
Dioubina, Olga I. / Pfitzinger, Hartmut R.:
"An IPA vowel diagram approach to analysing L1 effects on vowel production and perception",
2265-2268.
Helgason, Pétur / Gullbein, Sjúrðhur:
"Phonological norms in faroese speech synthesis",
2269-2272.
Mareüil, Philippe Boula de / Adda-Decker, Martine:
"Studying pronunciation variants in French by using alignment techniques",
2273-2276.
Hansson, Petra:
"Perceived boundary strength",
2277-2280.
Jun, Sun-Ah:
"Syntax over focus",
2281-2284.
Ohala, John J. / Roengpitya, Rungpat:
"Duration related phase realignment of Thai tones",
2285-2288.
Bosch, Louis ten:
"Probabilistic ranking of constraints",
2289-2292.
Komatsu, Masahiko / Tokuma, Shinichi / Tokuma, Won / Arai, Takayuki:
"Multi-dimensional analysis of sonority: perception, acoustics, and phonology",
2293-2296.
Feature Extraction for Speaker Recognition
Faúndez-Zanuy, Marcos / Nilsson, Mattias / Kleijn, W. Bastiaan:
"On the relevance of bandwidth extension for speaker verification",
2317-2320.
Sabac, Bogdan:
"Speaker recognition using discriminative features selection",
2321-2324.
Kinnunen, Tomi:
"Designing a speaker-discriminative adaptive filter bank for speaker recognition",
2325-2328.
Tsang, Chi-Leung / Mak, CMan-Wai / Kung, Sun-Yuan:
"Divergence-based out-of-class rejection for telephone handset identification",
2329-2332.
Ho, Purdy:
"A handset identifier using support vector machines",
2333-2336.
Issues in Speech Recognition
Faltlhauser, Robert / Ruske, Günther / Thomae, M.:
"Towards the question: why has speaking rate such an impact on speech recognition performance?",
2429-2432.
Arcienega, Mijail / Drygajlo, Andrzej:
"Robust voiced-unvoiced decision associated to continuous pitch tracking in noisy telephone speech",
2433-2436.
Yao, Kaisheng / Paliwal, Kuldip K. / Nakamura, Satoshi:
"Noise adaptive speech recognition with acoustic models trained from noisy speech evaluated on Aurora-2 database",
2437-2440.
Chen, Jingdong / Huang, Yiteng (Arden) / Li, Qi / Soong, Frank K.:
"Recognition of noisy speech using normalized moments",
2441-2444.
Chen, Chia-Ping / Bilmes, Jeff A. / Kirchhoff, Katrin:
"Low-resource noise-robust feature post-processing on Aurora 2.0",
2445-2448.
Deng, Li / Droppo, Jasha / Acero, Alex:
"Exploiting variances in robust feature extraction based on a parametric model of speech distortion",
2449-2452.
Ghulam, Muhammad / Fukuda, Takashi / Sato, Takaharu / Nitta, Tsuneo:
"Improving performance of an HMM-based ASR system by using monophone-level normalized confidence measure",
2453-2456.
Liu, Yi / Fung, Pascale:
"Model partial pronunciation variations for spontaneous Mandarin speech recognition",
2457-2460.
Zheng, Fang / Song, Zhanjiang / Fung, Pascale / Byrne, William:
"Reducing pronunciation lexicon confusion and using more data without phonetic transcription for pronunciation modeling",
2461-2464.
McDermott, Erik / Katagiri, Shigeru:
"Classification error from the theoretical Bayes classification risk",
2465-2468.
Klautau, Aldebaro / Jevtic, Nikola / Orlitsky, Alon:
"Combined binary classifiers with applications to speech recognition",
2469-2472.
Nagórski, Arkadiusz / Boves, Lou / Steeneken, Herman:
"Optimal selection of speech data for automatic speech recognition systems",
2473-2476.
Speech Pathology Processing and Treatment
Liotti, Mario / Ramig, Lorraine O. / Vogel, Deanie / New3, Pamela / Cook, Chris / Fox, Peter:
"Hypophonia in parkinson disease: neural correlates of voice treatment with LSVT revealed by PET",
2477-2480.
Duncan, Susan:
"Preliminary data on effects of behavioral and levodopa therapies on speech-accompanying gesture in parkinson²s disease",
2481-2484.
Quek, Francis / Harper, Mary / Haciahmetoglu, Yonca / Chen, Lei / Ramig, Lorraine O.:
"Speech pauses and gestural holds in parkinson²s disease",
2485-2488.
Spielman, Jennifer L. / Ramig, Lorraine O. / Borod, Joan C.:
"Oro-facial changes in parkinson²s disease following intensive voice therapy (LSVT)",
2489-2492.
Logemann, Jeri / Sundin, Ralph / Sundin, Jean:
"Swallowing and voice effects of lee silverman voice treatment (LSVT)",
2493-2496.
Speech Pathology Processing
Will, Leslie / Ramig, Lorraine O. / Spielman, Jennifer L.:
"Application of the lee silverman voice treatment (LSVT) to individuals with multiple sclerosis, ataxic dysarthria, and stroke",
2497-2500.
Farley, Becky G.:
"Think big, from voice to limb movement therapy",
2501-2504.
Parsa, Vijay / Jamieson, Donald G. / Stenning, Karen / Leeper, Herbert A.:
"On the estimation of signal-to-noise ratio in continuous speech for abnormal voices",
2505-2508.
Applications of Speech Signal Processing
Semenov, V. / Kovtonyuk, A. / Kalyuzhny, A.:
"Computationally efficient method of speech enhancement based on block representation of signal in state space and vector quantization **********************",
2509-2512.
Kondo, Kazuhiro / Nakagawa, Kiyoshi:
"Active speech cancellation for cellular speech",
2513-2516.
Muralishankar, R. / Ramakrishnan, A. G. / Prathibha, P.:
"Warped-LP residual resampling using DCT for pitch modification",
2517-2520.
Jung, E. / Schwarzbacher, A. / Humphreys, K. / Lawlor, R.:
"Application of real-time AMDF pitch-detection in a voice gender normalisation system",
2521-2524.
Laprie, Yves / Bonneau, Anne:
"A copy synthesis method to pilot the klatt synthesiser",
2525-2528.
Sakamoto, Masaharu / Saito, Takashi:
"Speaker recognizability evaluation of a voicefont-based text-to-speech system",
2529-2532.
Satué-Villar, Antonio / Fernández-Rubio, Juan:
"Time-frequency transforms and beamforming for speaker recognition",
2533-2536.
Kwon, Soonil / Narayanan, Shrikanth S.:
"Speaker change detection using a new weighted distance measure",
2537-2540.
Gómez-Cipriano, José L. / Nunes, Roger P. / Barone, Dante A. C.:
"FPGA hardware for speech recognition using hidden Markov models",
2541-2544.
Irino, Toshio / Minami, Yasuhiro / Nakatani, Tomohiro / Tsuzaki, Minoru / Tagawa, H.:
"Evaluation of a speech recognition / generation method based on HMM and straight",
2545-2548.
Speech Synthesis: Unit Selection
Vepa, Jithendra / King, Simon / Taylor, Paul:
"Objective distance measures for spectral discontinuities in concatenative speech synthesis",
2605-2608.
Hamza, Wael / Donovan, Robert:
"Data-driven segment preselection in the IBM trainable speech synthesis system",
2609-2612.
Peng, Hu / Zhao, Yong / Chu, Min:
"Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation",
2613-2616.
Yi, Jon / Glass, James:
"Information-theoretic criteria for unit selection synthesis",
2617-2620.
Kawai, Hisashi / Tsuzaki, Minoru:
"Acoustic measures vs. phonetic features as predictors of audible discontinuity in concatenative speech synthesis",
2621-2624.
Dialog Systems and Applications
Wang, Hsien-Chang / Huang, Chieh-Yi / Yang, Chung-Hsien / Wang, Jhing-Fa:
"A study of multi-speaker dialogue system for mobile information retrieval",
2677-2680.
Fabbrizio, Giuseppe Di / Dutton, Dawn / Gupta, Narendra K. / Hollister, Barbara / Rahim, Mazin / Riccardi, Giuseppe / Schapire, Robert / Schroeter, Juergen:
"AT&t help desk",
2681-2684.
Trias-Sanz, Roger / Mariño, José B.:
"Basurde[lite], a machine-driven dialogue system for accessing railway timetable information",
2685-2688.
Coulston, Rachel / Oviatt, Sharon / Darves, Courtney:
"Amplitude convergence in children²s conversational speech with animated personas",
2689-2692.
Stallard, David:
"Flexible dialogue management in the talk²n’travel system",
2693-2696.
Oria, Daniela / Koskinen, Esa:
"E-mail goes mobile: the design and implementation of a spoken language interface to e-mail",
2697-2700.
Yoma, Néstor Becerra / Cortés, Angela / Hormazábal, Mauricio / López, Enrique:
"Wizard of oz evaluation of a dialogue with communicator system in chile",
2701-2704.
Carpenter, Bob / Caskey, Sasha / Dayanidhi, Krishna / Drouin, Caroline / Pieraccini, Roberto:
"A portable, server-side dialog framework for voiceXML",
2705-2708.
Takahashi, S. / Morimoto, T. / Maeda, S. / Tsuruta, N.:
"Spoken dialogue system for home health care",
2709-2712.
Padrell, Jaume / Hernando, Javier:
"ACIMET: access to meteorological information by telephone",
2713-2716.
Engel, Ralf:
"SPIN: language understanding for spoken dialogue systems using a production system approach",
2717-2720.
Original Workshop Website
The link to the original website will bring you to the workshop website as long as it is
maintained.