7th International Conference on Spoken Language Processing
Table of Contents
[ICSLP-2002] 7th International Conference on Spoken Language Processing (ICSLP2002 - INTERSPEECH 2002), Denver, Colorado, USA, September 16-20, 2002, ed. by John H. L. Hansen and Bryan Pellom, ISCA Archive, http://www.isca-speech.org/archive/icslp02
Names written in boldface refer to first authors, in CAPITAL letters to keynote and invited papers. Full papers can be accessed from the abstracts (ISCA members only). Please note that each abstract opens in a separate window.
A B C D E F G H I J K L M N O PQ R S T UV W XY Z
Acoustic Correlates and Recognition of Emotion Acoustic Modeling Acoustic Speech Modeling
Applications of Speech Signal Processing Auditory Models and Hearing Aids
Call Classification and Routing Dialog Strategy Design Dialog Systems and Applications Dialog Systems I: Evaluation
Distributed Multimodal Dialog Management Using Internet Technologies
Duration, Tempo, and Intonation Experimental Phonetics Feature Extraction for Speaker Recognition
Finite State Transducers Applied to Spoken Language Processing
Integration of Speech Technology in Language Learning INTERSPEECH
Issues in Audio-Visual Spoken Language Processing Issues in Speech Recognition
Language Identification Language Modeling Large Vocabulary Speech Recognition
Mechanisms for Dialogue Processing Model Based Speech Processing
Multi-Lingual and Non-Native Spoken Language Processing Multimodal Spoken Language Processing
Pathology of Voice and Speech Production Perception Perception of Prosody Perception: Non-Native
Phonetics Phonology Prosody and Speech Recognition Prosody in Spoken Dialogue Systems
Speaker Modeling and Scoring Speaker Segmentation and Adaptation
Special Topics in Robust Speech Recognition
Speech Coding and Transmission Speech Enhancement Speech Features
Speech Pathology Processing Speech Pathology Processing and Treatment
Speech Processing Speech Production: Models and Physiology
Speech Recognition in Noise Speech Recognition: Adaptation Speech Recognition: Broadcast and Courtroom Transcription
Speech Recognition: In-Vehicle Speech Recognition: Practical Issues Speech Recognition: Search
Speech Synthesis Speech Synthesis: Alternative Views Speech Synthesis: Prosody Speech Synthesis: Unit Selection
Speech Technology Applications Speech to Speech Translation Spoken Document Retrieval
Spoken Language Understanding Spoken Language Resources Tools for Spoken Language Resources Voice Conversion
Fitch, W. Tecumseh: "The evolution of spoken language: a comparative approach", 1-8.
Young, Steve: "Talking to machines (statistically speaking)", 9-16.
Macho, Duncan / Mauuary, Laurent / Noé, Bernhard / Cheng, Yan Ming / Ealey, Doug / Jouvet, Denis / Kelleher, Holly / Pearce, David / Saadoun, Fabien: "Evaluation of a noise-robust DSR front-end on Aurora databases", 17-20.
Adami, Andre / Burget, Lukás / Dupont, Stephane / Garudadri, Hari / Grezl, Frantisek / Hermansky, Hynek / Jain, Pratibha / Kajarekar, Sachin / Morgan, Nelson / Sivadas, Sunil: "Qualcomm-ICSI-OGI features for ASR", 21-24.
Kleinschmidt, Michael / Gelbart, David: "Improving word accuracy with Gabor feature extraction", 25-28.
Droppo, Jasha / Deng, Li / Acero, Alex: "Evaluation of SPLICE on the Aurora 2 and 3 tasks", 29-32.
Mak, Brian / Tam, Yik-Cheung: "Performance of discriminatively trained auditory features on Aurora2 and Aurora3", 33-36.
Segura, José C. / Benítez, M.C. / Torre, Ángel de la / Rubio, Antonio J.: "Feature extraction combining spectral noise reduction and cepstral histogram equalization for robust ASR", 225-228.
Chen, Jingdong / Dimitriadis, Dimitris / Jiang, Hui / Li, Qi / Myrvoll, Tor André / Siohan, Olivier / Soong, Frank K.: "Bell labs approach to Aurora evaluation on connected digit recognition", 229-232.
Kim, Hong Kook / Rose, Richard C.: "Algorithms for distributed speech recognition in a noisy automobile environment", 233-236.
Hilger, Florian / Molau, Sirko / Ney, Hermann: "Quantile based histogram equalization for online applications", 237-240.
Chen, Chia-Ping / Filali, Karim / Bilmes, Jeff A.: "Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases", 241-244.
Ida, Masaki / Nakamura, Satoshi: "HMM COmposition-based rapid model adaptation using a priori noise GMM adaptation evaluation on Aurora2 corpus", 437-440.
Hung, Jeih-weih / Lee, Lin-shan: "Data-driven temporal filters obtained via different optimization criteria evaluated on Aurora2 database", 441-444.
Kotnik, Bojan / Vlaj, Damjan / Kacic, Zdravko / Horvat, Bogomir: "Efficient additive and convolutional noise reduction procedures", 445-448.
Lieb, Markus / Fischer, Alexander: "Progress with the philips continuous ASR system on the Aurora 2 noisy digits database", 449-452.
Wu, Jian / Huo, Qiang: "An environment compensated minimum classification error training approach and its evaluation on Aurora2 database", 453-456.
Yao, Kaisheng / Zhu, Dong-Lai / Nakamura, Satoshi: "Evaluation of a noise adaptive speech recognition system on the Aurora 3 database", 457-460.
Docío-Ferández, Laura / García-Mateo, Carmen: "Distributed speech recognition over IP networks on the Aurora 3 database", 461-464.
Fujimoto, M. / Ariki, Yasuo: "Evaluation of noisy speech recognition based on noise reduction and acoustic model adaptation on the Aurora2 tasks", 465-468.
Saon, George / Huerta, Juan M.: "Improvements to the IBM Aurora 2 multi-condition system", 469-472.
Jain, Pratibha / Hermansky, Hynek / Kingsbury, Brian: "Distributed speech recognition using noise-robust MFCC and traps-estimated manner features", 473-476.
Kitaoka, Norihide / Nakagawa, Seiichi: "Evaluation of spectral subtraction with smoothing of time direction on the Aurora 2 task", 477-480.
Cui, Xiaodong / Iseli, Markus / Zhu, Qifeng / Alwan, Abeer: "Evaluation of noise robust features on the Aurora databases", 481-484.
Evans, Nicholas W. D. / Mason, John S.: "Computationally efficient noise compensation for robust automatic speech recognition assessed under the Aurora 2/3 framework", 485-488.
Farooq, O. / Datta, S.: "Mel-scaled wavelet filter based features for noisy unvoiced phoneme recognition", 1017-1020.
Onoe, Kazuo / Segi, Hiroyuki / Kobayakawa, Takeshi / Sato, Shoei / Imai, Toru / Ando, Akio: "Filter bank subtraction for robust speech recognition", 1021-1024.
Morris, Andrew C. / Payne, Simon / Bourlard, Hervé: "Low cost duration modelling for noise robust speech recognition", 1025-1028.
Gong, Yifan: "A comparative study of approximations for parallel model combination of static and dynamic parameters", 1029-1032.
Motícek, Petr / Burget, Lukás: "Noise estimation for efficient speech enhancement and robust speech recognition", 1033-1036.
Çetin, Özgür / Nock, Harriet J. / Kirchhoff, Katrin / Bilmes, Jeff A. / Ostendorf, Mari: "The 2001 GMTK-based SPINE ASR system", 1037-1040.
Hung, Wei-Wen: "Using adaptive signal limiter together with weighting techniques for noisy speech recognition", 1041-1044.
Yamade, Shingo / Matsunami, Kanako / Baba, Akira / Lee, Akinobu / Saruwatari, Hiroshi / Shikano, Kiyohiro: "Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics", 1045-1048.
Siu, Manhung / Chan, Yu-Chung: "Robust speech recognition against short-time noise", 1049-1052.
Toma, M. / Lodi, A. / Guerrieri, R.: "Word endpoints detection in the presence of non-stationary noise", 1053-1056.
Pujol Marsal, Pere / Pol Font, Susagna / Hagen, Astrid / Bourlard, Hervé / Nadeu, Climent: "Comparison and combination of RASTA-PLP and FF features in a hybrid HMM/MLP speech recognition system", 1057-1060.
Xu, Tao / Cao, Zhigang: "Robust MMSE-FW-LAASR scheme at low SNRs", 1061-1064.
Zolnay, András / Schlüter, Ralf / Ney, Hermann: "Robust speech recognition using a voiced-unvoiced feature", 1065-1068.
Wet, Febe de / Veth, Johan de / Cranen, Bert / Boves, Lou: "Accumulated kullback divergence for analysis of ASR performance in the presence of noise", 1069-1072.
Kingsbury, Brian / Jain, Pratibha / Adami, Andre: "A hybrid HMM/traps model for robust voice activity detection", 1073-1076.
Zheng, Chengyi / Yan, Yonghong: "Run time information fusion in speech recognition", 1077-1080.
Arrowood, Jon A. / Clements, Mark A.: "Using observation uncertainty in HMM decoding", 1561-1564.
Stuttle, M. N. / Gales, M. J. F.: "Combining a Gaussian mixture model front end with MFCC parameters", 1565-1568.
Droppo, Jasha / Acero, Alex / Deng, Li: "Noise from corrupted speech log mel-spectral energies", 1569-1572.
Lima, Carlos / Almeida, Luís B. / Monteiro, João L.: "Improving the role of unvoiced speech segments by spectral normalisation in robust speech recognition", 1573-1576.
Gadde, Venkata Ramana Rao / Stolcke, Andreas / Vergyri, Dimitra / Zheng, Jing / Sönmez, Kemal / Venkataraman, Anand: "Building an ASR system for noisy environments: SRI’s 2001 SPINE evaluation system", 1577-1580.
Son, Rob J. J. H. van / Pols, Louis C. W.: "Evidence for efficiency in vowel production", 37-40.
Aylett, Matthew P.: "Stochastic suprasegmentals: relationship between the spectral characteristics of vowels, redundancy and prosodic structure", 41-44.
Serkhane, J. / Schwartz, Jean-Luc / Boë, Louis Jean / Davis, B. / Matyear, C.: "Motor specifications of a baby robot via the analysis of infants² vocalizations", 45-48.
Koenig, Laura L. / Lucero, Jorge C.: "Oral-laryngeal control patterns for fricatives in 5-year-olds and adults", 49-52.
Delvaux, Véronique / Metens, Thierry / Soquet, Alain: "French nasal vowels: acoustic and articulatory properties", 53-56.
Kenny, P. / Boulianne, G. / Dumouchel, Pierre: "Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks", 57-60.
Pusateri, Ernest J. / Hazen, Timothy J.: "Rapid speaker adaptation using speaker clustering", 61-64.
Huang, Chao / Chen, Tao / Chang, Eric: "Adaptive model combination for dynamic speaker selection training", 65-68.
Kwan, Ka-Yan / Lee, Tan / Yang, Chen: "Unsupervised n-best based model adaptation using model-level confidence measures", 69-72.
Nguyen, Patrick / Rigazio, Luca / Wellekens, Christian / Junqua, Jean-Claude: "LU factorization for feature transformation", 73-76.
Ding, Guo-Hong / Zhu, Yi-Fei / Li, Chengrong / Xu, Bo: "Implementing vocal tract length normalization in the MLLR framework", 1389-1392.
Kim, Dong Kook / Kim, Nam Soo: "Markov models based on speaker space model evolution", 1393-1396.
Li, Baojie / Hirose, Keikichi / Minematsu, Nobuaki: "Robust speech recognition using inter-speaker and intra-speaker adaptation", 1397-1400.
Lima, Carlos / Almeida, Luís B. / Monteiro, João L.: "Continuous environmental adaptation of a speech recogniser in telephone line conditions", 1401-1404.
Illina, Irina: "Tree-structured maximum a posteriori adaptation for a segment-based speech recognition system", 1405-1408.
Plötz, Thomas / Fink, Gernot A.: "Robust time-synchronous environmental adaptation for continuous speech recognition systems", 1409-1412.
Niesler, Thomas / Willett, Daniel: "Unsupervised language model adaptation for lecture speech transcription", 1413-1416.
Li, Yongxin / Erdogan, Hakan / Gao, Yuqing / Marcheret, Etienne: "Incremental on-line feature space MLLR adaptation for telephony speech recognition", 1417-1420.
Molau, Sirko / Hilger, Florian / Keysers, Daniel / Ney, Hermann: "Enhanced histogram normalization in the acoustic feature space", 1421-1424.
Levin, David N.: "Blind normalization of speech from different channels and speakers", 1425-1428.
Ogata, Jun / Ariki, Yasuo: "Unsupervised acoustic model adaptation based on phoneme error minimization", 1429-1432.
Zhou, Bowen / Hansen, John H. L.: "Improved structural maximum likelihood eigenspace mapping for rapid speaker adaptation", 1433-1436.
Torre, Ángel de la / Fohr, Dominique / Haton, Jean-Paul: "Statistical adaptation of acoustic models to noise conditions for robust speech recognition", 1437-1440.
Brugnara, F. / Cettolo, M. / Federico, M. / Giuliani, D.: "Issues in automatic transcription of historical audio data", 1441-1444.
Stockmal, Verna / Bond, Zinny S.: "Same talker, different language: a replication", 77-80.
Jayram, A. K. V. Sai / Ramasubramanian, V. / Sreenivas, T. V.: "Automatic language identification using acoustic sub-word units", 81-84.
Maddieson, Ian / Vasilescu, Ioana: "Factors in human language identification", 85-88.
Torres-Carrasquillo, Pedro A. / Singer, Elliot / Kohler, Mary A. / Greene, Richard J. / Reynolds, Douglas A. / Deller Jr., J. R.: "Approaches to language identification using Gaussian mixture models and shifted delta cepstral features", 89-92.
Wong, Eddie / Sridharan, Sridha: "Methods to improve Gaussian mixture model based language identification system", 93-96.
Jing, Hongyan / Tzoukermann, Evelyne: "Part-of-speech tagging in French text-to-speech synthesis: experiments in tagset selection", 97-100.
Uebler, Ulla: "Grapheme-to-phoneme conversion using pseudo-morphological units", 101-104.
Bisani, M. / Ney, Hermann: "Investigations on joint-multigram models for grapheme-to-phoneme conversion", 105-108.
Galescu, Lucian / Allen, James F.: "Pronunciation of proper names with a joint n-gram model for bi-directional grapheme-to-phoneme conversion", 109-112.
Jilka, Matthias / Syrdal, Ann K.: "The AT&t German text-to-speech system: realistic linguistic description", 113-116.
Li, Haiping / Chen, Fangxin / Shen, Liqin: "Generating script using statistical information of the context variation unit vector", 117-120.
Kuo, Chih-Chung / Huang, Jing-Yi: "Efficient and scalable methods for text script generation in corpus-based TTS design", 121-124.
Rutten, Peter / Aylett, Matthew P. / Fackrell, Justin / Taylor, Paul: "A statistically motivated database pruning technique for unit selection synthesis", 125-128.
Wu, Yi-Jian / Hu, Yu / Wu, Xiaoru / Wang, Ren-Hua: "A new method of building decision tree based on target information", 129-132.
Yamagishi, Junichi / Tamura, Masatsune / Masuko, Takashi / Tokuda, Keiichi / Kobayashi, Takao: "A context clustering technique for average voice model in HMM-based speech synthesis", 133-136.
Tsuzaki, Minoru / Kawai, Hisashi: "Feature extraction for unit selection in concatenative speech synthesis: comparison between AIM, LPC, and MFCC", 137-140.
Campillo-Díaz, Francisco / Banga, Eduardo R.: "Combined prosody and candidate unit selections for corpus-based text-to-speech systems", 141-144.
Kim, Yeon-Jun / Conkie, Alistair: "Automatic segmentation combining an HMM-based approach and spectral boundary correction", 145-148.
Sethy, Abhinav / Narayanan, Shrikanth S.: "Refined speech segmentation for concatenative speech synthesis", 149-152.
Breen, Andrew / Eggleton, Barry / Dion, Peter / Minnis, Steve: "Refocussing on the text normalisation process in text-to-speech systems", 153-156.
Vepa, Jithendra / Ayachitam, Jahnavi / Reddy, K. V. K. Kalpana: "A text-to-speech synthesis system for telugu", 157-160.
Freitas, Diamantino / Braga, Daniela: "Towards an intonation module for a portuguese TTS system", 161-164.
Saito, Takashi / Sakamoto, Masaharu: "Applying a hybrid intonation model to a seamless speech synthesizer", 165-168.
Hirai, Toshio / Tenpaku, Seiichi / Shikano, Kiyohiro: "Using start/end timings of spectral transitions between phonemes in concatenative speech synthesis", 2357-2360.
Ni, Jinfu / Kawai, Hisashi: "Design of a Mandarin sentence set for corpus-based speech synthesis by use of a multi-tier algorithm taking account of the varied prosodic and spectral characteristics", 2361-2364.
Mori, Hiroki / Ohtsuka, Takahiro / Kasuya, Hideki: "A data-driven approach to source-formant type text-to-speech system", 2365-2368.
Shi, Yu / Chang, Eric / Peng, Hu / Chu, Min: "Power spectral density based channel equalization of large speech database for concatenative TTS system", 2369-2372.
Meng, Helen M. / Keung, Chi Kin / Siu, Kai Chung / Fung, Tien Ying / Ching, P. C.: "CU VOCAL: corpus-based syllable concatenation for Chinese speech synthesis across domains and dialects", 2373-2376.
Lu, Jinlin / Kawai, Hisashi: "Perceptual evaluation of naturalness due to substitution of Chinese syllable for concatenative speech synthesis", 2377-2380.
Chazan, Dan / Hoory, Ron / Kons, Zvi / Silberstein, Dorel / Sorin, Alexander: "Reducing the footprint of the IBM trainable speech synthesis system", 2381-2384.
Lee, Sung-Joo / Kim, Hyung Soon: "Computationally efficient time-scale modification of speech using 3 level clipping", 2385-2388.
Shuang, Zhi-Wei / Hu, Yu / Ling, Zhen-Hua / Wang, Ren-Hua: "A miniature Chinese TTS system based on tailored corpus", 2389-2392.
Song, Hoeun / Kim, Jaein / Lee, Kyongrok / Kim, Jinyoung: "Phonetic normalization using z-score in segmental prosody estimation for corpus-based TTS system", 2393-2396.
Kawahara, Hideki / Zolfaghari, Parham / Cheveigné, Alain de: "On F0 trajectory optimization for very high-quality speech manipulation", 2397-2400.
Lee, Tan / Kochanski, Greg / Shih, Chilin / Li, Yujia: "Modeling tones in continuous Cantonese speech", 2401-2404.
Dong, Minghui / Lua, Kim-Teng: "Pitch contour model for Chinese text-to-speech using CART and statistical model", 2405-2408.
Navas, Eva / Hernáez, Inmaculada / Sánchez, Juan María: "Basque intonation modelling for text to speech conversion", 2409-2412.
Low, Phuay Hui / Vaseghi, Saeed: "Application of microprosody models in text to speech synthesis", 2413-2416.
Zhao, Sheng / Tao, Jianhua / Cai, Lianhong: "Prosodic phrasing with inductive learning", 2417-2420.
Milner, Ben / Shao, Xu: "Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model", 2421-2424.
Kawanami, Hiromichi / Masuda, Tsuyoshi / Toda, Tomoki / Shikano, Kiyohiro: "Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer", 2425-2428.
Bühler, Dirk / Minker, Wolfgang / Häußler, Jochen / Krüger, Sven: "Flexible multimodal human-machine interaction in mobile environments", 169-172.
Kaiser, Edward C. / Cohen, Philip R.: "Implementation testing of a hybrid symbolic/statistical multimodal architecture", 173-176.
Yamakata, Yoko / Kawahara, Tatsuya / Okuno, Hiroshi G.: "Belief network based disambiguation of object reference in spoken dialogue system for robot", 177-180.
Beskow, Jonas / Edlund, Jens / Nordstrand, Magnus: "Specification and realisation of multimodal output in dialogue systems", 181-184.
Quek, Francis / Xiong, Yingen / McNeill, David: "Gestural trajectory symmetries and discourse segmentation", 185-188.
Quek, Francis / McNeill, David / Bryll, Robert / Harper, Mary: "Gestural spatialization in natural discourse segmentation", 189-192.
Nakadai, Kazuhiro / Okuno, Hiroshi G. / Kitano, Hiroaki: "Real-time sound source localization and separation for robot audition", 193-196.
Ma, Jiyong / Yan, Jie / Cole, Ronald: "CU animate tools for enabling conversations with animated characters", 197-200.
Cohen, Philip R. / Coulston, Rachel / Krout, Kelly: "Multiparty multimodal interaction: a preliminary analysis", 201-204.
Poller, Peter / Müller, Jochen: "Distributed audio-visual speech synchronization", 205-208.
Daubias, Philippe / Deléglise, Paul: "Lip-reading based on a fully automatic statistical model", 209-212.
Liu, Xiaoxing / Zhao, Yibao / Pi, Xiaobo / Liang, Luhong / Nefian, Ara V.: "Audio-visual continuous speech recognition using a coupled hidden Markov model", 213-216.
Dybkjær, Laila / Bernsen, Niels Ole: "Data, annotation schemes and coding tools for natural interactivity", 217-220.
Quek, Francis / Shi, Yang / Kirbas, Cemil / Wu, Shunguang: "VisSTA: a tool for analyzing multimodal discourse data", 221-224.
Lambacher, Stephen / Martens, William / Kakehi, Kazuhiko: "The influence of identification training on identification and production of the american English mid and low vowels by native speakers of Japanese", 245-248.
Tajima, Keiichi / Akahane-Yamada, Reiko / Yamada, Tsuneo: "Perceptual learning of second-language syllable rhythm by elderly listeners", 249-252.
Clarke, Constance M.: "Perceptual adjustment to foreign-accented English with short term exposure", 253-256.
Burnham, Denis K. / Brooker, Ron: "Absolute pitch and lexical tones: tone perception by non-musician, musician, and absolute pitch non-tonal language speakers", 257-260.
Broersma, Mirjam: "Comprehension of non-native speech: inaccurate phoneme processing and activation of lexical competitors", 261-264.
Minker, Wolfgang: "Overview on recent activities in speech understanding and dialogue systems evaluation", 265-268.
Walker, Marilyn A. / Rudnicky, Alexander I. / Prasad, Rashmi / Aberdeen, John / Bratt, Elizabeth Owen / Garofolo, John S. / Hastie, Helen / Le, Audrey N. / Pellom, Bryan / Potamianos, Alex / Passonneau, Rebecca / Roukos, Salim / Sanders, Gregory A. / Seneff, Stephanie / Stallard, David: "DARPA communicator: cross-system results for the 2001 evaluation", 269-272.
Walker, Marilyn A. / Rudnicky, Alexander I. / Aberdeen, John / Bratt, Elizabeth Owen / Garofolo, John S. / Hastie, Helen / Le, Audrey N. / Pellom, Bryan / Potamianos, Alex / Passonneau, Rebecca / Prasad, Rashmi / Roukos, Salim / Sanders, Gregory A. / Seneff, Stephanie / Stallard, David: "DARPA communicator evaluation: progress from 2000 to 2001", 273-276.
Sanders, Gregory A. / Le, Audrey N. / Garofolo, John S.: "Effects of word error rate in the DARPA communicator data during 2000 and 2001", 277-280.
Sidner, Candace L. / Forlines, Clifton: "Subset languages for conversing with collaborative interface agents", 281-284.
Watanabe, Tomomi / Murakami, Takahiro / Namba, Munehiro / Hoya, Tetsuya / Ishida, Yoshihisa: "Transformation of spectral envelope for voice conversion based on radial basis function networks", 285-288.
Turk, Oytun / Arslan, Levent M.: "Subband based voice conversion", 289-292.
Mashimo, Mikiko / Toda, Tomoki / Kawanami, Hiromichi / Kashioka, Hideki / Shikano, Kiyohiro / Campbell, Nick: "Evaluation of cross-language voice conversion using bilingual and non-bilingual databases", 293-296.
Gustafson, Joakim / Sjölander, Kåre: "Voice transformations for improving children²s speech recognition in a publicly available dialogue system", 297-300.
Burger, Susanne / MacLaren, Victoria / Yu, Hua: "The ISL meeting corpus: the impact of meeting type on speech style", 301-304.
López-Cózar, R. / Torre, Ángel de la / Segura, José C. / Rubio, Antonio J. / López-Soler, J. M.: "A new method for testing dialogue systems based on simulations of real-world conditions", 305-308.
Ludwig, Thorsten: "Comfort noise detection and GSM-FR-codec detection for speech-quality evaluations in telephone networks", 309-312.
Cucchiarini, Catia / Binnenpoorte, Diana: "Validation and improvement of automatic phonetic transcriptions", 313-316.
Aman, Shigeaki / Kato, Kazumi / Kondo, Tadahisa: "Development of Japanese infant speech database and speaking rate analysis", 317-320.
Dong, Minghui / Lua, Kim-Teng: "Automatic prosodic break labeling for Mandarin Chinese speech data", 321-324.
Zitouni, Imed / Olive, Joseph / Iskra, Dorota / Choukri, Khalid / Emam4, Ossama / Gedge, Oren / Maragoudakis, Emmanuel / Tropf, Herbert / Moreno, Asunción / Rodriguez, Albino Nogueiras / Heuft, Barbara / Siemund, Rainer: "Orientel: speech-based interactive communication applications for the mediterranean and the middle east", 325-328.
Alvarez, Yolanda Vazquez / Huckvale, Mark: "The reliability of the ITU-t p.85 standard for the evaluation of text-to-speech systems", 329-332.
Demuynck, Kris / Laureys, Tom / Gillis, Steven: "Automatic generation of phonetic transcriptions for large speech corpora", 333-336.
Minker, Wolfgang: "Overview on recent activities in speech understanding and dialogue systems evaluation", 337-340.
Bennett, Christina / Rudnicky, Alexander I.: "The carnegie mellon communicator corpus", 341-344.
Schultz, Tanja: "Globalphone: a multilingual speech and text database developed at karlsruhe university", 345-348.
Salor, Özgül / Pellom, Bryan / Çiloglu, Tolga / Hacioglu, Kadri / Demirekler, Mübeccel: "On developing new text and audio corpora and speech recognition tools for the turkish language", 349-352.
Martell, Craig: "FORM: an extensible, kinematically-based gesture annotation scheme", 353-356.
Hosom, John-Paul: "Automatic phoneme alignment based on acoustic-phonetic modeling", 357-360.
Gupta, Narendra K. / Bangalore, Srinivas / Rahim, Mazin: "Extracting clauses for spoken language understanding in conversational systems", 361-364.
Lefèvre, F. / Bonneau-Maynard, H.: "Issues in the development of a stochastic speech understanding system", 365-368.
Pfitzinger, Hartmut R.: "10 years of phondat-II: a reassessment", 369-372.
Kumar, Shankar / Byrne, William: "Risk based lattice cutting for segmental minimum Bayes-risk decoding", 373-376.
Wendt, Sascha / Fink, Gernot A. / Kummert, Franz: "Dynamic search-space pruning for time-constrained speech recognition", 377-380.
Lee, Raymond H. / Choi, Eric H. C.: "A Gaussian selection method for multi-mixture HMM based continuous speech recognition", 381-384.
Dong, Rong / Zhu, Jie: "On use of duration modeling for continuous digits speech recognition", 385-388.
Zweig, Geoffrey / Saon, George / Yvon, F.: "Arc minimization in finite state decoding graphs with cross-word acoustic context", 389-392.
Zheng, Jing / Franco, Horacio: "Fast hierarchical grammar optimization algorithm toward time and space efficiency", 393-396.
Abdou, Sherif / Scordilis, Michael: "Dynamic tuning of language model score in speech recognition using a confidence measure", 397-400.
Zhang, Xiao / Zhao, Yunxin: "Minimum perfect hashing for fast n-gram language model lookup", 401-404.
Li, Xiang / Singh, Rita / Stern, Richard M.: "Combining search spaces of heterogeneous recognizers for improved speech recogniton", 405-408.
Pellant, Karel / Mejzlík, Jan / Prikryl, Karel / Skvor, Zdenek: "Transmission characteristics of outer ear canal", 409-412.
Kates, James M.: "Hearing-aid benefits and limitations: predictions from a cochlear model", 413-416.
Nelson, Peggy B. / DiGiovanni, Jeffrey J. / Schlauch, Robert S.: "A psychoacoustic basis for spectral sharpening", 417-420.
Huettel, Lisa G. / Collins, Leslie M.: "Model-based predictions of intensity discrimination for normal- and impaired-hearing listeners", 421-424.
Assmann, Peter F. / Nearey, Terrance M. / Scott, Jack M.: "Modeling the perception of frequency-shifted vowels", 425-428.
Mackersie, Carol L.: "The relationship between pure-tone sequential stream segregation and perceptual separation of male and female talkers by listeners with hearing loss", 429-432.
Johansson, Mathias / Blomberg, Mats / Elenius, Kjell / Hoffsten, Lars-Erik / Torberger, Anders: "A phoneme recognizer for the hearing impaired", 433-436.
Fischer, V. / Janke, E. / Kunzmann, S.: "Likelihood combination and recognition output voting for the decoding of non-native speech with multilingual HMMs", 489-492.
Angkititrakul, Pongtep / Hansen, John H. L.: "Stochastic trajectory model analysis for accent classification", 493-496.
Tian, Jilei / Häkkinen, Juha / Viikki, Olli: "Multilingual pronunciation modeling for improving multilingual speech recognition", 497-500.
Tian, Jilei / Häkkinen, Juha / Riis, Søren / Jensen, Kåre Jean: "On text-based language identification for multilingual speech recognition systems", 501-504.
Ma1, Bin / Guan, Cuntai / Li, Haizhou / Lee, Chin-Hui: "Multilingual speech recognition with language identification", 505-508.
Chengalvarayan, Rathi: "Robust HMM training for unified dutch and German speech recognition", 509-512.
Khudanpur, Sanjeev / Kim, Woosung: "Using cross-language cues for story-specific language modeling", 513-516.
Zhao, Bing / Vogel, Stephan: "Full-text story alignment models for Chinese-English bilingual news corpora", 517-520.
Sooful, Jayren J. / Botha, Elizabeth C.: "Comparison of acoustic distance measures for automatic cross-language phoneme mapping", 521-524.
He, Xiaodong / Zhao, Yunxin: "Maximum expected likelihood based model selection and adaptation for nonnative English speakers", 525-528.
Minematsu, Nobuaki / Kurata, Gakuto / Hirose, Keikichi: "Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition", 529-532.
Nguyen, Thu / Ingram, John: "Native and vietnamese production of compound and phrasal stress patterns", 533-536.
Caspers, Johanneke: "On the function of the late rise and the early fall in dutch dialogue: a perception experiment", 537-540.
Esposito, Anna / Duncan, Susan / Quek, Francis: "Holds as gestural correlates to empty and filled speech pauses", 541-544.
Itoh, Toshihiko / Kai, Atsuhiko / Konishi, Tatsuhiro / Itoh, Yukihiro: "Linguistic and acoustic changes of user²s utterances caused by different dialogue situations", 545-548.
Ward, Nigel / Nakagawa, Satoshi: "Automatic user-adaptive speaking rate selection for information delivery", 549-552.
Skantze, Gabriel: "Coordination of referring expressions in multimodal human-computer dialogue", 553-556.
Cerrato, Loredana: "A comparison between feedback strategies in human-to-human and human-machine communication", 557-560.
Darves, Courtney / Oviatt, Sharon: "Adaptation of users² spoken dialogue patterns in a conversational interface", 561-564.
Rosenberg, Aaron E. / Gorin, Allen / Liu, Zhu / Parthasarathy, S.: "Unsupervised speaker segmentation of telephone conversations", 565-568.
Sivakumaran, P. / Ariyaeeinia, A.M. / Fortuna, J.: "An effective unsupervised scheme for multiple-speaker-change detection", 569-572.
Ajmera, J. / Bourlard, Hervé / Lapidot, I. / McCowan, Iain A.: "Unknown-multiple speaker clustering using HMM", 573-576.
Meignier, Sylvain / Bonastre, Jean-François / Magrin-Chagnolleau, Ivan: "Speaker utterances tying among speaker segmented audio documents using hierarchical classification: towards speaker indexing of audio databases", 577-580.
Mariéthoz, Johnny / Bengio, Samy: "A comparative study of adaptation methods for speaker verification", 581-584.
Farrell, Kevin R.: "Speaker verification with data fusion and model adaptation", 585-588.
Mirghafori, Nikki / Heck, Larry P.: "An adaptive speaker verification system with speaker dependent a priori decision thresholds", 589-592.
Roy, Deb / Gorniak, Peter / Mukherjee, Niloy / Juster, Josh: "A trainable spoken language understanding system for visual object selection", 593-596.
Béchet, F. / Gorin, Allen / Wright, Jerry / Tur, D. Hakkani: "Named entity extraction from spontaneous speech in how may i help you?", 597-600.
Bousquet-Vernhettes, Caroline / Vigouroux, Nadine: "Recognition error processing for speech understanding", 601-604.
Pargellis, Andrew / Fosler-Lussier, Eric / Tsai, Augustine: "Using part-of-speech tags, context thresholding, and trigram contexts to improve the auto-induction of semantic classes", 605-608.
Wang1, Ye-Yi / Acero, Alex / Chelba, Ciprian / Frey, Brendan / Wong, Leon: "Combination of statistical and rule-based approaches for spoken language understanding", 609-612.
Xie, Guodong / Zong, Chengqing / Xu, Bo: "Chinese spoken language analyzing based on combination of statistical and rule methods", 613-616.
Pfannerer, Norbert: "A maximum entropy semantic parser using word classes", 617-620.
Gurijala, A. / R. Deller Jr., J. / Seadle, M. S. / Hansen, John H. L.: "Speech watermarking through parametric modeling", 621-624.
Hong, Kai Sze / Salleh, Sh-Hussain: "An education software in teaching automatic speech recognition (ASR)", 625-628.
Xiao, Benfang / Girand, Cynthia / Oviatt, Sharon: "Multimodal integration patterns in children", 629-632.
Scharenborg, Odette / Boves, Lou / Veth, Johan de: "ASR in a human word recognition model: generating phonemic input for shortlist", 633-636.
Wu, Chung-Hsien / Chiu, Yu-Hsien / Cheng, Kung-Wei: "Sign language translation using an error tolerant retrieval algorithm", 637-640.
Turk, Oytun / Sayli, Omer / Dutagaci, Helin / Arslan, Levent M.: "A sound source classification system based on subband processing", 641-644.
Zhang, Ying / Zhao, Bing / Yang, Jie / Waibel, Alex: "Automatic sign translation", 645-648.
Wenndt, Stanley J. / Cupples, Edward J. / Floyd, Richard M.: "A study on the classification of whispered and normally phonated speech", 649-652.
Tatara, Kiyoshi / Ito, Taisuke / Zolfaghari, Parham / Takeda, Kazuya / Itakura, Fumitada: "Experiments on recognition of lavalier microphone speech and whispered speech in real world environments", 653-656.
Iwaki, Mamoru / Seki, Hiromi: "An effect of amplitude modulation on perceptual segregation of tone sequences", 657-660.
Sanders, Eric / Ruiter, Marina / Beijer, Lilian / Strik, Helmer: "Automatic recognition of dutch dysarthric speech: a pilot study", 661-664.
Engwall, Olov: "Evaluation of a system for concatenative articulatory visual speech synthesis", 665-668.
Sato, Marc / Schwartz, Jean-Luc / Cathiard, Marie-Agnès / Abry, Christian / Loevenbruck, Hélène: "Intrasyllabic articulatory control constraints in verbal working memory", 669-672.
Campbell, Nick: "Towards a grammar of spoken language: incorporating paralinguistic information", 673-676.
Li, Qun / Russell, Martin J.: "An analysis of the causes of increased error rates in children²s speech recognition", 2337-2340.
Öster, Anne-Marie: "A new computer-based analytical speech perception test for prelingually deaf children and children with speech disorders", 2341-2344.
Fell, Harriet J. / MacAuslan, Joel / Ferrier, Linda J. / Worst, Susan G. / Chenausky, Karen: "Vocalization age as a clinical tool", 2345-2348.
Cosi, Piero / Cohen, Michael M. / Massaro, Dominic W.: "Baldini: baldi speaks italian!", 2349-2352.
Cavé, Christian / Guaïtella, Isabelle / Santi;, Serge: "Eyebrow movements and voice variations in dialogue situations: an experimental investigation", 2353-2356.
Córdoba, R. / Macías-Guarasa, J. / Ferreiros, J. / Montero, J. M. / Pardo, José M.: "State clustering improvements for continuous HMMs in a Spanish large vocabulary recognition system", 677-680.
Rotovnik, Tomaz / Maucec, Mirjam Sepesy / Horvat, Bogomir / Kacic, Zdravko: "A comparison of HTK, ISIP and julius in slovenian large vocabulary continuous speech recognition", 681-684.
Jia, Lei / Xu, Bo: "Parametric trajectory segment model for LVCSR", 685-688.
Diéguez-Tirado, F. Javier / Cardenal-López, Antonio: "Efficient precalculation of LM contexts for large vocabulary continuous speech recognition", 689-692.
Chengalvarayan, Rathi: "Integrating multiple pronunciations during MCE-based acoustic model training for large vocabulary speech recognition", 693-696.
Laureys, Tom / Vandeghinste, Vincent / Duchateau, Jacques: "A hybrid approach to compounds in LVCSR", 697-700.
Utsuro, Takehito / Harada, Tetsuji / Nishizaki, Hiromitsu / Nakagawa, Seiichi: "A confidence measure based on agreement among multiple LVCSR models - correlation between pair of acoustic models and confidence", 701-704.
Nouza, Jan / Drabkova, Jindra: "Combining lexical and morphological knowledge in language model for inflectional (czech) language", 705-708.
Nguyen, Long / Guo, Xuefeng / Makhoul, John: "Modeling frequent allophones in Japanese speech recognition", 709-712.
Chen, Feili / Zhu, Jie / Song, Wentao: "The structure and its implementation of hidden dynamic HMM for Mandarin speech recognition", 713-716.
Shinozaki, Takahiro / Furui, Sadaoki: "A new lexicon optimization method for LVCSR based on linguistic and acoustic characteristics of words", 717-720.
Langlois, David / Smaïli, Kamel / Haton, Jean-Paul: "Retrieving phrases by selecting the history: application to automatic speech recognition", 721-724.
Ahn, Dong-Hoon / Chung, Minhwa: "Compact subnetwork-based large vocabulary continuous speech recognition", 725-728.
Dutagaci, Helin / Arslan, Levent M.: "A comparison of four language models for large vocabulary turkish speech recognition", 729-732.
Hincks, Rebecca: "Speech recognition for language teaching and evaluating: a study of existing commercial products", 733-736.
Raux, Antoine / Kawahara, Tatsuya: "Automatic intelligibility assessment and diagnosis of critical pronunciation errors for computer-assisted pronunciation learning", 737-740.
Hirata, Yukari: "Effects of production training with visual feedback on the acquisition of Japanese pitch and durational contrasts", 741-744.
Minematsu, Nobuaki / Kobashikawa, Satoshi / Hirose, Keikichi / Erickson, Donna: "Acoustic modeling of sentence stress using differential features between syllables for English rhythm learning system development", 745-748.
Imoto, Kazunori / Tsubota, Yasushi / Raux, Antoine / Kawahara, Tatsuya / Dantsuji, Masatake: "Modeling and automatic detection of English sentence stress for computer-assisted English prosody learning system", 749-752.
Tsubota, Yasushi / Kawahara, Tatsuya / Dantsuji, Masatake: "Recognition and verification of English by Japanese students for computer-assisted language learning system", 1205-1208.
Neri, Ambra / Cucchiarini, Catia / Strik, Helmer: "Feedback in computer assisted pronunciation training: technology push or demand pull?", 1209-1212.
Minematsu, Nobuaki / Kurata, Gakuto / Hirose, Keikichi: "Corpus-based analysis of English spoken by Japanese students in view of the entire phonemic system of English", 1213-1216.
Hardison, Debra M.: "Computer-assisted second-language speech learning: generalization of prosody-focused training", 1217-1220.
Mostow, Jack / Beck, Joseph / Winter, S. Vanessa / Wang, Shaojun / Tobin, Brian: "Predicting oral reading miscues", 1221-1224.
Kim, Chanwoo / Sung, Wonyong: "Implementation of an intonational quality assessment system", 1225-1228.
Ariki, Yasuo / Ogata, Jun: "English call system with functions of speech segmentation and pronunciation evaluation using speech recognition technology", 1229-1232.
Mixdorff, Hansjörg / Luksaneeyanawin, Sudaporn / Fujisaki, Hiroya / Charnvivit, Patavee: "Perception of tone and vowel quantity in Thai", 753-756.
Kinoshita, Keisuke / Behne, Dawn M. / Arai, Takayuki: "Duration and F0 as perceptual cues to Japanese vowel quantity", 757-760.
Muto, Makiko / Kato, Hiroaki / Tsuzaki, Minoru / Sagisaka, Yoshinori: "Effects of intra-phrase position on acceptability of changes in segmental duration in sentence speech", 761-764.
Barac-Cikoja, Dragana / Revoile, Sally: "Perception of prosodic phrasing by hearing-impaired listeners", 765-768.
Aasland, Wendi A. / Baum, Shari R.: "Processing of temporal cues marking phrasal boundaries in individuals with brain damage", 769-772.
Herbordt, W. / Ying, J. / Buchner, H. / Kellermann, W.: "A real-time acoustic human-machine front-end for multimedia applications integrating robust adaptive beamforming and stereophonic acoustic echo cancellation", 773-776.
Lu, Ching-Ta / Wang, Hsiao-Chuan: "Enhancement of single channel speech using perception-based wavelet transform", 777-780.
Lin, L. / Holmes, W. H. / Ambikairajah, E.: "Speech enhancement based on a perceptual modification of wiener filtering", 781-784.
Attias, Hagai / Deng, Li: "A new approach to speech enhancement by a microphone array using EM and mixture models", 785-788.
Kim, Sang G. / Yoo, Chang D.: "Acoustic echo cancellation based on m-channel IIR cosine-modulated filter bank", 789-792.
Saruwatari, Hiroshi / Sawai, Katsuyuki / Lee, Akinobu / Shikano, Kiyohiro / Kaminuma, Atsunobu / Sakata, Masao: "Speech enhancement in car environment using blind source separation", 1781-1784.
Potamitis, I. / Fakotakis, Nikos / Kokkinakis, George: "Speech enhancement based on combining perceptual enhancement and short-time spectral attenuation", 1785-1788.
Nishiura, Takanobu / Nakamura, Satoshi / Okada, Yuka / Yamada, Takeshi / Shikano, Kiyohiro: "Suitable design of adaptive beamformer based on average speech spectrum for noisy speech recognition", 1789-1792.
Tam, King / Sheikhzadeh, Hamid / Schneider, Todd: "Highly oversampled subband adaptive filters for noise cancellation on a low-resource DSP system", 1793-1796.
Hu, Yi / Loizou, Philipos C.: "A perceptually motivated subspace approach for speech enhancement", 1797-1800.
Ju, Gwo-hwa / Lee, Lin-shan: "Speech enhancement based on generalized singular value decomposition approach", 1801-1804.
Kim, Jong Uk / Yoo, Chang D.: "Subspace speech enhancement using subband whitening filter", 1805-1808.
Chang, Sungwook / Jung, Sungil / Kwon, Y. / Yang, Sung-il: "Speech enhancement using wavelet packet transform", 1809-1812.
Deng, Li / Droppo, Jasha / Acero, Alex: "Sequential MAP noise estimation and a phase-sensitive model of the acoustic environment", 1813-1816.
Nakadai, Kazuhiro / Okuno, Hiroshi G. / Kitano, Hiroaki: "Auditory fovea based speech enhancement and its application to human-robot dialog system", 1817-1820.
Visser, Erik / Otsuka, Manabu / Lee, Te-Won: "A spatio-temporal speech enhancement scheme for robust speech recognition", 1821-1824.
Berthommier, Frédéric / Choi, Seungjin: "Comparative evaluation of CASA and BSS models for subband cocktail-party speech separation", 1825-1828.
Kim, Hyoung-Gook / Ruwisch, Dietmar: "Speech enhancement in non-stationary noise environments", 1829-1832.
Mizumachi, Mitsunori / Nakamura, Satoshi: "The 2ch hybrid subtractive beamformer applied to line sound sources", 1833-1836.
Yapanel, Umit / Zhang, Xianxian / Hansen, John H. L.: "High performance digit recognition in real car environments", 793-796.
Shinde, Tetsuya / Takeda, Kazuya / Itakura, Fumitada: "Multiple regression of log-spectra for in-car speech recognition", 797-800.
Gong, Yifan / Netsch, Lorin: "Experiments on speaker-independent voice command recognition using in-vehicle hands free speech", 801-804.
Kadambe, Shubha: "Application of over-complete blind source separation for robust automatic speech recognition", 805-808.
Beaufays, Françoise / Boies, Daniel / Weintraub, Mitch: "Porting channel robustness across languages", 809-812.
Takahashi, Yasuhiro / Dohsaka, Kohji / Aikawa, Kiyoaki: "An efficient dialogue control method using decision tree-based estimation of out-of-vocabulary word attributes", 813-816.
Bellegarda, Jerome R.: "Semantic inference: a data-driven solution for NL interaction", 817-820.
Wright, Jerry / Abella, Alicia / Gorin, Allen: "Unified task knowledge for spoken language understanding and dialog management", 821-824.
Lee, Yun-Tien / Wu, Cheng-Huang / Lee, Yumin / Lee, Lin-shan: "Distributed Chinese keyword spotting and verification for spoken dialogues under wireless environment", 825-828.
Higashinaka, Ryuichiro / Miyazaki, Noboru / Nakano, Mikio / Aikawa, Kiyoaki: "A method for evaluating incremental utterance understanding in spoken dialogue systems", 829-832.
Kakutani, Naoko / Kitaoka, Norihide / Nakagawa, Seiichi: "Detection and recognition of repaired speech on misrecognized utterances for speech input of car navigation system", 833-836.
Eklund, Robert: "Ingressive speech as an indication that humans are talking to humans (and not to machines)", 837-840.
Soltau, Hagen / Metze, Florian / Waibel, Alex: "Compensating for hyperarticulation by modeling articulatory properties", 841-844.
Goubanova, Olga V.: "Forms of introduction in map task dialogues: case of L2 Russian speakers", 845-848.
Veilleux, Nanette M.: "Bridges: regions between discourse segments", 849-852.
Guillevic, Didier / Gandrabur, Simona / Normandin, Yves: "Robust semantic confidence scoring", 853-856.
Müller, Ludek / Bartos, Tomás: "Statistically based approach to rejection of incorrectly recognized words", 857-860.
Sato, Ryo / Higashinaka, Ryuichiro / Tamoto, Masafumi / Nakano, Mikio / Aikawa, Kiyoaki: "Learning decision trees to determine turn-taking by spoken dialogue systems", 861-864.
Hamimed, H. / Damnati, G.: "Integration of phonetic length properties in the acoustic models of false starts and out-of-vocabulary words", 865-868.
Zhao, Yibao / Zhou, Guojun: "N-word-sequence frequency noise mitigation for SLM based on binomial distribution", 869-872.
Lee, Chul Min / Narayanan, Shrikanth S. / Pieraccini, Roberto: "Combining acoustic and language information for emotion recognition", 873-876.
Hacioglu, Kadri / Ward, Wayne: "A figure of merit for the analysis of spoken dialog systems", 877-880.
Akiba, Tomoyosi / Itou, Katunobu / Fujii, Atsushi / Ishikawa, Tetsuya: "Selective back-off smoothing for incorporating grammatical constraints into the n-gram language model", 881-884.
Zitouni, Imed / Siohan, Olivier / Kuo, Hong-Kwang Jeff / Lee, Chin-Hui: "Backoff hierarchical class n-gram language modelling for automatic speech recognition systems", 885-888.
Picard, Francis / Boucher, Dominique / Lapalme, Guy: "Constructing small language models from grammars", 889-892.
Zhang, Rong / Rudnicky, Alexander I.: "Improve latent semantic analysis based language model by integrating multiple level knowledge", 893-896.
Sicilia-Garcia, Elvira I. / Ming, Ji / Smith, F. Jack: "Individual word language models and the frequency approach", 897-900.
Stolcke, Andreas: "SRILM - an extensible language modeling toolkit", 901-904.
Whittaker, E. W. D. / Klakow, D.: "Efficient construction of long-range language models using log-linear interpolation", 905-908.
Corazza, Anna: "Integration of two stochastic context-free grammars", 909-912.
Rayner, Manny / Hockey, Beth Ann / Dowding, John: "Grammar specialisation meets language modelling", 913-916.
Huang, Jing / Zweig, Geoffrey: "Maximum entropy model for punctuation annotation from speech", 917-920.
Mori, Shinsuke: "An automatic sentence boundary detector based on a structured language model", 921-924.
Wu1, Genqing / Zheng, Fang / Wu1, Wenhu / Xu, Mingxing / Jin, Ling: "Improved katz smoothing for language modeling in speech recogniton", 925-928.
Mori, Renato De / Estève, Yannick / Raymond, Christian: "On the use of structures in language models for dialogue", 929-932.
Erdogan, Hakan / Sarikaya, Ruhi / Gao, Yuqing / Picheny, Michael: "Semantic structured language models", 933-936.
Hirose, Keikichi / Minematsu, Nobuaki / Terao, Makoto: "Statistical language modeling with prosodic boundaries and its use for continuous speech recognition", 937-940.
Iwano, Koji / Seki, Takahiro / Furui, Sadaoki: "Noise robust speech recognition using F0 contour extracted by hough transform", 941-944.
Almasganj, Farshad / Dehnavi, Farhad D. / Bijankhan, Mahmood: "Sharing relative stress of cross-word syllables and lexical stress to spontaneous speech recognition", 945-948.
Baron, Don / Shriberg, Elizabeth / Stolcke, Andreas: "Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues", 949-952.
Sun, Xuejing: "Pitch accent prediction using ensemble machine learning", 953-956.
Escudero-Mancebo, D. / González-Ferreras, C. / Cardeñoso-Payo, V.: "Quantitative evaluation of relevant prosodic factors for text-to-speech synthesis in Spanish", 1165-1168.
Thubthong, Nuttakorn / Kijsirikul, Boonserm / Luksaneeyanawin, Sudaporn: "Tone recognition in Thai continuous speech based on coarticulaion, intonation and stress effects", 1169-1172.
Takagi, Kazuyuki / Kubota, Hajime / Ozeki, Kazuhiko: "Combination of pause and F0 information in dependency analysis of Japanese sentences", 1173-1176.
Horiuchi, Yasuo / Ohsuga, Tomoko / Ichikawa, Akira: "Estimating syntactic structure from F0 contour and pause duration in Japanese speech", 1177-1180.
Yamashita, Yoichi / Inoue, Akira: "Extraction of important sentences using F0 information for speech summarization", 1181-1184.
Kitamura, Tatsuya / Itoh, Kayo / Itoh, Toshihiko / Kitazawa, Shigeyoshi: "Influence of prosody, context, and word order in the identification of focus in Japanese dialogue", 1185-1188.
Kai, Atsuhiko / Nonomura, Yukari / Itoh, Toshihiko / Konishi, Tatsuhiro / Itoh, Yukihiro: "Influence of different dialogue situations on user²s behavior in spoken corrections", 1189-1192.
Yang, Li-chiung: "Interpreting meaning from context: modeling the prosody of discourse markers in speech", 1193-1196.
Bartkova, Katarina / Gac, David Le / Charlet, Delphine / Jouvet, Denis: "Prosodic parameter for speaker identification", 1197-1200.
Shigeyoshi, Kitazawa / Toshihiko, Itoh / Tatsuya, Kitamura: "Juncture segmentation of Japanese prosodic unit based on the spectrographic features", 1201-1204.
Svec, Jan G. / Sram, Frantisek: "Kymographic imaging of the vocal fold oscillations", 957-960.
Mády, K. / Sader, R. / Zimmermann, A. / Hoole, P. / Beer, A. / Zeilhofer, H.-F. / Hannig, Ch.: "Assessment of consonant articulation in glossectomee speech by dynamic MRI", 961-964.
Wrench, Alan / Gibbon, Fiona / McNeill, Alison M. / Wood, Sara: "An EPG therapy protocol for remediation and assessment of articulation disorders", 965-968.
Patel, Rupal: "How speakers with and without speech impairment mark the question statement contrast", 969-972.
Zahorian, Stephen A. / Zimmer, A. Matthew / Meng, Fansheng: "Vowel classification for computer-based visual feedback for speech training for the hearing impaired", 973-976.
Alku, Paavo / Bäckström, Tom: "All-pole modeling of wide-band speech using weighted sum of the LSP polynomials", 977-980.
Schoentgen, Jean: "Analysis and synthesis of the phonatory excitation signal by means of a pair of polynomial shaping functions", 981-984.
Vintsiuk, Taras K.: "Optimal speech signal partition into one-quasiperiodical segments", 985-988.
Rufiner, Hugo L. / Rocha, Luis F. / Close, John Goddard: "Sparse and independent representations of speech signals based on parametric models", 989-992.
Funaki, Keiichi: "Improvement of the ELS-based time-varying complex speech analysis", 993-996.
Chin, K. K. / Woodland, P. C.: "Maximum mutual information training of hidden Markov models with vector linear predictors", 997-1000.
Hamaker, J. E. / Picone, J. / Ganapathiraju, A.: "A sparse modeling approach to speech recognition based on relevance vector machines", 1001-1004.
Chelba, Ciprian / Morton, Rachel: "Mutual information phone clustering for decision tree induction", 1005-1008.
Horn, Kevin S. Van: "Rethinking derived acoustic features in speech recognition", 1009-1012.
Markov, Konstantin / Nakamura, Satoshi: "Modeling HMM state distributions with Bayesian networks", 1013-1016.
Tsakalidis, Stavros / Doumpiotis, Vlasios / Byrne, William: "Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation", 2585-2588.
Okuda, Kozo / Kawahara, Tatsuya / Nakamura, Satoshi: "Speaking rate compensation based on likelihood criterion in acoustic model training and decoding", 2589-2592.
Bacchiani, Michiel: "Combining maximum likelihood and maximum a posteriori estimation for detailed acoustic modeling of context dependency", 2593-2596.
Huang, Jing / Goel, Vaibhava / Gopinath, Ramesh / Kingsbury, Brian / Olsen, Peder / Visweswariah, Karthik: "Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model", 2597-2600.
Zhang, Jin-Song / Nakamura, Satoshi: "Modeling varying pauses to develop robust acoustic models for recognizing noisy conversational speech", 2601-2604.
Song, Hwa Jeon / Kim, Hyung Soon: "Improving phone-level discrimination in LDA with subphone-level classes", 2625-2628.
Ou, Zhijian / Wang, Zuoying: "A combined model of statics-dynamics of speech optimized using maximum mutual information", 2629-2632.
Takahashi, Nobutoshi / Nakagawa, Seiichi: "Syllable recognition using syllable-segment statistics and syllable-based HMM", 2633-2636.
Thirion, J. W. F. / Botha, Elizabeth C.: "Recurrent neural network-enhanced HMM speech recognition systems", 2637-2640.
Yun, Young-Sun: "Sharing trend information of trajectory in segmental-feature HMM", 2641-2644.
Salomon, Jesper / King, Simon / Salomon, Jesper: "Framewise phone classification using support vector machines", 2645-2648.
Stewart, Darryl / Ji, Ming / Hanna, Philip / Smith, F. Jack: "A state-tying approach to building syllable HMMs", 2649-2652.
Lee, Weifeng / Sekhar, C. Chandra / Takeda, Kazuya / Itakura, Fumitada: "Recognition of continuous speech segments of monophone units using support vector machines", 2653-2656.
Park, Junho / Ko, Hanseok: "Construction of decision tree from data driven clustering", 2657-2660.
Lee, Akinobu / Mera, Yuuichiro / Saruwatari, Hiroshi / Shikano, Kiyohiro: "Selective multi-path acoustic model based on database likelihoods", 2661-2664.
Stephenson, Todd A. / Magimai-Doss, Mathew / Bourlard, Hervé: "Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition", 2665-2668.
Watanabe, Shinji / Minami, Yasuhiro / Nakamura, Atsushi / Ueda, Naonori: "Constructing shared-state hidden Markov models based on a Bayesian approach", 2669-2672.
Ogawa, Tetsuji / Kobayashi, Tetsunori: "Generalization of state-observation-dependency in partly hidden Markov models", 2673-2676.
Esling, John H.: "Laryngoscopic analysis of tibetan chanting modes and their relationship to register in sino-tibetan", 1081-1084.
Murray, Kathleen / Simonsen, Betina: "A corpus-based study of danish laryngealization", 1085-1088.
Warner, Natasha / Jongman, Allard / Mücke, Doris: "Variability in direction of dorsal movement during production of /l/", 1089-1092.
Xu, Yi / Liu, Fang: "Segmentation of glides with tonal alignment as reference", 1093-1096.
Maddieson, Ian / Larson, Julie: "Variability in the production of glottalized sonorants: data from yapese", 1097-1100.
Tuan, Vu Ngoc / d’Alessandro, Christophe / Rosset, Sophie: "A phonetic study of vietnamese tones: acoustic and electroglottographic measurements", 1101-1104.
Chung, Hyunsong: "Segment duration in spoken korean", 1105-1108.
Zvonik, Elena / Cummins, Fred: "Pause duration and variability in read texts", 1109-1112.
Pfitzinger, Hartmut R.: "Intrinsic phone durations are speaker-specific", 1113-1116.
Tronnier, Mechtild: "Preaspirated stops in southern Swedish", 1117-1120.
Warner, Natasha / Weber, Andrea: "Stop epenthesis at syllable boundaries", 1121-1124.
Raymond, William D. / Pitt, Mark / Johnson, Keith / Hume, Elizabeth / Makashay, Matthew / Dautricourt, Robin / Hilts, Craig: "An analysis of transcription consistency in spontaneous speech from the buckeye corpus", 1125-1128.
Aoyagi, Makiko: "Contextual effects on voicing judgment of stop consonants in Japanese", 1129-1132.
Joto, Akiyo / Imaishi, Motohisa / Nagase, Yoshiki / Funatsu, Seiya: "Discrimination of English vowels in consonantal contexts by native speakers of Japanese and its relations to dynamic information of formants", 1133-1136.
Tur, Gokhan / Wright, Jerry / Gorin, Allen / Riccardi, Giuseppe / Hakkani-Tür, Dilek: "Improving spoken language understanding using word confusion networks", 1137-1140.
Li, Li / Chou, Wu: "Improving latent semantic indexing based classifier with information gain", 1141-1144.
Kuo, Hong-Kwang Jeff / Lee, Chin-Hui / Zitouni, Imed / Fosler-Lussier, Eric / Ammicht, Egbert: "Discriminative training for call classification and routing", 1145-1148.
Cox, Stephen: "Speech and language processing for a constrained speech translation system", 1149-1152.
Chotimongkol, Ananlada / Rudnicky, Alexander I.: "Automatic concept identification in goal-oriented conversations", 1153-1156.
Levit, Michael / Nöth, Elmar / Gorin, Allen: "Using EM-trained string-edit distances for approximate matching of acoustic morphemes", 1157-1160.
Natarajan, Premkumar / Prasad, Rohit / Suhm, Bernhard / McCarthy, Daniel: "Speech-enabled natural language call routing: BBN call director", 1161-1164.
Gao, Sheng / Zhang, Jin-Song / Nakamura, Satoshi / Lee, Chin-Hui / Chua, Tat-seng: "Weighted graph based decision tree optimization for high accuracy acoustic modeling", 1233-1236.
Zhang, Li / Edmondson, William H.: "Speech recognition using syllable patterns", 1237-1240.
Brink, Janus D. / Botha, Elizabeth C.: "A comparison of L1 and african-mother-tongue acoustic models for south african English speech recognition", 1241-1244.
Somervuo, Panu: "Speech modeling using variational Bayesian mixture of Gaussians", 1245-1248.
Chen, Tao / Huang, Chao / Chang, Eric / Wang, Jingchun: "On the use of Gaussian mixture model for speaker variability analysis", 1249-1252.
Jackson, Philip J.B. / Russell, Martin J.: "Models of speech dynamics in a segmental-HMM recognizer using intermediate linear representations", 1253-1256.
Zen, Heiga / Tokuda, Keiichi / Kitamura, Tadashi: "Decision tree distribution tying based on a dimensional split technique", 1257-1260.
Huckvale, Mark: "Speech synthesis, speech simulation and speech science", 1261-1264.
Bulut, Murtaza / Narayanan, Shrikanth S. / Syrdal, Ann K.: "Expressive speech synthesis using a concatenative synthesizer", 1265-1268.
Shichiri, Kengo / Sawabe, Atsushi / Yoshimura, Takayoshi / Tokuda, Keiichi / Masuko, Takashi / Kobayashi, Takao / Kitamura, Tadashi: "Eigenvoices for HMM-based speech synthesis", 1269-1272.
Marsi, Erwin / Busser, Bertjan / Daelemans, Walter / Hoste, Veronique / Reynaert, Martin / Bosch, Antal van den: "Combining information sources for memory-based pitch accent placement", 1273-1276.
Swift, Mary D. / Campana, Ellen / Allen, James F. / Tanenhaus, Michael K.: "Eye-fixation as a measure of real-time processing of synthesized words", 1277-1280.
Stent, Amanda / Walker, Marilyn A. / Whittaker, Steve / Maloor, Preetam: "User-tailored generation for spoken dialogue: an experiment", 1281-1284.
Roy, Deb: "A system that learns to describe objects in visual scenes", 1285-1288.
Mou, Xiaolong / Seneff, Stephanie / Zue, Victor: "Integration of supra-lexical linguistic models with speech recognition using shallow parsing and finite state transducers", 1289-1292.
Shu, Han / Hetherington, I. Lee: "EM training of finite-state transducers and its application to pronunciation modeling", 1293-1296.
Szarvas, Máté / Furui, Sadaoki: "Finite-state transducer based hungarian LVCSR with explicit modeling of phonological changes", 1297-1300.
Caseiro, Diamantino / Trancoso, Isabel: "Using dynamic WFST composition for recognizing broadcast news", 1301-1304.
Dolfing, Hans J. G. A.: "Transducer search space modelings for large-vocabulary speech recognition", 1305-1308.
Kanthak, Stephan / Ney, Hermann / Riley, Michael / Mohri, Mehryar: "A comparison of two LVR search optimization techniques", 1309-1312.
Mohri, Mehryar / Riley, Michael: "An efficient algorithm for the n-best-strings problem", 1313-1316.
Xiang, Bing / Berger, Toby: "Structural Gaussian mixture models for efficient text-independent speaker verification", 1317-1320.
Petry, A. / Barone, Dante A. C.: "Text-dependent speaker verification using lyapunov exponents", 1321-1324.
BenZeghiba, Mohamed F. / Bourlard, Hervé: "User-customized password speaker verification based on HMM/ANN and GMM models", 1325-1328.
Xin, Dong / Wu, Zhaohui / Yang, Yingchun: "Exploiting support vector machines in hidden Markov models for speaker verification", 1329-1332.
Mami, Yassine / Charlet, Delphine: "Speaker identification by location in an optimal space of anchor models", 1333-1336.
Park, Alex / Hazen, Timothy J.: "ASR dependent techniques for speaker identification", 1337-1340.
Ding, Peng / Liu, Yang / Xu, Bo: "Factor analyzed Gaussian mixture models for speaker identification", 1341-1344.
Jin, Qin / Schultz, Tanja / Waibel, Alex: "Phonetic speaker identification", 1345-1348.
Navrátil, Jirí / Ramaswamy, Ganesh N.: "DETAC: a discriminative criterion for speaker verification", 1349-1352.
Liu, Ming / Chang, Eric / Dai, Bei-qian: "Hierarchical Gaussian mixture model for speaker verification", 1353-1356.
Kochanski, Greg / Lopresti, Daniel / Shih, Chilin: "A reverse turing test using speech", 1357-1360.
Ahn, Sungjoo / Kang, Sunmee / Ko, Hanseok: "On effective speaker verification based on subword model", 1361-1364.
Xiang, Bing: "Speaker verification using Gaussian component strings in dynamic trajectory space", 1365-1368.
Heck, Larry P. / Genoud, Dominique: "Combining speaker and speech recognition systems", 1369-1372.
Li, Qi / Jiang, Hui / Zhou, Qiru / Zheng, Jinsong: "Automatic enrollment for speaker authentication", 1373-1376.
Andorno, M. / Laface, P. / Gemello, Roberto: "Experiments in confidence scoring for word and sentence verification", 1377-1380.
Huggins, Mark C. / Grieco, John J.: "Confidence metrics for speaker identification", 1381-1384.
Elenius, Daniel / Blomberg, Mats: "Characteristics of a low reject mode speaker verification system", 1385-1388.
Bernstein, Lynne E. / Burnham, Denis K. / Schwartz, Jean-Luc: "Special session: issues in audiovisual spoken language processing (when, where, and how?)", 1445-1448.
Deligne, Sabine / Potamianos, Gerasimos / Neti, Chalapathy: "Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization)", 1449-1452.
Bailly, Gérard: "Audiovisual speech synthesis. from ground truth to models", 1453-1456.
Vatikiotis-Bateson, Eric / Hill, Harold / Kamachi, Miyuki / Lander, Karen / Munhall, Kevin G.: "The stimulus as basis for audiovisual integration", 1457-1460.
Rosenblum, Lawrence D.: "The perceptual basis for audiovisual speech integration", 1461-1464.
Hardison, Debra M.: "Sources of variability in the perceptual training of /r/ and /l/: interaction of adjacent vowel, word position, talkers² visual and acoustic cues", 1465-1468.
Hazan, Valerie / Sennema, Anke / Faulkner, Andrew: "Audiovisual perception in L2 learners", 1685-1688.
Kirk, Karen Iler / Pisoni, David B. / Lachs, Lorin: "Audiovisual integration of speech by children and adults with cochlear implants", 1689-1692.
Sekiyama, Kaoru / Sugita, Yoichi: "Auditory-visual speech perception examined by brain imaging and reaction time", 1693-1696.
Ponton, Curtis W. / Auer, Edward T. / Bernstein, Lynne E.: "Neurocognitive basis for audiovisual speech perception: evidence from event-related potentials", 1697-1700.
Lewkowicz, David J.: "Perception and integration of audiovisual speech in human infants", 1701-1704.
Bailly, Gérard / Badin, Pierre: "Seeing tongue movements from outside", 1913-1916.
Wojdel, Jacek C. / Wiggers, Pascal / Rothkrantz, Leon J.M.: "An audio-visual corpus for multimodal speech recognition in dutch language", 1917-1920.
Wiggers, Pascal / Wojdel, Jacek C. / Rothkrantz, Leon J.M.: "Medium vocabulary continuous audio-visual speech recognition", 1921-1924.
Heckmann, Martin / Kroschel, Kristian / Savariaux, Christophe / Berthommier, Frédéric: "DCT-based video features for audio-visual speech recognition", 1925-1928.
Erdener, V. Dogu / Burnham, Denis K.: "The effect of auditory-visual information and orthographic background in L2 acquisition", 1929-1932.
Krahmer, Emiel / Ruttkay, Zsófia / Swerts, Marc / Wesselink, Wieger: "Perceptual evaluation of audiovisual cues for prominence", 1933-1936.
Schwartz, Jean-Luc / Berthommier, Frédéric / Savariaux, Christophe: "Audio-visual scene analysis: evidence for a "very-early" integration process in audio-visual speech perception", 1937-1940.
Zelezný, Milos / Císar, Petr / Krnoul, Zdenek / Novák, Jan: "Design of an audio-visual speech corpus for the czech audio-visual speech synthesis", 1941-1944.
Attina, Virginie / Beautemps, Denis / Cathiard, Marie-Agnès: "Coordination of hand and orofacial movements for CV sequences in French cued speech", 1945-1948.
Attina, Virginie / Cathiard, Marie-Agnès / Beautemps, Denis: "Controling anticipatory behavior for rounding in French cued speech", 1949-1952.
Sodoyer, David / Girin, Laurent / Jutten, Christian / Schwartz, Jean-Luc: "Audio-visual speech sources separation: a new approach exploiting the audio-visual coherence of speech stimuli", 1953-1956.
House, David: "Intonational and visual cues in the perception of interrogative mode in Swedish", 1957-1960.
Lucey, Simon / Sridharan, Sridha / Chandran, Vinod: "A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition", 1961-1964.
Endo, Taku / Ward, Nigel / Terada, Minoru: "Can confidence scores help users post-editing speech recognizer output?", 1469-1472.
Watanabe, Masatoshi / Sugiyama, Masahide: "Information retrieval based on speech recognition results", 1473-1476.
Lemmelä, Saija-Maaria / Boda, Péter Pál: "Efficient combination of type-in and wizard-of-oz tests in speech interface development process", 1477-1480.
Macherey, Wolfgang / Viechtbauer, Jörg / Ney, Hermann: "Probabilistic retrieval based on document representations", 1481-1484.
Nishimoto, Takuya / Araki, Masahiro / Niimi, Yasuhisa: "Radiodoc: a voice-accessible document system", 1485-1488.
Goto, Masataka / Itou, Katunobu / Hayamizu, Satoru: "Speech completion: on-demand completion assistance using filled pauses for speech input interfaces", 1489-1492.
Wilkie, Jenny / Jack, Mervyn A. / Littlewood, Peter: "Design of system-initiated digressive proposals for automated banking dialogues", 1493-1496.
Toth, Arthur R. / Harris, Thomas K. / Sanders, James / Shriver, Stefanie / Rosenfeld, Roni: "Towards every-citizen²s speech interface: an application generator for speech interfaces to databases", 1497-1500.
Iyer, Rukmini / Ma, Jeffrey / Gish, Herbert / Kimball, Owen: "Training topic classifiers for conversational speech with limited data", 1501-1504.
Nishizaki, Hiromitsu / Nakagawa, Seiichi: "Comparing isolately spoken keywords with spontaneously spoken queries for Japanese spoken document retrieval", 1505-1508.
Lai, Jennifer C. / Lee, Kwan Min: "Choosing speech or touchtone modality for navigation within a telephony natural language system", 1509-1512.
Lo, Wai-Kit / Meng, Helen M. / Ching, P. C.: "Multi-scale and multi-model integration for improved performance in Chinese spoken document retrieval", 1513-1516.
Ogata, Kohichi / Sonoda, Yorinobu: "Development of a GUI-based articulatory speech synthesis system", 1517-1520.
Dang, Jianwu / Honda, Masaaki / Honda, Kiyoshi: "Investigation of coarticulation based on electromagnetic articulographic data", 1521-1524.
Niikawa, Takuya / Ando, Takanori / Matsumura, Masafumi: "Frequency dependence of vocal-tract length", 1525-1528.
Maeda, Shinji / Toda, Martine / Carlen, Andreas J. / Meftahi, Lyes: "Functional modeling of face movements during speech", 1529-1532.
Mochida, Takemi / Honda, Masaaki / Hayashi, Kouki / Kuwae, Toshiharu / Tanahashi, Kunihiro / Nishikawa, Kazufumi / Takanishi, Atsuo: "Control system for talking robot to replicate articulatory movement of natural speech", 1533-1536.
Finan, Donald S. / Smith, Anne / Ho, Michael: "Feed the tiger: a method for evoking reliable jaw stretch reflexes in children", 1537-1540.
Kaburagi, Tokihiko / Wakamiya, Kohei / Honda, Masaaki: "Three-dimensional electromagnetic articulograph based on a nonparametric representation of the magnetic field", 2297-2300.
Laprie, Yves / Ouni, Slim: "Introduction of constraints in an acoustic-to-articulatory inversion method based on a hypercubic articulatory table", 2301-2304.
Hiroya, Sadao / Honda, Masaaki: "Acoustic-to-articulatory inverse mapping using an HMM-based speech production model", 2305-2308.
Hashimoto, Kiyoshi: "Modeling articulatory dynamics in autoregressive linear system", 2309-2312.
Sciamarella, Denisse / d’Alessandro, Christophe: "A study of the two-mass model in terms of acoustic parameters", 2313-2316.
Kolossa, Dorothea / Huo, Qiang: "Using time-stretched pulses for accurate splitting of speech utterances played back in noisy reverberant environments", 1541-1544.
Maekawa, Kikuo / Kikuchi, Hideaki / Igarashi, Yosuke / Venditti, Jennifer: "X-JToBI: an extended j-toBI for spontaneous speech", 1545-1548.
Strik, Helmer / Daelemans, Walter / Binnenpoorte, Diana / Sturm, Janienke / Vriend, F. De / Cucchiarini, Catia: "Dutch HLT resources: from BLARK to priority lists", 1549-1552.
Yang, Fan / Strayer, Susan E. / Heeman, Peter A.: "ACT: a graphical dialogue annotation comparison tool", 1553-1556.
Yu, Ha-Jin / Kim, Jin Suk: "A training prompts generation algorithm for connected spoken word recognition", 1557-1560.
Cornu, Etienne / Sheikhzadeh, Hamid / Brennan, Robert: "A low-resource, miniature implementation of the ETSI distributed speech recognition front-end", 1581-1584.
Astrov, Sergey: "Memory space reduction for hidden Markov models in low-resource speech recognition systems", 1585-1588.
Wang, Xia / Iso-Sipilä, Juha: "Low complexity Mandarin speaker-independent isolated word recognition", 1589-1592.
Kiss, Imre / Vasilache, Marcel: "Low complexity techniques for embedded ASR systems", 1593-1596.
Reinhard, Klaus / Junkawitsch, Jochen / Kießling, Andreas / Dobler, Stefan: "Optimization of hidden Markov models for embedded systems", 1597-1600.
Filali, Karim / Li, Xiao / Bilmes, Jeff A.: "Data-driven vector clustering for low-memory footprint ASR", 1601-1604.
Jiang, Hui / Lee, Chin-Hui: "Utterance verification based on neighborhood information and Bayes factors", 1605-1608.
Lahti, Tommi / Suontausta, Janne: "Vocabulary independent OOV detection using support vector machines", 1609-1612.
Bazzi, Issam / Glass, James: "A multi-class approach for modelling out-of-vocabulary words", 1613-1616.
Duchateau, Jacques / Wambacq, Patrick: "Unconstrained versus constrained acoustic normalisation in confidence scoring", 1617-1620.
Falavigna, Daniele / Gretter, Roberto / Riccardi, Giuseppe: "Acoustic and word lattice based algorithms for confidence scores", 1621-1624.
Wang, Huei-Ming / Lin, Yi-Chung: "Error-tolerant spoken language understanding with confidence measuring", 1625-1628.
Weil, Shawn A.: "Comparing intelligibility of several non-native accent classes in noise", 1629-1632.
Ishizuka, Kentaro / Aikawa, Kiyoaki: "Effect of F0 fluctuation and amplitude modulation of natural vowels on vowel identification in noisy environments", 1633-1636.
Yoneyama, Kiyoko: "Similarities of words in noise in Japanese", 1637-1640.
Brungart, Douglas S. / Kordik, Alexander J. / Das, Koel / Shaw, Arnab K.: "The effects of F0 manipulation on the perceived distance of speech", 1641-1644.
Janse, Esther: "Time-compressing natural and synthetic speech", 1645-1648.
Xue, Jianxia / Takayanagi, Sumiko / Bernstein, Lynne E.: "Accounting for perceptual identification of consonants and vowels through acoustic dissimilarity", 1649-1652.
Wade, Travis / Eakin, Deborah K. / Webb, Russell / Agah, Arvin / Brown, Frank / Jongman, Allard / Gauch, John / Schreiber, Thomas A. / Sereno, Joan: "Modeling recognition of speech sounds with minerva2", 1653-1656.
Kearns, Ruth / Norris, Dennis / Cutler, Anne: "Syllable processing in English", 1657-1660.
Kuijpers, Cecile / Donselaar, Wilma van / Cutler, Anne: "Perceptual effects of assimilation-induced violation of final devoicing in dutch", 1661-1664.
Yip, Michael C.W.: "Access to homophonic meanings during spoken language comprehension: effects of context and neighborhood density", 1665-1668.
Magrin-Chagnolleau, Ivan / Barkat, Melissa / Meunier, Fanny: "Intelligibility of reverse speech in French: a perceptual study", 1669-1672.
Serniclaes, Willy / Carré, René: "Contextual effects in the perception of fricative place of articulation: a rotational hypothesis", 1673-1676.
Sock, Rudolph / Vaxelaire, Béatrice / Hecker, Véronique / Hirsch, Fabrice: "What relationship between protrusion anticipation and auditory perception?", 1677-1680.
Carré, René / Liénard, Jean Sylvain / Marsico, Egidio / Serniclaes, Willy: "On the role of the "schwa" in the perception of plosive consonants", 1681-1684.
Nguyen, Noël / Jankowski, Ludovic / Habib, Michel: "The perception of stop consonant sequences in dyslexic and normal children", 2565-2568.
Otake, Takashi / Iijima, Akemi: "Submoraic awareness by Japanese school children: evidence from a novel game", 2569-2572.
Markham, D. / Hazan, Valerie: "Speaker intelligibility of adults and children", 2573-2576.
Yamashita, Yasuki / Matsumoto, Hiroshi: "Acoustical correlates to SD ratings of speaker characteristics in two speaking styles", 2577-2580.
Ormanci, Eda / Nikbay, U. Hakan / Turk, Oytun / Arslan, Levent M.: "Subjective assessment of frequency bands for perception of speaker identity", 2581-2584.
Stallard, David / Natarajan, Premkumar / Noamany, Mohammed / Schwartz, Richard / Makhoul, John: "Design for a speech-to-speech translator for field use", 1705-1708.
Black, Alan W. / Brown, Ralf D. / Frederking, Robert / Lenzo, Kevin / Moody, John / Rudnicky, Alexander I. / Singh, Rita / Steinbrecher, Eric: "Rapid development of speech-to-speech translation systems", 1709-1712.
Imamura, Kenji / Sumita, Eiichiro: "Bilingual corpus cleaning focusing on translation literality", 1713-1716.
Tanaka, Hideki / Nightingale, Stephen / Kashioka, Hideki / Matsumoto, Kenji / Nishiwaki, Masamchi / Kumano, Tadashi / Maruyama, Takehiko: "Speech to speech translation system for monologues-data driven approach", 1717-1720.
Gispert, Adrià de / Mariño, José B.: "Using x-grams for speech-to-speech translation", 1885-1888.
Watanabe, Taro / Sumita, Eiichiro: "Statistical machine translation decoder based on phrase", 1889-1892.
Sumita, Eiichiro / Akiba, Yasuhiro / Imamura, Kenji: "Reliability measures for translation quality", 1893-1896.
Zhou, Bowen / Gao, Yuqing / Sorensen, Jeffrey / Diao, Zijian / Picheny, Michael: "Statistical natural language generation for speech-to-speech machine translation systems", 1897-1900.
Vogel, Stephan / Tribble, Alicia: "Improving statistical machine translation for a speech-to-speech translation task", 1901-1904.
Rossato, Solange / Blanchon, Hervé / Besacier, Laurent: "Speech-to-speech translation system evaluation: results for French for the NESPOLE! project first showcase", 1905-1908.
Kauers, Manuel / Vogel, Stephan / Fügen, Christian / Waibel, Alex: "Interlingua based statistical machine translation", 1909-1912.
Nishizawa, Nobuyuki / Hirose, Keikichi / Minematsu, Nobuaki: "Separation of voiced source characteristics and vocal tract transfer function characteristics for speech sounds by iterative analysis based on AR-HMM model", 1721-1724.
Narusawa, Shuichi / Minematsu, Nobuaki / Hirose, Keikichi / Fujisaki, Hiroya: "Automatic extraction of model parameters from fundamental frequency contours of English utterances", 1725-1728.
Murakami, Takahiro / Namba, Munehiro / Hoya, Tetsuya / Ishida, Yoshihisa: "Pitch extraction of speech signals using an eigen-based subspace method", 1729-1732.
Nakatani, Tomohiro / Irino, Toshio: "Robust fundamental frequency estimation against background noise and spectral distortion", 1733-1736.
Quatieri, Thomas F.: "2-d processing of speech with application to pitch estimation", 1737-1740.
Saraclar, Murat / Riley, Michael / Bocchieri, Enrico / Goffin, Vincent: "Towards automatic closed captioning : low latency real time broadcast news transcription", 1741-1744.
Prasad, Rohit / Nguyen, Long / Schwartz, Richard / Makhoul, John: "Automatic transcription of courtroom speech", 1745-1748.
Nguyen, Long / Guo, Xuefeng / Schwartz, Richard / Makhoul, John: "Japanese broadcast news transcription", 1749-1752.
Hecht, Robert / Riedler, Jürgen / Backfried, Gerhard: "German broadcast news transcription", 1753-1756.
Imai, Toru / Matsui, Atsushi / Homma, Shinichi / Kobayakawa, Takeshi / Onoe, Kazuo / Sato, Shoei / Ando, Akio: "Speech recognition with a re-speak method for subtitling live broadcasts", 1757-1760.
Takamaru, Keiichi / Hiroshige, Makoto / Araki, Kenji / Tochinai, Koji: "Evaluation of the method to detect Japanese local speech rate deceleration applying the variable threshold with a constant term", 1761-1764.
Kirkham, Sandra P.: "Tempo modulations in English: selected pilot study results", 1765-1768.
Smith, Caroline L.: "Modeling durational variability in reading aloud a connected text", 1769-1772.
Hifny, Yasser / Rashwan, Mohsen: "Duration modeling for arabic text to speech synthesis", 1773-1776.
Jokisch, Oliver / Ding, Hongwei / Kruschke, Hans / Strecha, Guntram: "Learning syllable duration and intonation of Mandarin Chinese", 1777-1780.
Patwardhan, Pushkar / Rao, Preeti: "Controlling perceived degradation in spectrum envelope modeling via predistortion", 1837-1840.
Veprek, Peter / Bradley, Alan B.: "Benefit and cost analysis of using the improved vector quantizer design algorithm for glottal source waveform compression", 1841-1844.
Zhong, Xin / Arrowood, Jon A. / Clements, Mark A.: "Speech coding and transmission for improved automatic recognition", 1845-1848.
Nguyen, Phu Chien / Ochi, Takao / Akagi, Masato: "Coding speech at very low rates using straight and temporal decomposition", 1849-1852.
Nieminen, Toni P.: "Floating-point adaptive multi-rate wideband speech codec", 1853-1856.
Halmi, Omar / Tolba, Hesham / Guerchi, Driss / O’Shaughnessy, Douglas: "On improving the performance of analysis-by-synthesis coding using a multi-magnitude algebraic code-book excitation signal", 1857-1860.
Humphreys, K. / Lawlor, R.: "Improved performance speech codec for mobile communications", 1861-1864.
Yakhnich, Evgeni / Bistritz, Yuval: "Fixed-length segment coding of LSF parameters", 1865-1868.
Parsa, Vijay / Jamieson, Donald G.: "Interaction of voice over internet protocol speech coders and disordered speech samples", 1869-1872.
Kelleher, Holly / Pearce, David / Ealey, Doug / Mauuary, Laurent: "Speech recognition performance comparison between DSR and AMR transcoded speech", 1873-1876.
Hirsch, Hans-Günter: "The influence of speech coding on recognition performance in telecommunication networks", 1877-1880.
Moharir, Gautam / Patwardhan, Pushkar / Rao, Preeti: "Spectral enhancement preprocessing for the HNM coding of noisy speech", 1881-1884.
Brun, Armelle / Smaïli, Kamel / Haton, Jean-Paul: "Contribution to topic identification by using word similarity", 1965-1968.
Zhou, Bowen / Hansen, John H. L.: "Speechfind: an experimental on-line spoken document retrieval system for historical audio archives", 1969-1972.
Suzuki, Yoshimi / Fukumoto, Fumiyo / Sekiguchi, Yoshihiro: "Topic tracking using subject templates", 1973-1976.
Asami, Katsushi / Takezawa, Toshiyuki / Kikui, Genichiro: "Topic detection of an utterance for speech dialogue processing", 1977-1980.
Liu, Daben / Ma, Jeffrey / Xu, Dongxin / Srivastava, Amit / Kubala, Francis: "Real-time rich-content transcription of Chinese broadcast news", 1981-1984.
Wang, Chun-Jen / Chen, Berlin / Lee, Lin-shan: "Improved Chinese spoken document retrieval with hybrid modeling and data-driven indexing features", 1985-1988.
Larson, Martha / Eickeler, Stefan / Paaß, Gerhard / Leopold, Edda / Kindermann, Jörg: "Exploring sub-word features and linear support vector machines for German spoken document classification", 1989-1992.
Wester, Mirjam / Kessens, Judith M. / Strik, Helmer: "Goal-directed ASR in a multimedia indexing and searching environment (MUMIS)", 1993-1996.
Logan, Beth / Thong, J. M. Van: "Confusion-based query expansion for OOV words in spoken document retrieval", 1997-2000.
Wickramaratna, J. T. / Woodland, P. C.: "Cluster identification for speaker-environment tracking", 2001-2004.
Pinquier, Julien / Rouas, Jean-Luc / André-Obrecht, Régine: "Robust speech / music classification in audio documents", 2005-2008.
Karnebäck, Stefan: "Expanded examinations of a low frequency modulation feature for speech/music discrimination", 2009-2012.
Ezzaidi, Hassan / Rouat, Jean: "Speech, music and songs discrimination in the context of handsets variability", 2013-2016.
Scherer, Klaus R. / Grandjean, D. / Johnstone, Tom / Klasmeyer, Gudrun / Bänziger, Thomas: "Acoustic correlates of task load and stress", 2017-2020.
Rahurkar, Mandar A. / Hansen, John H. L. / Meyerhoff, James / Saviolakis, George / Koenig, Michael: "Frequency band analysis for stress detection using a teager energy operator based feature", 2021-2024.
Yuan, Jiahong / Shen, Liqin / Chen, Fangxin: "The acoustic realization of anger, fear, joy and sadness in Chinese", 2025-2028.
Tato, Raquel / Santos, Rocío / Kompe, Ralf / Pardo, J. M.: "Emotional space improves emotion recognition", 2029-2032.
Chuang, Ze-Jing / Wu, Chung-Hsien: "Emotion recognition from textual input using an emotional semantic network", 2033-2036.
Ang, Jeremy / Dhillon, Rajdip / Krupski, Ashley / Shriberg, Elizabeth / Stolcke, Andreas: "Prosody-based automatic detection of annoyance and frustration in human-computer dialog", 2037-2040.
Makarova, Veronika / Petrushin, Valery A.: "RUSLANA: a database of Russian emotional utterances", 2041-2044.
O’Neill, Ian M. / McTear, Michael F.: "A pragmatic confirmation mechanism for an object-based spoken dialogue manager", 2045-2048.
Torge, Sunna / Rapp, Stefan / Kompe, Ralf: "Serving complex user wishes with an enhanced spoken dialogue system", 2049-2052.
Chung, Grace / Seneff, Stephanie: "Integrating speech with keypad input for automatic entry of spelling and pronunciation of new words", 2053-2056.
Campana, Ellen / Brown-Schmidt, Sarah / Tanenhaus, Michael K.: "Reference resolution by human partners in a natural interactive problem-solving task", 2057-2060.
Ferrer, Luciana / Shriberg, Elizabeth / Stolcke, Andreas: "Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody", 2061-2064.
Gorrell, Genevieve / Lewin, Ian / Rayner, Manny: "Adding intelligent help to mixed-initiative spoken dialogue systems", 2065-2068.
Shin, Jongho / Narayanan, Shrikanth S. / Gerber, Laurie / Kazemzadeh, Abe / Byrd, Dani: "Analysis of user behavior under error conditions in spoken dialogs", 2069-2072.
Jiang, Yinglong / Murphy, Peter: "Production based pitch modification of voiced speech", 2073-2076.
Sun, Xuejing: "F0 generation for speech synthesis using a multi-tier approach", 2077-2080.
Strom, Volker: "From text to prosody without toBI", 2081-2084.
Hirose, Keikichi / Eto, Masaya / Minematsu, Nobuaki: "Improved corpus-based synthesis of fundamental frequency contours using generation process model", 2085-2088.
Buhmann, Jeska / Martens, Jean-Pierre / Macken, Lieve / Coile, Bert Van: "Intonation modelling for the synthesis of structured documents", 2089-2092.
Meron, Joram: "Applying fallback to prosodic unit selection from a small imitation database", 2093-2096.
Tao, Jianhua / Cai, Lianhong: "Clustering and feature learning based F0 prediction for Chinese speech synthesis", 2097-2100.
Weber, Katrin / Wet, Febe de / Cranen, Bert / Boves, Lou / Bengio, Samy / Bourlard, Hervé: "Evaluation of formant-like features for ASR", 2101-2104.
Al-Dulaimy, Fadhil H. T. / Wang, Zuoying: "Entropy of energy operator as feature for large vocabulary Mandarin speaker independent speech recognition", 2105-2108.
Zhang, Yiyan / Liu, Wenju / Xu, Bo / Zhang, Huayun: "Improving parametric trajectory modeling by integration of pitch and tone information", 2109-2112.
Tolba, Hesham / Selouani, Sid-Ahmed / O’Shaughnessy, Douglas: "Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for automatic speech recognition using a multi-stream paradigm", 2113-2116.
Leung, Ka-Yee / Siu, Manhung: "Speech recognition using combined acoustic and articulatory information with retraining of acoustic model parameters", 2117-2120.
Wilkinson, N. J. / Russell, Martin J.: "Improved phone recognition on TIMIT using formant frequency data and confidence measures", 2121-2124.
Kitaoka, Norihide / Yamada, Daisuke / Nakagawa, Seiichi: "Speaker independent speech recognition using features based on glottal sound source", 2125-2128.
Omar, Mohamed Kamal / Chen, Ken / Hasegawa-Johnson, Mark / Brandman, Yigal: "An evaluation of using mutual information for selection of acoustic-features representation of phonemes for speech recognition", 2129-2132.
Metze, Florian / Waibel, Alex: "A flexible stream architecture for ASR using articulatory features", 2133-2136.
Ljolje, Andrej: "Speech recognition using fundamental frequency and voicing in acoustic modeling", 2137-2140.
Karnjanadecha, Montri / Kimsawad, Patimakorn: "A comparison of front-end analyses for Thai speech recognition", 2141-2144.
Turunen, Jari / Tanttu, Juha T. / Loula, Pekka: "New model for speech residual signal shaping with static nonlinearity", 2145-2148.
Ho, Ching-Hsiang / Rentzos, Dimitrios / Vaseghi, Saeed: "Formant model estimation and transformation for voice morphing", 2149-2152.
Megyesi, Beáta / Gustafson-Capková, Sofia: "Production and perception of pauses and their linguistic context in read and spontaneous speech in Swedish", 2153-2156.
Manfredi, Claudia / Matassini, Lorenzo: "Non-linear techniques for dysphonic voice analysis and correction", 2157-2160.
Sasou, Akira / Tanaka, Kazuyo: "Adaptive estimation of time-varying features from high-pitched speech based on an excitation source HMM", 2161-2164.
Toda, Martine / Maeda, Shinji / Carlen, Andreas J. / Meftahi, Lyes: "Lip gestures in English sibilants: articulatory - acoustic relationship", 2165-2168.
Malayath, Naren / Hermansky, Hynek: "Bark resolution from speech data", 2169-2172.
Selouani, Sid-Ahmed / O’Shaughnessy, Douglas: "Noise-robust speech recognition in car environments using genetic algorithms and a mel-cepstral subspace approach", 2173-2176.
Axelrod, Scott / Gopinath, Ramesh / Olsen, Peder: "Modeling with a subspace constraint on inverse covariance matrices", 2177-2180.
McCowan, Iain A. / Morris, Andrew C. / Bourlard, Hervé: "Improving speech recognition performance of small microphone arrays using missing data techniques", 2181-2184.
Gelbart, David / Morgan, Nelson: "Double the trouble: handling noise and reverberation in far-field automatic speech recognition", 2185-2188.
Couvreur, Laurent / Ris;, Christophe: "Model-based independent component analysis for robust multi-microphone automatic speech recognition", 2189-2192.
Yu, An-Tze / Wang, Hsiao-Chuan: "Compensation of channel effect on line spectrum frequencies", 2193-2196.
Zhang, Huayun / Han, Zhaobing / Xu, Bo: "Codebook dependent dynamic channel estimation for Mandarin speech recognition over telephone", 2197-2200.
Gemello, Roberto / Mana1, Franco / Pegoraro, Paolo / Mori, Renato De: "Robust multiple resolution analysis for automatic speech recognition", 2201-2204.
Peinado, Antonio M. / Sánchez, Victoria / Pérez-Córdoba, José L. / Segura, José C. / Rubio, Antonio J.: "HMM-based methods for channel error mitigation in distributed speech recognition", 2205-2208.
Fingscheidt, Tim / Aalburg, Stefanie / Stan, Sorel / Beaugeant, Christophe: "Network-based vs. distributed speech recognition in adaptive multi-rate wireless systems", 2209-2212.
Bernard, Alexis / Alwan, Abeer: "Channel noise robustness for low-bitrate remote speech recognition", 2213-2216.
Peláez-Moreno, C. / Gallardo-Antolín, A. / Vicente-Peña, J. / Díaz-de-María, F.: "Influence of transmission errors on ASR systems", 2217-2220.
Tsuge, Satoru / Kuroiwa, Shingo / Shishibori, Masami / Ren, Fuji / Kita, Kenji: "Robust feature extraction in a variety of input devices on the basis of ETSI standard DSR front-end", 2221-2224.
Tan, Zheng-Hua / Dalsgaard, Paul: "Channel error protection scheme for distributed speech recognition", 2225-2228.
Muthusamy, Yeshwant / Gong, Yifan / Gupta, Roshan: "The effects of speech compression on speech recognition and text-to-speech synthesis", 2229-2232.
Milner, Ben / Shao, Xu: "Transform-based feature vector compression for distributed speech recognition", 2233-2236.
Johnston, Michael / Bangalore, Srinivas / Stent, Amanda / Vasireddy, Gunaranjan / Ehlen, Patrick: "Multimodal language processing for mobile information access", 2237-2240.
Wang, Kuansan: "SALT: a spoken language interface for web-based multimodal dialog systems", 2241-2244.
Bennett, Christina / Llitjós, Ariadna Font / Shriver, Stefanie / Rudnicky, Alexander I. / Black, Alan W.: "Building voiceXML-based applications", 2245-2248.
Chai, Joyce: "Operations for context-based multimodal interpretation in conversational systems", 2249-2252.
Liu, Feng / Saad, Antoine / Li, Li / Chou, Wu: "A distributed multimodal dialogue system based on dialogue system and web convergence", 2253-2256.
Katsurada, Kouichi / Ootani, Yoshihiko / Nakamura, Yusaku / Kobayashi, Satoshi / Yamada, Hirobumi / Nitta, Tsuneo: "A modality-independent MMI system architecture", 2549-2552.
Armaroli, Cristiana / Azzini, Ivano / Ferrario, Lorenza / Giorgino, Toni / Nardelli, Luca / Orlandi, Marco / Rognoni, Carla: "An architecture for a multi-modal web browser", 2553-2556.
Ehlen, Patrick / Johnston, Michael / Vasireddy, Gunaranjan: "Collecting mobile multimodal data for match", 2557-2560.
Meng, Helen M. / Ching, P. C. / Wong, Yee Fong / Chan, Cheong Chat: "ISIS: a multi-modal, trilingual, distributed spoken dialog system developed with CORBA, java, XML and KQML", 2561-2564.
Tsukada, Kimiko: "An acoustic comparison between american English and australian English vowels", 2257-2260.
Jesus, Luis M.T. / Shadle, Christine H.: "A case study of portuguese and English bilinguality", 2261-2264.
Dioubina, Olga I. / Pfitzinger, Hartmut R.: "An IPA vowel diagram approach to analysing L1 effects on vowel production and perception", 2265-2268.
Helgason, Pétur / Gullbein, Sjúrðhur: "Phonological norms in faroese speech synthesis", 2269-2272.
Mareüil, Philippe Boula de / Adda-Decker, Martine: "Studying pronunciation variants in French by using alignment techniques", 2273-2276.
Hansson, Petra: "Perceived boundary strength", 2277-2280.
Jun, Sun-Ah: "Syntax over focus", 2281-2284.
Ohala, John J. / Roengpitya, Rungpat: "Duration related phase realignment of Thai tones", 2285-2288.
Bosch, Louis ten: "Probabilistic ranking of constraints", 2289-2292.
Komatsu, Masahiko / Tokuma, Shinichi / Tokuma, Won / Arai, Takayuki: "Multi-dimensional analysis of sonority: perception, acoustics, and phonology", 2293-2296.
Faúndez-Zanuy, Marcos / Nilsson, Mattias / Kleijn, W. Bastiaan: "On the relevance of bandwidth extension for speaker verification", 2317-2320.
Sabac, Bogdan: "Speaker recognition using discriminative features selection", 2321-2324.
Kinnunen, Tomi: "Designing a speaker-discriminative adaptive filter bank for speaker recognition", 2325-2328.
Tsang, Chi-Leung / Mak, CMan-Wai / Kung, Sun-Yuan: "Divergence-based out-of-class rejection for telephone handset identification", 2329-2332.
Ho, Purdy: "A handset identifier using support vector machines", 2333-2336.
Faltlhauser, Robert / Ruske, Günther / Thomae, M.: "Towards the question: why has speaking rate such an impact on speech recognition performance?", 2429-2432.
Arcienega, Mijail / Drygajlo, Andrzej: "Robust voiced-unvoiced decision associated to continuous pitch tracking in noisy telephone speech", 2433-2436.
Yao, Kaisheng / Paliwal, Kuldip K. / Nakamura, Satoshi: "Noise adaptive speech recognition with acoustic models trained from noisy speech evaluated on Aurora-2 database", 2437-2440.
Chen, Jingdong / Huang, Yiteng (Arden) / Li, Qi / Soong, Frank K.: "Recognition of noisy speech using normalized moments", 2441-2444.
Chen, Chia-Ping / Bilmes, Jeff A. / Kirchhoff, Katrin: "Low-resource noise-robust feature post-processing on Aurora 2.0", 2445-2448.
Deng, Li / Droppo, Jasha / Acero, Alex: "Exploiting variances in robust feature extraction based on a parametric model of speech distortion", 2449-2452.
Ghulam, Muhammad / Fukuda, Takashi / Sato, Takaharu / Nitta, Tsuneo: "Improving performance of an HMM-based ASR system by using monophone-level normalized confidence measure", 2453-2456.
Liu, Yi / Fung, Pascale: "Model partial pronunciation variations for spontaneous Mandarin speech recognition", 2457-2460.
Zheng, Fang / Song, Zhanjiang / Fung, Pascale / Byrne, William: "Reducing pronunciation lexicon confusion and using more data without phonetic transcription for pronunciation modeling", 2461-2464.
McDermott, Erik / Katagiri, Shigeru: "Classification error from the theoretical Bayes classification risk", 2465-2468.
Klautau, Aldebaro / Jevtic, Nikola / Orlitsky, Alon: "Combined binary classifiers with applications to speech recognition", 2469-2472.
Nagórski, Arkadiusz / Boves, Lou / Steeneken, Herman: "Optimal selection of speech data for automatic speech recognition systems", 2473-2476.
Liotti, Mario / Ramig, Lorraine O. / Vogel, Deanie / New3, Pamela / Cook, Chris / Fox, Peter: "Hypophonia in parkinson disease: neural correlates of voice treatment with LSVT revealed by PET", 2477-2480.
Duncan, Susan: "Preliminary data on effects of behavioral and levodopa therapies on speech-accompanying gesture in parkinson²s disease", 2481-2484.
Quek, Francis / Harper, Mary / Haciahmetoglu, Yonca / Chen, Lei / Ramig, Lorraine O.: "Speech pauses and gestural holds in parkinson²s disease", 2485-2488.
Spielman, Jennifer L. / Ramig, Lorraine O. / Borod, Joan C.: "Oro-facial changes in parkinson²s disease following intensive voice therapy (LSVT)", 2489-2492.
Logemann, Jeri / Sundin, Ralph / Sundin, Jean: "Swallowing and voice effects of lee silverman voice treatment (LSVT)", 2493-2496.
Will, Leslie / Ramig, Lorraine O. / Spielman, Jennifer L.: "Application of the lee silverman voice treatment (LSVT) to individuals with multiple sclerosis, ataxic dysarthria, and stroke", 2497-2500.
Farley, Becky G.: "Think big, from voice to limb movement therapy", 2501-2504.
Parsa, Vijay / Jamieson, Donald G. / Stenning, Karen / Leeper, Herbert A.: "On the estimation of signal-to-noise ratio in continuous speech for abnormal voices", 2505-2508.
Semenov, V. / Kovtonyuk, A. / Kalyuzhny, A.: "Computationally efficient method of speech enhancement based on block representation of signal in state space and vector quantization **********************", 2509-2512.
Kondo, Kazuhiro / Nakagawa, Kiyoshi: "Active speech cancellation for cellular speech", 2513-2516.
Muralishankar, R. / Ramakrishnan, A. G. / Prathibha, P.: "Warped-LP residual resampling using DCT for pitch modification", 2517-2520.
Jung, E. / Schwarzbacher, A. / Humphreys, K. / Lawlor, R.: "Application of real-time AMDF pitch-detection in a voice gender normalisation system", 2521-2524.
Laprie, Yves / Bonneau, Anne: "A copy synthesis method to pilot the klatt synthesiser", 2525-2528.
Sakamoto, Masaharu / Saito, Takashi: "Speaker recognizability evaluation of a voicefont-based text-to-speech system", 2529-2532.
Satué-Villar, Antonio / Fernández-Rubio, Juan: "Time-frequency transforms and beamforming for speaker recognition", 2533-2536.
Kwon, Soonil / Narayanan, Shrikanth S.: "Speaker change detection using a new weighted distance measure", 2537-2540.
Gómez-Cipriano, José L. / Nunes, Roger P. / Barone, Dante A. C.: "FPGA hardware for speech recognition using hidden Markov models", 2541-2544.
Irino, Toshio / Minami, Yasuhiro / Nakatani, Tomohiro / Tsuzaki, Minoru / Tagawa, H.: "Evaluation of a speech recognition / generation method based on HMM and straight", 2545-2548.
Vepa, Jithendra / King, Simon / Taylor, Paul: "Objective distance measures for spectral discontinuities in concatenative speech synthesis", 2605-2608.
Hamza, Wael / Donovan, Robert: "Data-driven segment preselection in the IBM trainable speech synthesis system", 2609-2612.
Peng, Hu / Zhao, Yong / Chu, Min: "Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation", 2613-2616.
Yi, Jon / Glass, James: "Information-theoretic criteria for unit selection synthesis", 2617-2620.
Kawai, Hisashi / Tsuzaki, Minoru: "Acoustic measures vs. phonetic features as predictors of audible discontinuity in concatenative speech synthesis", 2621-2624.
Wang, Hsien-Chang / Huang, Chieh-Yi / Yang, Chung-Hsien / Wang, Jhing-Fa: "A study of multi-speaker dialogue system for mobile information retrieval", 2677-2680.
Fabbrizio, Giuseppe Di / Dutton, Dawn / Gupta, Narendra K. / Hollister, Barbara / Rahim, Mazin / Riccardi, Giuseppe / Schapire, Robert / Schroeter, Juergen: "AT&t help desk", 2681-2684.
Trias-Sanz, Roger / Mariño, José B.: "Basurde[lite], a machine-driven dialogue system for accessing railway timetable information", 2685-2688.
Coulston, Rachel / Oviatt, Sharon / Darves, Courtney: "Amplitude convergence in children²s conversational speech with animated personas", 2689-2692.
Stallard, David: "Flexible dialogue management in the talk²n’travel system", 2693-2696.
Oria, Daniela / Koskinen, Esa: "E-mail goes mobile: the design and implementation of a spoken language interface to e-mail", 2697-2700.
Yoma, Néstor Becerra / Cortés, Angela / Hormazábal, Mauricio / López, Enrique: "Wizard of oz evaluation of a dialogue with communicator system in chile", 2701-2704.
Carpenter, Bob / Caskey, Sasha / Dayanidhi, Krishna / Drouin, Caroline / Pieraccini, Roberto: "A portable, server-side dialog framework for voiceXML", 2705-2708.
Takahashi, S. / Morimoto, T. / Maeda, S. / Tsuruta, N.: "Spoken dialogue system for home health care", 2709-2712.
Padrell, Jaume / Hernando, Javier: "ACIMET: access to meteorological information by telephone", 2713-2716.
Engel, Ralf: "SPIN: language understanding for spoken dialogue systems using a production system approach", 2717-2720.
Original Workshop Website
The link to the original website will bring you to the workshop website as long as it is maintained.