Table of Contents and Access to Abstracts - Ordered by Sessions
Acoustic Features for Robust Speech Recognition
Acoustic Modeling
Acoustic Model Adaptation
Acoustics of Spoken Language 1, 2
Acoustics of Spoken Language (Poster)
Acquisition and Learning of Spoken Language 1, 2
Adaptation and Acquisition in Spoken Language Processing 1, 2
Adaptation and Acquisition in Spoken Language Processing (Poster)
Dialogue Systems and Speech Input
Discourse and Dialogue 1, 2
Generation and Synthesis of Spoken Language 1, 2
Generation and Synthesis of Spoken Language 3
Generation and Synthesis of Spoken Language (Poster)
Language Modeling
Language Resources and Technology Evaluation (Special Session)
Large Vocabulary Continuous Speech Recognition
Linguistics, Phonology, Phonetics, and Psycholinguistics 1, 2
Linguistics, Phonology, Phonetics, and Psycholinguistics 3
Linguistics, Phonology, Phonetics, and Psycholinguistics (Poster)
Miscellaneous Topics 1 [A,B,C,G,H,L,O,Q,X]
Miscellaneous Topics 2 [M,J]
Miscellaneous Topics 3 [D,E,F,I,P,N,R,S,U,W,Y,Z]
Multimodal, Translingual, and Dialogue Systems
Perception and Comprehension of Spoken Language 1, 2
Problems and Prospects of Trans-Lingual Communication (Special Session)
Production of Spoken Language
Production of Spoken Language (Poster)
Prosody 1, 2
Prosody (Poster)
Prosody and Paralinguistics (Special Session)
Prosody, Acquisition, and Learning
Recognition and Understanding of Spoken Language 1, 2
Recognition and Understanding of Spoken Language 3, 4
Robust Modeling
Rules and Corpora (Special Session)
Signal Analysis, Processing, and Feature Extraction 1, 2
Signal Analysis, Processing, and Feature Extraction (Poster)
Speaker, Dialect, and Language Recognition 1, 2
Speaker, Dialect, and Language Recognition 3
Speaker, Dialect, and Language Recognition (Poster)
Speech Coding and Transmission
Speech Interface and Dialogue Systems
Speech Perception, Comprehension, and Production (Special Session)
Speech Production Control (Special Session)
Speech, Facial Expression, and Gesture
Spoken and Multi-Modal Dialogue Systems
Spoken Language Processing
Spoken Language Resources, Labeling, and Assessment
Trans-Modal and Multi-Modal Human-Computer Interaction (Special Session)
Speech Production Control (Special Session)
Liljencrants, Johan / Fant, Gunnar / Kruckenberg, Anita:
"Subglottal pressure and prosody in Swedish",
vol. 1, 1-4.
Honda, Kiyoshi / Masaki, Shinobu / Shimada, Yasuhiro:
"Observation of laryngeal control for voicing and pitch change by magnetic resonance imaging technique",
vol. 1, 5-8.
Fujisaki, Hiroya / Tomana, Ryou / Narusawa, Shuichi / Ohno, Sumio / Wang, Changfu:
"Physiological mechanisms for fundamental frequency control in standard Chinese",
vol. 1, 9-12.
Carré, René:
"On vocal tract asymmetry/symmetry",
vol. 1, 13-16.
Engwall, Olov:
"Are static MRI measurements representative of dynamic speech? results from a comparative study using MRI, EPG and EMA",
vol. 1, 17-20.
Lu, Shinan / He, Lin / Yang, Yufang / Cao, Jianfen:
"Prosodic control in Chinese TTS system",
vol. 1, 21-24.
Gao, Yuqing / Bakis, Raimo / Huang, Jing / Xiang, Bing:
"Multistage coarticulation model combining articulatory, formant and cepstral features",
vol. 1, 25-28.
Fujimura, Osamu:
"Rhythmic organization and signal characteristics of speech",
vol. 1, 29-35.
Öhman, Sven E. G.:
"Oral culture in the 21st century: the case of speech processing",
vol. 1, 36-41.
Jiang, Jintao / Alwan, Abeer / Bernstein, Lynne E. / Keating, Patricia / Auer, Ed:
"On the correlation between facial movements, tongue movements and speech acoustics",
vol. 1, 42-45.
Linguistics, Phonology, Phonetics, and Psycholinguistics 1, 2
Whiteside, S. P. / Rixon, E.:
"Coarticulation patterns in identical twins: an acoustic case study",
vol. 1, 46-49.
Hanna, Philip / Stewart, Darryl / Ming, Ji / Smith, F. Jack:
"Improved lexicon formation through removal of co-articulation and acoustic recognition errors",
vol. 1, 50-53.
Lindström, Anders / Kasaty, Anna:
"A two-level approach to the handling of foreign items in Swedish speech technology applications",
vol. 1, 54-57.
Den, Yasuharu / Clark, Herbert H.:
"Word repetitions in Japanese spontaneous speech",
vol. 1, 58-61.
Jongman, Allard / Moore, Corinne B.:
"The role of language experience in speaker and rate normalization processes",
vol. 1, 62-65.
Müller, Achim F. / Tao, Jianhua / Hoffmann, Rüdiger:
"Data-driven importance analysis of linguistic and phonetic information",
vol. 1, 66-69.
Fujisaki, Hiroya / Shirai, Katsuhiko / Doshita, Shuji / Nakagawa, Seiichi / Hirose, Keikichi / Itahashi, Shuichi / Kawahara, Tatsuya / Ohno, Sumio /
Kikuchi, Hideaki / Abe, Kenji / Kiriyama, Shinya:
"Overview of an intelligent system for information retrieval based on human-machine dialogue through spoken language",
vol. 1, 70-73.
Yang, Li-chiung:
"The expression and recognition of emotions through prosody",
vol. 1, 74-77.
Swerts, Marc / Taniguchi, Miki / Katagiri, Yasuhiro:
"Prosodic marking of information status in tokyo Japanese",
vol. 1, 78-81.
Wrede, Britta / Fink, Gernot A. / Sagerer, Gerhard:
"Influence of duration on static and dynamic properties of German vowels in spontaneous speech",
vol. 1, 82-85.
Zheng, Bo / Wang, Bei / Yang, Yufang / Lu, Shinan / Cao, Jianfen:
"The regular accent in Chinese sentences",
vol. 1, 86-89.
Mella, Odile / Fohr, Dominique / Martin, Laurent / Carlen, Andreas:
"A tool for the synchronization of speech and mouth shapes: LIPS",
vol. 1, 90-93.
Kurdi, Mohamed-Zakaria:
"Semantic tree unification grammar: a new formalism for spoken language processing",
vol. 1, 94-97.
Discourse and Dialogue 1, 2
Kurematsu, Akira / Shionoya, Yousuke:
"Identification of utterance intention in Japanese spontaneous spoken dialogue by use of prosody and keyword information",
vol. 1, 98-101.
Abdou, Sherif / Scordilis, Michael:
"Improved speech understanding using dialogue expectation in sentence parsing",
vol. 1, 102-105.
Meng, Helen M. / Wai, Carmen / Pieraccini, Roberto:
"The use of belief networks for mixed-initiative dialog modeling",
vol. 1, 106-109.
McTear, Michael F. / Allen, Susan / Clatworthy, Laura / Ellison, Noelle / Lavelle, Colin / McCaffery, Helen:
"Integrating flexibility into a structured dialogue model: some design considerations",
vol. 1, 110-113.
Niimi, Yasuhisa / Oku, Tomoki / Nishimoto, Takuya / Araki, Masahiro:
"A task-independent dialogue controller based on the extended frame-driven method",
vol. 1, 114-117.
Xu, Wei / Rudnicky, Alex:
"Language modeling for dialog system",
vol. 1, 118-121.
Georgila, Kallirroi / Fanotakis, Nikos / Kokkinakis, George:
"Building stochastic language model networks based on simultaneous word/phrase clustering",
vol. 1, 122-125.
Yang, Li-chiung / Esposito, Richard:
"Prosody and topic structuring in spoken dialogue",
vol. 1, 126-129.
Maes, Stéphane H.:
"Elements of conversational computing - a paradigm shift",
vol. 1, 130-133.
Müller, Ludek / Jurcicek, Filip / Smidl, Lubos:
"Rejection and key-phrase spottin techniques using a mumble model in a czech telephone dialog system",
vol. 1, 134-137.
Paek, Tim / Horvitz, Eric / Ringger, Eric:
"Continuous listening for unconstrained spoken dialog",
vol. 1, 138-141.
Shriver, Stefanie / Black, Alan W. / Rosenfeld, Ronald:
"Audio signals in speech interfaces",
vol. 1, 142-145.
Boda, Péter Pál:
"Visualisation of spoken dialogues",
vol. 1, 146-149.
Zajicek, Mary:
"The construction of speech output to support elderly visually impaired users starting to use the internet",
vol. 1, 150-153.
Recognition and Understanding of Spoken Language 1, 2
Takagi, Kazuyuki / Oguro, Rei / Ozeki, Kazuhiko:
"Effects of word string language models on noisy broadcast news speech recognition",
vol. 1, 154-157.
Luo, Xiaoqiang / Franz, Martin:
"Semantic tokenization of verbalized numbers in language modeling",
vol. 1, 158-161.
Kato, Kazuomi / Nanjo, Hiroaki / Kawahara, Tatsuya:
"Automatic transcription of lecture speech using topic-independent language modeling",
vol. 1, 162-165.
Guillén, Rocio / Erman, Randal:
"Extending grammars based on similar-word recognition",
vol. 1, 166-169.
Whittaker, E. W. D. / Woodland, P. C.:
"Particle-based language modelling",
vol. 1, 170-173.
Choi, W. N. / Wong, Y. W. / Lee, Tan / Ching, P. C.:
"Lexical tree decoding with a class-based language model for Chinese speech recognition",
vol. 1, 174-177.
Visweswariah, K. / Printz, H. / Picheny, M.:
"Impact of bucketing on performance of linearly interpolated language models",
vol. 1, 178-181.
Zhang, Shuwu / Yamamoto, Hirofami / Sagisaka, Yoshinori:
"An embedded knowledge integration for hybrid language modelling",
vol. 1, 182-195.
Galescu, Lucian / Allen, James:
"Hierarchical statistical language models: experiments on in-domain adaptation",
vol. 1, 186-189.
Yamamoto, Hirofumi / Tanigaki, Kouichi / Sagisaka, Yoshinori:
"A language model for conversational speech recognition using information designed for speech translation",
vol. 1, 190-193.
Carpenter, Bob / Lerner, Sol / Pieraccini, Roberto:
"Optimizing BNF grammars through source transformations",
vol. 1, 194-197.
Wu, Jian / Zheng, Fang:
"On enhancing katz-smoothing based back-off language model",
vol. 1, 198-201.
Xu, Wei / Rudnicky, Alex:
"Can artificial neural networks learn language models?",
vol. 1, 202-205.
Savova, Guergana / Schonwetter, Michael / Pakhomov, Sergey:
"Improving language model perplexity and recognition accuracy for medical dictations via within-domain interpolation with literal and semi-literal corpora",
vol. 1, 206-209.
Weilhammer, Karl / Ruske, Günther:
"Placing structuring elements in a word sequence for generating new statistical language models",
vol. 1, 210-213.
Estève, Yannick / Béchet, Frédéric / Mori, Renato de:
"Dynamic selection of language models in a dialogue system",
vol. 1, 214-217.
Johnsen, Magne H. / Holter, Trym / Svendsen, Torbjørn / Harborg, Erik:
"Stochastic modeling of semantic content for use IN a spoken dialogue system",
vol. 1, 218-221.
Takara, Tomio / Nagaki, Eiji:
"Spoken word recognition using the artificial evolution of a set of vocabulary",
vol. 1, 222-225.
Horvitz, Eric / Paek, Tim:
"Deeplistener: harnessing expected utility to guide clarification dialog in spoken language systems",
vol. 1, 226-229.
Deng, Yunbin / Xu, Bo / Huang, Taiyi:
"Chinese spoken language understanding across domain",
vol. 1, 230-233.
Martin, Sven C. / Kellner, Andreas / Portele, Thomas:
"Interpolation of stochastic grammar and word bigram models in natural language understanding",
vol. 1, 234-237.
Kogure, Satoru / Nakagawa, Seiichi:
"A portable development tool for spoken dialogue systems",
vol. 1, 238-241.
Lin, Yi-Chung / Wang, Huei-Ming:
"Error-tolerant language understanding for spoken dialogue systems",
vol. 1, 242-245.
Ito, Akinori / Hori, Chiori / Katoh, Masaharu / Kohda, Masaki:
"Language modeling by stochastic dependency grammar for Japanese speech recognition",
vol. 1, 246-249.
Zhang, Ruiqiang / Black, Ezra / Finch, Andrew / Sagisaka, Yoshinori:
"A tagger-aided language model with a stack decoder",
vol. 1, 250-253.
Hirschberg, Julia / Litman, Diane / Swerts, Marc:
"Generalizing prosodic prediction of speech recognition errors",
vol. 1, 254-257.
Bellegarda, Jerome R. / Silverman, Kim E. A.:
"Toward unconstrained command and control: data-driven semantic inference",
vol. 1, 258-261.
Hanazawa, Ken / Sakai, Shinsuke:
"Continuous speech recognition with parse filtering",
vol. 1, 262-265.
Adda-Decker, Martine / Adda, Gilles / Lamel, Lori:
"Investigating text normalization and pronunciation variants for German broadcast transcription",
vol. 1, 266-269.
Wester, Mirjam / Fosler-Lussier, Eric:
"A comparison of data-derived and knowledge-based modeling of pronunciation variation",
vol. 1, 270-273.
Kessens, Judith M. / Strik, Helmer / Cucchiarini, Catia:
"A bottom-up method for obtaining information about pronunciation variation",
vol. 1, 274-277.
Zhang, Jiyong / Zheng, Fang / Xu, Mingxing / Fang, Ditang:
"Semi-continuous segmental probability modeling for continuous speech recognition",
vol. 1, 278-281.
Antoniou, Christos A. / Reynolds, T. Jeff:
"Acoustic modelling using modular/ensemble combinations of heterogeneous neural networks",
vol. 1, 282-285.
Hon, Hsiao-Wuen / Kumar, Shankar / Wang, Kuansan:
"Unifying HMM and phone-pair segment models",
vol. 1, 286-289.
Li, Ming / Yu, Tiecheng:
"Multi-group mixture weight HMM",
vol. 1, 290-292.
Kitazoe, Tetsuro / Ichiki, Tomoyuki / Funamori, Makoto:
"Application of pattern recognition neural network model to hearing system for continuous speech",
vol. 1, 293-296.
Smith, Nathan / Niranjan, Mahesan:
"Data-dependent kernels in svm classification of speech patterns",
vol. 1, 297-300.
Umesh, S. / Rose, Richard C. / Parthasarathy, S.:
"Exploiting frequency-scaling invariance properties of the scale transform for automatic speech recognition",
vol. 1, 301-304.
Fujimoto, Masahiro / Ogata, Jun / Ariki, Yasuo:
"Large vocabulary continuous speech recognition under real environments using adaptive sub-band spectral subtraction",
vol. 1, 305-308.
Gu, Liang / Rose, Kenneth:
"Perceptual harmonic cepstral coefficients as the front-end for speech recognition",
vol. 1, 309-312.
Tam, Yik-Cheung / Mak, Brian:
"Optimization of sub-band weights using simulated noisy speech in multi-band speech recognition",
vol. 1, 313-316.
Faltlhauser, Robert / Pfau, Thilo / Ruske, Günther:
"On the use of speaking rate as a generalized feature to improve decision trees",
vol. 1, 317-320.
Toyama, Jun / Shimbo, Masaru:
"Syllable recognition using glides based on a non-linear transformation",
vol. 1, 321-324.
Sönmez, Kemal / Plauché, Madelaine / Shriberg, Elizabeth / Franco, Horacio:
"Consonant discrimination in elicited and spontaneous speech: a case for signal-adaptive front ends in ASR",
vol. 1, 325-328.
Daoudi, Khalid / Fohr, Dominique / Antoine, Christophe:
"A new approach for multi-band speech recognition based on probabilistic graphical models",
vol. 1, 329-332.
Glotin, Hervé / Berthommier, Frédéric:
"Test of several external posterior weighting functions for multiband full combination ASR",
vol. 1, 333-336.
Okada, Kanji / Arai, Takayuki / Kanederu, Noburu / Momomura, Yasunori / Murahara, Yuji:
"Using the modulation wavelet transform for feature extraction in automatic speech recognition",
vol. 1, 337-340.
Zhu, Qifeng / Alwan, Abeer:
"AM-demodulation of speech spectra and its application io noise robust speech recognition",
vol. 1, 341-344.
Hagen, Astrid / Morris, Andrew:
"Comparison of HMM experts with MLP experts in the full combination multi-band approach to robust ASR",
vol. 1, 345-348.
Hagen, Astrid / Bourlard, Hervé:
"Using multiple time scales in the framework of multi-stream speech recognition",
vol. 1, 349-352.
Yu, Hua / Waibel, Alex:
"Streamlining the front end of a speech recognizer",
vol. 1, 353-356.
Raj, Bhiksha / Seltzer, Michael L. / Stern, Richard M.:
"Reconstruction of damaged spectrographic features for robust speech recognition",
vol. 1, 357-360.
Sturm, Janienke / Kamperman, Hans / Boves, Lou / Os, Els den:
"Impact of speaking style and speaking task on acoustic models",
vol. 1, 361-364.
Kadambe, Shubha / Burns, Ron:
"Encoded speech recognition accuracy improvement in adverse environments by enhancing formant spectral bands",
vol. 1, 365-368.
Barker, Jon / Josifovski, Ljubomir / Cooke, Martin / Green, Phil:
"Soft decisions in missing data techniques for robust automatic speech recognition",
vol. 1, 373-376.
Liu, Jian / Yu, Tiecheng:
"New tone recognition methods for Chinese continuous speech",
vol. 1, 377-380.
Zhang, Bo / Peng, Gang / Wang, William S.-Y.:
"Reliable bands guided similarity measure for noise-robust speech recognition",
vol. 1, 381-384.
Nitta, Tsuneo / Takigawa, Masashi / Fukuda, Takashi:
"A novel feature extraction using multiple acoustic feature planes for HMM-based speech recognition",
vol. 1, 385-388.
Zheng, Fang / Zhang, Guoliang:
"Integrating the energy information into MFCC",
vol. 1, 389-392.
Farooq, Omar / Datta, Sekharjit:
"Speaker independent phoneme recognition by MLP using wavelet features",
vol. 1, 393-396.
Couvreur, Laurent / Couvreur, Christophe / Ris, Christophe:
"A corpus-based approach for robust ASR in reverberant environments",
vol. 1, 397-400.
Bazzi, Issam / Glass, James R.:
"Modeling out-of-vocabulary words for robust speech recognition",
vol. 1, 401-404.
Gajic, Bojana / Rose, Richard C.:
"Hidden Markov model environmental compensation for automatic speech recognition on hand-held mobile devices",
vol. 1, 405-408.
Morris, Andrew C. / Josifovski, Ljubomir / Bourlard, Hervé / Cooke, Martin / Green, Phil:
"A neural network for classification with incomplete data: application to robust ASR",
vol. 1, 409-412.
Matsuda, Shigeki / Nakai, Mitsuru / Shimodaira, Hiroshi / Sagayama, Shigeki:
"Feature-dependent allophone clustering",
vol. 1, 413-416.
Yang, Qian / Martens, Jean-Pierre:
"Data-driven lexical modeling of pronunciation variations for ASR",
vol. 1, 417-420.
Tran, Dat / Wagner, Michael:
"Fuzzy entropy hidden Markov models for speech recognition",
vol. 1, 421-424.
Quillen, Carl:
"Adjacent node continuous-state HMM’s",
vol. 1, 425-428.
Sturm, Janienke / Sanders, Eric:
"Modelling phonetic context using head-body-tail models for connected digit recognition",
vol. 1, 429-432.
Bazzi, Issam / Katabi, Dina:
"Using support vector machines for spoken digit recognition",
vol. 1, 433-436.
Sun, Jiping / Jing, Xing / Deng, Li:
"Data-driven model construction for continuous speech recognition using overlapping articulatory features",
vol. 1, 437-440.
Vasilache, Marcel:
"Speech recognition using HMMs with quantized parameters",
vol. 1, 441-444.
Qi, Yingyong / Xin, Jack:
"A perception and PDE based nonlinear transformation for processing spoken words",
vol. 1, 445-448.
Blasig, Reinhard / Rose, Georg / Meyer, Carsten:
"Training of isolated word recognizers with continuous speech",
vol. 1, 449-452.
Production of Spoken Language
Tseng, Shu-Chuan:
"Repair patterns in spontaneous Chinese dialogs: morphemes, words, and phrases",
vol. 1, 453-456.
Dang, Jianwu / Honda, Kiyoshi:
"Improvement of a physiological articulatory model for synthesis of vowel sequences",
vol. 1, 457-460.
Motoki, Kunitoshi / Pelorson, Xavier / Badin, Pierre / Matsuzaki, Hiroki:
"Computation of 3-d vocal tract acoustics based on mode-matching technique",
vol. 1, 461-464.
Ménard, Lucie / Boë, Louis-Jean:
"Exploring vowel production strategies from infant to adult by means of articulatory inversion of formant data",
vol. 1, 465-468.
Smith, Gavin / Robinson, Tony:
"Segmentation of a speech waveform according to glottal open and closed phases using an autoregressive-HMM",
vol. 1, 469-472.
Orr, Rosemary / Cranen, Bert / Jong, Felix de / Boves, Lou:
"Comparison of inverse filtering of the flow signal and microphone signal",
vol. 1, 473-476.
Iseli, Markus R. / Alwan, Abeer:
"Inter- and intra-speaker variability of glottal flow derivative using the LF model",
vol. 1, 477-480.
Linguistics, Phonology, Phonetics, and Psycholinguistics 3
Blache, Philippe / Hirst, Daniel:
"Multi-level annotation for spoken language corpora",
vol. 1, 481-484.
Li, Aijun / Zheng, Fang / Byrne, William / Fung, Pascale / Kamm, Terri / Liu, Yi / Song, Zhanjiang / Ruhi, Umar / Venkataramani, Veera / Chen, XiaoXia:
"CASS: a phonetically transcribed corpus of mandarin spontaneous speech",
vol. 1, 485-488.
Yamamoto, Kazuhide / Sumita, Eiichiro:
"Multiple decision-tree strategy for input-error robustness: a simulation of tree combinations",
vol. 1, 489-492.
Chen, Zheng / Lee, Kai-Fu / Li, Ming-jing:
"Discriminative training on language model",
vol. 1, 493-496.
Gao, Jianfeng / Li, Mingjing / Lee, Kai-Fu:
"N-gram distribution based language model adaptation",
vol. 1, 497-500.
Palou, Francisco / Bravetti, P. / Emam, O. / Fischer, V. / Janke, Eric:
"Towards a common phone alphabet for multilingual speech recognition",
vol. 1, 501-504.
Belvin, Robert / Burns, Ron / Hein, Cheryl:
"What²s next: a case study in the multidimensionality of a dialog system",
vol. 1, 504-507.
Dialogue Systems and Speech Input
Higashida, Masanobu / Ohmori, Kumiko:
"A new dialogue control method based on human listening process to construct an interface for ascertaining a user²s inputs",
vol. 1, 508-511.
Wang, XianFang / Du, LiMin:
"Spoken language understanding in a Chinese spoken dialogue system engine",
vol. 1, 512-515.
Dharanipragada, Satya / Franz, Martin / McCarley, J. Scott / Papineni, K. / Roukos, Salim / Ward, T. / Zhu, W.-J.:
"Statistical methods for topic segmentation",
vol. 1, 516-519.
Chen, Berlin / Wang, Hsin-min / Lee, Lin-shan:
"Retrieval of mandarin broadcast news using spoken queries",
vol. 1, 520-523.
Hansen, John H. L. / Plucienkowski, Jay / Gallant, Stephen / Pellom, Bryan / Ward, Wayne:
""CU-move": robust speech processing for in-vehicle speech systems",
vol. 1, 524-527.
Kim, Ji-Hwan / Woodland, Philip C.:
"A rule-based named entity recognition system for speech input",
vol. 1, 528-531.
Sadigh, Mohammad Reza / Sheikhzadeh, Hamid / Jahangir, M. R. / Farzan, Arash:
"A rule-based approach to farsi language text-to-phoneme conversion",
vol. 1, 532-535.
Jongman, Allard / Wang, Yue / Sereno, Joan:
"Acoustic and perceptual properties of English fricatives",
vol. 1, 536-539.
Shattuck-Hufnagel, Stefanie / Veilleux, Nanette:
"The special phonological characteristics of monosyllabic function words in English",
vol. 1, 540-543.
López de Ipiña, Miren Karmele / Torres, In / Oñederra, Lourdes / Varona, Amparo / Rodríguez, Luis Javier:
"Selection of sublexical units for continuous speech recognition of basque",
vol. 1, 544-547.
Plauché, Madelaine C. / Sönmez, Kemal:
"Machine learning techniques for the identification of cues for stop place",
vol. 1, 548-551.
Widera, Christina:
"Strategies of vowel reduction - a speaker-dependent phenomenon",
vol. 1, 552-555.
Fox, Michelle A.:
"Syllable-final /s/ lenition in the LDC's callhome Spanish corpus",
vol. 1, 556-559.
Kurematsu, Akira / Nakazaki, Takeaki:
"Meaning extraction based on frame representation for Japanese spoken dialogue",
vol. 1, 560-563.
Caspers, Johanneke:
"Pitch accents, boundary tones and turn-taking in dutch map task dialogues",
vol. 1, 565-568.
Yamashita, Yoichi / Murai, Michiyo:
"An annotation scheme of spoken dialogues with topic break indexes",
vol. 1, 569-572.
Veilleux, Nanette:
"Application of the centering framework in spontaneous dialogues",
vol. 1, 573-576.
Mori, Hiroki / Kasuya, Hideki:
"Automatic lexicon generation and dialogue modeling for spontaneous speech",
vol. 1, 577-580.
Wolters, Maria / Mixdorff, Hansjörg:
"Evaluating radio news intonation - autosegmental versus superpositional modelling",
vol. 1, 581-584.
Falavigna, Daniele / Gretter, Roberto / Orlandi, Marco:
"A mixed language model for a dialogue system over ihe telephone",
vol. 1, 585-588.
Bell, Linda / Gustafson, Joakim:
"Positive and negative user feedback in a spoken dialogue corpus",
vol. 1, 589-592.
Cutler, Anne / Koster, Mariëtte:
"Stress and lexical activation in dutch",
vol. 1, 593-596.
Eldin, Safa Nasser / Nour, Hanna Abdel / Abdenbi, Rajouani:
"Automatic modeling and implementation of intonation for the arabic language in TTS systems",
vol. 1, 597-600.
Gadde, Venkata Ramana Rao:
"Modeling word durations",
vol. 1, 601-604.
Venditti, Jennifer J. / Santen, Jan P. H. van:
"Japanese intonation synthesis using superposition and linear alignment models",
vol. 1, 605-608.
Minowa, Toshimitsu / Mochizuki, Ryo / Nishimura, Hirofumi:
"Improving the naturalness of synthetic speech by utilizing the prosody of natural speech",
vol. 1, 609-612.
Chen, Sin-Horng / Ho, Chen-Chung:
"A hybrid statistical/RNN approach to prosody synthesis for taiwanese TTS",
vol. 1, 613-616.
Minematsu, Nobuaki / Fujisawa, Yukiko / Nakagawa, Seiichi:
"Performance comparison among HMM, DTW, and human abilities in terms of identifying stress patterns of word utterances",
vol. 1, 617-620.
Montero, Juan Manuel / Córdoba, Ricardo / Vallejo, José A. / Gutiérrez-Arriola, Juana / Enríquez, Emilia / Pardo, Juan Manuel:
"Restricted-domain female-voice synthesis in Spanish: from database design to ANN prosodic modeling",
vol. 1, 621-624.
Fernández-Salgado, Xavier / Banga, Eduardo R.:
"A hierarchical intonation model for synthesising F0 contours in galician language",
vol. 1, 625-628.
Applebaum, Ted H. / Kibre, Nick / Pearson, Steve:
"Features for F0 contour prediction",
vol. 1, 629-632.
Gu, Zhenglai / Mori, Hiroki / Kasuya, Hideki:
"Prosodic variation of focused syllables of disyllabic word in Mandarin Chinese",
vol. 1, 633-636.
Chu, Stephen M. / Huang, Thomas S.:
"Automatic head gesture learning and synthesis from prosodic cues",
vol. 1, 637-640.
Vainio, Martti / Altosaar, Toomas / Werner, Stefan:
"Measuring the importance of morphological information for finnish speech synthesis",
vol. 1, 641-644.
Jokisch, Oliver / Mixdorff, Hansjörg / Kruschke, Hans / Kordon, Ulrich:
"Learning the parameters of quantitative prosody models",
vol. 1, 645-648.
Narusawa, Shuichi / Fujisaki, Hiroya / Ohno, Sumio:
"A method for automatic extraction of parameters of the fundamental frequency contour",
vol. 1, 649-652.
Kitazoe, Tetsuro / Kim, Sung-Ill / Yoshitomi, Yasunari / Ikeda, Tatsuhiko:
"Recognition of emotional states using voice, face image and thermal image of face",
vol. 1, 653-656.
Watanuki, Keiko / Seki, Susumu / Miyoshi, Hideo:
"Turn taking and multimodal information in two-people dialog",
vol. 1, 657-660.
Abutalebi, Hamid Reza / Bijankhan, Mahmood:
"Implementation of a text-to-speech system for farsi language",
vol. 1, 661-664.
Huber, Richard / Batliner, Anton / Buckow, Jan / Nöth, Elmar / Warnke, Volker / Niemann, Heinrich:
"Recognition of emotion in a realistic dialogue scenario",
vol. 1, 665-668.
Barry, Johanna / Blamey, Peter / Lee, Kathy / Cheung, Dilys:
"Differentiation in tone production in cantonese-speaking hearing-impaired children",
vol. 1, 669-672.
Zundert, Martine van / Terken, Jacques:
"Learning effects for phonetic properties of synthetic speech",
vol. 1, 673-676.
Tomokiyo, Laura Mayfield / Wang, Le / Eskenazi, Maxine:
"An empirical study of the effectiveness of speech-recognition-based pronunciation training",
vol. 1, 677-680.
Deroo, Olivier / Ris, Christophe / Gielen, Sofie / Vanparys, Johan:
"Automatic detection of mispronounced phonemes for language learning tools",
vol. 1, 681-684.
Escalona, Horacio Meza / Kirschning, Ingrid / Villagómez, Ofelia Cervantes:
"Estimation of duration models for phonemes in m exican speech synthesis",
vol. 1, 685-688.
Wu, Xiaoru / Wang, Renhua / Hu, Guoping:
"Special text processing based external descriptor rule",
vol. 1, 689-692.
Yu, Zhenli / Zeng, Shangcui:
"Articulatory synthesis using a vocal-tract model of variable length",
vol. 1, 693-696.
Boula_de_Mareüil, Philippe:
"Linguistic-prosodic processing for text-to-speech synthesis in italian",
vol. 1, 697-700.
Eichner, Matthias / Wolff, Matthias / Hoffmann, Rüdiger:
"A unified approach for speech synthesis and speech recognition using stochastic Markov graphs",
vol. 1, 701-704.
Breen, Andrew / Salter, James:
"Using F0 within a phonologically motivated method of unit selection",
vol. 1, 705-708.
Blouin, Christophe J. / Bagshaw, Paul C.:
"Analysis of the degradation of French vowels induced by the TD-PSOLA algorithm, in text-to-speech context",
vol. 1, 709-712.
Janicki, Artur:
"Automatic construction of acoustic inventory for the concatenative speech synthesis for polish",
vol. 1, 713-716.
Hirschfeld, Diane / Wolff, Matthias:
"Universal and multilingual unit selection for DRESS",
vol. 1, 717-720.
Pan, Davis / Heng, Brian / Cheung, Shiufun / Chang, Ed:
"Improving speech synthesis for high intelligibility under adverse conditions",
vol. 1, 721-724.
Nishizawa, Nobuyuki / Minematsu, Nobuaki / Hirose, Keikichi:
"Development of a formant-based analysis-synthesis system and generation of high quality liquid sounds of Japanese",
vol. 1, 725-728.
Jokisch, Oliver / Eichner, Matthias:
"Synthesizing and evaluating an artificial language: klingon",
vol. 1, 729-732.
Olinsky, Craig / Black, Alan W.:
"Non-standard word and homograph resolution for asian language text analysis",
vol. 1, 733-736.
Sen, Zhang / Shirai, Katsuhiko:
"Re-estimation of LPC coefficients in the sense of l&inf; criterion",
vol. 1, 737-740.
Jung, Sung-Kyo / Choi, Yong-Soo / Park, Young-Cheol / Youn, Dae-Hee:
"An efficient codebook search algorithm for EVRC",
vol. 1, 741-744.
Kim, Jong-Kuk / Kim, Jeong-Jin / Bae, Myung-Jin:
"The reduction of the search time by the pre-determination of the grid bit in the g.723.1 MP-MLQ",
vol. 1, 745-749.
Möller, Sebastian / Bourlard, Hervé:
"Real-time telephone transmission simulation for speech recognizer and dialogue system evaluation and improvement",
vol. 1, 750-753.
Chengalvarayan, Rathinavelu / Thomson, David L.:
"HMM-based echo and announcement modeling approaches for noise suppression avoiding the problem of false triggers",
vol. 1, 754-757.
Chen, Fangxin:
"Speaker information enhancement",
vol. 1, 758-761.
Dolfing, Hans:
"Exhaustive search for lower-bound error-rates in vocal tract length normalization",
vol. 1, 762-765.
Macho, Dusan / Nadeu, Climent:
"Use of voicing information to improve the robustness of the spectral parameter set",
vol. 1, 766-769.
Yao, Kaisheng / Shi, Bertram E. / Nakamura, Satoshi / Cao, Zhigang:
"Residual noise compensation by a sequential EM algorithm for robust speech recognition in nonstationary noise",
vol. 1, 770-773.
Ye, Hui / Fung, Pascale / Huang, Taiyi:
"Principal mixture speaker adaptation for improved continuous speech recognition",
vol. 1, 774-777.
Altosaar, Toomas / Vainio, Martti:
"Reduced impedance mismatch in speech database access",
vol. 1, 778-781.
Tian, Jiapeng / Miwa, Jouji:
"Internet training system for listening and pronunciation of Chinese stop consonants",
vol. 1, 782-785.
Ishi, Carlos Toshinori / Hirose, Keikichi / Minematsu, Nobuaki:
"Identification of Japanese double-mora phonemes considering speaking rate for the use in CALL systems",
vol. 1, 786-790.
Speech Perception, Comprehension, and Production (Special Session)
Patterson, Roy D. / Uppenkamp, Stefan / Norris, Dennis / Marslen-Wilson, William / Johnsrude, Ingrid / Williams, Emma:
"Phonological processing in the auditory system: a new class of stimuli and advances in fmri techniques",
vol. 2, 1-4.
Tatsumi, Itaru F. / Senda, Michio / Ishii, Kenji / Mishina, Masahiro / Oyama, Masashi / Toyama, Hinako / Oda, Keiichi / Tanaka, Masayuki / Gondo, Yasuyuki:
"Brain regions responsible for word retrieval, speech production and deficient word fluency in elderly people: a PET activation study",
vol. 2, 5-10.
Alku, Paavo / Tiitinen, Hannu / Palomäki, Kalle J. / Sivonen, Päivi:
"MEG-measurements of brain activity reveal the link between human speech production and perception",
vol. 2, 11-14.
Patterson, Karalyn / Ralph, Matthew A. Lambon / Bird, Helen / Hodges, John R. / McClelland, James L.:
"Normal and impaired processing in quasi-regular domains
of language: the case of English past-tense verbs",
vol. 2, 15-19.
Martin, Nadine / Saffran, Eleanor M. / Dell, Gary S. / Schwartz, Myrna F. / Gupta, Prahlad:
"Neuropsychological and computational evidence for a model of lexical processing, verbal short-term memory and learning",
vol. 2, 20-25.
Fushimi, Takao / Ijuin, Mutsuo / Sakuma, Naoko / Tanaka, Masayuki / Kondo, Tadahisa / Amano, Shigeaki / Patterson, Karalyn / Tatsumi, Itaru F.:
"Normal and impaired reading of Japanese kanji and kana",
vol. 2, 26-31.
Ijuin, Mutsuo / Fushimi, Takao / Patterson, Karalyn / Sakuma, Naoko / Tanaka, Masayuki / Tatsumi, Itaru / Kondo, Tadahisa / Amano, Shigeaki:
"A connectionist approach to naming disorders of Japanese in dyslexic patients",
vol. 2, 32-37.
Wydell, Taeko N. / Shinkai, Takako:
"Impaired pronunciations of kanji words by Japanese CVA patients",
vol. 2, 38-41.
Uno, Akira / Kaneko, M. / Haruhara, N. / Kaga, M.:
"Disability of phonological versus visual information processes in Japanese dyslexic children",
vol. 2, 42-44.
Zhou, Xiaolin / Qu, Yanxuan:
"Lexical tone in the spoken word recognition of Chinese",
vol. 2, 45-50.
Prosody 1, 2
Zhou, Xiaolin / Zhuang, Jie:
"Lexical tone in the speech production of Chinese words",
vol. 2, 51-54.
Hu, Yu / Liu, Qin-Feng / Wang, Ren-Hua:
"Prosody generation in Chinese synthesis using the template of quantified prosodic unit and base intonation contour",
vol. 2, 55-58.
Chen, Yiqiang / Gao, Wen / Zhu, Tingshao / Ma, Jiyong:
"Multi-strategy data mining on Mandarin prosodic patterns",
vol. 2, 59-62.
Verhelst, Werner / Compernolle, Dirk van / Wambacq, Patrick:
"A unified view on synchronized overlap-add methods for prosodic modifications of speech",
vol. 2, 63-66.
Shih, Chilin / Kochanski, Greg P.:
"Chinese tone modeling with stem-ML",
vol. 2, 67-70.
Wightman, Colin W. / Syrdal, Ann K. / Stemmer, Georg / Conkie, Alistair / Beutnagel, Mark:
"Perceptually based automatic prosody labeling and prosodically enriched unit selection improve concatenative text-to-speech synthesis",
vol. 2, 71-74.
Müller, Achim F. / Tao, Jianhua / Hoffmann, Rüdiger:
"Data-driven importance analysis of linguistic and phonetic information",
vol. 2, 75-78.
Li, Zhiqiang / Banksira, Degif Petros:
"Tonal structure of yes-no question intonation in chaha",
vol. 2, 79-82.
Wang, Chao / Seneff, Stephanie:
"Improved tone recognition by normalizing for coarticulation and intonation effects",
vol. 2, 83-86.
Zhang, Jin-Song / Nakamura, Satoshi / Hirose, Keikichi:
"Discriminating Chinese lexical tones by anchoring F0 features",
vol. 2, 87-90.
Gussenhoven, Carlos / Chen, Aoju:
"Universal and language-specific effects in the perception of question intonation",
vol. 2, 91-94.
Tseng, Chiu-Yu / Chen, Da-De:
"The interplay and interaction between prosody and syntax: evidence from Mandarin Chinese",
vol. 2, 95-97.
Mixdorff, Hansjörg / Fujisaki, Hiroya:
"A quantitative description of German prosody offering symbolic labels as a by-product",
vol. 2, 98-101.
Speech Interface and Dialogue Systems
Rosenfeld, Roni / Zhu, Xiaojin / Toth, Arthur / Shriver, Stefanie / Lenzo, Kevin / Black, Alan W.:
"Towards a universal speech interface",
vol. 2, 102-105.
Russell, Dale:
"A domain model centered approach to spoken language dialog systems",
vol. 2, 106-109.
Fafiotte, Georges / Zhai, Jian-She:
"From multilingual multimodal spoken language acquisition towards on-line assistance to intermittent human interpreting: SIM*, a versatile environment for SLP",
vol. 2, 110-113.
Denecke, Matthias:
"Informational characterization of dialogue states",
vol. 2, 114-117.
Abe, Kenji / Kurokawa, Kazushige / Taketa, Kazunari / Ohno, Sumio / Fujisaki, Hiroya:
"A new method for dialogue management in an intelligent system for information retrieval",
vol. 2, 118-121.
Levin, Esther / Narayanan, Shrikanth / Pieraccini, Roberto / Biatov, Konstantin / Bocchieri, E. / Fabbrizio, Giuseppe Di / Eckert, Wieland / Lee, S. / Pokrovsky, A. / Rahim, Mazin / Ruscitti, P. / Walker, M.:
"The AT&t-DARPA communicator mixed-initiative spoken dialog system",
vol. 2, 122-125.
Multimodal, Translingual, and Dialogue Systems
Bangalore, Srinivas / Johnston, Michael:
"Integrating multimodal language processing with speech recognition",
vol. 2, 126-129.
Rudnicky, Alexander I. / Bennett, Christina / Black, Alan W. / Chotimongkol, Ananlada / Lenzo, Kevin / Oh, Alice / Singh, Rita:
"Task and domain specific modelling in the Carnegie Mellon communicator system",
vol. 2, 130-134.
Gustafson, Joakim / Bell, Linda / Beskow, Jonas / Boye, Johan / Carlson, Rolf / Edlund, Jens / Granström, Björn / House, David / Wirén, Mats:
"Adapt - a multimodal conversational dialogue system in an apartment domain",
vol. 2, 134-137.
Wang, Kuansan:
"Implementation of a multimodal dialog system using extended markup languages",
vol. 2, 138-141.
Seneff, Stephanie / Chuu, Chian / Cyphers, D. Scott:
"ORION: from on-line interaction to off-line delegation",
vol. 2, 142-145.
Duan, Lei / Franz, Alexander / Horiguchi, Keiko:
"Practical spoken language translation using compiled feature structure grammars",
vol. 2, 146-149.
Meng, Helen / Chan, Shuk Fong / Wong, Yee Fong / Fung, Tien Ying / Tsui, Wai Ching / Lo, Tin Hang / Chan, Cheong Chat / Chen, Ke / Wang, Lan / Wu, Ting Yao / Li, Xiaolong / Lee, Tan / Choi, Wing Nin / Wong, Yiu Wing / Ching, P. C. / Chi, Huisheng:
"ISIS: A multilingual spoken dialog system developed with
CORBA and KQML agents",
vol. 2, 150-153.
Hirasawa, Jun-Ichi / Miyazaki, Noboru / Nakano, Mikio / Aikawa, Kiyoaki:
"New feature parameters for detecting misunderstandings in a spoken dialogue system",
vol. 2, 154-157.
Production of Spoken Language (Poster)
Mokhtari, Parham / Clermont, Frantz / Tanaka, Kazuyo:
"Toward an acoustic-articulatory model of inter-speaker variability",
vol. 2, 158-161.
Perrier, Pascal / Perkell, Joseph / Payan, Yohan / Zandipour, Majid / Guenther, Frank / Khalighi, Ali:
"Degrees of freedom of tongue movements in speech may be constrained by biomechanics",
vol. 2, 162-165.
Vaxelaire, Béatrice / Sock, Rudolph / Perrier, Pascal:
"Gestural overlap, place of articulation and speech rate - an x-ray investigation",
vol. 2, 166-169.
Honda, Masaaki / Fujino, Akinori:
"Articulatory compensation and adaptation for unexpected palate shape perturbation",
vol. 2, 170-173.
Niikawa, Takuya / Matsumura, Masafumi / Tachimura, Takashi / Wada, Takeshi:
"Modeling of a speech production system based on MRI measurement of three-dimensional vocal tract shapes during fricative consonant phonation",
vol. 2, 174-177.
Ouni, Slim / Laprie, Yves:
"Improving acoustic-to-articulatory inversion by using hypercube codebooks",
vol. 2, 178-181.
Hamza, Wael M. / Rashwan, Mohsen A.:
"Concatenative arabic speech synthesis using large speech database",
vol. 2, 182-185.
Chen, Dong / Kuang, Jingming / Zhang, Yan:
"A new speech classifier based on Yinyang compensatory soft computing theory",
vol. 2, 186-189.
Möller, Sebastian / Jekosch, Ute / Raake, Alexander:
"New models predicting conversational effects of telephone transmission on speech communication quality",
vol. 2, 190-193.
Li, Jinyu / Luo, Xin / Wang, Ren-Hua:
"A novel search algorithm for LSF VQ",
vol. 2, 194-197.
Maes, Stéphane H. / Chazan, Dan / Cohen, Gilad / Hoory, Ron:
"Conversational networking: conversational protocols for transport, coding, and control",
vol. 2, 198-201.
Ohmura, Hiroshi / Sasou, Akira / Tanaka, Kazuyo:
"A low bit rate speech coding method using a formant-articulatory parameter nomogram",
vol. 2, 202-205.
Li, Ning / Molyneux, Derek J. / Ho, Meau Shin / Cheetham, B. M. G.:
"Variable bit-rate sinusoidal transform coding using variable order spectral estimation",
vol. 2, 206-209.
Choi, Yong-Soo / Ryu, Sueng-Kyun / Park, Young-Cheol / Youn, Dae-Hee:
"Efficient harmonic-CELP based hybrid coding of speech at low bit rates",
vol. 2, 210-213.
Jensen, Jesper / Hansen, John H. L.:
"Speech enhancement based on a constrained sinusoidal model",
vol. 2, 214-217.
Park, Sang-Wook / Ryu, Seung-Kyun / Park, Young-Cheol / Youn, Dae-Hee:
"A bark coherence function for perceived speech quality estimation",
vol. 2, 218-221.
Kiang, Jinyu / Deng, Kun / Huang, Ronghuai:
"A high-efficiency scheme for secure speech transmission using spatiotemporal chaos synchronization",
vol. 2, 222-225.
Speaker, Dialect, and Language Recognition (Poster)
Rodríguez_Liñares, Leandro / García_Mateo, Carmen:
"Application of speaker authentication technology to a telephone dialogue system",
vol. 2, 226-229.
Dutat, Michel / Magrin-Chagnolleau, Ivan / Bimbot, Frédéric:
"Language recognition using time-frequency principal component analysis and acoustic modeling",
vol. 2, 230-233.
Tanprasert, Chularat / Achariyakulporn, Varin:
"Comparative study of GMM, DTW, and ANN on Thai speaker identification system",
vol. 2, 234-237.
Schwardt, Ludwig / Preez, Johan du:
"Efficient mixed-order hidden Markov model inference",
vol. 2, 238-241.
Thyes, Olivier / Kuhn, Roland / Nguyen, Patrick / Junqua, Jean-Claude:
"Speaker identification and verification using eigenvoices",
vol. 2, 242-245.
Surendran, Arun C. / Lee, Chin-Hui:
"A priori threshold selection for fixed vocabulary speaker verification systems",
vol. 2, 246-249.
Jin, Qin / Waibel, Alex:
"Application of LDA to speaker recognition",
vol. 2, 250-253.
Schwardt, Ludwig / Preez, Johan du:
"Automatic language identification using mixed-order HMMs and untranscribed corpora",
vol. 2, 254-257.
Lindberg, Johan / Blomberg, Mats:
"On the potential threat of using large speech corpora for impostor selection in speaker verification",
vol. 2, 258-261.
Ortega-Garcia, J. / Rodriguez, J. G. / Merino, D. T.:
"Phonetic consistency in Spanish for pin-based speaker verification system",
vol. 2, 262-265.
Liu, Zhimin / Wu, Xihong / Zhen, Bin / Chi, Huisheng:
"An auditory feature extraction method based on forward-masking and its application in robust speaker identification and speech recognition",
vol. 2, 266-269.
Peters, S. Douglas / Hébert, Matthieu / Boies, Daniel:
"Transition-oriented hidden Markov models for speaker verification",
vol. 2, 270-273.
Tsoi, Pang Kuen / Fung, Pascale:
"An LLR-based technique for frame selection for GMM-based text-independent speaker identification",
vol. 2, 274-277.
Ma, Jiyong / Gao, Wen:
"Robust speaker recognition based on high order cumulant",
vol. 2, 278-281.
Si, Luo / Hu, Qi Xiu:
"Two-stage speaker identification system based on VQ and NBDGMM",
vol. 2, 282-285.
Mariethoz, Johnny / Lindberg, Johan / Bimbot, Frédéric:
"A MAP approach, with synchronous decoding and unit-based normalization for text-dependent speaker verification",
vol. 2, 286-289.
Pan, Zhibin / Kotani, Koji / Ohmi, Tadahiro:
"A fast search method of speaker identification for large population using pre-selection and hierarchical matching",
vol. 2, 290-293.
Wang, Lan / Chen, Ke / Chi, Huisheng:
"Optimal fusion of diverse feature sets for speaker identification: an alternative method",
vol. 2, 294-297.
Chaudhari, Upendra V. / Navrátil, Jiri / Maes, Stéphane H. / Gopinath, Ramesh:
"Transformation enhanced multi-grained modeling for text-independent speaker recognition",
vol. 2, 298-301.
Masuko, Takashi / Tokuda, Keiichi / Kobayashi, Takao:
"Imposture using synthetic speech against speaker verification based on spectrum and pitch",
vol. 2, 302-305.
Parveen, Shahla / Qadeer, Abdul / Green, Phil:
"Speaker recognition with recurrent neural networks",
vol. 2, 306-309.
Itoh, Yoshiroh / Toyama, Jun / Shimbo, Masaru:
"Speaker feature extraction from pitch information based on spectral subtraction for speaker identification",
vol. 2, 310-313.
Tsai, Wei-Ho / Che, Chiwei / Chang, Wen-Whei:
"Text-independent speaker identification using Gaussian mixture bigram models",
vol. 2, 314-317.
Ezzaidi, Hassan / Rouat, Jean:
"Comparison of MFCC and pitch synchronous AM, FM parameters for speaker identification",
vol. 2, 318-321.
Faúndez-Zanu, Marcos / Slupinski, Adam:
"Speaker verification in mismatch training and testing conditions",
vol. 2, 322-325.
Uchibe, Toshiaki / Kuroiwa, Shingo / Higuchi, Norio:
"Determination of threshold for speaker verification using speaker adaptation gain in likelihood during training",
vol. 2, 326-329.
Liu, Mingkuan / Xu, Bo:
"Accent-specific Mandarin adaptation based on pronunciation modeling technology",
vol. 2, 330-333.
Prosody and Paralinguistics (Special Session)
Lee, Hyun Bok:
"In search of paralinguistic features",
vol. 2, 334-340.
Fant, Gunnar / Kruckenberg, Anita:
"A prominence based model of Swedish intonation",
vol. 2, 341-344.
Kasuya, Hideki / Yoshizawa, Masanori / Maekawa, Kikuo:
"Roles of voice source dynamics as a conveyer of paralinguistic features",
vol. 2, 345-348.
Maekawa, Kikuo / Kagomiya, Takayuki:
"Influence of paralinguistic information on segmental articulation",
vol. 2, 349-352.
Ohno, Sumio / Sugiyama, Yoshimitsu / Fujisaki, Hiroya:
"Analysis and modeling of the effect of paralinguistic information upon the local speech rate",
vol. 2, 353-356.
Cao, Jianfen:
"Rhythm of spoken Chinese - linguistic and paralinguistic evidences -",
vol. 2, 357-360.
Eda, Sanae:
"Identification and discrimination of syntactically and pragmatically contrasting intonation patterns by native and non-native speakers of standard Japanese",
vol. 2, 361-364.
Erickson, Donna / Abramson, Arthur / Maekawa, Kikuo / Kaburagi, Tokihiko:
"Articulatory characteristics of emotional utterances in spoken English",
vol. 2, 365-368.
Hirose, Keikichi / Minematsu, Nobuaki / Kawanami, Hiromichi:
"Analytical and perceptual study on the role of acoustic features in realizing emotional speech",
vol. 2, 369-372.
Mozziconacci, Sylvie J. L. / Hermes, Dik J.:
"Expression of emotion and attitude through temporal speech variations",
vol. 2, 373-378.
Scherer, Klaus R.:
"A cross-cultural investigation of emotion inferences from voice and speech: implications for speech technology",
vol. 2, 379-382.
Kang, Bong-Seok / Han, Chul-Hee / Lee, Sang-Tae / Youn, Dae-Hee / Lee, Chungyong:
"Speaker dependent emotion recognition using speech signals",
vol. 2, 383-386.
Generation and Synthesis of Spoken Language 1, 2
Morais, Edmilson S. / Taylor, Paul / Violaro, Fábio:
"Concatenative text-to-speech synthesis based on prototype waveform interpolation (a time frequency approach)",
vol. 2, 387-390.
Wang, Ren-Hua / Ma, Zhongke / Li, Wei / Zhu, Donglai:
"A corpus-based Chinese speech synthesis with contextual dependent unit selection",
vol. 2, 391-394.
Coorman, Geert / Fackrell, Justin / Rutten, Peter / Coile, Bert Van:
"Segment selection in the L&h Realspeak laboratory TTS system",
vol. 2, 395-398.
Lyu, Ren-yuan / Fu, Zhen-hong / Chiang, Yuang-chin / Liu, Hui-mei:
"A Taiwanese (min-nan) text-to-speech (TTS) system based on automatically generated synthetic units",
vol. 2, 399-402.
Yamada, Masayuki / Okutani, Yasuo / Fukada, Toshiaki / Aso, Takashi / Komori, Yasuhiro:
"Puretalk: a high quality Japanese text-to-speech system",
vol. 2, 403-406.
Law, Ka Man / Lee, Tan:
"Using cross-syllable units for Cantonese speech synthesis",
vol. 2, 407-410.
Black, Alan W. / Lenzo, Kevin A.:
"Limited domain synthesis",
vol. 2, 411-414.
Nakatani, Christine H. / Chu-Carroll, Jennifer:
"Coupling dialogue and prosody computation in spoken dialogue generation",
vol. 2, 415-418.
Takara, Tomio / Izumi, Kazuto / Funaki, Keiichi:
"A study on the pitch pattern of a singing voice synthesis system based on the cepstral method",
vol. 2, 419-422.
Pearson, Steve / Kuhn, Roland / Fincke, Steven / Kibre, Nick:
"Automatic methods for lexical stress assignment and syllabification",
vol. 2, 423-426.
Goubanova, Olga / Taylor, Paul:
"Using bayesian belief networks for model duration in text-to-speech systems",
vol. 2, 427-430.
Hirschfeld, Diane:
"Comparing static and dynamic features for segmental cost function calculation in concatenative speech synthesis",
vol. 2, 435-438.
Jain, Pratibha / Hermansky, Hynek:
"Temporal patterns of critical-band spectrum for text-to-speech",
vol. 2, 439-441.
Speaker, Dialect, and Language Recognition 1, 2
Choi, Eric H. C. / Song, Jianming:
"Successive cohort selection (SCS) for text-independent speaker verification",
vol. 2, 442-445.
Tran, Dat / Wagner, Michael:
"Fuzzy normalisation methods for speaker verification",
vol. 2, 446-449.
Gu, Yong / Jongebloed, Hans / Iskra, Dorota / Os, Els den / Boves, Lou:
"Speaker verification in operational environments - monitoring for improved service operation",
vol. 2, 450-453.
Heck, Larry P. / Mirghafori, Nikki:
"On-line unsupervised adaptation in speaker verification",
vol. 2, 454-457.
Sivakumaran, P. / Ariyaeeinia, A. M. / Hewitt, Jill A.:
"Multiple sub-band systems for speaker verification",
vol. 2, 458-461.
Liu, Xiaoxing / Yuan, Baosheng / Yan, Yonghong:
"An orthogonal GMM based speaker verification system",
vol. 2, 462-465.
Jin, Qin / Waibel, Alex:
"A nave de-lambing method for speaker identification",
vol. 2, 466-469.
Reynolds, Douglas A. / Dunn, R. Bob / McLaughlin, Jack L.:
"The lincoln speaker recognition system: NIST eval2000",
vol. 2, 470-473.
Rosenberg, Aaron E. / Parthasarathy, S. / Hirschberg, Julia / Whittaker, Stephen:
"Foldering voicemail messages by caller using text independent speaker recognition",
vol. 2, 474-478.
Montacié, Claude / Caraty, Marie-José:
"Structural framework for combining speaker recognition methods",
vol. 2, 479-482.
Andrews, Walter D. / Campbell, Joseph P. / Reynolds, Douglas A.:
"Bootstrapping for speaker recognition",
vol. 2, 483-486.
Zhen, Bin / Wu, Xihong / Liu, Zhimin / Chi, Huisheng:
"On the importance of components of the MFCC in speech and speaker recognition",
vol. 2, 487-490.
Quatieri, Thomas F. / Dunn, R. Bob / Reynolds, Douglas A.:
"On the influence of rate, pitch, and spectrum on automatic speaker recognition performance",
vol. 2, 491-494.
Teunen, Remco / Shahshahani, Ben / Heck, Larry:
"A model-based transformational approach to robust speaker recognition",
vol. 2, 495-498.
Linguistics, Phonology, Phonetics, and Psycholinguistics (Poster)
Miller-Ockhuizen, Amanda / Sands, Bonny E.:
"Contrastive lateral clicks and variation in click types",
vol. 2, 499-502.
Matsui, Tomoko / Naito, Masaki / Sagisaka, Yoshinori / Okuda, Kozo / Nakamura, Satoshi:
"Analysis of acoustic models trained on a large-scale Japanese speech database",
vol. 2, 503-506.
Bijankhan, Mahmood:
"Farsi vowel compensatory lengthening: an experimental approach",
vol. 2, 507-510.
Wang, Yue / Sereno, Joan A. / Jongman, Allard / Hirsch, Joy:
"Cortical reorganization associated with the acquisition of Mandarin tones by american learners: an FMRI study",
vol. 2, 511-514.
Whiteside, S. P. / Varley, R. A. / Phillips, T. / Garety, H.:
"The production of real and non-words in adult stutterers and non-stutterers: an acoustic study",
vol. 2, 515-518.
Shimizu, Masaaki / Dantsuji, Masatake:
"A new proposal of laryngeal features for the tonal system of Vietnamese",
vol. 2, 519-522.
Zhang, Hong / Xu, Bo / Huang, Taiyi:
"How to choose training set for language modeling",
vol. 2, 523-526.
Cosi, Piero / Hosom, John-Paul:
"High performance "general purpose" phonetic recognition for Italian",
vol. 2, 527-530.
López_de_Ipiña, Miren Karmele / Torres, In / Oñederra, Lourdes
/ Varona, Amparo / Ezeiza, N. / Peñagarikano, M. / Hernandez, M. /
Rodriguez, Luis Javier:
"First approach to the selection of lexical units for continuous speech recognition of Basque",
vol. 2, 531-534.
Gow Jr., David W.:
"Assimilation, ambiguity, and the feature parsing problem",
vol. 2, 535-538.
Kajarakar, Sachin S. / Hermansky, Hynek:
"Optimization of units for continuous-digit recognition task",
vol. 2, 539-542.
Vasilescu, Ioana / Pellegrino, Francois / Hombert, Jean-Marie:
"Perceptual features for the identification of Romance languages",
vol. 2, 543-546.
Behne, Dawn M. / Czigler, Peter E. / Sullivan, Kirk P. H.:
"Perception of Swedish vowel quantity: tracing late stages of development",
vol. 2, 547-550.
Chotimongkol, Ananlada / Black, Alan W.:
"Statistically trained orthographic to sound models for Thai",
vol. 2, 551-554.
Fon, Janice / Johnson, Keith:
"Speech timing patterning as an indicator of discourse and syntactic boundaries",
vol. 2, 555-558.
Arvaniti, Amalia / Tserdanelis, Georgios:
"On the phonetics of geminates: evidence from Cypriot Greek",
vol. 2, 559-562.
Ouden, Hanny den / Wijk, Carel van / Swerts, Marc:
"A simple procedure to clarify the relation between text and prosody",
vol. 2, 563-566.
Tsukada, Kimiko:
"Effects of consonantal voicing on English diphthongs: a comparison of L1 and L2 production",
vol. 2, 567-570.
Ward, Nigel:
"The challenge of non-lexical speech sounds",
vol. 2, 571-574.
El-Imam, Yousif A.:
"A method to synthesize Arabic from short phonetic",
vol. 2, 575-578.
Schramm, Mauricio C. / Freitas, Luis Felipe R. / Zanuz, Adriano / Barone, Dante:
"A brazilian portuguese language corpus development",
vol. 2, 579-582.
Colin, C. / Radeau, Monique / Demolin, Didier / Soquet, A.:
"Visual lipreading of voicing for French stop consonants",
vol. 2, 583-586.
Chen, Yang / Robb, Michael:
"Acoustic features of vowel production in Mandarin speakers of English",
vol. 2, 587-590.
Belvin, Robert / Burns, Ron / Hein, Cheryl:
"Spoken language navigation systems for drivers",
vol. 2, 591-594.
Chen, Fang / Yuan, Baozong:
"An approach to intelligent Chinese dialogue system",
vol. 2, 595-598.
Wang, Huei-Ming / Lin, Yi-Chung:
"Goal-oriented table-driven design for dialogue manager",
vol. 2, 599-602.
Potamianos, Alexandros / Ammicht, Egbert / Kuo, Hong-Kwang J.:
"Dialogue management in the Bell Labs communicator system",
vol. 2, 603-606.
Han, Jiang / Wang, Yong:
"Dialogue management based on a hierarchical task structure",
vol. 2, 607-610.
Caspers, Johanneke:
"Melodic characteristics of backchannels in Dutch map task dialogues",
vol. 2, 611-614.
Swerts, Marc / Litman, Diane / Hirschberg, Julia:
"Corrections in spoken dialogue systems",
vol. 2, 615-618.
Fry, John:
"F0 correlates of topic and subject in spontaneous Japanese speech",
vol. 2, 619-622.
Tomokiyo, Mutsuko / Hollard, Solange:
"Specification of communicative acts of utterances based on dialogue corpus analysis",
vol. 2, 623-627.
Noguchi, Hiroaki / Katagiri, Yasuhiro / Den, Yasuharu:
"An experimental verification of the prosodic/lexical effects on the occurrence of backchannels",
vol. 2, 628-631.
Sato, Tsutomu / Maidment, John A.:
"The acoustic characteristics of Japanese identical vowel sequences in connected speech",
vol. 2, 632-635.
Spoken and Multi-Modal Dialogue Systems
Narayanan, Shrikanth / Fabbrizio, Giuseppe Di / Kamm, C. / Hubbell, James / Buntschuh, B. / Ruscitti, P. / Wright, Jerry H.:
"Effects of dialog initiative and multi-modal presentation strategies on large directory information access",
vol. 2, 636-639.
Thompson, William / Bliss, Harry:
"A declarative framework for building compositional dialog modules",
vol. 2, 640-643.
Wang, Kuansan:
"A plan-based dialog system with probabilistic inferences",
vol. 2, 644-647.
Komatani, Kazunori / Kawahara, Tatsuya:
"Generating effective confirmation and guidance using two-level confidence measures for dialogue systems",
vol. 2, 648.
Ström, Nikko / Seneff, Stephanie:
"Intelligent barge-in in conversational systems",
vol. 2, 652-655.
Breen, Andrew / Eggleton, Barry / Churcher, Gavin / Deans, Paul / Downey, Simon:
"A system for the research into multi-modal man-machine communication within a virtual environment",
vol. 2, 656-659.
Brugnara, Fabio / Cettolo, Mauro / Federico, Marcello / Giuliani, Diego:
"Advances in automatic transcription of Italian broadcast news",
vol. 2, 660-663.
Chuang, Shui-Lung / Pu, Hsiao-Tieh / Lu, Wen-Hsiang / Chien, Lee-Feng:
"Live thesaurus construction for interactive voice-based web search",
vol. 2, 664-667.
Suzuki, Yoshimi / Fukumoto, Fumiyo / Sekiguchi, Yoshihiro:
"Selecting TV news stories and newswire articles related to a target article of newswire using SVM",
vol. 2, 668-671.
Ng, Kenney:
"Towards an integrated approach for spoken document retrieval",
vol. 2, 672-675.
Logan, Beth / Moreno, Pedro / Thong, Jean-Manuel van / Whittaker, Ed:
"An experimental study of an audio indexing system for the web",
vol. 2, 676-679.
Jin, Rong / Hauptmann, Alex G.:
"Title generation for spoken broadcast news using a training corpus",
vol. 2, 680-683.
Weber, Manfred / Kemp, Thomas:
"Evaluating different information retrieval algorithms on real-world data",
vol. 2, 684-687.
Koumpis, Konstantinos / Renals, Steve:
"Transcription and summarization of voicemail speech",
vol. 2, 688-691.
Tsai, W. C. / Chu, Y. C.:
"Robust rejection for embedded systems",
vol. 2, 692-695.
Oviatt, Sharon:
"Multimodal signal processing in naturalistic noisy environments",
vol. 2, 696-699.
Chai, Joyce / Levesque, Sylvie / Budzikowska, Margorzata / Horvath, Veronika / Kambhatla, Nanda / Nicolov, Nicolas / Zadrozny, Wlodek:
"A multi-modal dialog system for business transactions",
vol. 2, 700-703.
Han, Jiang / Yan, Yonghong / Lin, Zhiwei / Wang, Yong / Liu, Jian / Liu, Danjun / Wang, Zhihui:
"Office message center - a spoken dialogue system",
vol. 2, 704-706.
Miyazaki, Noboru / Hirasawa, Jun-ichi / Nakano, Mikio / Aikawa, Kiyoaki:
"A new method for understanding sequences of utterances by multiple speakers",
vol. 2, 707-710.
Kikuchi, Hideaki / Shirai, Katsuhiko:
"Improvement of dialogue efficiency by dialogue control model according to performance of processes",
vol. 2, 711-714.
Wang, C. / Cyphers, D. Scott / Mou, Xiaolong / Polifroni, Joseph / Seneff, Stephanie / Yi, J. / Zue, Victor:
"MUXING: a telephone-access Mandarin conversational system",
vol. 2, 715-718.
Turunen, Markku / Hakulinen, Jaakko:
"Jaspis - a framework for multilingual adaptive speech applications",
vol. 2, 719-722.
Pellom, Bryan / Ward, Wayne / Pradhan, Sameer:
"The CU communicator: an architecture for dialogue systems",
vol. 2, 723-726.
Bilici, Vildan / Krahmer, Emiel / Riele, Saskia te / Veldhuis, Raymond:
"Preferred modalities in dialogue systems",
vol. 2, 727-730.
Béchet, Fréderic / Os, Elisabeth den / Boves, Lou / Sienel, Jürgen:
"Introduction to the IST-HLT project speech-driven multimodal automatic directory assistance (SMADA)",
vol. 2, 731-734.
Mao, Crusoe / Tuo, Tony / Liu, Danjun:
"Using HPSG to represent multi-modal grammar in multi-modal dialogue",
vol. 2, 735-738.
Dohsaka, Kohji / Yasuda, Norihito / Miyazaki, Noboru / Nakano, Mikio / Aikawa, Kiyoaki:
"An efficient dialogue control method under system²s limited knowledge",
vol. 2, 739-742.
Cheng, Ying / Gupta, Anurag / Lee, Raymond:
"A distributed spoken user interface based on open agent architecture (OAA)",
vol. 2, 743-746.
Speech, Facial Expression, and Gesture
Chu, Stephen M. / Huang, Thomas S.:
"Bimodal speech recognition using coupled hidden Markov models",
vol. 2, 747-750.
Ma, Jiyong / Gao, Wen:
"A parallel multi-stream model for sign language recognition",
vol. 2, 751-754.
Revéret, Lionel / Bailly, Gérard / Badin, Pierre:
"MOTHER: a new generation of talking heads providing a flexible articulatory control for video-realistic speech animation",
vol. 2, 755-758.
Minnis, Steve / Breen, Andrew:
"Modeling visual coarticulation in synthetic talking heads using a lip motion unit inventory with concatenative synthesis",
vol. 2, 759-762.
Generation and Synthesis of Spoken Language 3
Wu, Hua / Huang, Taiyi / Xu, Bo:
"A generation system for Chinese texts",
vol. 2, 763-767.
Seneff, Stephanie / Polifroni, Joseph:
"Formal and natural language generation in the Mercury conversational system",
vol. 2, 767-770.
Saito, Takashi / Sakamoto, Masaharu:
"A method of creating a new speaker²s voicefont in a text-to-speech system",
vol. 2, 771-774.
Huang, Jun / Levinson, Stephen / Hasegawa-Johnson, Mark:
"Signal approximation in Hilbert space and its application on articulatory speech synthesis",
vol. 2, 775-778.
Minematsu, Nobuaki / Nakagawa, Seiichi:
"Quality improvement of PSOLA analysis-synthesis using partial zero-phase conversion",
vol. 2, 779-782.
Lindgren, Hanna / Granberg, Jessica:
"A machine learning approach to Swedish word pronunciation",
vol. 2, 783-786.
Ohtsuka, Takahiro / Kasuya, Hideki:
"An improved speech analysis-synthesis algorithm based on the autoregressive with exogenous input speech production model",
vol. 2, 787-790.
Speaker, Dialect, and Language Recognition 3
Yuo, Kuo-Hwei / Hwang, Tai-Hwei / Wang, Hsiao-Chuan:
"Combination of temporal trajectory filtering and projection measure for robust speaker identification",
vol. 2, 791-794.
Zhao, Yunxin / Zhang, Xiao / He, Xiaodong / Schopp, Laura:
"A combined adaptive and decision tree based speech separation technique for telemedicine applications",
vol. 2, 795-798.
Bellot, Olivier / Matrouf, Driss / Merlin, Teva / Bonastre, Jean-François:
"Additive and convolutional noises compensation for speaker recognition",
vol. 2, 799-802.
Beaugendre, Frédéric / Claes, Tom / Hamme, Hugo van:
"Dialect adaptation for Mandarin Chinese speech recognition",
vol. 2, 803-806.
Scherer, Klaus R. / Johnstone, Tom / Klasmeyer, Gudrun / Bänziger, Thomas:
"Can automatic speaker verification be improved by training the algorithms on emotional speech?",
vol. 2, 807-810.
Wang, Zhong-Hua / Wu, Cheng / Lubensky, David:
"New distance measures for text-independent speaker identification",
vol. 2, 811-814.
Miscellaneous Topics 2 [M,J]
Zhao, Fengguang / Raghavan, Prabhu / Gupta, Sunil K. / Lu, Ziyi / , Wentao Gu (1) / Gu, Wentao:
"Automatic speech recognition in Mandarin for embedded platforms",
vol. 2, 815-818.
Li, Husheng / Liu, Jia / Liu, Runsheng:
"Confidence measure based unsupervised speaker adaptation",
vol. 2, 819-822.
Macías-Guarasa, Javier / Ferreiros, Javier / Colás, José / Gallardo-Antolín, A. / Pardo, Juan Manuel:
"Improved variable preselection list length estimation using NNs in a large vocabulary telephone speech recognition system",
vol. 2, 823-826.
Gallardo-Antolín, Ascensión / Ferreiros, Javier / Macías-Guarasa, Javier / Córdoba, R. de / Pardo, Juan Manuel:
"Incorporating multiple-HMM acoustic modeling in a modular large vocabulary speech recognition system in telephone environment",
vol. 2, 827-830.
Suontausta, Janne / Häkkinen, Juha:
"Decision tree based text-to-phoneme mapping for speech recognition",
vol. 2, 831-834.
Meunier, Jeff:
"Reduced traceback matrix storage for small footprint model alignment",
vol. 2, 835-838.
Vair, Claudio / Fissore, Luciano / Laface, Pietro:
"Dynamic adaptation of vocabulary independent HMMs to an application environment",
vol. 2, 839-842.
Gemello, Roberto / Moisa, Loreta / Laface, Pietro:
"Synergy of spectral and perceptual features in multi-source connectionist speech recognition",
vol. 2, 843-846.
Hariharan, Ramalingam / Viikki, Olli:
"High performance connected digit recognition through gender-dependent acoustic modelling and vocal tract length normalisation",
vol. 2, 847-850.
Eide, Ellen / Maison, Benoît / Kanevsky, D. / Olsen, P. / Chen, S. / Mangu, L. / Gales, M. / Novak, Miroslav / Gopinath, Ramesh:
"Transcription of broadcast news with a time constraint: IBM’s 10xRT HUB4 system",
vol. 2, 851-854.
Zweig, Geoffrey / Padmanabhan, Mukund:
"Exact alpha-beta computation in logarithmic space with application to MAP word graph construction",
vol. 2, 855-858.
Yamamoto, Kazumasa / Nakagawa, Seiichi:
"Relationship among speaking style, inter-phoneme's distance and speech recognition performance",
vol. 2, 859-862.
San-Segundo, Ruben / Colás, José / Ferreiros, Javier / Macías-Guarasa, Javier / Pardo, Juan Miguel:
"Spanish recogniser of continuously spelled names over the telephone",
vol. 2, 863-866.
Seide, Frank / Wang, Nick J.C.:
"Two-stream modeling of Mandarin tones",
vol. 2, 867-870.
Seyyed Salehi, Seyyed Ali:
"A neural network speech recognizer based on the both acoustic steady portions and transitions",
vol. 2, 871-874.
Hofmann, Marc / Lang, Manfred:
"Belief networks for a syntactic and semantic analysis of spoken utterances for speech understanding",
vol. 2, 875-878.
Sun, Jiping / Togneri, Roberto / Deng, Li:
"A robust speech understanding system using conceptual relational grammar",
vol. 2, 879-882.
Lau, Wai / Lee, Tan / Wong, Yiu Wing / Ching, P. C.:
"Incorporating tone information into Cantonese large-vocabulary continuous speech recognition",
vol. 2, 883-886.
Kaiser, Janez / Horvat, Bogomir / Kacic, Zdravko:
"A novel loss function for the overall risk criterion based discriminative training of HMM models",
vol. 2, 887-890.
Maucec, Mirjam Sepesy / Kacic, Zdravko / Horvat, Bogomir:
"Looking for topic similarities of highly inflected languages for language model adaptation",
vol. 2, 891-894.
Janiszek, David / Béchet, Frédéric / Mori, Renato De:
"Integrating MAP and linear transformation for language model adaptation",
vol. 2, 895-898.
Tan, Beng Tiong / Gu, Yong / Thomas, Trevor:
"Utterance verification based speech recognition system",
vol. 2, 899-902.
Chengalvarayan, Rathinavelu:
"Use of linear extrapolation based linear predictive cepstral features (LE-LPCC) for Tamil speech recognition",
vol. 2, 903-906.
Atake, Yoshinori / Irino, Toshio / Kawahara, Hideki / Lu, Jinlin / Nakamura, Satoshi / Shikano, Kiyohiro:
"Robust fundamental frequency estimation using instantaneous frequencies of harmonic components",
vol. 2, 907-910.
Varona, Amparo / Torres, In / López de Ipiña, Miren Karmele / Rodriguez, Luis Javier:
"Integrating different acoustic and syntactic language models in a continuous speech recognition system",
vol. 2, 911-914.
Schwenk, Holger / Gauvain, Jean-Luc:
"Combining multiple speech recognizers using voting and language model information",
vol. 2, 915-918.
Watanabe, Keisuke / Ishikawa, Yasushi:
"Dialogue management based on inferred behavioral goal - improving the accuracy of understanding by dialogue context -",
vol. 2, 919-922.
Schlüter, Ralf / Wessel, Frank / Ney, Hermann:
"Speech recognition using context conditional word posterior probabilities",
vol. 2, 923-926.
Meinedo, Hugo / Neto, Joao P.:
"The use of syllable segmentation information in continuous speech recognition hybrid systems applied to the Portuguese language",
vol. 2, 927-930.
Meinedo, Hugo / Neto, Joao P.:
"Combination of acoustic models in continuous speech recognition hybrid systems",
vol. 2, 931-934.
Leeuwen, David A. van / Wijngaarden, Sander J. van:
"Automatic speech recognition of non-native speakers using consonant-vowel-consonant (CVC) words",
vol. 2, 935-938.
Zhao, Gang / Xu, Hong:
"Understanding Chinese in spoken dialogue systems",
vol. 2, 939-942.
Berthommier, Frédéric / Glotin, Hervé / Tessier, Emmanuel:
"A front-end using the harmonicity cue for speech enhancement in loud noise",
vol. 2, 943-946.
Zhou, Qiru / Kosenko, Sergey:
"Lucent automatic speech recognition: a speech recognition engine for internet and telephony srvice applications",
vol. 2, 947-950.
Stephenson, Todd A. / Bourlard, Hervé / Bengio, Samy / Morris, Andrew C.:
"Automatic speech recognition using dynamic bayesian networks with both acoustic and articulatory variables",
vol. 2, 951-954.
Das, Subrata / Lubensky, David:
"Towards robust telephony speech recognition in office and automobile environments",
vol. 2, 955-958.
Kojima, Hiroaki / Tanaka, Kazuyo:
"Extracting phonological chunks based on piecewise linear segment lattices",
vol. 2, 959-962.
Galescu, Lucian / Allen, James:
"Evaluating hierarchical hybrid statistical language models",
vol. 2, 963-966.
Ogata, Jun / Ariki, Yasuo:
"An efficient lexical tree search for large vocabulary continuous speech recognition",
vol. 2, 967-970.
Jia, Bin / Zhu, Xiaoyan / Luo, Yupin / Hu, Dongcheng:
"Reliability evaluation of speech recognition in acoustic modeling",
vol. 2, 971-974.
Xu, Ching X.:
"Using GMM for voiced/voiceless segmentation and tone decision in Mandarin continuous speech recognition",
vol. 2, 975-978.
Yim, Chi H. / Au, Oscar C. / Wan, Wanggen / Keung, Cyan L. / Fung, Carrson C.:
"Auditory spectrum based features (ASBF) for robust speech recognition",
vol. 2, 979-982.
Chang, Eric / Zhou, Jianlai / Di, Shuo / Huang, Chao / Lee, Kai-Fu:
"Large vocabulary Mandarin speech recognition with different approaches in modeling tones",
vol. 2, 983-986.
Georgila, Kalirroi / Sgarbas, Kyriakos / Fanotakis, Nikos / Kokkinakis, George:
"Fast very large vocabulary recognition based on compact DAWG-structured language models",
vol. 2, 987-990.
Eklund, Robert:
"Crosslinguistic disfluency modeling: a comparative analysis of Swedish and tok pisin human-human ATIS dialogues",
vol. 2, 991-994.
Terashima, Shiro / Takeda, Kazuya / Itakura, Fumitada:
"Vector space representation of language probabilities through SVD of n-gram matrix",
vol. 2, 995-998.
Kato, Yoshihide / Matsubara, Shigeki / Toyama, Katsuhiko / Inagaki, Yasuyoshi:
"Spoken language parsing based on incremental disambiguation",
vol. 2, 999-1002.
Shimodaira, Hiroshi / Kato, Yutaka / Akae, Toshihiko / Nakai, Mitsuru / Sagayama, Shigeki:
"Jacobian adaptation of HMM with initial model selection for noisy speech recognition",
vol. 2, 1003-1006.
Shu, Han / Wooters, Chuck / Kimball, Owen / Colthurst, Thomas / Richardson, Fred / Matsoukas, Spyros / Gish, Herbert:
"The BBN Byblos 2000 conversational Mandarin LVCSR system",
vol. 2, 1007-1010.
Colthurst, Thomas / Kimball, Owen / Richardson, Fred / Shu, Han / Wooters, Chuck / Iyer, Rukmini / Gish, Herbert:
"The 2000 BBN Byblos LVCSR system",
vol. 2, 1011-1014.
Chen, Langzhou / Lamel, Lori / Adda, Gilles / Gauvain, Jean-Luc:
"Broadcast news transcription in Mandarin",
vol. 2, 1015-1018.
Li, Yang / Zhang, Tong / Levinson, Stephen E.:
"Word concept model: a knowledge representation for dialogue agents",
vol. 2, 1019-1022.
Miyajima, Chiyomi / Tokuda, Keiichi / Kitamura, Tadashi:
"Audio-visual speech recognition using MCE-based hmms and model-dependent stream weights",
vol. 2, 1023-1026.
Nanjo, Hiroaki / Lee, Akinobu / Kawahara, Tatsuya:
"Automatic diagnosis of recognition errors in large vocabulary continuous speech recognition systems",
vol. 2, 1027-1030.
Chiang, Yuang-Chin / Yang, Zhi-Siang / Lyu, Ren-Yuan:
"Taiwanese corpus collection via continuous speech recognition tool",
vol. 2, 1031-1034.
Yuan, Baosheng / Zhao, Qingwei / Guo, Qing / Zhang, Xiangdong / Lin, Zhiwei:
"Optimal maximum likelihood on phonetic decision tree acoustic model for LVCSR",
vol. 2, 1035-1038.
Markov, Konstantin P. / Nakamura, Satoshi:
"Frame level likelihood transformations for ASR and utterance verification",
vol. 2, 1038-1041.
Hazen, Timothy J. / Burianek, Theresa / Polifroni, Joseph / Seneff, Stephanie:
"Integrating recognition confidence scoring with language understanding and dialogue modeling",
vol. 2, 1042-1045.
Yu, Yibiao / Zhao, Heming:
"Speech recognition based on estimation of mutual information",
vol. 2, 1046-1049.
Guo, Qing / Yan, Yonghong / Lin, Zhiwei / Yuan, Baosheng / Zhao, Qingwei / Liu, Jian:
"Keyword spotting in auto-attendant system",
vol. 2, 1050-1052.
Ren, Weimin / Wang, Chengfa / Gao, Wen / Xu, Jinpei:
"A new approach for modeling OOV words",
vol. 2, 1053-1056.
El Méliani, Rachida / O'Shaughnessy, Douglas:
"Speech recognition using error spotting",
vol. 2, 1057-1060.
Yang, Chung-Ho / Hsieh, Ming-Shiun:
"Robust endpoint detection for in-car speech recognition",
vol. 2, 1061-1064.
Miwa, Jouji / Kumagai, Masaru:
"Internet speech analysis system using e-mail and web technology",
vol. 2, 1065-1068.
Loog, Marco / Haeb-Umbach, Reinhold:
"Multi-class linear dimension reduction by generalized Fisher criteria",
vol. 2, 1069-1072.
Holmes, Wendy J.:
"Improving the representation of time structure in front-ends for automatic speech recognition",
vol. 2, 1073-1076.
Kirchhoff, Katrin:
"Speech analysis by rule extraction from trained artificial neural networks",
vol. 2, 1077-1080.
Venugopal, Jaishree / Zahorian, Stephen A. / Karnjanadecha, Montri:
"Minimum mean square error spectral peak envelope estimation for automatic vowel classification",
vol. 2, 1081-1084.
Keung, Cyan L. / Au, Oscar C. / Yim, Chi H. / Fung, Carrson C.:
"Probabilistic compensation of unreliable feature components for robust speech recognition",
vol. 2, 1085-1087.
Wang, Congxiu / Li, Qihu / Zhao, Guoying / Yin, Li / Hao, Shuai / Meng, Da:
"A new tone conversion method for Mandarin by an adaptive linear prediction analysis",
vol. 2, 1088-1091.
Trans-Modal and Multi-Modal Human-Computer Interaction (Special Session)
Oviatt, Sharon:
"Multimodal interface research: a science without borders",
vol. 3, 1-6.
Munhall, K. G. / Kroos, C. / Kuratate, T. / Lucero, J. / Pitermann, M. / Vatikiotis-Bateson, Eric / Yehia, H.:
"Studies of audiovisual speech perception using production-based animation",
vol. 3, 7-10.
Neti, Chalapathi / Iyengar, Giridharan / Potamianos, Gerasimos / Senior, A. / Maison, Benoit:
"Perceptual interfaces for information interaction: joint processing of audio and visual information for human-computer interaction",
vol. 3, 11-14.
Gao, Wen / Ma, Jiyong / Wang, Rui / Yao, Hongxun:
"Towards robust lipreading",
vol. 3, 15-19.
Nakamura, Satoshi / Ito, Hidetoshi / Shikano, Kiyohiro:
"Stream weight optimization of speech and lip image sequence for audio-visual speech recognition",
vol. 3, 20-24.
Sako, Shinji / Tokuda, Keiichi / Masuko, Takashi / Kobayashi, Takao / Kitamura, Tadashi:
"HMM-based text-to-audio-visual speech synthesis",
vol. 3, 25-28.
Hewitt, Jill / Bateman, Andi / Lambourne, Andrew / Ariyaeeinia, A. / Sivakumaran, P.:
"Real-time speech-generated subtitles: problems and solutions",
vol. 3, 29-32.
Huang, Xuedong / Acero, Alex / Chelba, C. / Deng, Li / Duchene, D. / Goodman, Joshua / Hon, H. / Jacoby, D. / Jiang, L. /
Loynd, R. / Mahajan, M. / Mau, P. / Meredith, S. / Mughal, S. / Neto, S. / Plumpe, Mike / Wang, K. / Wang, Y.:
"Mipad: a next generation PDA prototype",
vol. 3, 33-36.
Huang, Fei / Yang, Jie / Waibel, Alex:
"Dialogue management for multimodal user registration",
vol. 3, 37-40.
Bernstein, Lynne E.:
"Segmental optical phonetics for human and machine speech processing",
vol. 3, 43-46.
Thathong, Umavasee / Jitapunkul, Somchai / Ahkuputra, Visarut / Maneenoi, Ekkarit / Thampanitchawong, Boonchai:
"Classification of Thai consonant naming using Thai tone",
vol. 3, 47-50.
Signal Analysis, Processing, and Feature Extraction 1, 2
Li, Qi / Soong, Frank K. / Siohan, Olivier:
"A high-performance auditory feature for robust speech recognition",
vol. 3, 51-54.
Xia, Kun / Espy-Wilson, Carol:
"A new strategy of formant tracking based on dynamic programming",
vol. 3, 55-58.
Lu, Xugang / Li, Gang / Wang, Lipo:
"Dominant subspace analysis for auditory spectrum",
vol. 3, 59-62.
Potamitis, Ilyas / Fanotakis, Nikos / Kokkinakis, George:
"Spectral and cepstral projection bases constructed by independent component analysis",
vol. 3, 63-66.
Krstulovic, Sacha:
"Relating LPC modeling to a factor-based articulatory model",
vol. 3, 67-70.
Shire, Michael L. / Chen, Barry Y.:
"On data-derived temporal processing in speech feature extraction",
vol. 3, 71-74.
Saon, George / Padmanabhan, Mukund:
"Minimum Bayes error feature selection",
vol. 3, 75-78.
Ellis, Daniel P. W. / Bilmes, Jeff A.:
"Using mutual information to design feature combinations",
vol. 3, 79-82.
Choi, Seungjin / Hong, Heonseok / Glotin, Hervé / Berthommier, Frédéric:
"Multichannel signal separation for cocktail party speech recognition: a dynamic recurrent network",
vol. 3, 83-86.
Prasad, V. Kamakshi / Murthy, Hema A.:
"An automatic algorithm for segmenting and labelling a connected digit sequence",
vol. 3, 87-90.
Yan, Hui / Zhang, Xuegong / Li, Yanda / Shen, Liqin / Zhu, Weibin:
"The signal reconstruction of speech by KPCA",
vol. 3, 91-93.
Saruwatari, Hiroshi / Kurita, Satoshi / Takeda, Kazuya / Itakura, Fumitada / Shikano, Kiyohiro:
"Blind source separation based on subband ICA and beamforming",
vol. 3, 94-97.
Estienne, Claudio / Pelle, Patricia:
"A synchrony front-end using phase-locked-loop techniques",
vol. 3, 98-101.
Hernando, Javier:
"On the use of filter-bank energies driven from the autocorrelation sequence for noisy speech recognition",
vol. 3, 102-105.
Language Modeling
Bod, Rens:
"Combining semantic and syntactic structure for language modeling",
vol. 3, 106-109.
Goodman, Joshua / Gao, Jianfeng:
"Language model size reduction by pruning and clustering",
vol. 3, 110-113.
Wu, Jun / Khudanpur, Sanjeev:
"Efficient training methods for maximum entropy language modeling",
vol. 3, 114-118.
Deligne, Sabine:
"Statistical language modeling with a class based n-multigram model",
vol. 3, 119-122.
Tanigaki, Koichi / Yamamoto, Hirofumi / Sagisaka, Yoshinori:
"A hierarchical language model incorporating class-dependent word models for OOV words recognition",
vol. 3, 123-126.
Zheng, Fang / Wu, Jian / Wu, Wenhu:
"Input Chinese sentences using digits",
vol. 3, 127-130.
Acoustic Modeling
Richardson, Matt / Bilmes, Jeff / Diorio, Chris:
"Hidden-articulator Markov models: performance improvements and robustness to noise",
vol. 3, 131-134.
Sandness, Eric D. / Hetherington, I. Lee:
"Keyword-based discriminative training of acoustic models",
vol. 3, 135-138.
Goel, Vaibhava / Kumar, Shankar / Byrne, William:
"Segmental minimum Bayes-risk ASR voting strategies",
vol. 3, 139-142.
Nock, Harriet J. / Young, Steve J.:
"Loosely coupled HMMs for ASR",
vol. 3, 143-146.
Weber, Katrin / Bengio, Samy / Bourlard, Hervé:
"HMM2- a novel approach to HMM emission probability estimation",
vol. 3, 147-150.
Singh, Rita / Raj, Bhiksha / Stern, Richard M.:
"Structured redefinition of sound units by merging and splitting for improved speech recognition",
vol. 3, 151-154.
Arsigny, V. / Chollet, Gérard / Gravier, Guillaume / Sigelle, Marc:
"Speech modeling with state constrained Markov fields over frequency bands",
vol. 3, 155-158.
Prosody (Poster)
Zhu, Weibin / Shen, Liqin / Miu, Xiaochuan:
"Duration modeling for Chinese synthesis from C-toBI labeled corpus",
vol. 3, 159-162.
Wang, Bei / Zheng, Bo / Lu, Shinan / Cao, Jianfen / Yang, Yufang:
"The pitch movement of word stress in Chinese",
vol. 3, 163-166.
Watanabe, Michiko / Ishi, Carlos Toshinori:
"The distribution of fillers in lectures in the Japanese language",
vol. 3, 167-170.
Harnud, Huhe / Zheng, Yuling / Chen, Jiayou:
"Research on stress in bisyllsblic words of Mongolian",
vol. 3, 171-174.
Imoto, Kazunori / Dantsuji, Masatake / Kawahara, Tatsuya:
"Modelling of the perception of English sentence stress for computer-assisted language learning",
vol. 3, 175-178.
Buhmann, Jeska / Vereecken, Halewijn / Fackrell, Justin / Martens, Jean-Pierre / Coile, Bert van:
"Data driven intonation modelling of 6 languages",
vol. 3, 179-182.
Blin, Laurent / Edgington, Mike:
"Prosody prediction using a tree-structure similarity metric",
vol. 3, 183-186.
Teixeira, Carlos / Franco, Horacio / Shriberg, Elizabeth / Precoda, Kristin / Sönmez, Kemal:
"Prosodic features for automatic text-independent evaluation of degree of nativeness for language learners",
vol. 3, 187-190.
Minematsu, Nobuaki / Nakagawa, Seiichi:
"Instantaneous estimation of prosodic pronunciation habits for Japanese students to learn English pronunciation",
vol. 3, 191-194.
Ni, Jinfu / Hirose, Keikichi:
"Synthesis of fundamental FDrequency contours of standard Chinese sentences from tone sandhi and focus conditions",
vol. 3, 195-198.
Zu, Yiqing / Chan, Xiaoxia / Li, Aijun / Hua, Wu / Sun, Guohua:
"Syllable duration and its functions in standard Chinese discourse",
vol. 3, 199-202.
Holm, Bleicke / Bailly, Gérard:
"Generating prosody by superposing multi-parametric overlapping contours",
vol. 3, 203-206.
Veldhuis, Raymond:
"Consistent pitch marking",
vol. 3, 207-210.
Jun, Sun-Ah / Lee, Sook-Hyang / Kim, Keeho / Lee, Yong-Ju:
"Labeler agreement in transcribing korean intonation with K-toBI",
vol. 3, 211-214.
Hirose, Yukiyoshi / Ozeki, Kazuhiko / Takagi, Kazuyuki:
"Effectiveness of prosodic features in syntactic analysis of read Japanese sentences",
vol. 3, 215-218.
Banno, Mieko:
"A study of F0 declination in Japanese: towards a discourse model of prosodic structure",
vol. 3, 219-222.
Sakurai, Atsuhiro / Minematsu, Nobuaki / Hirose, Keikichi:
"Data-driven intonation modeling using a neural network and a command response model",
vol. 3, 223-226.
Erdem, Caglayan / Holzapfel, Martin / Hoffmann, Rüdiger:
"Natural F0 contours with a new neural-network-hybrid approach",
vol. 3, 227-230.
Fackrell, Justin / Vereecken, Halewijn / Buhmann, Jeska / Martens, Jean-Pierre / Coile, Bert Van:
"Prosodic variation with text type",
vol. 3, 231-234.
Syrdal, Ann K. / McGory, Julia:
"Inter-transcriber reliability of toBI prosodic labeling",
vol. 3, 235-238.
Kochanski, Greg P. / Shih, Chilin:
"Stem-ML: language-independent prosody description",
vol. 3, 239-242.
Dong, Minghui / Lua, Kim Teng:
"Using prosody database in Chinese speech synthesis",
vol. 3, 243-246.
Erickson, Donna / Maekawa, Kikuo / Hashi, Michiko / Dang, Jianwu:
"Some articulatory and acoustic changes associated with emphasis in spoken English",
vol. 3, 247-250.
Janse, Esther / Sennema, Anke / Slis, Anneke:
"Fast speech timing in Dutch: durational correlates of lexical stress and pitch accent",
vol. 3, 251-254.
Hiroshige, Makoto / Suzuki, Kantaro / Araki, Kenji / Tochinai, Koji:
"On perception of word-based local speech rate in Japanese without focusing attention",
vol. 3, 255-258.
Sakurai, Atsuhiro / Iwano, Koji / Hirose, Keikichi:
"Modeling and generation of accentual phrase F0 contours based on discrete HMMs synchronized at mora-unit transitions",
vol. 3, 259-262.
Louw, Philippa H. / Roux, Justus. C. / Botha, Elizabeth. C.:
"Synthesizing prosody for commands in a Xhosa TTS system",
vol. 3, 263-266.
Generation and Synthesis of Spoken Language (Poster)
Christogiannis, Costas / Stavroulas, Yiannis / Vamvakoulas, Yiannis / Varvarigou, Theodora / Zappa, Agatha / Shih, Chilin / Arvaniti, Amalia:
"Design and implementation of a Greek text-to-speech system based on concatenative synthesis",
vol. 3, 267-270.
Baptist, Lauren / Seneff, Stephanie:
"GENESIS-II: a versatile system for language generation in conversational system applications",
vol. 3, 271-274.
Kim, Eun-Kyoung / Oh, Yung-Hwan:
"New analysis method for harmonic plus noise model based on time-domain periodicity score",
vol. 3, 275-278.
Toda, Tomoki / Lu, Jinlin / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Straight-based voice conversion algorithm based on Gaussian mixture model",
vol. 3, 279-282.
Libossek, Marion / Schiel, Florian:
"Syllable-based text-to-phoneme conversion for German",
vol. 2, 283-286.
Hain, Horst-Udo:
"A hybrid approach for grapheme-to-phoneme conversion based on a combination of partial string matching and a neural network",
vol. 3, 291-294.
Tillmann, Hans G. / Pfitzinger, Hartmut R.:
"Parametric high definition (PHD) speech synthesis-by-analysis: the development of a fundamentally new system creating connected speech by modifying lexically-represented language units",
vol. 3, 295-297.
Kwon, Chul H. / Lee, Minkyu / Olive, Joseph P.:
"A new synthesis algorithm using phase information for TTS systems",
vol. 3, 298-301.
Wouters, Johan / Macon, Michael W.:
"Unit fusion for concatenative speech synthesis",
vol. 3, 302-305.
Lenzo, Kevin A. / Black, Alan W.:
"Diphone collection and synthesis",
vol. 3, 306-309.
Portele, Thomas:
"Natural language generation for spoken dialogue",
vol. 3, 310-313.
Conkie, Alistair / Beutnagel, Mark C. / Syrdal, Ann K. / Brown, Philip E.:
"Preselection of candidate units in a unit selection-based text-to-speech synthesis system",
vol. 3, 314-317.
Jensen, Kare Jean / Riis, Søren:
"Self-organizing letter code-book for text-to-phoneme neural network model",
vol. 3, 318-321.
Yi, Jon R. W. / Glass, James R. / Hetherington, I. Lee:
"A flexible, scalable finite-state transducer architecture for corpus-based concatenative speech synthesis",
vol. 3, 322-325.
Wang, Changfu / Fujisaki, Hiroya / Tomana, Ryou / Ohno, Sumio:
"Analysis of fundamental frequency contours of standard Chinese in terms of the command-response model and its application to synthesis by rule of intonation",
vol. 3, 326-329.
Hirai, Toshio / Tenpaku, Seiichi / Shikano, Kiyohiro:
"Manipulating speech pitch periods according to optimal insertion/deletion position in residual signal for intonation control in speech synthesis",
vol. 3, 330-333.
Mittrapiyanuruk, Pradit / Hansakunbuntheung, Chatchawarn / Tesprasit, Virongrong / Sornlertlamvanich, Virach:
"Improving naturalness of Thai text-to-speech synthesis by prosodic rule",
vol. 3, 334-337.
Xu, Dawei / Mori, Hiroki / Kasuya, Hideki:
"Word-level F0 range in Mandarin Chinese and its application to inserting words into a sentence",
vol. 3, 338-341.
Isogai, Mitsuaki / Tanaka, Kimihito / Takano, Satoshi / Mizuno, Hideyuki / Abe, Masanobu / Nakajima, Sin’ya:
"A new Japanese TTS system based on speech-prosody database and speech modification",
vol. 3, 342-345.
San-Segundo, Ruben / Montero, Juan Manuel / Córdoba, Ricardo de / Gutiérrez-Arriola, Juana:
"Stress assignment in Spanish proper names",
vol. 3, 346-349.
Niu, Zhengyu / Chai, Peiqi:
"Segmentation of prosodic phrases for improving the naturalness of synthesized Mandarin Chinese speech",
vol. 3, 350-353.
Liu, Xiaohu / O'Shaughnessy, Douglas:
"Practical language modeling: an interpolating method",
vol. 3, 354-357.
Li, Gongjun / Dong, Na / Ishikawa, Toshiro:
"Combination of different n-grams based on their different assumptions",
vol. 3, 358-361.
Kawaguchi, Nobuo / Matsubara, Shigeki / Iwa, Hiroyuki / Kajita, Shoji / Takeda, Kazuya / Itakura, Fumitada / Inagaki, Yasuyoshi:
"Construction of speech corpus in moving car environment",
vol. 3, 362-365.
Lee, Yue-Shi / Chen, Hsin-Hsi:
"Parsing spoken dialogues",
vol. 3, 366-369.
Lindberg, Børge / Johansen, Finn Tore / Warakagoda, Narada / Lehtinen, Gunnar / Kacic, Zdravko / Zgank, Andrej / Elenius, Kjell / Salvi, Giampiero:
"A NOISE ROBUST MULTILINGUAL REFERENCE RECOGNISER BASED ON SPEECHDAT(II)",
vol. 3, 370-373.
Lv, Muhua / Cai, Lianhong:
"The design and application of a speech database for Chinese TTS system",
vol. 3, 378-381.
Chengalvarayan, Rathinavelu:
"Use of multiple classifiers for speech recognition in wireless CDMA network environments",
vol. 3, 382-385.
Franz, Alexander / Horiguchi, Keiko / Duan, Lei:
"An imperative programming language for spoken language translation",
vol. 3, 386-389.
Wakita, Yumi / Matsui, Kenji / Sagisaka, Yoshinori:
"Fine keyword clustering using a thesaurus and example sentences for speech translation",
vol. 3, 390-393.
Feng, JunLan / Wang, XianFang / Du, LiMin:
"Data collection and processing in a Chinese spontaneous speech corpus IIS_CSS",
vol. 3, 394-397.
Aizawa, Yasuyuki / Matsubara, Shigeki / Kawaguchi, Nobuo / Toyama, Katsuhiko / Inagaki, Yasuyoshi:
"Spoken language corpus for machine interpretation research",
vol. 3, 398-401.
Rules and Corpora (Special Session)
Santen, Jan van / Macon, Michael / Cronk, Andrew / Hosom, John-Paul / Kain, Alexander / Pagel, Vincent / Wouters, Johan:
"When will synthetic speech sound human: role of rules and data",
vol. 3, 402-409.
Syrdal, Ann K. / Wightman, Colin W. / Conkie, Alistair / Stylianou, Yannis / Beutnagel, Mark / Schroeter, Juergen / Strom, Volker / Lee, Ki-Seung / Makashay, Matthew J.:
"Corpus-based techniques in the AT&t nextgen synthesis system",
vol. 3, 410-415.
Campbell, Nick:
"Limitations to concatenative speech synthesis",
vol. 3, 416-419.
Kawai, Hisashi / Yamamoto, Seiichi / Higuchi, Norio / Shimizu, Tohru:
"A design method of speech corpus for text-to-speech synthesis taking account of prosody",
vol. 3, 420-425.
Sproat, Richard:
"Corpus-based methods and hand-built methods",
vol. 3, 426-428.
Picheny, Michael A.:
"Heredity and environment in speech recognition: the role of a priori information vs. data",
vol. 3, 429-433.
Kubozono, Haruo:
"A constraint-based analysis of compound accent in Japanese",
vol. 3, 438-441.
Iwahashi, Naoto:
"Language acquisition through a human-robot interface",
vol. 3, 442-447.
Sagisaka, Yoshinori / Yamamoto, Hirofumi / Tsuzaki, Minoru / Kato, Hiroaki:
"Rules, but what for? - rule description as efficient and robust abstraction of corpora and optimal fitting to applications -",
vol. 3, 448-451.
Perception and Comprehension of Spoken Language 1, 2
Makarova, Veronika:
"Cross-linguistic aspects of intonation perception",
vol. 3, 452-453.
Kubozono, Haruo / Haraguchi, Shosuke:
"Visual information and the perception of prosody",
vol. 3, 454-457.
Akagi, Masato / Kitakaze, Hironori:
"Perception of synthesized singing voices with fine fluctuations in their fundamental frequency contours",
vol. 3, 458-461.
Palomäki, Kalle J. / Alku, Paavo / Mäkinen, Ville / May, Patrick / Tiitinen, Hannu:
"Neuromagnetic study on localization of speech sounds",
vol. 3, 462-465.
Hirose, Yukiyoshi / Kakehi, Kazuhiko:
"Perception of identical vowel sequences in Japanese conversational speech",
vol. 3, 466-469.
Fernández, Santiago / Feijóo, Sergio:
"Acoustic cues to perception of vowel quality",
vol. 3, 470-473.
Klabbers, Esther / Veldhuis, Raymond / Koppen, Kim:
"A solution to the reduction of concatenation artefacts in speech synthesis",
vol. 3, 474-477.
Wang, Jhing-Fa / Wang, Hsien-Chang / Lee, Kin-Nan / Huang, Chieh-Yi:
"Domain-unconstrained language understanding based on CKIP-auto tag, how-net, and ART",
vol. 3, 478-481.
Powell, Chris / Zajicek, Mary / Duce, David:
"The generation of representations of word meanings from dictionaries",
vol. 3, 482-485.
Luk, Po Chui / Meng, Helen / Wang, Filung:
"Grammar partitioning and parser composition for natural language understanding",
vol. 3, 486-489.
Lai, Jennifer / Tsimhoni, Omer / Green, Paul:
"Comprehension of synthesized speech while driving and in the lab",
vol. 3, 490-493.
Tyler, Michael D. / Burnham, Denis K.:
"Orthographic influences on initial phoneme addition and deletion tasks: the effect of lexical status",
vol. 3, 494-497.
Zolfaghari, Parham / Atake, Yoshinori / Shikano, Kiyohiro / Kawahara, Hideki:
"Investigation of analysis and synthesis parameters of straight by subjective evaluation",
vol. 3, 498-501.
Spoken Language Processing
Pargellis, Andrew N. / Potamianos, Alexandros:
"Cross-domain classification using generalized domain acts",
vol. 3, 502-505.
Ramaswamy, Ganesh N. / Kleindienst, Jan:
"Hierarchical feature-based translation for scalable natural language understanding",
vol. 3, 506-509.
Potamianos, Alexandros / Kuo, Hong-Kwang J.:
"Statistical recursive finite state machine parsing for speech understanding",
vol. 3, 510-513.
Liu, Chaojun / Yan, Yonghong:
"Speaker change detection using minimum message length criterion",
vol. 3, 514-517.
Furui, Sadaoki / Maekawa, Kikuo / Isahara, Hitoshi / Ohdaira, Takahiro Shinozaki (1) and Takashi:
"Toward the realization of spontaneous speech recognition - introduction of a Japanese priority program and preliminary results -",
vol. 3, 518-521.
Takezawa, Toshiyuki / Sugaya, Fumiaki / Naito, Masaki / Yamamoto, Seiichi:
"A comparative study on acoustic and linguistic characteristics using speech from human-to-human and human-to-machine conversations",
vol. 3, 522-525.
Yoma, Néstor Becerra:
"Speaker dependent temporal constraints combined with speaker independent HMM for speech recognition in noise",
vol. 3, 526-529.
Acoustic Features for Robust Speech Recognition
Ito, Yoshihiro / Matsumoto, Hiroshi / Yamamoto, Kazumasa:
"Forward masking on a generalized logarithmic scale for robust speech recognition",
vol. 3, 530-533.
Christensen, Heidi / Lindberg, Børge / Andersen, Ove:
"Noise robustness of heterogeneous features employing minimum classification error feature space transformations",
vol. 3, 534-537.
Seltzer, Michael L. / Raj, Bhiksha / Stern, Richard M.:
"Classifier-based mask estimation for missing feature methods of robust speech recognition",
vol. 3, 538-541.
Hermus, Kris / Verhelst, Werner / Wambacq, Patrick:
"Optimized subspace weighting for robust speech recognition in additive noise environments",
vol. 3, 542-545.
Ming, Ji / Jancovic, Peter / Hanna, Philip / Stewart, Darryl / Smith, F. Jack:
"Robust feature selection using probabilistic union models",
vol. 3, 546-549.
Hariharan, Ramalingam / Kiss, Imre / Viikki, Olli / Tian, Jilei:
"Multi-resolution front-end for noise robust speech recognition",
vol. 3, 550-553.
O'Shaughnessy, Douglas / Gabrea, Marcel:
"Recognition of digit strings in noisy speech with limited resources",
vol. 3, 554-557.
Prosody, Acquisition, and Learning
Tajima, Keiichi / Erickson, Donna / Nagao, Kyoko:
"Factors affecting native Japanese speakers' production of intrusive (epenthetic) vowels in English words",
vol. 3, 558-561.
Zitouni, Imed / Smaïli, Kamel / Haton, Jean-Paul:
"Beyond the conventional statistical language models: the variable-length sequences approach",
vol. 3, 562-565.
Tsubota, Yasushi / Dantsuji, Masatake / Kawahara, Tatsuya:
"Computer-assisted English vowel learning system for Japanese speakers using cross language formant structures",
vol. 3, 566-569.
Holter, Trym / Harborg, Erik / Johnsen, Magne Hallstein / Svendsen, Torbjörn:
"ASR-based subtitling of live TV-programs for the hearing impaired",
vol. 3, 570-573.
Wu, Chung-Hsien / Chiu, Yu-Hsien / Guo, Chi-Shiang:
"Natural language processing for Taiwanese sign language to speech conversion",
vol. 3, 574-577.
Miwa, Jouji / Sasaki, Hiroshi / Tanno, Kazunori:
"Japanese spoken language learning system using java information technology",
vol. 3, 578-581.
Strik, Helmer / Cucchiarini, Catia / Binnenpoorte, Diana:
"L2 pronunciation quality in read and spontaneous speech",
vol. 3, 582-585.
Kitamura, Tomoko / Kinoshita, Keisuke / Arai, Takayuki / Kusumoto, Akiko / Murahara, Yuji:
"Designing modulation filters for improving speech intelligibility in reverberant environments",
vol. 3, 586-589.
Zhang, Lei / Han, Jiqing / Lv, Chengguo / Wang, Chengfa:
"An environment model-based robust speech recognition",
vol. 3, 590-593.
Vermaak, Jaco / Andrieu, Christophe / Doucet, Arnaud:
"Particle filtering for non-stationary speech modelling and enhancement",
vol. 3, 594-597.
Graciarena, Martin:
"Maximum likelihood noise HMMm estimation in model-based robust speech recognition",
vol. 3, 598-601.
Zeng, Qingsheng / O'Shaughnessy, Douglas:
"Microphone array within a handset or face mask for speech enhancement",
vol. 3, 602-605.
Wang, Chengfa / Wang, Qiusheng:
"Embedding visually recognizable watermarks into digital audio signals",
vol. 3, 606-609.
Iwaki, Mamoru:
"Auditory perception of amplitude modulated sinusoid using a pure tone and band-limited noises as modulation signals",
vol. 3, 610-613.
Geravanchizadeh, Masoud:
"Spectral voice conversion based on unsupervised clustering of acoustic space",
vol. 3, 614-617.
Pfitzinger, Hartmut R.:
"Removing hum from spoken language resources",
vol. 3, 618-621.
Amdal, Ingunn / Korkmazskiy, Filipp / Surendran, Arun C.:
"Joint pronunciation modelling of non-native speakers using data-driven methods",
vol. 3, 622-625.
Bell, Linda / Eklund, Robert / Gustafson, Joakim:
"A comparison of disfluency distribution in a unimodal and a multimodal speech interface",
vol. 3, 626-629.
Liu, Yi / Fung, Pascale:
"Modelling pronunciation variations in spontaneous Mandarin speech",
vol. 3, 630-633.
Suzuki, Tadashi / Ishii, Jun / Nakajima, Kunio:
"A method of generating English pronunciation dictionary for Japanese English recognition systems",
vol. 3, 634-637.
Bonneau-Maynard, Hélène / Devillers, L.:
"A framework for evaluating contextual understanding",
vol. 3, 638-641.
Deng, Yonggang / Huang, Taiyi / Xu, Bo:
"Towards high performance continuous Mandarin digit string recognition",
vol. 3, 642-645.
Aylett, Matthew:
"Stochastic suprasegmentals: relationships between redundancy, prosodic structure and care of articulation in spontaneous speech",
vol. 3, 646-649.
Sakamoto, Masaharu / Saitoh, Takashi:
"An automatic pitch-marking method using wavelet transform",
vol. 3, 650-653.
Takamaru, Keiichi / Hiroshige, Makoto / Araki, Kenji / Tochinai, Koji:
"A proposal of a model to extract Japanese voluntary speech rate control",
vol. 3, 654-657.
Makarova, Veronika:
"Acoustic characteristics of surprise in Russian questions",
vol. 3, 658-661.
Deng, Yonggang / Cao, Yang / Xu, Bo:
"Neural network based integration of multiple confidence measures for OOV detection",
vol. 3, 662-665.
Xu, Yi / Sun, Xuejing:
"How fast can we really change pitch? maximum speed of pitch change revisited",
vol. 3, 666-669.
Klabbers, Esther / Santen, Jan van:
"Predicting segmental durations for Dutch using the sums-of-products approach",
vol. 3, 670-673.
Cao, Yang / Huang, Taiyi / Xu, Bo / Li, Chengrong:
"A stochastic polynomial tone model for continuous Mandarin speech",
vol. 3, 674-677.
Gabrea, Marcel / O’Shaughnessy, Douglas:
"Detection of filled pauses in spontaneous conversational speech",
vol. 3, 678-681.
Lyberg, Bertil / Sangarig, Sonia:
"Some observations on different strategies for the timing of fundamental frequency events",
vol. 3, 682-685.
Wu, Zhiyong / Cai, Lianhong / Zhou, Tongchun:
"Research on dynamic characters of Chinese pitch contours",
vol. 3, 686-689.
Adaptation and Acquisition in Spoken Language Processing (Poster)
Zhao, Bing / Xu, Bo:
"Incorporating HMM-state sequence confusion for rapid MLLR adaptation to new speakers",
vol. 3, 690-693.
Zhang, Zhipeng / Furui, Sadaoki:
"An online incremental speaker adaptation method using speaker-clustered initial models",
vol. 3, 694-697.
Li, Guoqiang / Du, Limin / Hou, Ziqiang:
"Prior parameter transformation for unsupervised speaker adaptation",
vol. 3, 698-701.
Sarikaya, Ruhi / Hansen, John H. L.:
"Improved Jacobian adaptation for fast acoustic model adaptation in noisy speech recognition",
vol. 3, 702-705.
Fujita, Keiko / Ono, Yoshio / Nakatoh, Yoshihisa:
"A study of vocal tract length normalization with generation-dependent acoustic models",
vol. 3, 706-709.
Wang, Shaojun / Zhao, Yunxin:
"Optimal on-line Bayesian model selection for speaker adaptation",
vol. 3, 710-713.
Zhou, Bowen / Hansen, John H. L.:
"Unsupervised audio stream segmentation and clustering via the Bayesian information criterion",
vol. 3, 714-717.
Tsuge, Satoru / Fukada, Toshiaki / Kita, Kenji:
"Frame-period adaptation for speaking rate robust speech recognition",
vol. 3, 718-721.
Nieuwoudt, C. / Botha, Elizabeth C.:
"Cross-language use of acoustic information for automatic speech recognition",
vol. 3, 722-725.
Sato, Shoei / Imai, Toru / Tanaka, Hideki / Ando, Akio:
"Selective training of HMMs by using two-stage clustering",
vol. 3, 726-729.
Torre, Angel de la / Fohr, Dominique / Haton, Jean-Paul:
"Compensation of noise effects for robust speech recognition in car environments",
vol. 3, 730-733.
Kim, Dong Kook / Kim, Nam Soo:
"Bayesian speaker adaptation based on probabilistic principal component analysis",
vol. 3, 734-737.
Liu, Wai Kat / Fung, Pascale:
"MLLR-based accent model adaptation without accented data",
vol. 3, 738-741.
Chen, Kuan-Ting / Liau, Wen-Wei / Wang, Hsin-Min / Lee, Lin-Shan:
"Fast speaker adaptation using eigenspace-based maximum likelihood linear regression",
vol. 3, 742-745.
Potamianos, Gerasimos / Neti, Chalapathy:
"Stream confidence estimation for audio-visual speech recognition",
vol. 3, 746-749.
Komatsu, Masahiko / Tokuma, Won / Tokuma, Shinichi / Arai, Takayuki:
"The effect of reduced spectral information on Japanese consonant perception: comparison between L1 and L2 listeners",
vol. 3, 750-753.
Ciocca, Valter / Aisha, Rani / Francis, Alex / Wong, Lena:
"Can cantonese children with cochlear implants perceive lexical tones?",
vol. 3, 754-757.
Yip, Michael C. W.:
"Recognition of spoken words in the continuous speech: effects of transitional probability",
vol. 3, 758-761.
Salomon, Ariel / Espy-Wilson, Carol:
"Detection of speech landmarks using temporal cues",
vol. 3, 762-765.
Otake, Takashi / Cutler, Anne:
"A set of Japanese word cohorts rated for relative familiarity",
vol. 3, 766-769.
Yamakawa, Kimiko / Miyazono, Hiromitsu / Baba, Ryoji:
"The phonetic value of the devocalized vowel in Japanese - in case of velar plosive",
vol. 3, 770-773.
McQueen, James M. / Cutler, Anne / Norris, Dennis:
"Positive and negative influences of the lexicon on phonemic decision-making",
vol. 3, 778-781.
Weber, Andrea:
"Phonotactic and acoustic cues for word segmentation in English",
vol. 3, 782-785.
Janse, Esther:
"Intelligibility of time-compressed speech: three ways of time-compression",
vol. 3, 786-789.
Traunmüller, Hartmut:
"Evidence for demodulation in speech perception",
vol. 3, 790-793.
Large Vocabulary Continuous Speech Recognition
Gauvain, Jean-Luc / Lamel, Lori:
"Fast decoding for indexation of broadcast data",
vol. 3, 794-797.
Gao, Sheng / Xu, Bo / Zhang, Hong / Zhao, Bing / Li, Chengrong / Huang, Taiyi:
"Update progress of Sinohear: advanced Mandarin LVCSR system at NLPR",
vol. 3, 798-801.
Aubert, Xavier L. / Blasig, Reinhard:
"Combined acoustic and linguistic look-ahead for one-pass time-synchronous decoding",
vol. 3, 802-805.
Deng, Li / Acero, Alex / Plumpe, Mike / Huang, Xuedong:
"Large-vocabulary speech recognition under adverse acoustic environments",
vol. 3, 806-809.
Fischer, Volker / Kunzmann, S. J.:
"Acoustic language model classes for a large vocabulary continuous speech recognizer",
vol. 3, 810-813.
Kummert, Franz / Fink, Gernot A. / Sagerer, Gerhard:
"A hybrid speech recognizer combining HMMs and polynomial classification",
vol. 3, 814-817.
Huang, Chao / Chang, Eric / Zhou, Jianlai / Lee, Kai-Fu:
"Accent modeling based on pronunciation dictionary adaptation for large vocabulary Mandarin speech recognition",
vol. 3, 818-821.
Speech Coding and Transmission
Zhang, Jinzhong / He, Yingmin / Yu, Renshu:
"A mixed and code excitation LPC vocoder at 1.76 kb/s",
vol. 3, 822-825.
Kohata, Minoru / Mitsuya, Ikuya / Suzuki, Motoyuki / Makino, Shozo:
"Efficient segment quantization of LSP parameters for very low bit speech coding",
vol. 3, 826-829.
Ribeiro, Carlos M. / Trancoso, Isabel M. / Caseiro, Diamantino A.:
"Phonetic vocoder assessment",
vol. 3, 830-833.
Hu, Hongtao / Du, Limin:
"A new low bit rate speech coder based on intraframe waveform interpolation",
vol. 3, 834-837.
Chengalvarayan, Rathinavelu / Thomson, David L.:
"Discriminatively derived HMM-based announcement modeling approach for noise control avoiding the problem of false alarms",
vol. 3, 838-841.
Huerta, Juan M. / Stern, Richard M.:
"Instantaneous-distortion based weighted acoustic modeling for robust recognition of coded speech",
vol. 3, 842-845.
Acoustic Model Adaptation
Rajput, Nitendra / Subramaniam, L. Venkata / Verma, Ashish:
"Adapting phonetic decision trees between languages for continuous speech recognition",
vol. 3, 850-852.
Cox, Stephen:
"Speaker normalization in the MFCC domain",
vol. 3, 853-856.
Haeb-Umbach, Reinhold:
"Data-driven phonetic regression class tree estimation for MLLR adaptation",
vol. 3, 857-860.
Afify, Mohamed / Siohan, Olivier:
"Constrained maximum likelihood linear regression for speaker adaptation",
vol. 3, 861-864.
Choi, Woo-Yong / Kim, Hyung Soon:
"Predictive speaker adaptation based on least squares method",
vol. 3, 865-868.
Acero, Alex / Deng, Li / Kristjansson, Trausti / Zhang, Jerry:
"HMM adaptation using vector taylor series for noisy speech recognition",
vol. 3, 869-872.
Vergyri, Dimitra / Tsakalidis, Stavros / Byrne, William:
"Minimum risk acoustic clustering for multilingual acoustic model combination",
vol. 3, 873-876.
Miscellaneous 3 [D,E,F,I,P,N,R,S,U,W,Y,Z]
Oviatt, Sharon:
"Talking to thimble jellies: children²s conversational speech with animated characters",
vol. 3, 877-880.
Rodman, Robert / McAllister, David / Bitzer, Donald / Chappell, D.:
"A high-resolution glottal pulse tracker",
vol. 3, 881-884.
Alku, Paavo / Svec, Jan G. / Vilkman, Erkki / Sram, Frantisek:
"Analysis of voice production in breathy, normal and pressed phonation by comparing inverse filtering and videokymography",
vol. 3, 885-888.
Ito, Takayuki / Gomi, Hiroaki / Honda, Masaaki:
"Model of the mechanical linkage of the upper lip-jaw for the articulatory coordination",
vol. 3, 889-892.
Matsumura, Masafumi / Niikawa, Takuya / Torii, Taku / Yamasaki, Hitoshi / Hara, Hisanaga / Tachimura, Takashi / Wada, Takeshi:
"Measurement of palatolingual contact pressure and tongue force using a force-sensor-mounted palatal plate",
vol. 3, 893-896.
Engwall, Olov:
"A 3d tongue model based on MRI data",
vol. 3, 901-904.
Bae, Jae-Hyun / Byeon, Heo-Jin / Oh, Yung-Hwan:
"Speech quality improvement in TTS system using ABS/OLA sinusoidal model",
vol. 3, 905-908.
Bruyninckx, Marielle / Harmegnies, Bernard:
"A study of palatal segments' production by danish speakers",
vol. 3, 909-912.
Ramabhadran, Bhuvana / Gao, Yuqing / Picheny, Michael:
"Dynamic selection of feature spaces for robust speech recognition",
vol. 3, 913-916.
Fernández, Santiago / Feijóo, Sergio:
"A probabilistic model of integration of acoustic cues in FV syllables",
vol. 3, 917-920.
Bilmes, Jeff A. / Kirchhoff, Katrin:
"Directed graphical models of classifier combination: application to phone recognition",
vol. 3, 921.
Jan, E. E. / Botella Ordinas, Jaime / Saon, George / Roukos, Salim:
"Real-time multilingual HMM training robust to channel variations",
vol. 3, 925-928.
Wijngaarden, Sander J. van / Steeneken, Herman J.M.:
"The intelligibility of German and English speech to Dutch listeners",
vol. 3, 929-932.
Zhen, Bin / Wu, Xihong / Liu, Zhimin / Chi, Huisheng:
"On the use of bandpass liftering in speaker recognition",
vol. 3, 933-936.
Carré, René / Sprenger-Charolles, Liliane / Messaoud-Galusi, Souhila / Serniclaes, Willy:
"On auditory-phonetic short-term transformation",
vol. 3, 937-940.
Alwan, James J. Hant and Abeer:
"Predicting the perceptual confusion of synthetic plosive consonants in noise",
vol. 3, 941-944.
Larson, Martha / Willett, Daniel / Köhler, Joachim / Rigoll, Gerhard:
"Compound splitting and lexical unit recombination for improved performance of a speech recognition system for German parliamentary speeches",
vol. 3, 945-948.
Zundert, Martine van / Terken, Jacques:
"Learning and transfer of learning for synthetic speech",
vol. 3, 949-952.
Zhang, Yang / Kuhl, Patricia K. / Imada, Toshiaki / Iverson, Paul / Pruitt, John / Kotani, Makoto / Stevens, Erica:
"Neural plasticity revealed in perceptual training of a Japanese adult listener to learn american /l-r/ contrast: a whole-head magnetoencephalography study",
vol. 3, 953-956.
Joto, Akiyo:
"The effect of consonantal context and acoustic characteristics on the discrimination between the English vowel /i/ and /e/ by Japanese learners",
vol. 3, 957-960.
Zhao, Li / Lu, Wei / Jiang, Ye / Wu, Zhenyang:
"A study on emotional feature recognition in speech",
vol. 3, 961-964.
Godino-Llorente, Juan I. / Aguilera-Navarro, Santiago / Gómez-Vilda, Pedro:
"LPC, LPCC and MFCC parameterisation applied to the detection of voice impairments",
vol. 3, 965-968.
T'sou, Benjamin K. / Lai, Tom B. Y.:
"A complementary approach to computer-aided transcription: synergy of statistical-based and kbnowledge discovery paradigms",
vol. 3, 969-972.
Caraty, Marie-José / Montacié, Claude:
"Teraspeech’2000 : a 10,000 speakers database",
vol. 3, 973-976.
Dybkjær, Laila / Bernsen, Niels Ole:
"The MATE workbench - a tool in support of spoken dialogue annotation and information extraction",
vol. 3, 977-980.
Brun, Armelle / Langlois, David / Smaili, Kamel / Haton, Jean-Paul:
"Discarding impossible events from statistical language models",
vol. 3, 981-984.
Lepage, Yves / Auclerc, Nicolas / Shirai, Satoshi:
"A tool to build a treebank for conversational Chinese",
vol. 3, 985-988.
Auckenthaler, Roland / Carey, Michael / Maso, John:
"Parameter reduction in a text-independent speaker verification system",
vol. 3, 989-992.
Gu, Yong / Thomas, Trevor:
"Advances on HMM-based text-dependent speaker verification",
vol. 3, 993-996.
Stapert, Robert / Mason, John S. / Auckenthaler, Roland:
"Optimisation of GMM in speaker recognition",
vol. 3, 997-1000.
Zilca, Ran D. / Bistritz, Yuval:
"Distance-based Gaussian mixture model for speaker recognition over the telephone",
vol. 3, 1001-1004.
Liu, Jun-Hui / Chen, Ke:
"Pruning abnormal data for better making a decision in speaker verification",
vol. 3, 1005-1008.
Bosch, Louis ten:
"ASR, dialects, and acoustic/phonological distances",
vol. 3, 1009-1012.
Nishida, Masafumi / Ariki, Yasuo:
"Speaker verification by integrating dynamic and static features using subspace method",
vol. 3, 1013-1016.
Kim, Se-Hyun / Jang, Gil-Jin / Oh, Yung-Hwan:
"Improvement of speaker recognition system by individual information weighting",
vol. 3, 1017-1020.
Yoma, Néstor Becerra / Pegoraro, Tarciano Facco:
"Speaker verification in noise using temporal constraints",
vol. 3, 1021-1024.
Sabac, Bogdan / Gavat, Inge / Valsan, Zica:
"Speaker identification using discriminative features selection",
vol. 3, 1025-1028.
Magrin-Chagnolleau, Ivan / Gravier, Guilleaume / Seck, Mouhamadou / Boeffard, Olivier / Blouet, R. / Bimbot, Frédéric:
"A further investigation on speech features for speaker characterization",
vol. 3, 1029-1032.
Balleda, Jyotsana / Murthy, Hema A / Nagarajan, T.:
"Language identification from short segments of speech",
vol. 3, 1033-1036.
Kronenberg, Susanne / Kummert, Franz:
"Generation of utterances based on visual context information",
vol. 3, 1037-1040.
Rahim, Mazin / Pieraccini, Roberto / Eckert, Wieland / Levin, Esther / Fabbrizio, Giuseppe Di / Riccardi, Giuseppe / Kamm, Candy / Narayanan, Shrikanth:
"A spoken dialogue system for conference/workshop services",
vol. 3, 1041-1044.
Churcher, Gavin / Wyard, Peter:
"Developing robust, user-centred multimodal spoken language systems: the MUeSLI project",
vol. 3, 1045-1048.
Johnsen, Magne H. / Svendsen, Torbjørn / Amble, Tore / Holter, Trym / Harborg, Erik:
"TABOR - a norwegian spoken dialogue system for bus travel information",
vol. 3, 1049-1052.
Huang, Yinfei / Zheng, Fang / Xu, Mingxing / Yan, Pengju / Wu, Wenhu:
"Language understanding component for Chinese dialogue system",
vol. 3, 1053-1056.
Aoyama, Kazumi / Hirano, Izumi / Kikuchi, Hideaki / Shirai, Katsuhiko:
"Designing a domain independent platform of spoken dialogue system",
vol. 3, 1057-1060.
Zhou, Qiru / Saad, Antoine / Abdou, Sherif:
"An enhanced BLSTIP dialogue research platform",
vol. 3, 1061-1064.
Qu, Weidong / Shirai, Katsuhiko:
"Using machine learning method and subword unit representations for spoken document categorization",
vol. 3, 1065-1068.
Stark, Litza / Whittaker, Steve / Hirschberg, Julia:
"ASR satisficing: the effects of ASR accuracy on speech retrieval",
vol. 3, 1069-1072.
Nishizaki, Hiromitsu / Nakagawa, Seiichi:
"A system for retrieving broadcast news speech documents using voice input keywords and similarity between words",
vol. 3, 1073-1076.
Lai, Yu-Sheng / Lee, Kuen-Lin / Wu, Chung-Hsien:
"Intention extraction and semantic matching for internet FAQ retrieval using spoken language query",
vol. 3, 1077-1080.
Vark, Robert J. van / Haan, Jelle K. de / Rothkrantz, Leon J. M.:
"A domain-independent model to improve spelling in a web environment",
vol. 3, 1081-1084.
Takao, Seiichi / Ogata, Jun / Ariki, Yasuo:
"Expanded vector space model based on word space in cross media retrieval of news speech data",
vol. 3, 1085-1088.
Hansen, John H. L. / Zhou, Bowen / Akbacak, Murat / Sarikaya, Ruhi / Pellom, Bryan:
"Audio stream phrase recognition for a national gallery of the spoken word: "one small step"",
vol. 3, 1089-1092.
Nakajima, Hideharu / Sagisaka, Yoshinori / Yamamoto, Hirofumi:
"Pronunciation variants description using recognition error modeling with phonetic derivation hypotheses",
vol. 3, 1093-1096.
Tsukahara, Wataru / Ward, Nigel:
"Evaluating responsiveness in spoken dialog systems",
vol. 3, 1097-1100.
Kitawaki, Nobuhiko / Asano, Futoshi / Yamada, Takeshi:
"Characteristics of spoken language required for objective quality evaluation of echo cancellers",
vol. 3, 1101-1104.
Sugaya, Fumiaki / Takezawa, Toshiyuki / Yokoo, Akio / Sagisaka, Yoshinori / Yamamoto, Seiichi:
"Evaluation of the ATR-matrix speech translation system with a pair comparison method between the system and humans",
vol. 3, 1105-1108.
Maruyama, Ichiro / Abe, Yoshiharu / Ehara, Terumasa / Shirai, Katsuhiko:
"An automatic timing detection method for superimposing closed captions of TV programs",
vol. 3, 1109-1112.
Ogner, Marcel / Kacic, Zdravko:
"Normalized time-frequency speech representation in articulation training systems",
vol. 3, 1113-1116.
Torihara, Shinichi / Nagao, Katashi:
"Semantic transcoding: making the handicapped and the aged free from their barriers in obtaining information on the web",
vol. 3, 1117-1120.
Chengalvarayan, Rathinavelu:
"The use of nonlinear energy transformation for Tamil connected-digit speech recognition",
vol. 3, 1121-1124.
Chen, Aimin / Vaseghi, Saeed:
"State based sub-band Wiener filters for speech enhancement in car environments",
vol. 3, 1125-1128.
Hermus, Kris / Verhelst, Werner / Wambacq, Patrick / Lemmerling, Philippe:
"Total least squares based subband modelling for scalable speech representations with damped sinusoids",
vol. 3, 1129-1132.
Chang, Joon-Hyuk / Kim, Nam Soo:
"Speech enhancement: new approaches to soft decision",
vol. 3, 1133-1136.
Language Resources and Technology Evaluation (Special Session)
Glass, James / Polifroni, Joseph / Seneff, Stephanie / Zue, Victor:
"Data collection and performance evaluation of spoken dialogue systems: the MIT experience",
vol. 4, 1-4.
Lamel, Lori / Rosset, Sophie / Gauvain, Jean-Luc:
"Considerations in the design and evaluation of spoken language dialog systems",
vol. 4, 5-8.
Heckmann, Martin / Berthommier, Frédéric / Savario, Christophe / Kroschel, Kristian:
"Labeling audio-visual speech corpora and training an ANN/HMM audio-visual speech recognition system",
vol. 4, 9-12.
Li, Aijun / Lin, Maocan / Chen, XiaoXia / Zu, Yiqing / Sun, Guohua / Hua, Wu / Yin, Zhigang / Yan, Jingzhu:
"Speech corpus of Chinese discourse and the phonetic research",
vol. 4, 13-18.
Fiscus, Jonathan G. / Doddington, George R.:
"Results of the 1999 topic detection and tracking evaluation in Mandarin and English",
vol. 4, 19-24.
Nakamura, Satoshi / Watanuki, Keiko / Takezawa, Toshiyuki / Hayamizu, Satoru:
"Multimodal corpora for human-machine interaction research",
vol. 4, 25-28.
Pearce, David / Hirsch, Hans-Günter:
"The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions",
vol. 4, 29-32.
Tillmann, Hans-Günther / Schiel, Florian / Draxler, Christoph / Hoole, Phil:
"The bavarian archive for speech signals - serving the speech community",
vol. 4, 33-36.
Millar, J. Bruce:
"The development of spoken language resources in oceania",
vol. 4, 37-40.
Soong, Frank K. / Woudenberg, Eric A.:
"Hands-free human-machine dialogue - corpora, technology and evaluation",
vol. 4, 41-44.
Acquisition and Learning of Spoken Language 1, 2
Riccardi, Giuseppe:
"On-line learning of acoustic and lexical units for domain-independent ASR",
vol. 4, 45-48.
Akiba, Tomoyosi / Itou, Katsunobu:
"Semi-automatic language model acquisition without large corpora",
vol. 4, 49-52.
Petrovska-Delacrétaz, Dijana / Gorin, Allen L. / Wright, Jerry H. / Riccardi, Giuseppe:
"Detecting acoustic morphemes in lattices for spoken language understanding",
vol. 4, 53-56.
Mizumachi, Mitsunori / Akagi, Masato / Nakamura, Satoshi:
"Design of robust subtractive beamformer for noisy speech recognition",
vol. 4, 57-60.
Sheikhzadeh, Hamid / Amirfattahi, Rassoul:
"Objective long-term assessment of speech quality changes in pre-lingual cochlear implant children",
vol. 4, 61-64.
Nöth, Elmar / Niemann, Heinrich / Haderlein, Tino / Decher, M. / Eysholdt, Uwe / Rosanowski, F. / Wittenberg, T.:
"Automatic stuttering recognition using hidden Markov models",
vol. 4, 65-68.
Roy, Deb:
"Grounded speech communication",
vol. 4, 69-72.
Jun, Sun-Ah / Oh, Mira:
"Acquisition of second language intonation",
vol. 4, 73-76.
Siu, Man-hung / Wong, Ka-Ming / Ching, Man-Yan / Lau, Mei-Sum:
"Computer-aided Mandarin pronunciation learning system",
vol. 4, 77-80.
McTear, Michael / Conn, Norma / Phillips, Nicola:
"Speech recognition software: a tool for people with dyslexia",
vol. 4, 81-84.
Bunnell, H. Timothy / Yarrington, Debra M. / Polikoff, James B.:
"STAR: articulation training for young children",
vol. 4, 85-88.
Acoustics of Spoken Language 1, 2
Nakai, Takayoshi / Ishida, Keizo / Suzuki, Hisayoshi:
"Sound pressure distributions and propagation paths in the vocal tract with the pyriform fossa and the larynx",
vol. 4, 89-92.
Czap, László:
"Lip representation by image ellipse",
vol. 4, 93-96.
Son, Rob J. J. H. van / Streefkerk, Barbertje M. / Pols, Louis C. W.:
"An acoustic profile of speech efficiency",
vol. 4, 97-100.
Meng, Helen M. / Lo, W. K. / Li, Yuk Chi / Ching, P. C.:
"Multi-scale audio indexing for Chinese spoken document retrieval",
vol. 4, 101-104.
Soltau, Hagen / Waibel, Alex:
"Phone dependent modeling of hyperarticulated effects#",
vol. 4, 105-108.
Guo, Qing / Yan, Yonghong / Yuan, Baosheng / Zhang, Xiangdong / Jia, Ying / Liu, Xiaoxing:
"Vocabulary-based acoustic model trim down and task adaptation",
vol. 4, 109-112.
Chen, Willa S. / Alwan, Abeer:
"Place of articulation cues for voiced and voiceless plosives and fricatives in syllable-initial position",
vol. 4, 113-116.
Chen, Jingdong / Paliwal, Kuldip K. / Nakamura, Satoshi:
"A block cosine transform and its application in speech recognition",
vol. 4, 117-120.
Hung, Jeih-Weih / Wang, Hsin-Min / Lee, Lin-Shan:
"Automatic metric-based speech segmentation for broadcast news via principal component analysis",
vol. 4, 121-124.
Gao, Yuqing / Li, Yongxin / Picheny, Michael:
"Maximal rank likelihood as an optimization function for speech recognition",
vol. 4, 125-128.
Pan, Yue / Waibel, Alex:
"The effects of room acoustics on MFCC speech parameter",
vol. 4, 129-132.
Hasegawa-Johnson, Mark:
"Time-frequency distribution of partial phonetic information measured using mutual information",
vol. 4, 133-136.
Recognition and Understanding of Spoken Language 3, 4
Jiang, Li / Huang, Xuedong:
"Subword-dependent speaker clustering for improved speech recognition",
vol. 4, 137-140.
Luo, Chunhua / Zheng, Fang / Xu, Mingxing:
"An equivalent-class based MMI learning method for MGCPM",
vol. 4, 141-144.
Wrench, Alan A. / Richmond, Korin:
"Continuous speech recognition using articulatory data",
vol. 4, 145-148.
Mak, Brian / Tam, Yik-Cheung:
"Asynchrony with trained transition probabilities improves performance in multi-band speech recognition",
vol. 4, 149-152.
Sivadas, Sunil / Jain, Pratibha / Hermansky, Hynek:
"Discriminative MLPs in HMM-based recognition of speech in cellular telephony",
vol. 4, 153-156.
Hanazawa, Toshiyuki / Ishii, Jun / Okato, Yohei / Nakajima, Kunio:
"Acoustic modeling for spontaneous speech recognition using syllable dependent models",
vol. 4, 157-160.
Jiang, Hui / Deng, Li:
"A robust training strategy against extraneous acoustic variations for spontaneous speech recognition",
vol. 4, 161-164.
Purnell, Darryl W. / Botha, Elizabeth C.:
"Improved performance and generalization of minimum classification error training for continuous speech recognition",
vol. 4, 165-168.
Jia, Ying / Yan, Yonghong / Yuan, Baosheng:
"Dynamic threshold setting via Bayesian information criterion (BIC) in HMM training",
vol. 4, 169-171.
Hain, Thomas / Woodland, Philip C.:
"Modelling sub-phone insertions and deletions in continuous speech recognition",
vol. 4, 172-175.
Fung, Carrson C. / Au, Oscar C. / Wan, Wanggen / Yim, Chi H. / Keung, Cyan L.:
"Improved acoustics modeling for speech recognition using transformation techniques",
vol. 4, 176-179.
Gu, Liang / Nayak, Jayanth / Rose, Kenneth:
"Discriminative training of tied-mixture HMM by deterministic annealing",
vol. 4, 183-186.
Kuo, Hong-Kwang Jeff / Lee, Chin-Hui:
"Discriminative training in natural language call routing",
vol. 4, 187-190.
Tanaka, Kazuyo / Kojima, Hiroaki:
"A speech recognition method with a language-independent intermediate phonetic code",
vol. 4, 191-194.
Lefèvre, Fabrice:
"Confidence measures based on the k-nn probability estimator",
vol. 4, 195-197.
Mukherjee, Niloy / Rajput, Nitendra / Subramaniam, L. Venkata / Verma, Ashish:
"On deriving a phoneme model for a new language",
vol. 4, 198-201.
Saito, Tomonobu / Hashimoto, Kiyoshi:
"Estimation of semantic case of Japanese dialogue by use of distance derived from statistics of dependency",
vol. 4, 202-205.
Cox, Stephen / Dasmahapatra, Srinandan:
"A semantically-based confidence measure for speech recognition",
vol. 4, 206-209.
Ganapathiraju, Aravind / Picone, Joseph:
"Support vector machines for automatic data cleanup",
vol. 4, 210-213.
Gu, Yong / Thomas, Trevor:
"Competition-based score analysis for utterance verification in name recognition",
vol. 4, 214-217.
Zhang, Yaxin:
"Utterance verification/rejection for speaker-dependent and speaker-independent speech recognition",
vol. 4, 218-221.
Petrushin, Valery A.:
"Emotion recognition in speech signal: experimental study, development, and application",
vol. 2, 222-225.
Lyu, Ren-yuan / Chen, Chi-yu / Chiang, Yuang-chin / Liang, Min-shung:
"A bi-lingual Mandarin/taiwanese (min-nan), large vocabulary, continuous speech recognition system based on the tong-yong phonetic alphabet (TYPA)",
vol. 2, 226-229.
Emam, Ossama / Gonzalez, Jorge / Günther, Carsten / Janke, Eric / Kunzmann, Siegfried / Maltese, Giulio / Waast-Richard, Claire:
"A data-driven methodology for the production of multilingual conversational systems",
vol. 2, 230-233.
Vaich, Tzur / Cohen, Arnon:
"Multi-path, context dependent SC-HMM architectures for improved connected word recognition",
vol. 4, 234-237.
Meron, Yoram / Hirose, Keikichi:
"Robust recognition using multiple utterances",
vol. 4, 238-241.
Cosi, Piero / Hosom, John-Paul / Tesser, Fabio:
"High performance Italian continuous "digit" recognition",
vol. 4, 242-245.
Fohr, Dominique / Mella, Odile / Antoine, Christophe:
"The automatic speech recognition engine ESPERE: experiments on telephone speech",
vol. 4, 246-249.
Kiss, Imre:
"A comparison of distributed and network speech recognition for mobile communication systems",
vol. 4, 250-253.
Frankel, Joe / Richmond, Korin / King, Simon / Taylor, Paul:
"An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces",
vol. 4, 254-257.
Shobaki, Khaldoun / Hosom, John-Paul / Cole, Ronald A.:
"The OGI kids² speech corpus and recognizers",
vol. 4, 258-261.
Wu, Jian / Zheng, Fang:
"Reducing time-synchronous beam search effort using stage based look-ahead and language model rank based pruning",
vol. 4, 262-265.
Chung, Grace:
"A three-stage solution for flexible vocabulary speech understanding",
vol. 4, 266-269.
Barker, Jon / Cooke, Martin / Ellis, Daniel P. W.:
"Decoding speech in the presence of other sound sources",
vol. 4, 270-273.
Lee, Shi-Wook / Hirose, Keikichi / Minematsu, Nobuaki:
"Efficient search strategy in large vocabulary continuous speech recognition using prosodic boundary information",
vol. 4, 274-277.
Yu, Ha-Jin / Kim, Hoon / Hong, Joon-Mo / Kim, Min-Seong / Lee, Jong-Seok:
"Large vocabulary Korean continuous speech recognition using a one-pass algorithm",
vol. 4, 278-281.
Seward, Alexander:
"A tree-trellis n-best decoder for stochastic context-free grammars",
vol. 4, 282-285.
Nguyen, Patrick / Rigazio, Luca / Junqua, Jean-Claude:
"EWAVES: an efficient decoding algorithm for lexical tree based speech recognition",
vol. 4, 286-289.
Ogawa, Atsunori / Noda, Yoshiaki / Matsunaga, Shoichi:
"Novel two-pass search strategy using time-asynchronous shortest-first second-pass beam search",
vol. 4, 290-293.
Chan, Yu-Chung / Siu, Manhung / Mak, Brian:
"Pruning of state-tying tree using bayesian information criterion with multiple mixtures",
vol. 4, 294-297.
Liao, Yuan-Fu / Wang, Nick / Huang, Max / Huang, Hank / Seide, Frank:
"Improvements of the Philips 2000 Taiwan Mandarin benchmark system",
vol. 4, 298-301.
Neukirchen, Christoph / Aubert, Xavier / Dolfing, Hans:
"Extending the generation of word graphs for a cross-word m-gram decoder",
vol. 4, 302-305.
Zhao, Qingwei / Lin, Zhiwei / Yuan, Baosheng / Yan, Yonghong:
"Improvements in search algorithm for large vocabulary continuous speech recognition",
vol. 4, 306-309.
Yu, Hua / Tomokiyo, Takashi / Wang, Zhirong / Waibel, Alex:
"New developments in automatic meeting transcription",
vol. 4, 310-313.
Pan, Jielin / Yuan, Baosheng / Yan, Yonghong:
"Effective vector quantization for a highly compact acoustic model for LVCSR",
vol. 4, 318-321.
Yamamoto, Hiroki / Fukada, Toshiaki / Komori, Yasuhiro:
"Effective lexical tree search for large vocabulary continuous speech recognition",
vol. 4, 322-325.
Hori, Chiori / Furui, Sadaoki:
"Improvements in automatic speech summarization and evaluation methods",
vol. 4, 326-329.
Chang, Shuangyu / Shastri, Lokendra / Greenberg, Steven:
"Automatic phonetic transcription of spontaneous speech (american English)",
vol. 4, 330-333.
Novak, Miroslav / Picheny, Michael:
"Speed improvement of the tree-based time asynchronous search",
vol. 4, 334-337.
Huang, Jing / Kingsbury, B. / Mangu, L. / Padmanabhan, Mukund / Saon, George / Zweig, Geoffrey:
"Recent improvements in speech recognition performance on large vocabulary conversational speech (voicemail and switchboard)",
vol. 4, 338-341.
He, Lei / Fang, Ditang / Wu, Wenhu:
"Speaker normalization training and adaptation for speech recognition",
vol. 4, 342-345.
Tomokiyo, Laura Mayfield:
"Lexical and acoustic modeling of non-native speech in LVSCR",
vol. 4, 346-349.
Li, Baojie / Hirose, Keikichi / Minematsu, Nobuaki:
"Modeling phone correlation for speaker adaptive speech recognition",
vol. 4, 350-353.
Botterweck, Henrik:
"Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices",
vol. 4, 354-357.
Zheng, Chengyi / Yan, Yonghong:
"Efficiently using speaker adaptation data",
vol. 4, 358-361.
Pfau, Thilo / Faltlhauser, Robert / Ruske, Günther:
"A combination of speaker normalization and speech rate normalization for automatic speech recognition",
vol. 4, 362-365.
Hwang, Tai-Hwei / Yuo, Kuo-Hwei / Wang, Hsiao-Chuan:
"Speech model compensation with direct adaptation of cepstral variance to noisy environment",
vol. 4, 366-369.
Wu, Ji / Wang, Zuoying:
"Gaussian similarity analysis and its application in speaker adaptation",
vol. 4, 370-373.
Itoh, Nobuyasu / Nishimura, Masafumi / Mori, Shinsuke:
"A method for style adaptation to spontaneous speech by using a semi-linear interpolation technique",
vol. 4, 374-377.
Geutner, Petra / Arevalo, Luis / Breuninger, Joerg:
"VODIS - voice-operated driver information systems: a usability study on advanced speech technologies for car environments",
vol. 4, 378-382.
Chou, Wu / Zhou, Qiru / Kuo, Hong-Kwang Jeff / Saad, Antoine / Attwater, David / Durston, Peter / Farrell, Mark / Scahill, Frank:
"Natural language call steering for service applications",
vol. 4, 382-385.
Hunsinger, Jörg / Lang, Manfred:
"A single-stage top-down probabilistic approach towards understanding spoken and handwritten mathematical formulas",
vol. 4, 386-389.
Raghavan, Prabhu / Gupta, Sunil K.:
"Low complexity connected digit recognition for mobile applications",
vol. 4, 390-393.
Nouza, Jan:
"Telephone speech recognition from large lists of Czech words",
vol. 4, 394-397.
Wu, Duanpei / Menendez-Pidal, X. / Olorenshaw, L. / Chen, R. / Tanaka, M. / Amador, M.:
"Speech and word detection algorithms for hands-free applications",
vol. 4, 398-401.
Rao, Ashwin / Roth, Bob / Nagesha, Venkatesh / McAllaster, Don / Liberman, Natalie / Gillick, Larry:
"Large vocabulary continuous speech recognition of read speech over cellular and landline networks",
vol. 4, 402-405.
Problems and Prospects of Trans-Lingual Communication (Special Session)
Yamamoto, Seiichi:
"Toward speech communications beyond language barrier - research of spoken language translation technologies at ATR -",
vol. 4, 406-411.
Blanchon, Hervé / Boitet, Christian:
"Speech translation for French within the c-STAR II consortium and future perspectives",
vol. 4, 412-417.
Zong, Chengqing / Wakita, Yumi / Xu, Bo / Chen, Zhenbiao / Matsui, Kenji:
"Japanese-to-Chinese spoken language translation based on the simple expression",
vol. 4, 418-421.
Bangalore, Srinivas / Riccardi, Giuseppe:
"Finite-state models for lexical reordering in spoken language translation",
vol. 4, 422-425.
Engel, Ralf:
"CHUNKY: an example based machine translation system for spoken dialogs",
vol. 4, 426-429.
Lazzari, Gianni:
"Spoken translation: challenges and opportunities",
vol. 4, 430-435.
Boitet, Christian / Guilbaud, Jean-Philippe:
"Analysis into a formal task-oriented pivot without clear abstract - semantics is best handled as "usual" translation",
vol. 4, 436-439.
Zong, Chengqing / Huang, Taiyi / Xu, Bo:
"An improved template-based approach to spoken language translation",
vol. 4, 440-443.
Watanabe, Takao / Okumura, Akitoshi / Sakai, Shinsuke / Yamabana, Kiyoshi / Doi, Shinichi / Hanazawa, Ken:
"An automatic interpretation system for travel conversation",
vol. 4, 444-447.
Gruhn, Rainer / Singer, Harald / Tsukada, Hajime / Naito, Masaki / Nishino, Atsushi / Nakamura, Atsushi / Sagisaka, Yoshinori / Nakamura, Satoshi:
"Cellular-phone based speech-to-speech translation system ATR-MATRIX",
vol. 4, 448-451.
Spoken Language Resources, Labeling, and Assessment
Beringer, Nicole / Ito, Tsuyoshi / Neff, Marcia:
"Generation of pronunciation rule sets for automatic segmentation of American English and Japanese",
vol. 4, 452-455.
Samudravijaya, K. / Rao, P. V. S. / Agrawal, S. S.:
"Hindi speech database",
vol. 4, 456-459.
Wang, Hsiao-Chuan / Seide, Frank / Tseng, Chiu-Yu / Lee, Lin-Shan:
"MAT-2000 - design, collection, and validation of a Mandarin 2000-speaker telephone speech database",
vol. 4, 460-463.
Sjölander, Kåre / Beskow, Jonas:
"Wavesurfer - an open source speech tool",
vol. 4, 464-467.
Campbell, Nick / Marumoto, Toru:
"Automatic labelling of voice-quality in speech databases for synthesis",
vol. 4, 468-471.
Timoney, Joe / Foley, J. Brian:
"Speech quality evaluation based on AM-FM time-frequency representations",
vol. 4, 472-475.
Kawahara, Tatsuya / Lee, Akinobu / Kobayashi, Tetsunori / Takeda, Kazuya / Minematsu, Nobuaki / Sagayama, Shigeki / Itou, Katsunobu / Ito, Akinori / Yamamoto, Mikio / Yamada, Atsushi / Utsuro, Takehito / Shikano, Kiyohiro:
"Free software toolkit for Japanese large vocabulary continuous speech recognition",
vol. 4, 476-479.
Robust Modeling
Huo, Qiang / Ma, Bin:
"Robust speech recognition based on off-line elicitation of multiple priors and on-line adaptive prior fusion",
vol. 4, 480-483.
Roberts, William J.J. / Furui, Sadaoki:
"Robust speech recognition via modeling spectral coefficients with HMM's with complex Gaussian components",
vol. 4, 484-487.
Wester, Mirjam / Kessens, Judith M. / Strik, Helmer:
"Pronunciation variation in ASR: which variation to model?",
vol. 4, 488-491.
Mou, Xiaolong / Zue, Victor:
"The use of dynamic reliability scoring in speech recognition",
vol. 4, 492-495.
Macías-Guarasa, Javier / Ferreiros, Javier / San-Segundo, Ruben / Montero, Juan Manuel / Pardo, Juan Manuel:
"Acoustical and lexical based confidence measures for a very large vocabulary telephone speech hypothesis-verification system",
vol. 4, 496-499.
Goronzy, Silke / Marasek, Krzysztof / Kompe, Ralf / Haag, Andreas:
"Phone-duration-based confidence measures for embedded applications",
vol. 4, 500-503.
Ganapathiraju, Aravind / Hamaker, Jonathan / Picone, Joseph:
"Hybrid SVM/HMM architectures for speech recognition",
vol. 4, 504-507.
Adaptation and Acquisition in Spoken Language Processing 1, 2
Sasaki, Koki / Jiang, Hui / Hirose, Keikichi:
"Rapid adaptation of n-gram language models using inter-word correlation for speech recognition",
vol. 4, 508-511.
Moore, Gareth / Young, Steve:
"Class-based language model adaptation using mixtures of word-class weights",
vol. 4, 512-515.
Sun, Jiasong / Cui, Xiaodong / Wang, Zuoying / Liu, Yang:
"A language model adaptation approach based on text classification",
vol. 4, 516-519.
Chung, Grace:
"Automatically incorporating unknown words in JUPITER",
vol. 4, 520-523.
Chengalvarayan, Rathinavelu:
"Look-ahead sequential feature vector normalization for noisy speech recognition",
vol. 4, 524-527.
Iwahashi, Naoto / Kawasaki, Akihiko:
"Speaker adaptation in noisy environments based on parameter estimation using uncertain data",
vol. 4, 528-531.
Acero, Alex / Altschuler, Steven / Wu, Lani:
"Speech/noise separation using two microphones and a VQ model of speech signals",
vol. 4, 532-535.
Bacchiani, Michiel:
"Using maximum likelihood linear regression for segment clustering and speaker identification",
vol. 4, 536-539.
Myrvoll, Tor André / Siohan, Olivier / Lee, Chin-Hui / Chou, Wu:
"Structural maximum a-posteriori linear regression for unsupervised speaker adaptation",
vol. 4, 540-543.
Chien, Jen-Tzung / Liao, Guo-Hong:
"Transformation-based Bayesian predictive classification for online environmental learning and robust speech recognition",
vol. 4, 544-547.
Pitz, Michael / Wessel, Frank / Ney, Hermann:
"Improved MLLR speaker adaptation using confidence measures for conversational speech recognition",
vol. 4, 548-551.
Chengalvarayan, Rathinavelu:
"Unified acoustic modeling for continuous speech recognition",
vol. 4, 552-555.
Dharanipragada, Satya / Padmanabhan, Mukund:
"A nonlinear unsupervised adaptation technique for speech recognition",
vol. 4, 556-559.
Doh, Sam-Joo / Stern, Richard M.:
"Using class weighting in inter-class MLLR",
vol. 4, 560-563.
Acoustics of Spoken Language (Poster)
Hosom, John-Paul / Cole, Ronald A.:
"Burst detection based on measurements of intensity discrimination",
vol. 4, 564-567.
Ferreiros López, Javier / Ellis, Daniel P. W.:
"Using acoustic condition clustering to improve acoustic change detection on broadcast news",
vol. 4, 568-571.
Nedel, Jon P. / Singh, Rita / Stern, Richard M.:
"Phone transition acoustic modeling: application to speaker independent and spontaneous speech systems",
vol. 4, 572-575.
Shen, Liqin / Fu, Guokang / Chai, Haixin / Qin, Yong:
"The measurement of acoustic similarity and its applications",
vol. 4, 576-579.
Yi, Sopae / Kim, Hyung Soon / Lee, One Good:
"Glottal parameters contributing to the perceotion of loud voices",
vol. 4, 580-583.
Schillo, Christoph / Fink, Gernot A. / Kummert, Franz:
"Grapheme based speech recognition for large vocabularies",
vol. 4, 584-587.
Nedel, Jon P. / Singh, Rita / Stern, Richard M.:
"Automatic subword unit refinement for spontaneous speech recognition via phone splitting",
vol. 4, 588-591.
Tarui, Takeshi:
"Rhythm timing in Japanese English",
vol. 4, 592-595.
Iwaki, Mamoru:
"A vocal tract area ratio estimation from spectral parameter extracted by straight",
vol. 4, 596-599.
Ramabhadran, Bhuvana / Gao, Yuqing:
"Decision tree based rate of speech modeling for speech recognition",
vol. 4, 600-603.
Padmanabhan, Mukund:
"Spectral peak tracking and its use in speech recognition",
vol. 4, 604-607.
Li, Yongxin / Gao, Yuqing / Erdogan, Hakan:
"Weighted pairwise scatter to improve linear discriminant analysis",
vol. 4, 608-611.
Matousek, Jindrich / Psutka, Josef:
"ARTIC: a new Czech text-to-speech system using statistical approach to speech segment database construction",
vol. 4, 612-615.
Chou, Wu / Siohan, Olivier / Myrvoll, Tor André / Lee, Chin-Hui:
"Extended maximum a posterior linear regression (EMAPLR) model adaptation for speech recognition",
vol. 4, 616-619.
Maneenoi, Ekkarit / Jitapunkul, Somchai / Ahkuputra, Visarut / Thathong, Umavasee / Thampanitchawong, Boonchai / Luksaneeyanawin, Sudaporn:
"Thai monophthong recognition using continuous density hidden Markov model and LPC cepstral coefficients",
vol. 4, 620-623.
Wu, Chung-Hsien / Chen, Yeou-Jiunn / Yang, Cher-Yao:
"Error recovery and sentence verification using statistical partial pattern tree for conversational speech",
vol. 4, 624-627.
Howitt, Andrew Wilson:
"Vowel landmark detection",
vol. 4, 628-631.
Meyer, Carsten / Rose, Georg:
"Rival training: efficient use of data in discriminative training",
vol. 4, 632-635.
Chen, Marilyn Y.:
"Nasal detection module for a knowledge-based speech recognition system",
vol. 4, 636-639.
Liu, Jun / Zhu, Xiaoyan / Jia, Bin:
"Semi-continuous segmental probability model for speech signals",
vol. 4, 640-643.
Jan, Ea-Ee / Botella Ordinas, Jaime:
"Cross-domain robust acoustic training",
vol. 4, 644-647.
Wang, Fan / Zheng, Fang / Wu, Wenhu:
"A c/v segmentation method for Mandarin speech based on multiscale fractal dimension",
vol. 4, 648-651.
Chen, Xiaoxia / Li, Aijun / Sun, Guohua / Hua, Wu / Yu, Zhigang:
"An application of SAMPA-c for standard Chinese",
vol. 4, 652-655.
Signal Analysis, Processing, and Feature Extraction
Lu, Wenkai / Zhang, Xuegong / Li, Yanda / Liqin, Shen / Weibin, Zhu:
"Joint speech signal enhancement based on spectral subtraction and SVD filter",
vol. 4, 656-659.
Krstulovic, Sacha / Bimbot, Frédéric:
"Inverse lattice filtering of speech with adapted non-uniform delays",
vol. 4, 660-663.
Kawahara, Hideki / Atake, Yoshinori / Zolfaghari, Parham:
"Accurate vocal event detection method based on a fixed-point analysis of mapping from time to weighted average group delay",
vol. 4, 664-667.
Huang, Jun / Padmanabhan, Mukund:
"Filterbank-based feature extraction for speech recognition and its application to voice mail transcription",
vol. 4, 668-671.
Murphy, Peter J.:
"A cepstrum-based harmonics-to-noise ratio in voice signals",
vol. 4, 672-675.
Sun, Xuejing:
"A pitch determination algorithm based on subharmonic-to-harmonic ratio",
vol. 4, 676-679.
Solé i Casals, Jordi / Monte i Moreno, Enric / Jutten, Christian / Taleb, Anisse:
"Source separation techniques applied to speech linear prediction",
vol. 4, 680-683.
Sugiyama, Masahide:
"Model based voice decomposition method",
vol. 4, 684-687.
Funaki, Keiichi:
"A time-varying complex speech analysis based on IV method",
vol. 4, 688-691.
Zolfaghari, Parham / Kawahara, Hideki:
"A sinusoidal model based on frequency-to-instantaneous frequency mapping",
vol. 4, 692-695.
Farooq, Omar / Datta, Sekharjit:
"Dynamic feature extraction by wavelet analysis",
vol. 4, 696-699.
Karnjanadecha, Montri / Zahorian, Stephen A.:
"An investigation of variable block length methods for calculation of spectral/temporal features for automatic speech recognition",
vol. 4, 700-703.
Sasou, Akira / Tanaka, Kazuyo:
"Glottal excitation modeling using HMM with application to robust analysis of speech signal",
vol. 4, 704-707.
Docío-Fernández, Laura / García-Mateo, Carmen:
"Automatic segmentation of speech based on hidden Markov models and acoustic features",
vol. 4, 708-711.
Kurematsu, Akira / Akegami, Youichi / Burge, Susanne / Jekat, Susanne / Lause, Brigitte / Maclaren, Victoria L. / Oppermann, Daniela / Schultz, Tanja:
"VERBMOBIL dialogues: multifaced analysis",
vol. 4, 712-715.
Zhang, Jin-Jie / Cao, Zhi-Gang / Ma, Zheng-Xin:
"A computation-efficient parameter adaptation algorithm for the generalized spectral subtraction method",
vol. 4, 716-719.
Araki, Masahiro / Ueda, Kiyoshi / Nishimoto, Takuya / Niimi, Yasuhisa:
"A semantic tagging tool for spoken dialogue corpus",
vol. 4, 720-723.
Li, Aijun / Chen, Xiaoxia / Sun, Guohua / Hua, Wu / Yin, Zhigang / Zu, Yiqing / Zheng, Fang / Song, Zhanjiang:
"The phonetic labeling on read and spontaneous discourse corpora",
vol. 4, 724-727.
Beringer, Nicole / Schiel, Florian:
"The quality of multilingual automatic segmentation using German MAUS",
vol. 4, 728-731.
Radová, Vlasta / Psutka, Josef:
"UWB_S01 corpus - a czech read-speech corpus",
vol. 4, 732-735.
Fabbrizio, Giuseppe Di / Narayanan, Shrikanth:
"Web-based monitoring, logging and reporting tools for multi-service multi-modal systems",
vol. 4, 736-739.
Strik, Helmer / Cucchiarini, Catia / Kessens, Judith M.:
"Comparing the recognition performance of CSRs: in search of an adequate metric and statistical significance test",
vol. 4, 740-743.
Raake, Alexander:
"Perceptual dimensions of speech sound quality in modern transmission systems",
vol. 4, 744-747.