Table of Contents and Access to Abstracts
Keynotes 1-4
Zue, Victor:
"On organic interfaces",
1-8.
Scott, Sophie K.:
"The neural basis of speech perception - a view from functional imaging",
9-13.
Waibel, Alex / Bernardin, Keni / Wölfel, Matthias:
"Computer-supported human-human multilingual communication",
14-21.
Oudeyer, Pierre-Yves:
"Self-organization in the evolution of shared systems of speech sounds: a computational study",
22-29.
Discriminative and Large Margin Techniques in Acoustic Modeling
Li, Jinyu / Lee, Chin-Hui:
"Soft margin feature extraction for automatic speech recognition",
30-33.
Yin, Yan / Jiang, Hui:
"A fast optimization method for large margin estimation of HMMs based on second order cone programming",
34-37.
Li, Hao-Zheng / O'Shaughnessy, Douglas:
"Frame margin probability discriminative training algorithm for noisy speech recognition",
38-41.
Valente, Fabio / Vepa, Jithendra / Plahl, Christian / Gollan, Christian / Hermansky, Hynek / Schlüter, Ralf:
"Hierarchical neural networks feature extraction for LVCSR system",
42-45.
Olsen, Peder A. / Hershey, John R.:
"Bhattacharyya error and divergence using variational importance sampling",
46-49.
Wu, Tingyao / Duchateau, Jacques / Compernolle, Dirk :
"Phoneme dependent frame selection preference",
50-53.
Speech Production I, II
Zhou, Xinhui / Espy-Wilson, Carol Y. / Tiede, Mark / Boyce, Suzanne:
"An articulatory and acoustic study of "retroflex" and "bunched" american English rhotic sound based on MRI",
54-57.
Martins, Paula / Carbone, Inês / Silva, Augusto / Teixeira, António J. S.:
"An MRI study of european portuguese nasals",
58-61.
Takano, Sayoko / Matsuzaki, Hiroki / Motoki, Kunitoshi:
"A four-cube FEM model of the extrinsic and intrinsic tongue muscles to simulate the production of vowel /i/",
62-65.
Torres, Juan / Moore, Elliot:
"Performance evaluation of glottal quality measures from the perspective of vocal tract filter consistency",
66-69.
Singampalli, Veena D. / Jackson, Philip J. B.:
"Statistical identification of critical, dependent and redundant articulators",
70-73.
Qin, Chao / Carreira-Perpiñán, Miguel Á.:
"An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping",
74-77.
Dusan, Sorin:
"Vocal tract length during speech production",
1366-1369.
Miki, Nobuhiro / Hayashi, Kyohei:
"Approximation method of subglottal system using ARMA filter",
1370-1373.
Toutios, Asterios / Margaritis, Konstantinos:
"Enhancing acoustic-to-EPG mapping with lip position information",
1374-1377.
Kaburagi, Tokihiko / Tanabe, Yosuke:
"A model of glottal flow incorporating viscous-inviscid interaction",
1378-1381.
Seeber, Kilian G.:
"Thinking outside the cube: modeling language processing tasks in a multiple resource paradigm",
1382-1385.
Cisonni, Julien / Hirtum, Annemie Van / Willems, Jan / Pelorson, Xavier:
"Experimental validation of direct and inverse glottal flow models for unsteady flow conditions",
1386-1389.
Nomura, Hideyuki / Funada, Tetsuo:
"Effect of unsteady glottal flow on the speech production process",
1390-1393.
Schneider, Katrin / Möbius, Bernd:
"Word stress correlates in spontaneous child-directed speech in German",
1394-1397.
Aron, Michael / Ferveur, Nicolas / Kerrien, Erwan / Berger, Marie-Odile / Laprie, Yves:
"Acquisition and synchronization of multimodal articulatory data",
1398-1401.
Robert, Vincent / Laprie, Yves / Bonneau, Anne:
"A phonetic concatenative approach of labial coarticulation",
1402-1405.
Turkmani, Aseel / Hilton, Adrian / Jackson, Philip J. B. / Edge, James:
"Visual analysis of lip coarticulation in VCV utterances",
1406-1409.
Airas, Matti / Alku, Paavo:
"Comparison of multiple voice source parameters in different phonation types",
1410-1413.
Knoll, Monja / Scharrer, Lisa:
"Acoustic and affective comparisons of natural and imaginary infant-, foreigner- and adult-directed speech",
1414-1417.
Araújo, André / Jesus, Luis M. T. / Costa, Isabel M.:
"Vowel production in two occlusal classes",
1418-1421.
Khatiwada, Rajesh:
"Nepalese retroflex stops: a static palatography study of inter- and intra-speaker variability",
1422-1425.
Lamoureux, Charles A. / Boucher, Victor J.:
"Effects of testosterone levels on temporal and intonational aspects of speech: more exploratory data",
1426-1428.
Phonetic Segmentation and Classification I, II
Karsmakers, Peter / Pelckmans, Kristiaan / Suykens, Johan / hamme, Hugo Van:
"Fixed-size kernel logistic regression for phoneme classification",
78-81.
Park, Seung Seop / Shin, Jong Won / Kim, Jong Kyu / Kim, Nam Soo:
"A multiple-model based framework for automatic speech segmentation",
82-85.
Jansen, Aren / Niyogi, Partha:
"Semi-supervised learning of speech sounds",
86-89.
Parate, Abhinav / Verma, Ashish / Basak, Jayanta:
"Evaluation of syllable stress using single class classifier",
90-93.
Huda, Mohammad Nurul / Muhammad, Ghulam / Horikawa, Junsei / Nitta, Tsuneo:
"Distinctive phonetic feature (DPF) based phone segmentation using hybrid neural networks",
94-97.
Goldman, J. -Ph. / Avanzi, M. / Simon, A. -C. / Lacheret, Anne / Auchlin, A.:
"A methodology for the automatic detection of perceived prominent syllables in spoken French",
98-101.
Niu, Xiaochuan / Santen, Jan P. H. van:
"Dual-channel acoustic detection of nasalization states",
1921-1924.
Pruthi, Tarun / Espy-Wilson, Carol Y.:
"Acoustic parameters for the automatic detection of vowel nasalization",
1925-1928.
Hou, Jun / Rabiner, Lawrence R. / Dusan, Sorin:
"On the use of time-delay neural networks for highly accurate classification of stop consonants",
1929-1932.
Golipour, Ladan / O'Shaughnessy, Douglas:
"A new approach for phoneme segmentation of speech signals",
1933-1936.
Stouten, Veronique / Demuynck, Kris / hamme, Hugo Van:
"Automatically learning the units of speech by non-negative matrix factorisation",
1937-1940.
Kalinli, Ozlem / Narayanan, Shrikanth S.:
"A saliency-based auditory attention model with applications to unsupervised prominent syllable detection in speech",
1941-1944.
An, Sung Jun / Kim, Young-Ik / Kil, Rhee Man:
"Zero-crossing-based ratio masking for sound segregation",
1945-1948.
Tanaka, Satomi / Tsuzaki, Minoru / Kato, Hiroaki / Sagisaka, Yoshinori:
"Event detection of speech signals based on auditory processing with a dynamic compressive gammachirp filterbank",
1949-1952.
Scharenborg, Odette / Ernestus, Mirjam / Wan, Vincent:
"Segmentation of speech: child's play?",
1953-1956.
Errity, Andrew / McKenna, John / Kirkpatrick, Barry:
"Dimensionality reduction methods applied to both magnitude and phase derived features",
1957-1960.
Discourse, Dialog and Conversation
Mori, Hiroki / Kasuya, Hideki:
"Voice source and vocal tract variations as cues to emotional states perceived from expressive conversational speech",
102-105.
Yang, Fan / Heeman, Peter A.:
"Exploring initiative strategies using computer simulation",
106-109.
Tseng, Chiu-yu / Su, Zhao-yu:
"From one base form to multiple output styles - predicting stylistic dynamics of discourse prosody",
110-113.
Crocco, Claudia / Savy, Renata:
"Topic in dialogue: prosodic and syntactic features",
114-117.
Watanabe, Michiko / Den, Yasuharu / Hirose, Keikichi / Miwa, Shusaku / Minematsu, Nobuaki:
"Features of pauses and conjunctions at syntactic and discourse boundaries in Japanese monologues",
118-121.
Spoken Dialog Systems I, II
Wootton, Craig / McTear, Michael / Anderson, Terry:
"Utilizing online content as domain knowledge in a multi-domain dynamic dialogue system",
122-125.
Schooten, Boris van / Rosset, Sophie / Galibert, Olivier / Max, Aurélien / Akker, Rieks op den / Illouz, Gabriel:
"Handling speech input in the ritel QA dialogue system",
126-129.
Kim, Woosung:
"Online call quality monitoring for automating agent-based call centers",
130-133.
Möller, Sebastian / Engelbrecht, Klaus-Peter / Oulasvirta, Antti:
"Analysis of communication failures for spoken dialogue systems",
134-137.
Mann, Sandra / Berton, André / Ehrlich, Ute:
"How to access audio files of large data bases using in-car speech dialogue systems",
138-141.
Komatani, Kazunori / Kawahara, Tatsuya / Okuno, Hiroshi G.:
"Analyzing temporal transition of real user's behaviors in a spoken dialogue system",
142-145.
Sherwani, J. / Yu, Dong / Paek, Tim / Czerwinski, Mary / Ju, Yun-Cheng / Acero, Alex:
"Voicepedia: towards speech-based access to unstructured information",
146-149.
Rangarajan, Vivek / Bangalore, Srinivas / Narayanan, Shrikanth S.:
"Exploiting prosodic features for dialog act tagging in a discriminative modeling framework",
150-153.
Ai, Hua / Roque, Antonio / Leuski, Anton / Traum, David:
"Using information state to improve dialogue move identification in a spoken dialogue system",
154-157.
Chu, Shiu-Wah / O'Neill, Ian / Hanna, Philip:
"Using multiple strategies to manage spoken dialogue",
158-161.
Quinderé, Marcelo / Lopes, Luís Seabra / Teixeira, António J. S.:
"An information state based dialogue manager for a mobile robot",
162-165.
Yu, Dong / Ju, Yun-Cheng / Wang, Ye-Yi / Zweig, Geoffrey / Acero, Alex:
"Automated directory assistance system - from theory to practice",
2709-2712.
Zweig, Geoffrey / Nguyen, Patrick / Ju, Yun-Cheng / Wang, Ye-Yi / Yu, Dong / Acero, Alex:
"The voice-rate dialog system for consumer ratings",
2713-2716.
Winterboer, Andi / Hu, Jiang / Moore, Johanna D. / Nass, Clifford:
"The influence of user tailoring and cognitive load on user performance in spoken dialogue systems",
2717-2720.
Wang, Ye-Yi / Yu, Dong / Ju, Yun-Cheng / Zweig, Geoffrey / Acero, Alex:
"Confidence measures for voice search applications",
2721-2724.
Higashinaka, Ryuichiro / Dohsaka, Kohji / Amano, Shigeaki / Isozaki, Hideki:
"Effects of quiz-style information presentation on user understanding",
2725-2728.
Kuo, Hong-Kwang Jeff / Goel, Vaibhava:
"A data visualization and analysis method for natural language call routing system design",
2729-2732.
Accent and Language Identification I, II
Bauer, Josef G. / Andrassy, Bernt / Timoshenko, Ekaterina:
"Discriminative optimization of language adapted HMMs for a language identification system based on parallel phoneme recognizers",
166-169.
Sim, Khe Chai / Li, Haizhou:
"Fusion of contrastive acoustic models for parallel phonotactic spoken language identification",
170-173.
Wang, Liang / Ambikairajah, Eliathamby / Choi, Eric H. C.:
"Multi-layer kohonen self-organizing feature map for language identification",
174-177.
Yin, Bo / Ambikairajah, Eliathamby / Chen, Fang:
"Hierarchical language identification based on automatic language clustering",
178-181.
Timoshenko, Ekaterina / Höge, Harald:
"Using speech rhythm for acoustic language identification",
182-185.
Wong, Ka-keung / Siu, Man-hung / Mak, Brian:
"A model-based estimation of phonotactic language verification performance",
186-189.
Rosner, Mike / Farrugia, Paulseph-John:
"A tagging algorithm for mixed language identification in a noisy domain",
190-193.
Toledano, Doroteo T. / Gonzalez-Dominguez, Javier / Abejon-Gonzalez, Alejandro / Spada, Danilo / Mateos-Garcia, Ismael / Gonzalez-Rodriguez, Joaquin:
"Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features",
194-197.
Leeuwen, David A. van / Truong, Khiet P.:
"An open-set detection evaluation methodology applied to language and emotion recognition",
338-341.
Yang, Xi / Siu, Man-hung / Gish, Herbert / Mak, Brian:
"Boosting with anti-models for automatic language identification",
342-345.
Castaldo, Fabio / Colibro, Daniele / Dalmasso, Emanuele / Laface, Pietro / Vair, Claudio:
"Acoustic language identification using fast discriminative training",
346-349.
Li, Ming / Suo, Hongbin / Wu, Xiao / Lu, Ping / Yan, Yonghong:
"Spoken language identification using score vector modeling and support vector machine",
350-353.
Cordoba, R. / D'Haro, L. F. / Fernandez-Martinez, F. / Macias-Guarasa, J. / Ferreiros, J.:
"Language identification based on n-gram frequency ranking",
354-357.
Shen, Wade / Reynolds, Douglas:
"Improving phonotactic language recognition with acoustic adaptation",
358-361.
Education and Training
Bolanos, Daniel / Ward, Wayne / Vuuren, Sarel Van / Garrido, Javier:
"Syllable lattices as a basis for a children's speech reading tracker",
198-201.
Pan, Fuping / Zhao, Qingwei / Yan, Yonghong:
"Mandarin vowel pronunciation quality evaluation by using formant pattern recognition",
202-205.
Black, Matthew / Tepperman, Joseph / Lee, Sungbok / Price, Patti / Narayanan, Shrikanth S.:
"Automatic detection and classification of disfluent reading miscues in young children's speech for the purpose of assessment",
206-209.
Minematsu, Nobuaki / Kamata, K. / Asakawa, Satoshi / Makino, T. / Nishimura, T. / Hirose, Keikichi:
"Structural assessment of language learners' pronunciation",
210-213.
Samir, Abdurrahman / Abdou, Sherif Mahdy / Khalil, Ahmed Husien / Rashwan, Mohsen:
"Enhancing usability of CAPL system for qur'an recitation learning",
214-217.
Wet, Febe de / Walt, Christa van der / Niesler, Thomas:
"Automatic large-scale oral language proficiency assessment",
218-221.
Robust ASR I, II
Denda, Yuki / Tanaka, Takamasa / Nakayama, Masato / Nishiura, Takanobu / Yamashita, Yoichi:
"Noise-robust hands-free voice activity detection with adaptive zero crossing detection using talker direction estimation",
222-225.
Álvarez, A. / Martínez, R. / Gómez, P. / Nieto, V. / Rodellar, V.:
"A robust mel-scale subband voice activity detector for a car platform",
226-229.
Ishizuka, Kentaro / Nakatani, Tomohiro / Fujimoto, Masakiyo / Miyazaki, Noboru:
"Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio",
230-233.
Toh, A. M. / Togneri, Roberto / Nordholm, Sven:
"Feature and distribution normalization schemes for statistical mismatch reduction in reverberant speech recognition",
234-237.
Gibson, Matthew / Hain, Thomas:
"Temporal masking for unsupervised minimum Bayes risk speaker adaptation",
238-241.
Hsieh, Tsung-hsueh / Hung, Jeih-weih:
"Speech feature compensation based on pseudo stereo codebooks for robust speech recognition in additive noise environments",
242-245.
Dimitriadis, Dimitrios / Maragos, Petros / Lefkimmiatis, Stamatios:
"Multiband, multisensor robust features for noisy speech recognition",
246-249.
Sasou, Akira / Kojima, Hiroaki:
"Noise robust speech recognition for voice driven wheelchair",
250-253.
Hu, Yu / Huo, Qiang:
"Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions",
1042-1045.
Buera, Luis / Miguel, Antonio / Lleida, Eduardo / Saz, Óscar / Ortega, Alfonso:
"On the jointly unsupervised feature vector normalization and acoustic model compensation for robust speech recognition",
1046-1049.
Tsao, Yu / Lee, Chin-Hui:
"An ensemble modeling approach to joint characterization of speaker and speaking environments",
1050-1053.
Lin, Shih-Hsiang / Yeh, Yao-Ming / Chen, Berlin:
"Cluster-based polynomial-fit histogram equalization (CPHEQ) for robust speech recognition",
1054-1057.
Martinez, Pedro M. / Segura, Jose C. / Garcia, Luz:
"Robust distributed speech recognition using histogram equalization and correlation information",
1058-1061.
Chien, Jen-Tzung / Shinoda, Koichi / Furui, Sadaoki:
"Predictive minimum Bayes risk classification for robust speech recognition",
1062-1065.
Ma, Ning / Barker, Jon / Green, Phil:
"Applying word duration constraints by using unrolled HMMs",
1066-1069.
Xiao, Xiong / Chng, Eng Siong / Li, Haizhou:
"Evaluating the temporal structure normalisation technique on the Aurora-4 task",
1070-1073.
Bořil, Hynek / Fousek, Petr / Höge, Harald:
"Two-stage system for robust neutral/lombard speech recognition",
1074-1077.
Jitsuhiro, Takatoshi / Toriyama, Tomoji / Kogure, Kiyoshi:
"Noise suppression using search strategy with multi-model compositions",
1078-1081.
Nishiura, Takanobu / Hirano, Yoshiki / Denda, Yuki / Nakayama, Masato:
"Investigations into early and late reflections on distant-talking speech recognition toward suitable reverberation criteria",
1082-1085.
Windmann, Stefan / Haeb-Umbach, Reinhold:
"An approach to iterative speech feature enhancement and recognition",
1086-1089.
Hung, Jeih-weih:
"Optimization of temporal filters in the modulation frequency domain for constructing robust features in speech recognition",
1090-1093.
Petrick, Rico / Lohde, Kevin / Wolff, Matthias / Hoffmann, Rüdiger:
"The harming part of room acoustics in automatic speech recognition",
1094-1097.
Liao, Yuan Fu / Yang, Yh-Her / Hsu, Chi-Hui / Lee, Cheng-Chang / Zeng, Jing-Teng:
"A reference model weighting-based method for robust speech recognition",
1098-1101.
Nasersharif, Babak / Akbari, Ahmad / Homayounpour, Mohammad Mehdi:
"Mel sub-band filtering and compression for robust speech recognition",
1102-1105.
Adaptation in ASR I, II
Tang, Yun / Rose, Richard:
"Clustered maximum likelihood linear basis for rapid speaker adaptation",
254-257.
Teng, Wenxuan / Gravier, Guillaume / Bimbot, Frédéric / Soufflet, Frédéric:
"Rapid speaker adaptation by reference model interpolation",
258-261.
Gomez, Randy / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection",
262-265.
Mak, Brian / Hsiao, Roger:
"Robustness of several kernel-based fast adaptation methods on noisy LVCSR",
266-269.
Pylkkönen, Janne:
"Estimating VTLN warping factors by distribution matching",
270-273.
Liu, Ming / Zhou, Xi / Hasegawa-Johnson, Mark / Huang, Thomas S. / Zhang, Zhengyou:
"Frequency domain correspondence for speaker normalization",
274-277.
Nishida, Masafumi / Horiuchi, Yasuo / Ichikawa, Akira:
"Unsupervised training of adaptation rate using q-learning in large vocabulary continuous speech recognition",
278-281.
Karafiát, Martin / Burget, Lukáš / Černocký, Jan / Hain, Thomas:
"Application of CMLLR in narrow band wide band adapted systems",
282-285.
Lévy, Christophe / Linarès, Georges / Bonastre, Jean-François:
"Fast adaptation of GMM-based compact models",
286-289.
Lööf, Jonas / Schlüter, Ralf / Ney, Hermann:
"Efficient estimation of speaker-specific projecting feature transforms",
1557-1560.
Omar, Mohamed Kamal:
"Regularized feature-based maximum likelihood linear regression for speech recognition",
1561-1564.
Morales, Omar Caballero / Cox, Stephen:
"Modelling confusion matrices to improve speech recognition accuracy, with an application to dysarthric speech",
1565-1568.
Huo, Qiang / Li, Wei:
"An active approach to speaker and task adaptation based on automatic analysis of vocabulary confusability",
1569-1572.
Zheng, Jing / Stolcke, Andreas:
"fMPE-MAP: improved discriminative adaptation for modeling new domains",
1573-1576.
Hazen, Timothy J. / McDermott, Erik:
"Discriminative MCE-based speaker adaptation of acoustic models for a spoken lecture processing task",
1577-1580.
Speaker Verification & Identification I-IV
Karam, Zahi N. / Campbell, William M.:
"A new kernel for SVM MLLR based speaker recognition",
290-293.
Lee, Kong-Aik / You, Changhuai / Li, Haizhou / Kinnunen, Tomi:
"A GMM-based probabilistic sequence kernel for speaker verification",
294-297.
Aronowitz, Hagai:
"Speaker recognition using kernel-PCA and intersession variability modeling",
298-301.
Dehak, Réda / Dehak, Najim / Kenny, Patrick / Dumouchel, Pierre:
"Linear and non linear kernel GMM supervector machines for speaker verification",
302-305.
Lopez-Moreno, Ignacio / Mateos-Garcia, Ismael / Ramos, Daniel / Gonzalez-Rodriguez, Joaquin:
"Support vector regression for speaker verification",
306-309.
Longworth, C. / Gales, M. J. F.:
"Derivative and parametric kernels for speaker verification",
310-313.
Calvo, Jose R. / Fernández, Rafael / Hernández, Gabriel:
"Application of shifted delta cepstral features in speaker verification",
734-737.
Ferrer, Luciana / Sönmez, Kemal / Shriberg, Elizabeth:
"A smoothing kernel for spatially related features and its application to speaker verification",
738-741.
Charlet, D. / Collet, M. / Bimbot, Frédéric:
"VZ-norm: an extension of z-norm to the multivariate case for anchor model based speaker verification",
742-745.
Lei, Howard / Mirghafori, Nikki:
"Word-conditioned HMM supervectors for speaker recognition",
746-749.
Tsai, Wei-Ho:
"Speaker clustering using direct maximization of a BIC-based score",
750-753.
Preti, A. / Bonastre, Jean-François / Matrouf, Driss / Capman, F. / Ravera, B.:
"Confidence measure based unsupervised target model adaptation for speaker verification",
754-757.
Bao, Huanjun / Xu, Ming-Xing / Zheng, Thomas Fang:
"Emotion attribute projection for speaker recognition on emotional speech",
758-761.
Zhang, Shi-Xiong / Mak, Man-Wai / Meng, Helen:
"High-level feature-based speaker verification via articulatory phonetic-class pronunciation modeling",
762-765.
Yingthawornsuk, T. / Keskinpala, H. Kaymaz / Wilkes, D. M. / Shiavi, R. G. / Salomon, R. M.:
"Direct acoustic feature using iterative EM algorithm and spectral energy for classifying suicidal speech",
766-769.
Garreton, Claudio / Yoma, Nestor Becerra / Huenupán, Fernando / Molina, Carlos:
"On comparing and combining intra-speaker variability compensation and unsupervised model adaptation in speaker verification",
770-773.
Zhao, Xianyu / Dong, Yuan / Yang, Hao / Zhao, Jian / Lu, Liang / Wang, Haila:
"Comparison of two kinds of speaker location representation for SVM-based speaker verification",
774-777.
Farrús, Mireia / Hernando, Javier / Ejarque, Pascual:
"Jitter and shimmer measurements for speaker recognition",
778-781.
Shan, Zhenyu / Yang, Yingchun / Ye, Ruizhi:
"Natural-emotion GMM transformation algorithm for emotional speaker recognition",
782-785.
Tseng, Ivy H. / Verscheure, Olivier / Turaga, Deepak S. / Chaudhari, Upendra V.:
"Optimized one-bit quantization for adapted GMM-based speaker verification",
786-789.
McLaren, Mitchell / Vogt, Robbie / Baker, Brendan / Sridharan, Sridha:
"A comparison of session variability compensation techniques for SVM-based speaker recognition",
790-793.
Fauve, Benoît / Evans, Nicholas / Pearson, Neil / Bonastre, Jean-François / Mason, John:
"Influence of task duration in text-independent speaker verification",
794-797.
Shriberg, Elizabeth / Ferrer, Luciana:
"A text-constrained prosodic system for speaker verification",
1226-1229.
Hannani, Asmaa El / Petrovska-Delacrétaz, Dijana:
"Fusing acoustic, phonetic and data-driven systems for text-independent speaker verification",
1230-1233.
Dehak, Najim / Kenny, Patrick / Dumouchel, Pierre:
"Continuous prosodic features and formant modeling with joint factor analysis for speaker verification",
1234-1237.
Vair, Claudio / Colibro, Daniele / Castaldo, Fabio / Dalmasso, Emanuele / Laface, Pietro:
"Loquendo - Politecnico di torino's 2006 NIST speaker recognition evaluation system",
1238-1241.
Matrouf, Driss / Scheffer, Nicolas / Fauve, Benoît / Bonastre, Jean-François:
"A straightforward and efficient implementation of the factor analysis model for speaker verification",
1242-1245.
Hazen, Timothy J. / Schultz, Daniel:
"Multi-modal user authentication from video for mobile or variable-environment applications",
1246-1249.
Gerber, Michael / Beutler, René / Pfister, Beat:
"Quasi text-independent speaker-verification based on pattern matching",
1993-1996.
Solewicz, Yosef A. / Koppel, Moshe:
"Virtual fusion for speaker recognition",
1997-2000.
Chao, Yi-Hsiang / Tsai, Wei-Ho / Cheng, Shih-Sian / Wang, Hsin-Min / Chang, Ruei-Chuan:
"Evolutionary minimum verification error learning of the alternative hypothesis model for LLR-based speaker verification",
2001-2004.
Nakagawa, Seiichi / Asakawa, Kouhei / Wang, Longbiao:
"Speaker recognition by combining MFCC and phase information",
2005-2008.
Manocha, Sandeep / Espy-Wilson, Carol Y.:
"A semi-automatic approach for speaker mining of tapped telephone conversations",
2009-2012.
Yang, Hao / Dong, Yuan / Zhao, Xianyu / Zhao, Jian / Lu, Liang / Wang, Haila:
"Cluster adaptive training weights as features in SVM-based speaker verification",
2013-2016.
Okamoto, Hideki / Kojima, Mariko / Matsui, Tomoko / Kawanami, Hiromichi / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Study on speaker verification with non-audible murmur segments",
2017-2020.
Lu, Xugang / Dang, Jianwu:
"Dimension reduction for speaker identification based on mutual information",
2021-2024.
Lindh, Jonas / Eriksson, Anders:
"Robustness of long time measures of fundamental frequency",
2025-2028.
Prakash, Vinod / Hansen, John H. L.:
"Score distribution scaling for speaker recognition",
2029-2032.
Morris, A. C. / Koreman, J. / Ly-Van, B. / Sellahewa, H. / Jassim, S. / Gómez, R. Llarena:
"Global features for rapid identity verification with dynamic biometric data",
2033-2036.
Pham, Tuan Van / Neffe, Michael / Kubin, Gernot:
"Robust voice activity detection for narrow-bandwidth speaker verification under adverse environments",
2037-2040.
Huenupán, Fernando / Yoma, Nestor Becerra / Molina, Carlos / Garreton, Claudio:
"Speaker verification with multiple classifier fusion using Bayes based confidence measure",
2041-2044.
Chetty, Girija / Wagner, Michael:
"Audiovisual speaker identity verification based on lip motion features",
2045-2048.
Tur, Gokhan / Shriberg, Elizabeth / Stolcke, Andreas / Kajarekar, Sachin:
"Duration and pronunciation conditioned lexical modeling for speaker verification",
2049-2052.
Bonastre, Jean-François / Matrouf, Driss / Fredouille, Corinne:
"Artificial impostor voice transformation effects on false acceptance rates",
2053-2056.
Spoken Data Retrieval I, II
Miller, David R. H. / Kleber, Michael / Kao, Chia-Lin / Kimball, Owen / Colthurst, Thomas / Lowe, Stephen A. / Schwartz, Richard M. / Gish, Herbert:
"Rapid and accurate spoken term detection",
314-317.
Pan, Yi-cheng / Chang, Hung-lin / Chen, Berlin / Lee, Lin-shan:
"Subword-based position specific posterior lattices (s-PSPL) for indexing speech information",
318-321.
Merkel, Andreas / Klakow, Dietrich:
"Improved methods for language model based question classification",
322-325.
Akiba, Tomoyosi / Tsujimura, Hirofumi:
"Error-tolerant question answering for spoken documents",
326-329.
Hakkani-Tür, Dilek / Tur, Gokhan / Levit, Michael:
"Exploiting information extraction annotations for document retrieval in distillation tasks",
330-333.
Thambiratnam, K. / Seide, F.:
"Learning spoken document similarity and recommendation using supervised probabilistic latent semantic analysis",
334-337.
Wallace, Roy / Vogt, Robbie / Sridharan, Sridha:
"A phonetic search approach to the 2006 NIST spoken term detection evaluation",
2385-2388.
Itoh, Yoshiaki / Iwata, Kohei / Kojima, Kazunori / Ishigame, Masaaki / Tanaka, Kazuyo / Lee, Shi-wook:
"An integration method of retrieval results using plural subword models for vocabulary-free spoken document retrieval",
2389-2392.
Vergyri, Dimitra / Shafran, Izhak / Stolcke, Andreas / Gadde, Ramana R. / Akbacak, Murat / Roark, Brian / Wang, Wen:
"The SRI/OGI 2006 spoken term detection system",
2393-2396.
Goto, Masataka / Ogata, Jun / Eto, Kouichirou:
"Podcastle: a web 2.0 approach to speech recognition research",
2397-2400.
Camelin, Nathalie / Béchet, Frédéric / Damnati, Géraldine / Mori, Renato De:
"Speech mining in noisy audio message corpus",
2401-2404.
Shao, Jian / Zhao, Qingwei / Zhang, Pengyuan / Liu, Zhaojie / Yan, Yonghong:
"A fast fuzzy keyword spotting algorithm based on syllable confusion network",
2405-2408.
Kim, Wooil / Hansen, John H. L.:
"Advances in speechfind: transcript reliability estimation employing confidence measure based on discriminative sub-word model for SDR",
2409-2412.
Favre, Benoit / Bonastre, Jean-François / Bellot, Patrice:
"An interactive timeline for speech database browsing",
2413-2416.
Speech Perception I, II
Yip, Michael C. W.:
"Spoken word recognition of Chinese homophones: a further investigation",
362-365.
Wolters, Maria / Campbell, Pauline / DePlacido, Christine / Liddell, Amy / Owens, David:
"The role of outer hair cell function in the perception of synthetic versus natural speech",
366-369.
Kusumoto, Akiko / Kain, Alexander B. / Hosom, John-Paul / Santen, Jan P. H. van:
"Hybridizing conversational and clear speech",
370-373.
Dufour, Sophie / Frauenfelder, Ulrich Hans:
"Neighborhood density and neighborhood frequency effects in French spoken word recognition",
374-377.
Irino, Toshio / Aoki, Yoshie / Hayashi, Yoshie / Kawahara, Hideki / Patterson, Roy D.:
"Discrimination and recognition of scaled word sounds",
378-381.
Tóth, László:
"Benchmarking human performance on the acoustic and linguistic subtasks of ASR systems",
382-385.
Yang, Lin / Zhang, Jianping / Yan, Yonghong:
"Contributions of temporal fine structure cues to Chinese speech recognition in cochlear implant simulation",
386-389.
Wu, Xihong / Chen, Jing / Yang, Zhigang / Huang, Qiang / Wang, Mengyuan / Li, Liang:
"Effect of number of masking talkers on speech-on-speech masking in Chinese",
390-393.
Bagou, Odile / Dufour, Sophie / Fougeron, Cécile / Content, Alain / Frauenfelder, Ulrich Hans:
"Do different boundary types induce subtle acoustic cues to which French listeners are sensitive?",
394-397.
Stadler, Svante / Leijon, Arne / Hagerman, Björn:
"An information theoretic approach to predict speech intelligibility for listeners with normal and impaired hearing",
398-401.
Wade, Travis / Möbius, Bernd:
"Speaking rate effects in a landmark-based phonetic exemplar model",
402-405.
Maniwa, Kazumi / Jongman, Allard / Wade, Travis:
"Acoustic correlates of intelligibility enhancements in clearly produced fricatives",
406-409.
Jürgens, Tim / Brand, Thomas / Kollmeier, Birger:
"Modelling the human-machine gap in speech reception: microscopic speech intelligibility prediction for normal-hearing subjects with an auditory model",
410-413.
Ikeno, Ayako / Hansen, John H. L.:
"Lombard speech impact on perceptual speaker recognition",
414-417.
Goy, Huiwen / Pichora-Fuller, Kathleen / Lieshout, Pascal van / Singh, Gurjit / Schneider, Bruce:
"Effect of within- and between-talker variability on word identification in noise by younger and older adults",
418-421.
Bunnell, H. Timothy / Schanen, N. Carolyn / Vallino, Linda D. / Morlet, Thierry G. / Polikoff, James B. / Driscoll, Jennette D. / Mantell, James T.:
"Speech perception in children with speech sound disorder",
422-425.
Wang, Huan / Hemmert, Werner:
"Speech coding and information processing by auditory neurons",
426-429.
Gilbert, Annie C. / Boucher, Victor J.:
"What do listeners attend to in hearing prosodic structures? investigating the human speech-parser using short-term recall",
430-433.
Brungart, Douglas S. / Iyer, Nandini:
"Time-compressed speech perception with speech and noise maskers",
1581-1584.
Cutler, Anne / Cooke, Martin / Lecumberri, Maria Luisa Garcia / Pasveer, Dennis:
"L2 consonant identification in noise: cross-language comparisons",
1585-1588.
Le, Jennifer T. / Best, Catherine T. / Tyler, Michael D. / Kroos, Christian:
"Effects of non-native dialects on spoken word recognition",
1589-1592.
Meyer, Julien / Meunier, Fanny / Dentel, Laure:
"Identification of natural whistled vowels by non-whistlers",
1593-1596.
Jesse, Alexandra / McQueen, James M.:
"Prelexical adjustments to speaker idiosyncrasies: are they position-specific?",
1597-1600.
Mitterer, Holger:
"Top-down effects on compensation for coarticulation are not replicable",
1601-1604.
Prosody: Prosodic Structure
Igarashi, Yosuke:
"Pitch pattern alternation in goshogawara Japanese: evidence for a prosodic phrase above the domain for downstep",
434-437.
Nesterenko, Irina / Skrelin, Pavel:
"Some evidence on the phonetics and phonology of prosodic phrasing in Russian",
438-441.
Volín, Jan / Skarnitzl, Radek:
"Temporal downtrends in Czech read speech",
442-445.
Cho, Hyongsil / Hirst, Daniel:
"Empirical evidence for prosodic phrasing: pauses as linguistic annotation in Korean read speech",
446-449.
Dreyer, Markus / Shafran, Izhak:
"Exploiting prosody for PCFGs with latent annotations",
450-453.
Shi, Qin / Jiang, DanNing / Meng, FanPing / Qin, Yong:
"Combining length distribution model with decision tree in prosodic phrase prediction",
454-457.
Yang, Li-chiung:
"Duration and pauses as boundary-markers in speech: a cross-linguistic study",
458-461.
Prosodic Modeling I, II
Yu, Jian / Huang, Lixing / Tao, Jianhua / Wang, Xia:
"Modeling incompletion phenomenon in Mandarin dialog prosody",
462-465.
Tamm, Anne / Abari, Kálmán / Olaszy, Gábor:
"Accent assignment algorithm in Hungarian, based on syntactic analysis",
466-469.
Lin, Cheng-Yuan / Jao, Pei-Chi / Jang, J. -S. Roger:
"An effective initial/final duration prediction method for corpus-based singing voice synthesis of Mandarin Chinese",
470-473.
Németh, Géza / Fék, Márk / Csapó, Tamás Gábor:
"Increasing prosodic variability of text-to-speech synthesizers",
474-477.
Lolive, Damien / Barbot, Nelly / Boeffard, Olivier:
"Unsupervised HMM classification of F0 curves",
478-481.
Read, Ian / Cox, Stephen:
"Automatic pitch accent prediction for text-to-speech synthesis",
482-485.
Ni, Xinqiang / Chen, Yining / Soong, Frank K. / Chu, Min / Zhang, Ping:
"An unsupervised approach to automatic prosodic annotation",
486-489.
Inanoglu, Zeynep / Young, Steve:
"A system for transforming the emotion in speech: combining data-driven conversion techniques for prosody and voice quality",
490-493.
Chiang, Chen-Yu / Yu, Hsiu-Min / Wang, Yih-Ru / Chen, Sin-Horng:
"An automatic prosody labeling method for Mandarin speech",
494-497.
Hirose, Keikichi / Ochi, Keiko / Minematsu, Nobuaki:
"Corpus-based generation of prosodic features from text based on generation process model",
1274-1277.
Tian, Jilei / Nurminen, Jani / Kiss, Imre:
"Novel eigenpitch-based prosody model for text-to-speech synthesis",
1278-1281.
Strom, Volker / Nenkova, Ani / Clark, Robert / Vazquez-Alvarez, Yolanda / Brenier, Jason / King, Simon / Jurafsky, Dan:
"Modelling prominence and emphasis improves unit-selection synthesis",
1282-1285.
Takada, Seiya / Yagi, Yuji / Hirose, Keikichi / Minematsu, Nobuaki:
"A framework of reply speech generation for concept-to-speech conversion in spoken dialogue systems",
1286-1289.
Stocksmeier, Thorsten / Kopp, Stefan / Gibbon, Dafydd:
"Synthesis of prosodic attitudinal variants in German backchannel ja",
1290-1293.
Li, Ke / Greenberg, Yoko / Sagisaka, Yoshinori:
"Inter-language prosodic style modification experiment using word impression vector for communicative speech generation",
1294-1297.
Speech Analysis
Crammer, Koby:
"A conservative aggressive subspace tracker",
498-501.
Nilsson, Mattias / Kleijn, W. Bastiaan:
"Mutual information and the speech signal",
502-505.
Ezzat, Tony / Bouvrie, Jake / Poggio, Tomaso:
"Spectro-temporal analysis of speech using 2-d Gabor filters",
506-509.
Dekens, Tomas / Demol, Mike / Verhelst, Werner / Verhoeve, Piet:
"A comparative study of speech rate estimation techniques",
510-513.
Falk, Tiago H. / Yuan, Hua / Chan, Wai-Yip:
"Spectro-temporal processing for blind estimation of reverberation time and single-ended quality measurement of reverberant speech",
514-517.
Spectral Analysis, Formants and Vocal Tract Models
Waterschoot, Toon van / Moonen, Marc:
"Linear prediction of audio signals",
518-521.
Magi, Carlo / Bäckström, Tom / Alku, Paavo:
"Stabilised weighted linear prediction - a robust all-pole method for speech processing",
522-525.
Rudoy, Daniel / Spendley, Daniel N. / Wolfe, Patrick J.:
"Conditionally linear Gaussian models for estimating vocal tract resonances",
526-529.
Schnell, Karl / Lacroix, Arild:
"Time-varying pre-emphasis and inverse filtering of speech",
530-533.
Thiemann, Joachim / Kabal, Peter:
"Reconstructing audio signals from modified non-coherent hilbert envelopes",
534-537.
Nguyen, Binh Phu / Akagi, Masato:
"A flexible spectral modification method based on temporal decomposition and Gaussian mixture model",
538-541.
Darch, Jonathan / Milner, Ben:
"A comparison of estimated and MAP-predicted formants and fundamental frequencies with a speech reconstruction application",
542-545.
Deng, Huiqun / O'Shaughnessy, Douglas:
"Effect of incomplete glottal closures on estimates of glottal waves via inverse filtering of vowel sounds",
546-549.
Kalgaonkar, Kaustubh / Clements, Mark A.:
"Vocal tract and area function estimation with both lip and glottal losses",
550-553.
, Guruprasad S. / , Yegnanarayana B. / , Sri Rama Murty K.:
"Detection of instants of glottal closure using characteristics of excitation source",
554-557.
Sturmel, Nicolas / D'Alessandro, Christophe / Doval, Boris:
"A comparative evaluation of the zeros of z transform representation for voice source estimation",
558-561.
Speech and Audio Processing for Intelligent Environments
Loenen, Evert van:
"Paper 1315 was not available at the time of publication ambient intelligence - an overview",
paper 0.
Härmä, Aki:
"Ambient telephony: scenarios and research challenges",
562-565.
Obuchi, Yasunari / Amano, Akio:
"Always listening to you: creating exhaustive audio database in home environments",
566-569.
Schmalenstroeer, Joerg / Haeb-Umbach, Reinhold:
"Joint speaker segmentation, localization and identification for streaming audio",
570-573.
Lu, Yan-Chen / Cooke, Martin / Christensen, Heidi:
"Active binaural distance estimation for dynamic sources",
574-577.
Borgström, Bengt J. / Alwan, Abeer:
"A packetization and variable bitrate interframe compression scheme for vector quantizer-based distributed speech recognition",
578-581.
Wölfel, Matthias:
"Channel selection by class separability measures for automatic transcriptions on distant microphones",
582-585.
Wyatt, Danny / Choudhury, Tanzeem / Bilmes, Jeff:
"Conversation detection and speaker segmentation in privacy-sensitive situated speech data",
586-589.
Abad, Alberto / Segura, Carlos / Nadeu, Climent / Hernando, Javier:
"Audio-based approaches to head orientation estimation in a smart-room",
590-593.
Ion, Valentin / Haeb-Umbach, Reinhold:
"Multi-resolution soft features for channel-robust distributed speech recognition",
594-597.
Language Modeling I, II
Su, Yi / Jelinek, Frederick / Khudanpur, Sanjeev:
"Large-scale random forest language models for speech recognition",
598-601.
Akita, Yuya / Nemoto, Yusuke / Kawahara, Tatsuya:
"PLSA-based topic detection in meetings for adaptation of lexicon and language model",
602-605.
Sako, Atsushi / Takiguchi, Tetsuya / Ariki, Yasuo:
"Language modeling using PLSA-based topic HMM",
606-609.
Pan, Yi-cheng / Lee, Lin-shan:
"Lexicon adaptation with reduced character error (LARCE) - a new direction in Chinese language modeling",
610-613.
Wu, Meng-Sung / Chien, Jen-Tzung:
"Minimum rank error training for language modeling",
614-617.
Wang, Wen / Stolcke, Andreas:
"Integrating MAP, marginals, and unsupervised language model adaptation",
618-621.
Yamazaki, Hiroki / Iwano, Koji / Shinoda, Koichi / Furui, Sadaoki / Yokota, Haruo:
"Dynamic language model adaptation using presentation slides for lecture speech recognition",
2349-2352.
Munteanu, Cosmin / Penn, Gerald / Baecker, Ron:
"Web-based language modelling for automatic lecture transcription",
2353-2356.
Alumäe, Tanel / Kirt, Toomas:
"LSA-based language model adaptation for highly inflected languages",
2357-2360.
Heidel, Aaron / Chang, Hung-an / Lee, Lin-shan:
"Language model adaptation using latent dirichlet allocation and an efficient topic inference algorithm",
2361-2364.
Yaman, Sibel / Chien, Jen-Tzung / Lee, Chin-Hui:
"Structural Bayesian language modeling and adaptation",
2365-2368.
Martins, Ciro / Teixeira, António J. S. / Neto, João:
"Vocabulary selection for a broadcast news transcription system using a morpho-syntactic approach",
2369-2372.
Bach, Nguyen / Noamany, Mohamed / Lane, Ian / Schultz, Tanja:
"Handling OOV words in Arabic ASR via flexible morphological constraints",
2373-2376.
Justo, Raquel / Torres, M. Inés:
"Phrases in category-based language models for Spanish and basque ASR",
2377-2380.
Arısoy, Ebru / Sak, Haşim / Saraçlar, Murat:
"Language modeling for automatic turkish broadcast news transcription",
2381-2384.
Prosody Production and Perception
Calhoun, Sasha:
"Predicting focus through prominence structure",
622-625.
Bulut, Murtaza / Lee, Sungbok / Narayanan, Shrikanth S.:
"Analysis of emotional speech prosody in terms of part of speech tags",
626-629.
Liu, Fang / Xu, Yi:
"The neutral tone in question intonation in Mandarin",
630-633.
Rochet-Capellan, Amélie / Schwartz, Jean-Luc / Laboissière, Rafael / Galvàn, Arturo:
"Pointing to a target while naming it with /pata/ or /tapa/: the effect of consonants and stress position on jaw-finger coordination",
634-637.
Hide, Øydis / Gillis, Steven / Govaerts, Paul:
"Suprasegmental aspects of pre-lexical speech in cochlear implanted children",
638-641.
Niebuhr, Oliver:
"Categorical perception in intonation: a matter of signal dynamics?",
642-645.
Multimodal Speech Recognition
Aboutabit, Noureddine / Beautemps, Denis / Clarke, Jeanne / Besacier, Laurent:
"A HMM recognition of consonant-vowel syllables from lip contours: the cued speech case",
646-649.
Lucey, Patrick / Potamianos, Gerasimos / Sridharan, Sridha:
"A unified approach to multi-pose audio-visual ASR",
650-653.
Seymour, Rowan / Stewart, Darryl / Ming, Ji:
"Audio-visual integration for robust speech recognition using maximum weighted stream posteriors",
654-657.
Hueber, Thomas / Chollet, Gérard / Denby, Bruce / Dreyfus, Gérard / Stone, Maureen:
"Continuous-speech phone recognition from ultrasound and optical images of the tongue and lips",
658-661.
Zhu, Bo / Hazen, Timothy J. / Glass, James:
"Multimodal speech recognition with ultrasonic sensors",
662-665.
Dean, David / Lucey, Patrick / Sridharan, Sridha / Wark, Tim:
"Fused HMM-adaptation of multi-stream HMMs for audio-visual speech recognition",
666-669.
Speech and Other Modalities
Ishi, Carlos T. / Ishiguro, Hiroshi / Hagita, Norihiro:
"Analysis of head motions and speech in spoken dialogue",
670-673.
Larsen, Lars Bo / Jensen, Kasper L. / Larsen, Søren / Rasmussen, Morten:
"A paradigm for mobile speech-centric services",
674-677.
Campr, Pavel / Hrúz, Marek / Železný, Miloš:
"Design and recording of Czech sign language corpus for automatic sign language recognition",
678-681.
Edlund, Jens / Beskow, Jonas:
"Pushy versus meek - using avatars to influence turn-taking behaviour",
682-685.
Wand, Michael / Jou, Szu-Chen Stan / Schultz, Tanja:
"Wavelet-based front-end for electromyographic speech recognition",
686-689.
Ferré, Gaëlle / Bertrand, Roxane / Blache, Philippe / Espesser, Robert / Rauzy, Stéphane:
"Intensive gestures in French and their multimodal correlates",
690-693.
Ouni, Slim / Ouni, Kais:
"Aspects of visual speech in Arabic",
694-697.
Burnham, Denis / Reynolds, Jessica / Vignali, Guillaume / Bollwerk, Sandra / Jones, Caroline:
"Rigid vs non-rigid face and head motion in phone and tone perception",
698-701.
Multimodal/Multimedia Signal Processing
Kjellström, Hedvig / Engwall, Olov / Abdou, Sherif Mahdy / Bälter, Olle:
"Audio-visual phoneme classification for pronunciation training applications",
702-705.
Grauwinkel, Katja / Dewitt, Britta / Fagel, Sascha:
"Visual information and redundancy conveyed by internal articulator dynamics in synthetic audiovisual speech",
706-709.
Zhou, Wei / Wang, Zengfu:
"A speech rate related lip movement model for speech animation",
710-713.
Wu, Guanyong / Zhu, Jie:
"An extension 2DPCA based visual feature extraction method for audio-visual speech recognition",
714-717.
Lee, Soo-jong / Park, Jun / Kim, Eung-kyeu:
"Preventing an external acoustic noise from being misrecognized as a speech recognition object by confirming the lip movement image signal",
718-721.
Hofer, Gregor / Shimodaira, Hiroshi:
"Automatic head motion prediction from speech data",
722-725.
Denda, Yuki / Nishiura, Takanobu / Yamashita, Yoichi:
"Omnidirectional audio-visual talker localizer with dynamic feature fusion based on validity and reliability criteria",
726-729.
Campbell, Nick / Douxchamps, Damien:
"Processing image and audio information for recognising discourse participation status through features of face and voice",
730-733.
Speech Enhancement
Wójcicki, Kamil K. / So, Stephen / Paliwal, Kuldip K.:
"The effect of the additivity assumption on time and frequency domain wiener filtering for speech enhancement",
798-801.
Li, Junfeng / Sakamoto, Shuichi / Hongo, Satoshi / Akagi, Masato / Suzuki, Yôiti:
"Noise reduction based on adaptive β-order generalized spectral subtraction for speech enhancement",
802-805.
Das, Amit / Hansen, John H. L.:
"Class constrained ROVER based speech enhancement",
806-809.
Deger, Erhan / Molla, Md. Khademul Islam / Hirose, Keikichi / Minematsu, Nobuaki / Hasan, Md. Kamrul:
"EMD based soft-thresholding for speech enhancement",
810-813.
Borowicz, Adam / Petrovsky, Alexander:
"An approximate solution for perceptually constrained signal subspace speech enhancement method",
814-817.
Fingscheidt, Tim / Suhadi, Suhadi:
"Quality assessment of speech enhancement systems by separation of enhanced speech, noise, and echo",
818-821.
Aicha, Anis Ben / Jebara, Sofia Ben:
"Perceptual musical noise reduction using critical bands tonality coefficients and masking thresholds",
822-825.
Mauler, Dirk / Nagathil, Anil M. / Martin, Rainer:
"On optimal estimation of compressed speech for hearing aids",
826-829.
Hendriks, Richard C. / Jensen, Jesper / Heusdens, Richard:
"DFT domain subspace based noise tracking for speech enhancement",
830-833.
Krishnamurthy, Nitish / Hansen, John H. L.:
"Noise tracking for speech systems in adverse environments",
834-837.
Essebbar, Abderrahman / Poinsard, Tristan:
"Speech enhancement using multi-reference noise reduction in a vehicle environment",
838-841.
Warsitz, Ernst / Haeb-Umbach, Reinhold / Vu, Dang Hai Tran:
"Blind adaptive principal eigenvector beamforming for acoustical source separation",
842-845.
Koldovský, Zbyněk / Tichavský, Petr:
"Time-domain blind audio source separation using advanced ICA methods",
846-849.
Lee, S. W. / Soong, Frank K. / Ching, P. C.:
"Model-based speech separation with single-microphone input",
850-853.
Kinoshita, Keisuke / Delcroix, Marc / Nakatani, Tomohiro / Miyoshi, Masato:
"Multi-step linear prediction based speech dereverberation in noisy reverberant environment",
854-857.
Lee, Seung Yeol / Shin, Jong Won / Yun, Hwan Sik / Kim, Nam Soo:
"A statistical model based post-filtering algorithm for residual echo suppression",
858-861.
Huang, Xiaoshan / Zhao, Xiaoqun:
"An optimal speech enhancement under speech uncertainty probability and masking property of auditory system",
862-865.
Structure-based and Template-based Automatic Speech Recognition
Maier, Viktoria / Moore, Roger K.:
"Temporal episodic memory model: an evolution of minerva2",
866-869.
Coro, Gianpaolo / Cutugno, Francesco / Caropreso, Fulvio:
"Speech recognition with factorial-HMM syllabic acoustic models",
870-873.
Wachter, Mathias De / Demuynck, Kris / Wambacq, Patrick / Compernolle, Dirk Van:
"Evaluating acoustic distance measures for template based recognition",
874-877.
Han, Yan / Boves, Lou:
"Hierarchical acoustic modeling based on random-effects regression for automatic speech recognition",
878-881.
Hämäläinen, Annika / Bosch, Louis ten / Boves, Lou:
"Construction and analysis of multiple paths in syllable models",
882-885.
Espy-Wilson, Carol Y. / Pruthi, Tarun / Juneja, Amit / Deshmukh, Om:
"Landmark-based approach to speech recognition: an alternative to HMMs",
886-889.
Asakawa, Satoshi / Minematsu, Nobuaki / Hirose, Keikichi:
"Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics",
890-893.
Togneri, Roberto / Deng, Li:
"A structured speech model parameterized by recursive dynamics and neural networks",
894-897.
Deng, Li / Strik, Helmer:
"Structure-based and template-based automatic speech recognition - comparing parametric and non-parametric approaches",
898-901.
Grangier, David / Bengio, Samy:
"Learning the inter-frame distance for discriminative template-based keyword detection",
902-905.
Yu, Dong / Deng, Li / Acero, Alex:
"Handling phonetic context and speaker variation in a structure-based speech recognizer",
906-909.
Robust ASR Against Noise and Reverberation
Segbroeck, Maarten Van / hamme, Hugo Van:
"Vector-quantization based mask estimation for missing data automatic speech recognition",
910-913.
Demange, Sébastien / Cerisara, Christophe / Haton, Jean-Paul:
"Accurate marginalization range for missing data recognition",
914-917.
Kühne, Marco / Togneri, Roberto / Nordholm, Sven:
"Smooth soft mel-spectrographic masks based on blind sparse source separation",
918-921.
Laidler, Jonathan / Cooke, Martin / Lawrence, Neil D.:
"Model-driven detection of clean speech patches in noise",
922-925.
Stern, Richard M. / Gouvêa, Evandro B. / Thattai, Govindarajan:
""polyaural" array processing for automatic speech recognition in degraded environments",
926-929.
Morales, Nicolás / Gu, Liang / Gao, Yuqing:
"Adding noise to improve noise robustness in speech recognition",
930-933.
Language Resources and Tools
Fosler-Lussier, Eric / Dilley, Laura / Tyson, Na'im / Pitt, Mark:
"The buckeye corpus of speech: updates and enhancements",
934-937.
Barroso, N. / Ezeiza, A. / Gilisagasti, N. / Ipiña, K. López de / López, A. / López, J. M.:
"Development of multimodal resources for multilingual information retrieval in the basque context",
938-941.
Schwartz, Reva / Shen, Wade / Campbell, Joseph / Paget, Shelley / Vonwiller, Julie / Estival, Dominique / Cieri, Christopher:
"Construction of a phonotactic dialect corpus using semiautomatic annotation",
942-945.
Abdennadher, Slim / Aly, Mohamed / Bühler, Dirk / Minker, Wolfgang / Pittermann, Johannes:
"BECAM tool - a semi-automatic tool for bootstrapping emotion corpus annotation and management",
946-949.
Cieri, Christopher / Corson, Linda / Graff, David / Walker, Kevin:
"Resources for new research directions in speaker recognition: the mixer 3, 4 and 5 corpora",
950-953.
Heeman, Peter A. / McMillin, Andy / Yaruss, J. Scott:
"Intercoder reliability in annotating complex disfluencies",
954-957.
Single-channel Speech Enhancement
Radfar, M. H. / Dansereau, R. M.:
"Single channel speech separation using maximum a posteriori estimation",
958-961.
Suhadi, Suhadi / Fingscheidt, Tim:
"Speech enhancement with improved a posteriori SNR computation",
962-965.
Tat, Thang Vu / Seide, Germine / Unoki, Masashi / Akagi, Masato:
"Method of LP-based blind restoration for improving intelligibility of bone-conducted speech",
966-969.
Falk, Tiago H. / Stadler, Svante / Kleijn, W. Bastiaan / Chan, Wai-Yip:
"Noise suppression based on extending a speech-dominated modulation band",
970-973.
Abolhassani, Amin Haji / Selouani, Sid-Ahmed / O'Shaughnessy, Douglas / Harkat, Mohamed-Faouzi:
"Speech enhancement using PCA and variance of the reconstruction error model identification",
974-977.
Shin, Jong Won / Lim, Woohyung / Sung, Junesig / Kim, Nam Soo:
"Speech reinforcement based on partial specific loudness",
978-981.
Phonetics and Phonology
Rathcke, Tamara / Harrington, Jonathan:
"The phonetics and phonology of high and low tones in two falling f0-contours in standard German",
982-985.
John, Tina / Harrington, Jonathan:
"Temporal alignment of creaky voice in neutralised realisations of an underlying, post-nasal voicing contrast in German",
986-989.
Demol, Mike / Verhelst, Werner / Verhoeve, Piet:
"The duration of speech pauses in a multilingual environment",
990-993.
Gibbon, Dafydd / Bachan, Jolanta / Demenko, Grażyna:
"Syllable timing patterns in Polish: results from annotation mining",
994-997.
Kalimeris, Constandinos / Bakamidis, Stelios:
"Minimal pairs and functional loads of sound contrasts obtained from a list of modern greek words",
998-1001.
Wissing, Daan:
"More on acoustic correlates of stress",
1002-1005.
Woehrling, Cécile / Mareüil, Philippe Boula de:
"Comparing praat and snack formant measurements on two large corpora of northern and southern French",
1006-1009.
Barry, William / Andreeva, Bistra / Steiner, Ingmar:
"The phonetic exponency of phrasal accentuation in French and German",
1010-1013.
Christodoulou, Christiana:
"Phonetic geminates in cypriot greek: the case of voiceless plosives",
1014-1017.
Williams, Darcie / Poiré, François:
"Predicting vowel duration in spontaneous canadian French speech",
1018-1021.
Chow, Ivan / Poiré, François:
"Rhotic variation and schwa epenthesis in windsor French",
1022-1025.
Bürki, Audrey / Fougeron, Cécile / Gendrot, Cédric:
"On the categorical nature of the process involved in schwa elision in French",
1026-1029.
Hu, Yue-Ning / Chu, Min / Huang, Chao / Zhang, Yan-Ning:
"Exploring tonal variations via context-dependent tone models",
1030-1033.
Martin, Philippe / Li, Jun:
"Acoustic analysis of the neutral tone in Mandarin",
1034-1037.
Ho, Rerrario Shui-Ching / Sagisaka, Yoshinori:
"F0 analysis of perceptual distance among Cantonese level tones",
1038-1041.
Features for ASR
Hsu, Chang-wen / Lee, Lin-shan:
"Extended powered cepstral normalization (p-CN) with range equalization for robust features in speech recognition",
1106-1109.
Sakai, Makoto / Kitaoka, Norihide / Nakagawa, Seiichi:
"Selection of optimal dimensionality reduction method using chernoff bound for segmental unit input HMM",
1110-1113.
Tyagi, Vivek:
"Fepstrum: an improved modulation spectrum for ASR",
1114-1117.
Macho, Dušan:
"Narrowband to wideband feature expansion for robust multilingual ASR",
1118-1121.
Li, Weifeng / Bourlard, Hervé:
"Non-linear spectral contrast stretching for in-car speech recognition",
1122-1125.
Li, Xiao-Bing / O'Shaughnessy, Douglas:
"Clustering-based two-dimensional linear discriminant analysis for speech recognition",
1126-1129.
Kubo, Yotaro / Okawa, Shigeki / Kurematsu, Akira / Shirai, Katsuhiko:
"A study on temporal features derived by analytic signal",
1130-1133.
Zahorian, Stephen A. / Singh, Tara / Hu, Hongbing:
"Dimensionality reduction of speech features using nonlinear principal components analysis",
1134-1137.
Sanand, D. R. / Kumar, D. Dinesh / Umesh, S.:
"Linear transformation approach to VTLN using dynamic frequency warping",
1138-1141.
Alencar, Vladimir Fabregas Surigué de / Alcaim, Abraham:
"Features interpolation domain for distributed speech recognition and performance for ITU-t g.723.1 CODEC",
1142-1145.
Sato, Shoei / Onoe, Kazuo / Kobayashi, Akio / Homma, Shinich / Imai, Toru / Takagi, Tohru / Kobayashi, Tetsunori:
"Dynamic integration of multiple feature streams for robust real-time LVCSR",
1146-1149.
Matsumasa, Hironori / Takiguchi, Tetsuya / Ariki, Yasuo / Li, Ichao / Nakabayashi, Toshitaka:
"PCA-based feature extraction for fluctuation in speaking style of articulation disorders",
1150-1153.
Valente, Fabio / Vepa, Jithendra / Hermansky, Hynek:
"Multi-stream features combination based on dempster-shafer rule for LVCSR system",
1154-1157.
Singh-Miller, Natasha / Collins, Michael / Hazen, Timothy J.:
"Dimensionality reduction for speech recognition using neighborhood components analysis",
1158-1161.
Su, Dan / Wu, Xihong / Chi, Huisheng:
"Probabilistic latent speaker analysis for large vocabulary speech recognition",
1162-1165.
Prasanna, S. R. Mahadeva / Hermansky, Hynek:
"MRASTA and PLP in automatic speech recognition",
1166-1169.
Objective Assessment of Voice and Speech Quality
Brückl, Markus:
"Women's vocal aging: a longitudinal approach",
1170-1173.
Cnockaert, Laurence / Schoentgen, Jean / Ozsancak, Canan / Auzou, Pascal / Grenez, Francis:
"Effect of intensive voice therapy on vocal tremor for parkinson speakers",
1174-1177.
Alpan, A. / Kacha, A. / Grenez, Francis / Schoentgen, Jean:
"Assessment of vocal dysperiodicities in connected disordered speech",
1178-1181.
Laukkanen, Anne-Maria / Horáček, Jaromír / Švancara, Pavel / Lehtinen, Elina:
"Effects of FE modelled consequences of tonsillectomy on perceptual evaluation of voice",
1182-1185.
Leeuw, Irma M. Verdonck-de / Bosch, Louis ten / Chao, Li Ying / Rinkel, Rico N. P. M. / Borggreven, Pepijn A. / Boves, Lou / Leemans, C. René:
"Speech quality after major surgery of the oral cavity and oropharynx with microvascular soft tissue reconstruction",
1186-1189.
Bruijn, Christel de / Whiteside, Sandra:
"Voice fatigue and use of speech recognition: a study of voice quality ratings",
1190-1193.
Bonastre, Jean-François / Fredouille, Corinne / Ghio, A. / Giovanni, A. / Pouchoulin, G. / Révis, J. / Teston, B. / Yu, P.:
"Complementary approaches for voice disorder assessment",
1194-1197.
Pouchoulin, G. / Fredouille, Corinne / Bonastre, Jean-François / Ghio, A. / Giovanni, A.:
"Frequency study for the characterization of the dysphonic voices",
1198-1201.
Boucher, Victor J.:
"Acoustic correlates of laryngeal-muscle fatigue: findings for a phonometric prevention of acquired voice pathologies",
1202-1205.
Maier, Andreas / Schuster, Maria / Batliner, Anton / Nöth, Elmar / Nkenke, Emeka:
"Automatic scoring of the intelligibility in patients with cancer of the oral cavity",
1206-1209.
Duchateau, Jacques / Cleuren, Leen / hamme, Hugo Van / Ghesquière, Pol:
"Automatic assessment of children's reading level",
1210-1213.
Ferrer, Carlos / Hernández-Díaz, María E. / González, Eduardo:
"Using waveform matching techniques in the measurement of shimmer in voiced signals",
1214-1217.
Fraile, R. / Godino-Llorente, J. I. / Sáenz-Lechón, N. / Osma-Ruiz, V. / Gómez-Vilda, P.:
"Analysis of the impact of analogue telephone channel on MFCC parameters for voice pathology detection",
1218-1221.
Manfredi, C. / Bocchi, L. / Cantarella, G. / Peretti, G. / Guidi, G. / Mezzatesta, V.:
"Objective parameters from videokymographic images: a user-friendly interface",
1222-1225.
Discourse, Dialog and Emotion Expression
House, David:
"Integrating audio and visual cues for speaker friendliness in multimodal speech synthesis",
1250-1253.
Wesseling, Wieneke / Son, R. J. J. H. van / Pols, Louis C. W.:
"The influence of masking words on the prediction of TRPs in a shadowed dialog",
1254-1257.
Laskowski, Kornel / Burger, Susanne:
"Analysis of the occurrence of laughter in meetings",
1258-1261.
Barkhuysen, Pashiera / Krahmer, Emiel / Swerts, Marc:
"Incremental perception of acted and real emotional speech",
1262-1265.
Schlangen, David / Fernández, Raquel:
"Speaking through a noisy channel - experiments on inducing clarification behaviour in human-human dialogue",
1266-1269.
D'Alessandro, Christophe / Rilliard, Albert / Beux, Sylvain Le:
"Computerized chironomy: evaluation of hand-controlled intonation reiteration",
1270-1273.
Resource Acquisition and Preparation; Resource and System Evaluation
Habernal, Ivan / Konopík, Miloslav:
"JAAE: the java abstract annotation editor",
1298-1301.
Nagino, Goshu / Shozakai, Makoto / Shikano, Kiyohiro:
"How to judge reusability of existing speech corpora for target task by utilizing statistical multidimensional scaling",
1302-1305.
Rutten, Peter:
"Feasibility of constructing an expressive speech corpus from television soap opera dialogue",
1306-1309.
Orr, Rosemary / Llinares, Bernat González i / Petersen, Françoise / Hüttenrauch, Helge / Böcker, Martin / Tate, Michael:
"Collection of empirical data for standardization of generic vocabularies in speech driven ICT devices and services",
1310-1313.
Selmini, Antonio Marcos / Violaro, Fábio:
"Acoustic-phonetic features for refining the explicit speech segmentation",
1314-1317.
Lecouteux, B. / Linarès, Georges / Beaugendre, Frédéric / Nocera, Pascal:
"Text island spotting in large speech databases",
1318-1321.
Paek, Tim / Ju, Yun-Cheng / Meek, Christopher:
"People watcher: a game for eliciting human-transcribed data for automated directory assistance",
1322-1325.
Kun, Andrew / Paek, Tim / Medenica, Zeljko:
"The effect of speech interface accuracy on driving performance",
1326-1329.
Zhang, Hua / Wang, Lijuan / Soong, Frank K. / Liu, Wenju:
"Context constrained-generalized posterior probability for verifying phone transcriptions",
1330-1333.
Angkititrakul, Pongtep / Kwak, DongGu / Choi, SangJo / Kim, JeongHee / PhucPhan, Anh / Sathyanarayana, Amardeep / Hansen, John H. L.:
"Getting start with UTDrive: driver-behavior modeling and assessment of distraction for in-vehicle speech systems",
1334-1337.
Kolluru, BalaKrishna / Gotoh, Yoshihiko:
"Relative evaluation of informativeness in machine generated summaries",
1338-1341.
Takezawa, Toshiyuki / Mizushima, Masahide / Shimizu, Tohru / Kikui, Genichiro:
"A method for evaluating task-oriented spoken dialog translation systems based on communication efficiency",
1342-1345.
Hooijdonk, Charlotte van / Commandeur, Edwin / Cozijn, Reinier / Krahmer, Emiel / Marsi, Erwin:
"Using eye movements for online evaluation of speech synthesis",
1346-1349.
Li, Jian / Sityaev, Dmitry / Hao, Jie:
"Sentence level intelligibility evaluation for Mandarin text-to-speech systems using semantically unpredictable sentences",
1350-1353.
Kessens, Judith / Leeuwen, David A. van:
"N-best: the northern- and southern-dutch benchmark evaluation of speech recognition technology",
1354-1357.
Holter, Trym / Sørsdal, Svein:
"A MAP based approach to adaptive speech intelligibility measurements",
1358-1361.
Boonsuk, Sirinoot / Punyabukkana, Proadpran / Suchato, Atiwong:
"Phone boundary detection using selective refinements and context-dependent acoustic features",
1362-1365.
ASR: New Paradigms
Tan, Tien-Ping / Besacier, Laurent:
"Modeling context and language variation for non-native speech recognition",
1429-1432.
Zhao, Xufang / O'Shaughnessy, Douglas:
"An evaluation of cross-language adaptation and native speech training for rapid HMM construction based on very limited training data",
1433-1436.
Markov, Konstantin / Nakamura, Satoshi:
"Never-ending learning with dynamic hidden Markov network",
1437-1440.
Breslin, C. / Gales, M. J. F.:
"Building multiple complementary systems using directed decision trees",
1441-1444.
Nanjo, Hiroaki / Oku, Yuichi / Yoshimi, Takehiko:
"Automatic speech recognition framework for multilingual audio contents",
1445-1448.
Bouselmi, G. / Fohr, Dominique / Illina, I.:
"Combined acoustic and pronunciation modelling for non-native speech recognition",
1449-1452.
Emori, Tadashi / Onishi, Yoshifumi / Shinoda, Koichi:
"Automatic estimation of scaling factors among probabilistic models in speech recognition",
1453-1456.
Stoimenov, Emilian / McDonough, John:
"Memory efficient modeling of polyphone context with weighted finite-state transducers",
1457-1460.
Pylypenko, Valeriy:
"Extra large vocabulary continuous speech recognition algorithm based on information retrieval",
1461-1464.
Hetherington, I. Lee:
"PocketSUMMIT: small-footprint continuous speech recognition",
1465-1468.
Cincarek, Tobias / Shindo, Izumi / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task",
1469-1472.
Ma, Chengyuan / Lee, Chin-Hui:
"A study on word detector design and knowledge-based pruning and rescoring",
1473-1476.
Colthurst, Thomas / Arvizo, Tresi / Kao, Chia-Lin / Kimball, Owen / Lowe, Stephen A. / Miller, David R. H. / Sciver, Jim Van:
"Parameter tuning for fast speech recognition",
1477-1480.
Bosch, Louis ten / Cranen, Bert:
"A computational model for unsupervised word discovery",
1481-1484.
Meyer, Bernd T. / Wächter, Matthias / Brand, Thomas / Kollmeier, Birger:
"Phoneme confusions in human and automatic speech recognition",
1485-1488.
Ohta, Kengo / Tsuchiya, Masatoshi / Nakagawa, Seiichi:
"Construction of spoken language model including fillers using filler prediction model",
1489-1492.
Kumaran, Raghunandan / Bilmes, Jeff / Kirchhoff, Katrin:
"Attention shift decoding for conversational speech recognition",
1493-1496.
Speech and Language Technology for Less-resourced Languages
Mihajlik, Péter / Fegyó, Tibor / Tüske, Zoltán / Ircing, Pavel:
"A morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian",
1497-1500.
Yang, Mei / Zheng, Jing / Kathol, Andreas:
"A semi-supervised learning approach for morpheme segmentation for an Arabic dialect",
1501-1504.
Huyssteen, Gerhard B. van / Puttkammer, Martin J.:
"Accelerating the annotation of lexical data for less-resourced languages",
1505-1508.
Draxler, Christoph:
"On web-based creation of speech resources for less-resourced languages",
1509-1512.
Martinović, Miroslav / Vesić, Srdjan / Rakić, Goran:
"Building an information retrieval system for serbian - challenges and solutions",
1513-1516.
Pauw, Guy De / Wagacha, Peter Waiganjo:
"Bootstrapping morphological analysis of gĩkũyũ using unsupervised maximum entropy learning",
1517-1520.
Gros, Jerneja Žganec / Gruden, Stanislav:
"The voiceTRAN machine translation system",
1521-1524.
Paulo, Sérgio / Oliveira, Luís C.:
"MuLAS: a framework for automatically building multi-tier corpora",
1525-1528.
Ringersma, Jacquelijn / Kemps-Snijders, Marc:
"Creating multimedia dictionaries of endangered languages using LEXUS",
1529-1532.
Loftsson, Hrafn / Rögnvaldsson, Eiríkur:
"IceNLP: a natural language processing toolkit for icelandic",
1533-1536.
Peche, Marius / Davel, Marelie / Barnard, Etienne:
"Phonotactic spoken language identification with limited training data",
1537-1540.
Abate, Solomon Teferra / Menzel, Wolfgang:
"Automatic speech recognition for an under-resourced language - amharic",
1541-1544.
Nimaan, Abdillahi / Nocera, Pascal / Béchet, Frédéric / Bonastre, Jean-François:
"Information retrieval strategies for accessing african audio corpora",
1545-1548.
Siivola, Vesa / Creutz, Mathias / Kurimo, Mikko:
"Morfessor and variKN machine learning tools for speech and language technology",
1549-1552.
Jongtaveesataporn, Markpong / Thienlikit, Issara / Wutiwiwatchai, Chai / Furui, Sadaoki:
"Towards better language modeling for Thai LVCSR",
1553-1556.
Spoken Language Understanding
Raymond, Christian / Riccardi, Giuseppe:
"Generative and discriminative algorithms for spoken language understanding",
1605-1608.
Iosif, Elias / Potamianos, Alexandros:
"A soft-clustering algorithm for automatic induction of semantic classes",
1609-1612.
Gravano, Agustín / Benus, Stefan / Hirschberg, Julia / Mitchell, Shira / Vovsha, Ilia:
"Classification of discourse functions of affirmative words in spoken dialogue",
1613-1616.
Minescu, Bogdan / Damnati, Géraldine / Béchet, Frédéric / Mori, Renato De:
"Conditional use of word lattices, confusion networks and 1-best string hypotheses in a sequential interpretation strategy",
1617-1620.
Kolář, Jáchym / Liu, Yang / Shriberg, Elizabeth:
"Speaker adaptation of language models for automatic dialog act segmentation of meetings",
1621-1624.
Albalate, Amparo / Dimitrov, Dimitar / Pieraccini, Roberto:
"Unsupervised categorisation approaches for technical support automated agents",
1625-1628.
Pitch Extraction I, II
Wohlmayr, Michael / Képesi, Marián:
"Joint position-pitch extraction from multichannel audio",
1629-1632.
Kim, Hyun Soo:
"Morphological pre-processing technique and its applications on speech signal",
1633-1636.
Pelle, Patricia A. / Estienne, Claudio F.:
"A pitch extraction system based on phase locked loops and consensus decision",
1637-1640.
Legát, Milan / Matoušek, Jindřich / Tihelka, Daniel:
"A robust multi-phase pitch-mark detection algorithm",
1641-1644.
Molla, Md. Khademul Islam / Hirose, Keikichi / Minematsu, Nobuaki / Hasan, Md. Kamrul:
"Pitch estimation of noisy speech signals using empirical mode decomposition",
1645-1648.
Hirst, Daniel / Cho, Hyongsil / Kim, Sunhee / Yu, Hyunji:
"Evaluating two versions of the momel pitch modelling algorithm on a corpus of read speech in Korean",
1649-1652.
Hussein, Hussein / Jokisch, Oliver:
"Hybrid electroglottograph and speech signal based algorithm for pitch marking",
1653-1656.
Droppo, Jasha / Acero, Alex:
"A fine pitch model for speech",
2757-2760.
Ghosh, Prasanta Kumar / Ortega, Antonio / Narayanan, Shrikanth S.:
"Pitch period estimation using multipulse model and wavelet transform",
2761-2764.
Heckmann, Martin / Joublin, Frank / Goerick, Christian:
"Combining rate and place information for robust pitch extraction",
2765-2768.
Christensen, Heidi / Ma, Ning / Wrigley, Stuart N. / Barker, Jon:
"Integrating pitch and localisation cues at a speech fragment level",
2769-2772.
Liénard, Jean-Sylvain / Signol, François / Barras, Claude:
"Speech fundamental frequency estimation using the alternate comb",
2773-2776.
Rosenberg, Andrew / Hirschberg, Julia:
"Detecting pitch accent using pitch-corrected energy-based predictors",
2777-2780.
Speech Coding and Transmission
Chatterjee, Saikat / Sreenivas, T. V.:
"Normalized two stage SVQ for minimum complexity wide-band LSF quantization",
1657-1660.
Zhang, Peng / Bao, Chang-chun:
"A novel 2kb/s waveform interpolation speech coder based on non-negative matrix factorization",
1661-1664.
Ismail, Ahmed / Dakroury, Yasser / Abbas, Hazem:
"A novel energy distribution comparison approach for robust speech spectrum vector quantization",
1665-1668.
Ismail, Ahmed / Dakroury, Yasser / Abbas, Hazem:
"Novel low-band phase representation for low bit-rate speech coding",
1669-1672.
Wu, Chun-Feng / Lee, Cheng-Lung / Chang, Wen-Whei:
"Perceptual-based playout mechanisms for multi-stream voice over IP networks",
1673-1676.
Zopf, Robert / Thyssen, Jes / Chen, Juin-Hwey:
"Time-warping and re-phasing in packet loss concealment",
1677-1680.
Agiomyrgiannakis, Yannis / Stylianou, Yannis:
"The harmonic model codec (HMC) framework for voIP",
1681-1684.
Agiomyrgiannakis, Yannis / Stylianou, Yannis:
"Bit-erasure channel decoding for GMM-based multiple description coding",
1685-1688.
Yuan, Hua / Falk, Tiago H. / Chan, Wai-Yip:
"Degradation-classification assisted single-ended quality measurement of speech",
1689-1692.
Raake, Alexander / Spors, Sascha / Ahrens, Jens / Ajmera, Jitendra:
"Concept and evaluation of a downward-compatible system for spatial teleconferencing using automatic speaker clustering",
1693-1696.
Lee, Min-Ki / Kim, Kyung-Tae / Kang, Hong-Goo / Youn, Dae Hee:
"Speech quality estimation using packet loss effects in CELP-type speech coders",
1697-1700.
Oshikiri, Masahiro / Ehara, Hiroyuki / Morii, Toshiyuki / Yamanashi, Tomofumi / Satoh, Kaoru / Yoshida, Koji:
"An 8-32 kbit/s scalable wideband coder extended with MDCT-based bandwidth extension on top of a 6.8 kbit/s narrowband CELP coder",
1701-1704.
Topics in Acoustic Modeling
Wielgat, Robert / Zieliński, Tomasz P. / Świętojański, Paweł / Żołądź, Piotr / Król, Daniel / Woźniak, Tomasz / Grabias, Stanisław:
"Comparison of HMM and DTW methods in automatic recognition of pathological phoneme pronunciation",
1705-1708.
Yu, K. / Gales, M. J. F. / Woodland, P. C.:
"Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio",
1709-1712.
Wu, Hao / Wu, Xihong:
"Context dependent syllable acoustic model for continuous Chinese speech recognition",
1713-1716.
Oikonomidis, Dimitris / Diakoloukas, Vassilis / Digalakis, Vassilis:
"A sub-optimal viterbi-like search for linear dynamic models classification",
1717-1720.
Heigold, Georg / Schlüter, Ralf / Ney, Hermann:
"On the equivalence of Gaussian HMM and Gaussian HMM-like hidden conditional random fields",
1721-1724.
Scanzio, Stefano / Laface, Pietro / Gemello, Roberto / Mana, Franco:
"Speeding-up neural network training using sentence and frame selection",
1725-1728.
Liu, Linquan / Zheng, Thomas Fang / Akabane, Makoto / Chen, Ruxin / Wu, Wenhu:
"Using a small development set to build a robust dialectal Chinese speech recognizer",
1729-1732.
Confidence Measures (and Related Topics)
Molina, Carlos / Yoma, Nestor Becerra / Huenupán, Fernando / Garreton, Claudio:
"Unsupervised re-scoring of observation probability in viterbi based on reinforcement learning by using confidence measure and HMM neighborhood",
1733-1736.
Lin, Shiuan-Sung / Yvon, François:
"Optimization on decoding graphs by discriminative training",
1737-1740.
Huet, Stéphane / Gravier, Guillaume / Sébillot, Pascale:
"Morphosyntactic processing of n-best lists for improved recognition and confidence measure computation",
1741-1744.
Li, Xiang / Huerta, Juan M.:
"How predictable is ASR confidence in dialog applications?",
1745-1748.
Allauzen, Alexandre:
"Error detection in confusion network",
1749-1752.
Oba, Takanobu / Hori, Takaaki / Nakamura, Atsushi:
"An approach to efficient generation of high-accuracy and compact error-corrective models for speech recognition",
1753-1756.
Ketabdar, Hamed / Hannemann, Mirko / Hermansky, Hynek:
"Detection of out-of-vocabulary words in posterior based ASR",
1757-1760.
Grapheme-to-Phoneme Conversion
Braga, Daniela / Coelho, Luís / , Fernando Gil V. Resende Jr.:
"Homograph ambiguity resolution in front-end design for portuguese TTS systems",
1761-1764.
Choueiter, Ghinwa F. / Seneff, Stephanie / Glass, James:
"New word acquisition using subword modeling",
1765-1768.
Thomas, Samuel / Verma, Ashish:
"Language identification of person names using CF-IOF based weighing function",
1769-1772.
Heuvel, Henk van den / Martens, Jean-Pierre / Konings, Nanneke:
"G2p conversion of names: what can we do (better)?",
1773-1776.
Thangthai, Ausdang / Wutiwiwatchai, Chai / Ragchatjaroen, Anocha / Saychum, Sittipong:
"A learning method for Thai phonetization of English words",
1777-1780.
Werner, Steffen / Hoffmann, Rüdiger:
"Spontaneous speech synthesis by pronunciation variant selection - a comparison to natural speech",
1781-1784.
Tsourakis, Nikos / Digalakis, Vassilis:
"A generic methodology of converting transliterated text to phonetic strings case study: greeklish",
1785-1788.
Singh, Rita / Gouvêa, Evandro B. / Raj, Bhiksha:
"Probabilistic deduction of symbol mappings for extension of lexicons",
1789-1792.
Lexical and Prosodic Modeling
Astrov, Sergey / Hofer, Joachim / Höge, Harald:
"Use of syllable center detection for improved duration modeling in Chinese Mandarin connected digits recognition",
1793-1796.
Pellegrini, Thomas / Lamel, Lori:
"Using phonetic features in unsupervised word decompounding for ASR with application to a less-represented language",
1797-1800.
Qiang, Sheng / Qian, Yao / Soong, Frank K. / Xu, Congfu:
"Robust F0 modeling for Mandarin speech recognition in noise",
1801-1804.
Seppi, Dino / Falavigna, Daniele / Stemmer, Georg / Gretter, Roberto:
"Word duration modeling for word graph rescoring in LVCSR",
1805-1808.
Tamburini, Fabio / Wagner, Petra:
"On automatic prominence detection for German",
1809-1812.
Ananthakrishnan, Sankaranarayanan / Narayanan, Shrikanth S.:
"Prosody-enriched lattices for improved syllable recognition",
1813-1816.
Pinto, Joel / Lovitt, Andrew / Hermansky, Hynek:
"Exploiting phoneme similarities in hybrid HMM-ANN keyword spotting",
1817-1820.
Liu, C. E. / Thambiratnam, K. / Seide, F.:
"Online vocabulary adaptation using limited adaptation data",
1821-1824.
Speech Recognition by Automatic Attribute Transcription
Lee, Chin-Hui / Clements, Mark A. / Dusan, Sorin / Fosler-Lussier, Eric / Johnson, Keith / Juang, Biing-Hwang / Rabiner, Lawrence R.:
"An overview on automatic speech attribute transcription (ASAT)",
1825-1828.
Bromberg, Ilana / Qian, Qian / Hou, Jun / Li, Jinyu / Ma, Chengyuan / Matthews, Brett / Moreno-Daniel, Antonio / Morris, Jeremy / Siniscalchi, Sabato Marco / Tsao, Yu / Wang, Yu:
"Detection-based ASR in the automatic speech attribute transcription project",
1829-1832.
Lin, Chi-Yueh / Wang, Hsiao-Chuan:
"Attribute-based Mandarin speech recognition using conditional random fields",
1833-1836.
Strik, Helmer / Truong, Khiet P. / Wet, Febe de / Cucchiarini, Catia:
"Comparing classifiers for pronunciation error detection",
1837-1840.
Krajewski, Jarek / Kröger, Bernd:
"Using prosodic and spectral characteristics for sleepiness detection",
1841-1844.
Ore, Brian M. / Slyh, Raymond E.:
"Score fusion for articulatory feature detection",
1845-1848.
Speaker Diarization
Otterson, Scott:
"Improved location features for meeting speaker diarization",
1849-1852.
Han, Kyu J. / Narayanan, Shrikanth S.:
"A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system",
1853-1856.
Huijbregts, Marijn / Wooters, Chuck:
"The blame game: performance analysis of speaker diarization system components",
1857-1860.
Aronowitz, Hagai:
"Trainable speaker diarization",
1861-1864.
Huang, Jing / Marcheret, Etienne / Visweswariah, Karthik:
"Improving speaker diarization for CHIL lecture meetings",
1865-1868.
Le, Viet-Bac / Mella, Odile / Fohr, Dominique:
"Speaker diarization using normalized cross likelihood ratio",
1869-1872.
First and Second Language Learning
Lee, Wai-Sum:
"Tone production by the speakers of different age-and-gender groups",
1873-1876.
Xu, Nan / Burnham, Denis / Kitamura, Christine:
"Vowels and tones in infant directed speech: hyperarticulation for both, but different developmental patterns",
1877-1880.
Ko, Eon-Suk:
"Acquisition of vowel duration in children speaking american English",
1881-1884.
Hirano, Hiroko / Hirose, Keikichi / Kawai, Goh / Gu, Wentao / Minematsu, Nobuaki:
"F0 models show Chinese speakers of Japanese insert intonational boundaries and drop pitch",
1885-1888.
Escudero, Paola / Kastelein, Jelle / Weiand, Klara / Son, R. J. J. H. van:
"Formal modelling of L1 and L2 perceptual learning: computational linguistics versus machine learning",
1889-1892.
Broersma, Mirjam:
"Kettle hinders cat, shadow does not hinder shed: activation of ‘almost embedded’ words in nonnative listening",
1893-1896.
Speech Synthesis I, II
Krstulović, Sacha / Hunecke, Anna / Schröder, Marc:
"An HMM-based speech synthesis system applied to German and its adaptation to a limited set of expressive football announcements",
1897-1900.
Gu, Liang / Zhang, Wei / Tahir, Lazkin / Gao, Yuqing:
"Statistical vowelization of Arabic text for speech synthesis in speech-to-speech translation systems",
1901-1904.
Liu, Wu / Huang, Dezhi / Dong, Yuan / Mao, Xinnian / Wang, Haila:
"A pair-based language model for the robust lexical analysis in Chinese text-to-speech synthesis",
1905-1908.
Maia, R. / Toda, Tomoki / Zen, Heiga / Nankaku, Yoshihiko / Tokuda, Keiichi:
"A trainable excitation model for HMM-based speech synthesis",
1909-1912.
Steigner, Jochen / Schröder, Marc:
"Cross-language phonemisation in German text-to-speech synthesis",
1913-1916.
Tachibana, Ryuki / Nagano, Tohru / Kurata, Gakuto / Nishimura, Masafumi / Babaguchi, Noboru:
"Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone",
1917-1920.
Chomphan, Suphattharachai / Kobayashi, Takao:
"Implementation and evaluation of an HMM-based Thai speech synthesis system",
2849-2852.
Bonardo, Davide / Zovato, Enrico:
"Speech synthesis enhancement in noisy environments",
2853-2856.
Schmid, Helmut / Möbius, Bernd / Weidenkaff, Julia:
"Tagging syllable boundaries with joint n-gram models",
2857-2860.
Xu, Jun / Huang, Dezhi / Wang, Yongxin / Dong, Yuan / Cai, Lianhong / Wang, Haila:
"Hierarchical non-uniform unit selection based on prosodic structure",
2861-2864.
Birkholz, Peter:
"Control of an articulatory speech synthesizer based on dynamic approximation of spatial articulatory targets",
2865-2868.
Nishizawa, Nobuyuki / Kawai, Hisashi:
"A preselection method based on cost degradation from the optimal sequence for concatenative speech synthesis",
2869-2872.
Strecha, Guntram / Eichner, Matthias / Hoffmann, Rüdiger:
"Line cepstral quefrencies and their use for acoustic inventory coding",
2873-2876.
Cahill, Peter / Aioanei, Daniel / Carson-Berndsen, Julie:
"Articulatory acoustic feature applications in speech synthesis",
2877-2880.
Krul, Aleksandra / Damnati, Géraldine / Yvon, François / Boidin, Cédric / Moudenc, Thierry:
"Approaches for adaptive database reduction for text-to-speech synthesis",
2881-2884.
Tsai, Richard Tzong-Han / Hung, Hsi-Chuan / Dai, Hong-Jie / Hsu, Wen-Lian:
"Exploiting unlabeled internal data in conditional random fields to reduce word segmentation errors for Chinese texts",
2885-2888.
Kirkpatrick, Barry / O'Brien, Darragh / Scaife, Ronán / Errity, Andrew:
"On the role of spectral dynamics in unit selection speech synthesis",
2889-2892.
Langner, Brian / Black, Alan W.:
"ugloss: a framework for improving spoken language generation understandability",
2893-2896.
Schnell, Karl / Lacroix, Arild:
"Combination of LSF and pole based parameter interpolation for model-based diphone concatenation",
2897-2900.
Prahallad, Kishore / Toth, Arthur R. / Black, Alan W.:
"Automatic building of synthetic voices from large multi-paragraph speech databases",
2901-2904.
Gallardo-Antolín, A. / Barra, R. / Schröder, Marc / Krstulović, Sacha / Montero, J. M.:
"Automatic phonetic segmentation of Spanish emotional speech",
2905-2908.
Lin, Dacheng / Zhao, Yong / Soong, Frank K. / Chu, Min / Zhao, Jieyu:
"Iterative unit selection with unnatural prosody detection",
2909-2912.
Voice Conversion and Modification
Hanzlíček, Zdeněk / Matoušek, Jindřich:
"F0 transformation within the voice conversion framework",
1961-1964.
Erro, Daniel / Moreno, Asunción:
"Weighted frequency warping for voice conversion",
1965-1968.
Erro, Daniel / Moreno, Asunción:
"Frame alignment method for cross-lingual voice conversion",
1969-1972.
Nurminen, Jani / Tian, Jilei / Popa, Victor:
"Voicing level control with application in voice conversion",
1973-1976.
Percybrooks, Winston S. / Moore, Elliot:
"New algorithm for LPC residual estimation from LSF vectors for a voice conversion system",
1977-1980.
Ohtani, Yamato / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model",
1981-1984.
Petkov, Petko N. / Kleijn, W. Bastiaan:
"Improving the phase vocoder approach to pitch-shifting",
1985-1988.
Mesbahi, Larbi / Barreaud, Vincent / Boeffard, Olivier:
"Comparing GMM-based speech transformation systems",
1989-1992.
Improved Acoustic Modeling for ASR
Kuo, Jen-Wei / Lo, Hung-Yi / Wang, Hsin-Min:
"Improved HMM/SVM methods for automatic phoneme segmentation",
2057-2060.
Shinozaki, Takahiro / Kawahara, Tatsuya:
"Gaussian mixture optimization for HMM based on efficient cross-validation",
2061-2064.
Zen, Heiga / Nankaku, Yoshihiko / Tokuda, Keiichi:
"Model-space MLLR for trajectory HMMs",
2065-2068.
Ketabdar, Hamed / Bourlard, Hervé:
"In-context phone posteriors as complementary features for tandem ASR",
2069-2072.
Qian, Qian / He, Xiaodong / Deng, Li:
"Phone-discriminating minimum classification error (p-MCE) training for phonetic recognition",
2073-2076.
Lamel, Lori / Messaoudi, Abdel. / Gauvain, Jean-Luc:
"Improved acoustic modeling for transcribing Arabic broadcast data",
2077-2080.
McDermott, Erik / Nakamura, Atsushi:
"String and lattice based discriminative training for the corpus of spontaneous Japanese lecture transcription task",
2081-2084.
Kang, Byung-Ok / Jung, Ho-Young / Lee, Yun-Keun:
"Discriminative noise adaptive training approach for an environment migration",
2085-2088.
Chen, Jia-Yu / Olsen, Peder A. / Hershey, John R.:
"Word confusability - measuring hidden Markov model similarity",
2089-2092.
Deselaers, Thomas / Heigold, Georg / Ney, Hermann:
"Speech recognition with state-based nearest neighbour classifiers",
2093-2096.
Teunen, Remco / Akamine, Masami:
"HMM-based speech recognition using decision trees instead of GMMs",
2097-2100.
Gollan, Christian / Hahn, Stefan / Schlüter, Ralf / Ney, Hermann:
"An improved method for unsupervised training of LVCSR systems",
2101-2104.
Omar, Mohamed Kamal:
"A variational approach to robust maximum likelihood estimation for speech recognition",
2105-2108.
Yu, Kai / Rutenbar, Rob A.:
"Generating small, accurate acoustic models with a modified Bayesian information criterion",
2109-2112.
Bell, Peter / King, Simon:
"Sparse Gaussian graphical models for speech recognition",
2113-2116.
Sakti, Sakriani / Markov, Konstantin / Nakamura, Satoshi:
"An HMM acoustic model incorporating various additional knowledge sources",
2117-2120.
Varjokallio, Matti / Kurimo, Mikko:
"Comparison of subspace methods for Gaussian mixture models in speech recognition",
2121-2124.
Multilingualism in Speech and Language Processing
Schultz, Tanja / Black, Alan W. / Badaskar, Sameer / Hornyak, Matthew / Kominek, John:
"SPICE: web-based tools for rapid language adaptation in speech processing systems",
2125-2128.
Deprez, Filip / Odijk, Jan / Moortel, Jan De:
"Introduction to multilingual corpus-based concatenative speech synthesis",
2129-2132.
Stouten, Frederik / Martens, Jean-Pierre:
"Recognition of foreign names spoken by native speakers",
2133-2136.
Cordoba, R. / D'Haro, L. F. / Fernandez-Martinez, F. / Montero, J. M. / Barra, R.:
"Language identification using several sources of information with a multiple-Gaussian classifier",
2137-2140.
Solar, Carmen Del / Pérez, Guillermo / Florencio, Eva / Moral, David / Amores, Gabriel / Manchón, Pilar:
"Dynamic language change in MIMUS",
2141-2144.
Systems for LVCSR and Rich Transcription I, II
Lööf, Jonas / Gollan, Christian / Hahn, Stefan / Heigold, Georg / Hoffmeister, B. / Plahl, Christian / Rybach, David / Schlüter, Ralf / Ney, Hermann:
"The RWTH 2007 TC-STAR evaluation system for european English and Spanish",
2145-2148.
Koh, Eugene Chin Wei / Sun, Hanwu / Nwe, Tin Lay / Nguyen, Trung Hieu / Ma, Bin / Chng, Eng Siong / Li, Haizhou / Rahardja, Susanto:
"Using direction of arrival estimate and acoustic feature information in speaker diarization",
2149-2152.
Batista, Fernando / Caseiro, Diamantino / Mamede, Nuno / Trancoso, Isabel:
"Recovering punctuation marks for automatic speech recognition",
2153-2156.
Yeh, Jui-Feng / Wu, Chung-Hsien / Wu, Wei-Yen:
"Disfluency correction of spontaneous speech using conditional random fields with variable-length features",
2157-2160.
Huang, Jing / Marcheret, Etienne / Visweswariah, Karthik / Libal, Vit / Potamianos, Gerasimos:
"Detection, diarization, and transcription of far-field lecture speech",
2161-2164.
Hazen, Timothy J. / Sherry, Brennan / Adler, Mark:
"Speech-based annotation and retrieval of digital photographs",
2165-2168.
Guz, Umit / Cuendet, Sébastien / Hakkani-Tür, Dilek / Tur, Gokhan:
"Co-training using prosodic and lexical information for sentence segmentation",
2597-2600.
Estève, Yannick / Meignier, Sylvain / Deléglise, Paul / Mauclair, Julie:
"Extracting true speaker identities from transcriptions",
2601-2604.
Fu, Rong / Benest, Ian D.:
"An improved speaker diarization system",
2605-2608.
Stüker, Sebastian / Fügen, Christian / Kraft, Florian / Wölfel, Matthias:
"The ISL 2007 English speech transcription system for european parliament speeches",
2609-2612.
Hwang, Mei-Yuh / Wang, Wen / Lei, Xin / Zheng, Jing / Cetin, Ozgur / Peng, Gang:
"Advances in Mandarin broadcast speech recognition",
2613-2616.
Ogata, Jun / Goto, Masataka / Eto, Kouichirou:
"Automatic transcription for a web 2.0 service to search podcasts",
2617-2620.
Language Learning and Assessment
Tepperman, Joseph / Kazemzadeh, Abe / Narayanan, Shrikanth S.:
"A text-free approach to assessing nonnative intonation",
2169-2172.
Lee, John / Seneff, Stephanie:
"Automatic generation of cloze items for prepositions",
2173-2176.
Waple, Christopher / Wang, Hongcui / Kawahara, Tatsuya / Tsubota, Yasushi / Dantsuji, Masatake:
"Evaluating and optimizing Japanese tutor system featuring dynamic question generation and interactive guidance",
2177-2180.
Cucchiarini, Catia / Neri, Ambra / Wet, Febe de / Strik, Helmer:
"ASR-based pronunciation training: scoring accuracy and pedagogical effectiveness of a system for dutch L2 learners",
2181-2184.
Tepperman, Joseph / Black, Matthew / Price, Patti / Lee, Sungbok / Kazemzadeh, Abe / Gerosa, Matteo / Heritage, Margaret / Alwan, Abeer / Narayanan, Shrikanth S.:
"A Bayesian network classifier for word-level reading assessment",
2185-2188.
Multimodal Interaction: Analysis and Technology
Holzapfel, Hartwig / Waibel, Alex:
"Behavior models for learning and receptionist dialogs",
2189-2192.
Turunen, Markku / Hakulinen, Jaakko / Kainulainen, Anssi / Melto, Aleksi / Hurtig, Topi:
"Design of a rich multimodal interface for mobile spoken route guidance",
2193-2196.
Theune, Mariët / Hofs, Dennis / Kessel, Marco van:
"The virtual guide: a direction giving embodied conversational agent",
2197-2200.
Gandhe, Sudeep / Traum, David:
"Creating spoken dialogue characters from corpora without annotations",
2201-2204.
Hui, Pui-Yu / Zhou, Zhengyu / Meng, Helen:
"Complementarity and redundancy in multimodal user inputs with speech and pen gestures",
2205-2208.
Bell, Linda / Gustafson, Joakim:
"Children's convergence in referring expressions to graphical objects in a speech-enabled computer game",
2209-2212.
Emotion
Kawatsu, Hiromi / Ohno, Sumio:
"An analysis of individual differences in the f0 contour and the duration of anger utterances at several degrees",
2213-2216.
Arimoto, Yoshiko / Ohno, Sumio / Iida, Hitoshi:
"Acoustic features of anger utterances during natural dialog",
2217-2220.
Biadsy, Fadi / Hirschberg, Julia / Rosenberg, Andrew / Dakka, Wisam:
"Comparing american and palestinian perceptions of charisma using acoustic-prosodic and lexical analysis",
2221-2224.
Busso, Carlos / Lee, Sungbok / Narayanan, Shrikanth S.:
"Using neutral speech models for emotional speech analysis",
2225-2228.
Satoh, N. / Yamauchi, K. / Matsunaga, S. / Yamashita, M. / Nakagawa, R. / Shinohara, K.:
"Emotion clustering using the results of subjective opinion tests for emotion recognition in infants' cries",
2229-2232.
Barra, R. / Montero, J. M. / Macias-Guarasa, J. / Gutiérrez-Arriola, J. / Ferreiros, J. / Pardo, J. M.:
"On the limitations of voice conversion techniques in emotion identification tasks",
2233-2236.
Dupuis, Kate / Pichora-Fuller, Kathleen:
"Use of lexical and affective prosodic cues to emotion by younger and older adults",
2237-2240.
Gupta, Purnima / Rajput, Nitendra:
"Two-stream emotion recognition for call center monitoring",
2241-2244.
Grichkovtsova, Ioulia / Lacheret, Anne / Morel, Michel:
"The role of intonation and voice quality in the affective speech perception",
2245-2248.
Vlasenko, Bogdan / Schuller, Björn / Wendemuth, Andreas / Rigoll, Gerhard:
"Combining frame and turn-level information for robust recognition of emotions within speech",
2249-2252.
Speakers: Expression, Emotion and Personality Recognition
Schuller, Björn / Batliner, Anton / Seppi, Dino / Steidl, Stefan / Vogt, Thurid / Wagner, Johannes / Devillers, Laurence / Vidrascu, Laurence / Amir, Noam / Kessous, Loic / Aharonson, Vered:
"The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functionals",
2253-2256.
Quang, Vũ Minh / Besacier, Laurent / Castelli, Eric:
"Automatic question detection: prosodic-lexical features and crosslingual experiments",
2257-2260.
Tachibana, Makoto / Kawashima, Keigo / Yamagishi, Junichi / Kobayashi, Takao:
"Performance evaluation of HMM-based style classification with a small amount of training data",
2261-2264.
Truong, Khiet P. / Leeuwen, David A. van:
"Visualizing acoustic similarities between emotions in speech: an acoustic map of emotions",
2265-2268.
Hu, Hao / Xu, Ming-Xing / Wu, Wei:
"Fusion of global statistical and segmental spectral features for speech emotion recognition",
2269-2272.
Sethu, Vidhyasaharan / Ambikairajah, Eliathamby / Epps, Julien:
"Group delay features for emotion detection",
2273-2276.
Müller, Christian / Burkhardt, Felix:
"Combining short-term cepstral and long-term pitch features for automatic recognition of speaker age",
2277-2280.
Enos, Frank / Shriberg, Elizabeth / Graciarena, Martin / Hirschberg, Julia / Stolcke, Andreas:
"Detecting deception using critical segments",
2281-2284.
Nose, Takashi / Kato, Yoichi / Kobayashi, Takao:
"Style estimation of speech based on multiple regression hidden semi-Markov model",
2285-2288.
Zhang, Chi / Hansen, John H. L.:
"Analysis and classification of speech mode: whispered through shouted",
2289-2292.
First Language, Second Language, Cross-language
Bettoni-Techio, Melissa / Rauber, Andréia S. / Koerich, Rosana Denise:
"Perception and production of word-final alveolar stops by brazilian portuguese learners of English",
2293-2296.
Kluge, Denise Cristina / Rauber, Andréia S. / Reis, Mara Silvia / Bion, Ricardo A. Hoffmann:
"The relationship between the perception and production of English nasal codas by brazilian learners of English",
2297-2300.
Utashiro, Takafumi / Kawai, Goh:
"CALL courseware for learning reactive tokens in face-to-face dialogs",
2301-2304.
Kiriyama, Shinya / Tsuji, Ryo / Kasami, Tomohiko / Ishikawa, Shogo / Otani, Naofumi / Horiuchi, Hiroaki / Takebayashi, Yoichi / Kitazawa, Shigeyoshi:
"The developmental analysis of demonstrative expression skills utilizing a multimodal infant behavior corpus",
2305-2308.
Lyakso, Elena E. / Frolova, Olga V.:
"Russian vowels system acoustic features development in ontogenesis",
2309-2312.
Alphen, Petra van / Bree, Elise de / Fikkert, Paula / Wijnen, Frank:
"The role of metrical stress in comprehension and production in dutch children at-risk of dyslexia",
2313-2316.
Nakagawa, Seiichi / Ohta, Kei:
"A statistical method of evaluating pronunciation proficiency for presentation in English",
2317-2320.
Joto, Akiyo / Nagase, Yoshiki / Funatsu, Seiya:
"The intelligibility and its relations to acoustic characteristics of English /s/ and /esh/ produced by native speakers of Japanese",
2321-2324.
Goudbeek, Martijn / Swingley, Daniel / Kluender, Keith R.:
"The limits of multidimensional category learning",
2325-2328.
Uther, Maria / Uther, James / Athanasopoulos, Panos / Singh, Pushpendra / Akahane-Yamada, Reiko:
"Mobile adaptive CALL (MAC): a lightweight speech-based intervention for mobile language learners",
2329-2332.
Best, Catherine T. / Hallé, Pierre A. / Pardo, Jennifer S.:
"English and French speakers' perception of voicing distinctions in non-native lateral consonant syllable onsets",
2333-2336.
Lacerda, Francisco / Gustavsson, Lisa:
"Predicting the consequences of vocalizations in early infancy",
2337-2340.
Weenink, David / Chen, Guangqin / Chen, Zongyan / Konink, Stefan de / Vierkant, Dennis / Hagen, Eveline van / Son, R. J. J. H. van:
"Learning tone distinctions for Mandarin Chinese",
2341-2344.
Lai, Catherine / Gorman, Kyle / Yuan, Jiahong / Liberman, Mark:
"Perception of disfluency: language differences and listener bias",
2345-2348.
Novel Techniques for the NATO Non-native Air-traffic Control and HIWIRE Cockpit Databases
Pigeon, Stephane / Shen, Wade / Lawson, Aaron / Leeuwen, David A. van:
"Design and characterization of the non-native military air traffic communications database (nnMATC)",
2417-2420.
Shen, Wade / Reynolds, Douglas:
"A comparison of speaker clustering and speech recognition techniques for air situational awareness",
2421-2424.
Dimitriadis, Dimitrios / Segura, Jose C. / Garcia, Luz / Potamianos, Alexandros / Maragos, Petros / Pitsikalis, Vassilis:
"Advanced front-end for robust speech recognition in extremely adverse environments",
2425-2428.
Gemello, Roberto / Mana, Franco / Scanzio, Stefano:
"Experiments on hiwire database using denoising and adaptation with a hybrid HMM-ANN model",
2429-2432.
Smolenski, Brett Y.:
"Detection and removal of switching noise in push-to-talk and voice operated exchange communications systems",
2433-2436.
Buera, Luis / Miguel, Antonio / Saz, Óscar / Lleida, Eduardo / Ortega, Alfonso:
"Evaluation of the combined use of MEMLIN and MLLR on the non-native adaptation task of hiwire project database",
2437-2440.
Systems for Spoken Language Translation I, II
Déchelotte, Daniel / Schwenk, Holger / Adda, Gilles / Gauvain, Jean-Luc:
"Improved machine translation of speech-to-text outputs",
2441-2444.
Saleem, S. / Subramanian, K. / Prasad, R. / Stallard, David / Kao, Chia-Lin / Natarajan, P. / Suleiman, R.:
"Improvements in machine translation for English/iraqi speech translation",
2445-2448.
Matusov, Evgeny / Hillard, Dustin / Magimai-Doss, Mathew / Hakkani-Tür, Dilek / Ostendorf, Mari / Ney, Hermann:
"Improving speech translation with automatic boundary prediction",
2449-2452.
Cattoni, Roldano / Bertoldi, Nicola / Federico, Marcello:
"Punctuating confusion networks for speech translation",
2453-2456.
Reddy, Aarthi / Rose, Richard / Désilets, Alain:
"Integration of ASR and machine translation models in a document translation task",
2457-2460.
Tam, Yik-Cheung / Schultz, Tanja:
"Bilingual LSA-based translation lexicon adaptation for spoken language translation",
2461-2464.
Stallard, David / Choi, Fred / Kao, Chia-Lin / Krstovski, Kriste / Natarajan, P. / Prasad, R. / Saleem, S. / Subramanian, K.:
"The BBN 2007 displayless English/iraqi speech-to-speech translation system",
2817-2820.
Sarikaya, Ruhi / Deng, Yonggang / Gao, Yuqing:
"Context dependent word modeling for statistical machine translation using part-of-speech tags",
2821-2824.
Appling, Darren Scott / Campbell, Nick:
"Translating conversational speech to standard linguistic form",
2825-2828.
Lavecchia, Caroline / Smaïli, Kamel / Langlois, David / Haton, Jean-Paul:
"Using inter-lingual triggers for machine translation",
2829-2832.
Falavigna, Daniele / Bertoldi, Nicola / Brugnara, Fabio / Cattoni, Roldano / Cettolo, Mauro / Chen, Boxing / Federico, Marcello / Giuliani, Diego / Gretter, Roberto / Gupta, Deepa / Seppi, Dino:
"The IRST English-Spanish translation system for european parliament speeches",
2833-2836.
Fügen, Christian / Kolss, Muntsin:
"The influence of utterance chunking on machine translation performance",
2837-2840.
Precoda, Kristin / Zheng, Jing / Vergyri, Dimitra / Franco, Horacio / Richey, Colleen / Kathol, Andreas / Kajarekar, Sachin:
"Iraqcomm: a next generation translation system",
2841-2844.
Rao, Sharath / Lane, Ian / Schultz, Tanja:
"Optimizing sentence segmentation for spoken language translation",
2845-2848.
Articulatory Features
Richmond, Korin:
"A multitask learning perspective on acoustic-articulatory inversion",
2465-2468.
Qin, Chao / Carreira-Perpiñán, Miguel Á.:
"A comparison of acoustic features for articulatory inversion",
2469-2472.
Scharenborg, Odette / Wan, Vincent:
"Can unquantised articulatory feature continuums be modelled?",
2473-2476.
Shah, Milind S. / Pandey, Prem C.:
"Estimation of place of articulation in stop consonants for visual feedback",
2477-2480.
Potard, Blaise / Laprie, Yves:
"Compact representations of the articulatory-to-acoustic mapping",
2481-2484.
Frankel, Joe / Magimai-Doss, Mathew / King, Simon / Livescu, Karen / Çetin, Özgür:
"Articulatory feature classifiers trained on 2000 hours of telephone speech",
2485-2488.
Wideband Speech Processing
Nour-Eldin, Amr H. / Kabal, Peter:
"Objective analysis of the effect of memory inclusion on bandwidth extension of narrowband speech",
2489-2492.
Geiser, Bernd / Taddei, Hervé / Vary, Peter:
"Artificial bandwidth extension without side information for ITU-t g.729.1",
2493-2496.
Pulakka, Hannu / Alku, Paavo / Laaksonen, Laura / Valve, Päivi:
"The effect of highband harmonic structure in the artificial bandwidth expansion of telephone speech",
2497-2500.
Kuroiwa, Shingo / Takashina, Masashi / Tsuge, Satoru / Fuji, Ren:
"Artificial bandwidth extension for speech signals using speech recogniton",
2501-2504.
Guerchi, Driss / Rabie, Tamer / Louzi, Abdelrhani:
"Voicing-based codebook in low-rate wideband CELP coding",
2505-2508.
Duni, Ethan R. / Rao, Bhaskar D.:
"Performance of speaker-dependent wideband speech coding",
2509-2512.
Accessibility Issues
Dreuw, Philippe / Rybach, David / Deselaers, Thomas / Zahedi, Morteza / Ney, Hermann:
"Speech recognition techniques for a sign language recognition system",
2513-2516.
Nakamura, Keigo / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees",
2517-2520.
Cerva, Petr / Nouza, Jan:
"Design and development of voice controlled aids for motor-handicapped persons",
2521-2524.
Katsurada, Kouichi / Okuma, Yuji / Yano, Makoto / Iribe, Yurie / Nitta, Tsuneo:
"Management of static/dynamic properties in a multimodal interaction system",
2525-2528.
San-Segundo, R. / Pérez, A. / Ortiz, D. / D'Haro, L. F. / Torres, M. Inés / Casacuberta, F.:
"Evaluation of alternatives on speech to sign language translation",
2529-2532.
Németh, Géza / Olaszy, Gábor / Bartalis, Mátyás / Kiss, Géza / Zainkó, Csaba / Mihajlik, Péter:
"Speech based drug information system for aged and visually impaired persons",
2533-2536.
Nogueira, Waldo / Harczos, Tamás / Edler, Bernd / Ostermann, Jörn / Büchner, Andreas:
"Automatic speech recognition with a cochlear implant front-end",
2537-2540.
Suk, Soo-Young / Kojima, Hiroaki:
"Voice activated powered wheelchair with non-voice rejection algorithm",
2541-2544.
Sitbon, Laurianne / Bellot, Patrice / Blache, Philippe:
"Phonetic based sentence level rewriting of questions typed by dyslexic spellers in an information retrieval context",
2545-2548.
New Application Areas
Berton, André / Regel-Brietzmann, Peter / Block, Hans-Ulrich / Schachtl, Stefanie / Gehrke, Manfred:
"How to integrate speech-operated internet information dialogs into a car",
2549-2552.
Glass, James / Hazen, Timothy J. / Cyphers, Scott / Malioutov, Igor / Huynh, David / Barzilay, Regina:
"Recent progress in the MIT spoken lecture processing project",
2553-2556.
Fischer, Philipp / Österle, Andreas / Berton, André / Regel-Brietzmann, Peter:
"How to personalize speech applications for web-based information in a car",
2557-2560.
Ikeda, Satoshi / Komatani, Kazunori / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Topic estimation with domain extensibility for guiding user's out-of-grammar utterances in multi-domain spoken dialogue systems",
2561-2564.
Nishimura, Ryota / Kitaoka, Norihide / Nakagawa, Seiichi:
"Prosody change and response timing analysis in spontaneously spoken dialogs and their modeling in a spoken dialog system",
2565-2568.
Tamura, Satoshi / Takamatsu, Kunihiko / Ogura, Shinji / Hayamizu, Satoru:
"GEMSIS - a novel application of speech recognition to emergency and disaster medicine",
2569-2572.
Coulston, Rachel / Klabbers, Esther / Villiers, Jacques de / Hosom, John-Paul:
"Application of speech technology in a home based assessment kiosk for early detection of alzheimer's disease",
2573-2576.
Vybornova, Olga / Gemo, Monica / Moncarey, Ronald / Macq, Benoit:
"Ontology-based multimodal high level fusion involving natural language analysis for aged people home care application",
2577-2580.
Story Segmentation
Chan, Shing-kai / Xie, Lei / Meng, Helen:
"Modeling the statistical behavior of lexical chains to capture word cohesiveness for automatic story segmentation",
2581-2584.
Fung, James G. / Hakkani-Tür, Dilek / Magimai-Doss, Mathew / Shriberg, Elizabeth / Cuendet, Sébastien / Mirghafori, Nikki:
"Cross-linguistic analysis of prosodic features for sentence segmentation",
2585-2588.
Rosenberg, Andrew / Sharifi, Mehrbod / Hirschberg, Julia:
"Varying input segmentation for story boundary detection in English, Arabic and Mandarin broadcast news",
2589-2592.
Kolluru, BalaKrishna / Gotoh, Yoshihiko:
"Speaker role based structural classification of broadcast news stories",
2593-2596.
Prosody: Production
Jilka, Matthias / Möbius, Bernd:
"The influence of vowel quality features on peak alignment",
2621-2624.
Shue, Yen-Liang / Iseli, Markus / Veilleux, Nanette / Alwan, Abeer:
"Pitch accent versus lexical stress: quantifying acoustic measures related to the voice source",
2625-2628.
Benus, Stefan / Gravano, Agustín / Hirschberg, Julia:
"Prosody, emotions, and… ‘whatever’",
2629-2632.
Gu, Wentao / Ho, Rerrario Shui-Ching / Lee, Tan:
"Modeling tones in hakka on the basis of the command-response model",
2633-2636.
Kentner, Gerrit:
"Length, ordering preference and intonational phrasing: evidence from pauses",
2637-2640.
Peters, Jörg / Hanssen, Judith / Gussenhoven, Carlos:
"Alignment of the second low target in dutch falling-rising pitch contours",
2641-2644.
Moniz, Helena / Mata, Ana Isabel / Viana, M. Céu:
"On filled-pauses and prolongations in european portuguese",
2645-2648.
Prosody: Perception
Olsberg, Michael / Xu, Yi / Green, Jeremy:
"Dependence of tone perception on syllable perception",
2649-2652.
Winkler, Ralf:
"Testing the relevance of speech rate, pitch and a glottal Chink for the perception of age in synthesized speech using formant synthesis",
2653-2656.
Böhm, Tamás / Shattuck-Hufnagel, Stefanie:
"Utterance-final glottalization as a cue for familiar speaker recognition",
2657-2660.
Huang, Chun-Fang / Akagi, Masato:
"A rule-based speech morphing for verifying a expressive speech perception model",
2661-2664.
Helander, Elina E. / Nurminen, Jani:
"On the importance of pure prosody in the perception of speaker identity",
2665-2668.
Chen, Shi-Han / Kuo, Chih-Chung:
"Perceptual relevance of pitch contours of Mandarin tones and its efficacy in prosody generation of speech synthesis",
2669-2672.
Nishizaki, Hiromitsu / Sohmiya, Mitsuhiro / Kobayashi, Kenji / Sekiguchi, Yoshihiro:
"The effect of filled pauses in a lecture speech on impressive evaluation of listeners",
2673-2676.
Li, Yujia / Lee, Tan:
"Perceptual equivalence of approximated Cantonese tone contours",
2677-2680.
Shahid, Suleman / Krahmer, Emiel / Swerts, Marc:
"Audiovisual emotional speech of game playing children: effects of age and culture",
2681-2684.
Machine Learning for Spoken Dialog Systems
Lemon, Oliver / Pietquin, Olivier:
"Machine learning for spoken dialogue systems",
2685-2688.
Rieser, Verena / Lemon, Oliver:
"Learning dialogue strategies for interactive database search",
2689-2692.
Cuayáhuitl, Heriberto / Renals, Steve / Lemon, Oliver / Shimodaira, Hiroshi:
"Hierarchical dialogue optimization using semi-Markov decision processes",
2693-2696.
Ai, Hua / Litman, Diane J.:
"Knowledge consistent user simulations for dialog systems",
2697-2700.
Wu, Hsu-Chih / Seneff, Stephanie:
"Reducing recognition error rate based on context relationships among dialogue turns",
2701-2704.
Misu, Teruhisa / Kawahara, Tatsuya:
"Bayes risk-based optimization of dialogue management for document retrieval system with speech interface",
2705-2708.
Phonetics
Ulbrich, Christiane / Ulbrich, Horst:
"Realisations and alternations in German /r/-realisation",
2733-2736.
Doty, Christopher S. / Idemaru, Kaori / Guion, Susan G.:
"Singleton and geminate stops in Finnish - acoustic correlates",
2737-2740.
Bael, Christophe Van / Baayen, Harald / Strik, Helmer:
"Segment deletion in spontaneous speech: a corpus study using mixed effects models with crossed random effects",
2741-2744.
Zheng, Hongying / Tsang, Peter W. M. / Wang, William S. -Y.:
"Categorical perception of Cantonese tones in context: a cross-linguistic study",
2745-2748.
Chen, Yiya / Yuan, Jiahong:
"A corpus study of the 3rd tone sandhi in standard Chinese",
2749-2752.
Harrington, Jonathan / Palethorpe, Sallyanne / Watson, Catherine I.:
"Age-related changes in fundamental frequency and formants: a longitudinal study of four speakers",
2753-2756.
Spoken Language Understanding and Summarization
Zhang, Jian / Chan, Ho Yin / Fung, Pascale / Cao, Lu:
"A comparative study on speech summarization of broadcast news and lecture speech",
2781-2784.
Murray, Gabriel / Renals, Steve:
"Towards online speech summarization",
2785-2788.
Yamagata, Tomoyuki / Sako, Atsushi / Takiguchi, Tetsuya / Ariki, Yasuo:
"System request detection in conversation based on acoustic and speaker alternation features",
2789-2792.
Levit, Michael / Boschee, Elizabeth / Freedman, Marjorie:
"Selecting on-topic sentences from natural language corpora",
2793-2796.
Kim, Seokhwan / Jeong, Minwoo / Lee, Gary Geunbae:
"A semi-supervised method for efficient construction of statistical spoken language understanding resources",
2797-2800.
Fujii, Yasuhisa / Kitaoka, Norihide / Nakagawa, Seiichi:
"Automatic extraction of cue phrases for important sentences in lecture speech and automatic lecture speech summarization",
2801-2804.
Chen, Yi-Ting / Chiu, Hsuan-Sheng / Wang, Hsin-Min / Chen, Berlin:
"A unified probabilistic generative framework for extractive spoken document summarization",
2805-2808.
Hébert, Matthieu:
"Generic class-based statistical language models for robust speech understanding in directed dialog applications",
2809-2812.
Seltzer, Michael L. / Ju, Yun-Cheng / Tashev, Ivan / Acero, Alex:
"Robust location understanding in spoken dialog systems using intersections",
2813-2816.
Voice Activity Detection and Sound Classification
Markaki, Maria / Wohlmayr, Michael / Stylianou, Yannis:
"Speech-nonspeech discrimination using the information bottleneck method and spectro-temporal modulation index",
2913-2916.
Jang, Keun Won / Kim, Dong Kook / Chang, Joon-Hyuk:
"A uniformly most powerful test for statistical model-based voice activity detection",
2917-2920.
Dines, John / Vepa, Jithendra:
"Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics",
2921-2924.
Huijbregts, Marijn / Wooters, Chuck / Ordelman, Roeland:
"Filtering the unknown: speech activity detection in heterogeneous video collections",
2925-2928.
Sangwan, Abhijeet / Krishnamurthy, Nitish / Hansen, John H. L.:
"Environmentally aware voice activity detector",
2929-2932.
Fujimoto, Masakiyo / Ishizuka, Kentaro:
"Noise robust voice activity detection based on switching kalman filter",
2933-2936.
Jo, Q-Haing / Park, Yun-Sik / Lee, Kye-Hwan / Song, Ji-Hyun / Chang, Joon-Hyuk:
"Voice activity detection based on support vector machine using effective feature vectors",
2937-2940.
, Sri Rama Murty K. / , Yegnanarayana B. / , Guruprasad S.:
"Voice activity detection in degraded speech using excitation source information",
2941-2944.
Cournapeau, David / Kawahara, Tatsuya:
"Evaluation of real-time voice activity detection based on high order statistics",
2945-2948.
Guo, Yanmeng / Qian, Qian / Yan, Yonghong:
"Robust voice activity detection based on adaptive sub-band energy sequence analysis and harmonic detection",
2949-2952.
Fredouille, Corinne / Evans, Nicholas:
"The influence of speech activity detection and overlap on speaker diarization for meeting room recordings",
2953-2956.
Kim, Gibak / Cho, Nam Ik:
"Voice activity detection using the phase vector in microphone array",
2957-2960.
Flego, Federico / Zieger, Christian / Omologo, Maurizio:
"Adaptive weighting of microphone arrays for distant-talking F0 and voiced/unvoiced estimation",
2961-2964.
Murthy, A. Sreenivasa / Sekhar, S. Chandra / Sreenivas, T. V.:
"Robust and high-resolution voiced/unvoiced classification in noisy speech using a signal smoothness criterion",
2965-2968.
Sainath, Tara N. / Zue, Victor / Kanevsky, Dimitri:
"Audio classification using extended baum-welch transformations",
2969-2972.
Knox, Mary Tai / Mirghafori, Nikki:
"Automatic laughter detection using neural networks",
2973-2976.
Peng, Gang / Hwang, Mei-Yuh / Ostendorf, Mari:
"Automatic acoustic segmentation for speech recognition on broadcast recordings",
2977-2980.
Unreviewed Papers for Special Sessions
Birkholz, Peter:
"Articulatory synthesis of singing",
4001-4004.
Saitou, Takeshi / Goto, Masataka / Unoki, Masashi / Akagi, Masato:
"Vocal conversion from speaking voice to singing voice using STRAIGHT",
4005-4006.
Roebel, Axel / Fineberg, Joshua:
"Speech to chant transformation with the phase vocoder",
4007-4008.
Kenmochi, Hideki / Ohshita, Hayato:
"VOCALOID - commercial singing synthesizer based on sample concatenation",
4009-4010.
DAlessandro, Nicolas / Dutoit, Thierry:
"RAMCESS/handsketch : a multi-representation framework for realtime and expressive singing synthesis",
4011-4012.
Ternström, Sten / Sundberg, Johan:
"Formant-based synthesis of singing",
4013-4014.
Sloetjes, Han / Russel, Albert / Klassmann, Alexander:
"ELAN: a free and open-source multimedia annotation tool",
4015-4016.
Szakos, Jozsef / Glavitsch, Ulrike:
"Speechindexer in action: managing endangered Formosan languages",
4017-4019.
Ifukube, Tohru / Shimizu, Yasuyuki:
"A portable record player for wax cylinders using a laser-beam reflection method",
4020.