Table of Contents and Access to Abstracts
Language Modeling for Spoken Dialog Systems
Purver, Matthew / Ratiu, Florin / Cavedon, Lawrence:
"Robust interpretation in dialogue by combining confidence scores with contextual features",
paper 1314-Mon1A1O.1.
Ye, Hui / Young, Steve:
"A clustering approach to semantic decoding",
paper 1118-Mon1A1O.2.
Misu, Teruhisa / Kawahara, Tatsuya:
"A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts",
paper 1167-Mon1A1O.3.
Horndasch, Axel / Nöth, Elmar / Batliner, Anton / Warnke, Volker:
"Phoneme-to-grapheme mapping for spoken inquiries to the semantic web",
paper 1635-Mon1A1O.4.
Weilhammer, Karl / Stuttle, Matthew N. / Young, Steve:
"Bootstrapping language models for dialogue systems",
paper 1482-Mon1A1O.5.
Feng, Junlan:
"Question answering with discriminative learning algorithms",
paper 1642-Mon1A1O.6.
Feature Enhancement for Robust ASR
Kenny, Patrick / Gupta, Vishwa / Boulianne, G. / Ouellet, Pierre / Dumouchel, Pierre:
"Feature normalization using smoothed mixture transformations",
paper 1026-Mon1A2O.1.
Hsieh, Chia-Hsin / Wu, Chung-Hsien / Lin, Jun-Yu:
"Stochastic vector mapping-based feature enhancement using prior model and environment adaptation for noisy speech recognition",
paper 1170-Mon1A2O.2.
Nasersharif, Babak / Akbari, Ahmad:
"A framework for robust MFCC feature extraction using SNR-dependent compression of enhanced mel filter bank energies",
paper 1632-Mon1A2O.3.
Faubel, Friedrich / Wölfel, Matthias:
"Coupling particle filters with automatic speech recognition for speech feature enhancement",
paper 1683-Mon1A2O.4.
Hsu, Chang-wen / Lee, Lin-shan:
"Extension and further analysis of higher order cepstral moment normalization (HOCMN) for robust features in speech recognition",
paper 1748-Mon1A2O.5.
Islam, Md. Babul / Matsumoto, Hiroshi / Yamamoto, Kazumasa:
"An improved mel-wiener filter for mel-LPC based speech recognition",
paper 1830-Mon1A2O.6.
Dialog and Discourse
Hurtado, Lluis F. / Griol, David / Segarra, Encarna / Emilio, Emilio / Sanchis, Sanchis:
"A stochastic approach for dialog management based on neural networks",
paper 1206-Mon1A3O.1.
Rotaru, Mihai / Litman, Diane J.:
"Discourse structure and speech recognition problems",
paper 1650-Mon1A3O.2.
Banerjee, Satanjeev / Rudnicky, Alexander I.:
"A texttiling based approach to topic boundary detection in meetings",
paper 1827-Mon1A3O.3.
Schulz, Stefan / Donker, Hilko:
"An user-centered development of an intuitive dialog control for speech-controlled music selection in cars",
paper 1855-Mon1A3O.4.
Raux, Antoine / Bohus, Dan / Langner, Brian / Black, Alan W. / Eskenazi, Maxine:
"Doing research on a deployed spoken dialogue system: one year of let's go! experience",
paper 1794-Mon1A3O.5.
Liscombe, Jackson / Venditti, Jennifer J. / Hirschberg, Julia:
"Detecting question-bearing turns in spoken tutorial dialogues",
paper 1491-Mon1A3O.6.
The Speech Separation Challenge
Srinivasan, Soundararajan / Shao, Yang / Jin, Zhaozhang / Wang, DeLiang:
"A computational auditory scene analysis system for robust speech recognition",
paper 1547-Mon1WeS.1.
Han, Runqiang / Zhao, Pei / Gao, Qin / Zhang, Zhiping / Wu, Hao / Wu, Xihong:
"CASA based speech separation for robust speech recognition",
paper 2068-Mon1WeS.2.
Every, Mark R. / Jackson, Philip J.B.:
"Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm",
paper 1637-Mon1WeS.3.
Barker, Jon / Coy, André / Ma, Ning / Cooke, Martin:
"Recent advances in speech fragment decoding techniques",
paper 1479-Mon1WeS.4.
Virtanen, Tuomas:
"Speech recognition using factorial hidden Markov models for separation in the feature space",
paper 1850-Mon1WeS.5.
Ming, Ji / Hazen, Timothy J. / Glass, James R.:
"Combining missing-feature theory, speech enhancement and speaker-dependent/-independent modeling for speech separation",
paper 1377-Mon1WeS.6.
Kristjansson, T. / Hershey, J. / Olsen, P. / Rennie, S. / Gopinath, Ramesh:
"Super-human multi-talker speech recognition: the IBM 2006 speech separation challenge system",
paper 1775-Mon1WeS.7.
Deshmukh, Om D. / Espy-Wilson, Carol Y.:
"Modified phase opponency based solution to the speech separation challenge",
paper 1936-Mon1WeS.8.
Multilingual and Multi-Accent Processing
Lööf, J. / Bisani, M. / Gollan, Ch. / Heigold, G. / Hoffmeister, Björn / Plahl, Ch. / Schlüter, Ralf / Ney, Hermann:
"The 2006 RWTH parliamentary speeches transcription system",
paper 1545-Mon1BuP.1.
Bouselmi, G. / Fohr, D. / Illina, I. / Haton, Jean-Paul:
"Multilingual non-native speech recognition using phonetic confusion-based acoustic model modification and graphemic constraints",
paper 1569-Mon1BuP.2.
Chan, Joyce Y. C. / Ching, P. C. / Lee, Tan / Cao, Houwei:
"Automatic speech recognition of Cantonese-English code-mixing utterances",
paper 1065-Mon1BuP.3.
Zimmerman, M. / Hakkani-Tür, Dilek / Fung, J. / Mirghafori, N. / Gottlieb, L. / Shriberg, Elizabeth / Liu, Yang:
"The ICSI+ multilingual sentence segmentation system",
paper 1808-Mon1BuP.4.
Cheng, Yan Ming / Ma, Changxue / Melnar, Lynette:
"Cross-language evaluation of voice-to-phoneme conversions for voice-tag application in embedded platforms",
paper 1062-Mon1BuP.5.
Wang, Huanliang / Qian, Yao / Soong, Frank K. / Zhou, Jian-Lai / Han, Jiqing:
"A multi-space distribution (MSD) approach to speech recognition of tonal languages",
paper 1473-Mon1BuP.6.
Le, Viet Bac / Besacier, Laurent:
"Comparison of acoustic modeling techniques for Vietnamese and Khmer ASR",
paper 1662-Mon1BuP.7.
Liu, Yi / Fung, Pascale:
"Multi-accent Chinese speech recognition",
paper 1887-Mon1BuP.8.
Ghorshi, Seyed / Vaseghi, Saeed / Yan, Qin:
"Comparative analysis of formants of British, american and australian accents",
paper 1252-Mon1BuP.9.
Liu, Linquan / Zheng, Thomas Fang / Wu, Wenhu:
"Automatic initial/final generation for dialectal Chinese speech recognition",
paper 1051-Mon1BuP.10.
Sarikaya, Ruhi / Emam, Ossama / Zitouni, Imed / Gao, Yuqing:
"Maximum entropy modeling for diacritization of Arabic text",
paper 1418-Mon1BuP.11.
Lihan, Slavomír / Juhár, Jozef / Cizmár, Anton:
"Comparison of Slovak and Czech speech recognition based on grapheme and phoneme acoustic models",
paper 1462-Mon1BuP.12.
Corpora, Annotation, and Assessment Metrics I, II
Jones, Rhys James / Choy, Ambrose / Williams, Briony:
"Integrating Festival and Windows",
paper 1380-Mon1CaP.1.
Munteanu, Cosmin / Penn, Gerald / Baecker, Ron / Toms, Elaine / James, David:
"Measuring the acceptable word error rate of machine-generated webcast transcripts",
paper 1756-Mon1CaP.2.
Nagino, Goshu / Shozakai, Makoto:
"Analyzing reusability of speech corpus based on statistical multidimensional scaling method",
paper 1382-Mon1CaP.3.
Fitt, Susan / Richmond, Korin:
"Redundancy and productivity in the speech technology lexicon - can we do better?",
paper 1202-1CaP.4.
Yamada, Takeshi / Kumakura, Masakazu / Kitawaki, Nobuhiko:
"Word intelligibility estimation of noise-reduced speech",
paper 1443-1CaP.5.
Draxler, Christoph:
"Exploring the unknown - collecting 1000 speakers over the internet for the ph@ttsessionz database of adolescent speakers",
paper 1217-Mon1CaP.6.
Murphy, Timothy / Picovici, Dorel / Mahdi, Abdulhussain E.:
"A new single-ended measure for assessment of speech quality",
paper 1538-Mon1CaP.7.
Ní Chasaide, Ailbhe / Wogan, John / Raghallaigh, Brian Ó / Bhriain, Áine Ní / Zoerner, Eric / Berthelsen, Harald / Gobl, Christer:
"Speech technology for minority languages: the case of Irish (gaelic)",
paper 1378-Mon1CaP.8.
Fraga, Francisco José / Ynoguti, Carlos Alberto / Chiovato, André Godoi:
"Further investigations on the relationship between objective measures of speech quality and speech recognition rates in noisy environments",
paper 1877-Mon1CaP.9.
Grancharov, Volodya / Zhao, David Y. / Lindblom, Jonas / Kleijn, W. Bastiaan:
"Non-intrusive speech quality assessment with low computational complexity",
paper 1391-Mon1CaP.10.
Liang, Min-Siong / Lyu, Ren-Yuan / Chiang, Yuang-Chin:
"Using speech recognition technique for constructing a phonetically transcribed taiwanese (min-nan) text corpus",
paper 1442-Mon1CaP.11.
Zgank, Andrej / Rotovnik, Tomas / Grasic, Matej / Kos, Marko / Vlaj, Damjan / Kacic, Zdravko:
"Sloparl - slovenian parliamentary speech and text corpus for large vocabulary continuous speech recognition",
paper 1493-Mon1CaP.12.
Toh, Siew Leng / Yang, Fan / Heeman, Peter A.:
"An annotation scheme for agreement analysis",
paper 1857-Mon1CaP.13.
Aoki, Hitoshi / Kurashima, Atsuko / Takahashi, Akira:
"Conversational quality estimation model for wideband IP-telephony services",
paper 1036-Tue2WeO.1.
Kilanski, Kelley / Malkin, Jonathan / Li, Xiao / Wright, Richard / Bilmes, Jeff A.:
"The vocal joystick data collection effort and vowel corpus",
paper 1885-Tue2WeO.2.
Sityaev, Dmitry / Knill, Katherine / Burrows, Tina:
"Comparison of the ITU-t p.85 standard to other methods for the evaluation of text-to-speech systems",
paper 1233-Tue2WeO.3.
Heeman, Peter A. / McMillin, Andy / Yaruss, J. Scott:
"An annotation scheme for complex disfluencies",
paper 1859-Tue2WeO.4.
Bael, Christophe Van / Boves, Lou / Heuvel, Henk van den / Strik, Helmer:
"Automatic phonetic transcription of large speech corpora: a comparative study",
paper 1173-Tue2WeO.5.
Shi, Yongmei / Zhou, Lina:
"Examining knowledge sources for human error correction",
paper 1530-Tue2WeO.6.
Speech Coding
Chang, Joon-Hyuk / Lim, Woohyung / Kim, Nam Soo:
"Signal modification incorporating perceptual weighting filter",
paper 1495-Mon1FoP.1.
Nurminen, Jani:
"Enhanced dynamic codebook reordering for advanced quantizer structures",
paper 1560-Mon1FoP.2.
Lee, Chang-Heon / Jung, Sung-Kyo / Eriksson, Thomas / Jun, Won-Suk / Kang, Hong-Goo:
"An efficient segment-based speech compression technique for hand-held TTS systems",
paper 1980-Mon1FoP.3.
Ramasubramanian, V. / Harish, D.:
"An unified unit-selection framework for ultra low bit-rate speech coding",
paper 2028-Mon1FoP.4.
Thyssen, Jes / Chen, Juin-Hwey:
"Efficient VQ techniques and general noise shaping in noise feedback coding",
paper 1254-Mon1FoP.5.
Qian, Yasheng / Hsu, Wei-Shou / Kabal, Peter:
"Classified comfort noise generation for efficient voice transmission",
paper 1307-Mon1FoP.6.
Kövesi, Balázs / Massaloux, Dominique / Virette, David / Bensa, Julien:
"Integration of a CELP coder in the ARDOR universal sound codec",
paper 1309-Mon1FoP.7.
Chatterjee, Saikat / Sreenivas, T. V.:
"Two stage transform vector quantization of LSFs for wideband speech coding",
paper 1433-Mon1FoP.8.
Chatterjee, Saikat / Sreenivas, T. V.:
"Comparison of prediction based LSF quantization methods using split VQ",
paper 1435-Mon1FoP.9.
Hofbauer, Konrad / Kubin, Gernot:
"High-rate data embedding in unvoiced speech",
paper 1906-Mon1FoP.10.
Anderson, Kyle D. / Gournay, Philippe:
"Pitch resynchronization while recovering from a late frame in a predictive speech decoder",
paper 1029-Mon1FoP.11.
Speech Enhancement I, II
Suhadi, Suhadi / Stan, Sorel / Fingscheidt, Tim:
"A novel environment-dependent speech enhancement method with optimized memory footprint",
paper 1181-Mon2A1O.1.
Zavarehei, Esfandiar / Vaseghi, Saeed / Yan, Qin:
"Weighted codebook mapping for noisy speech enhancement using harmonic-noise model",
paper 1244-Mon2A1O.2.
Jensen, J. / Hendriks, R. C. / Erkelens, J. S. / Heusdens, R.:
"MMSE estimation of complex-valued discrete Fourier coefficients with generalized gamma priors",
paper 1277-Mon2A1O.3.
Subramanya, Amarnag / Seltzer, Michael L. / Acero, Alex:
"Automatic removal of typed keystrokes from speech signals",
paper 1324-Mon2A1O.4.
Rank, Erhard / Kubin, Gernot:
"Lattice LP filtering for noise reduction in speech signals",
paper 1643-Mon2A1O.5.
Deshmukh, Om D. / Espy-Wilson, Carol Y.:
"Speech enhancement using modified phase opponency model",
paper 1699-Mon2A1O.6.
Jin, Wen / Scordilis, Michael:
"Single channel speech enhancement by frequency domain constrained optimization and temporal masking",
paper 1027-Tue3FoP.1.
Shin, Jong Won / Lee, Seung Yeol / Yun, Hwan Sik / Kim, Nam Soo:
"Speech enhancement based on residual noise shaping",
paper 1201-Tue3FoP.2.
Pulakka, Hannu / Laaksonen, Laura / Alku, Paavo:
"Quality improvement of telephone speech by artificial bandwidth expansion - listening tests in three languages",
paper 1245-Tue3FoP.3.
Shannon, Benjamin J. / Paliwal, Kuldip K.:
"Role of phase estimation in speech enhancement",
paper 1330-Tue3FoP.4.
Shannon, Benjamin J. / Paliwal, Kuldip K. / Nadeu, Climent:
"Speech enhancement based on spectral estimation from higher-lag autocorrelation",
paper 1331-Tue3FoP.5.
Krishnamurthy, Nitish / Hansen, John H. L.:
"Noise update modeling for speech enhancement: when do we do enough?",
paper 1396-Tue3FoP.6.
Shahina, A. / Yegnanarayana, B.:
"Mapping neural networks for bandwidth extension of narrowband speech",
paper 1840-Tue3FoP.7.
Das, Amit / Hansen, John H. L.:
"Decision directed constrained iterative speech enhancement",
paper 1866-Tue3FoP.8.
Murakami, Takahiro / Ishida, Yoshihisa:
"Adaptive filtering for attenuating musical noise caused by spectral subtraction",
paper 1919-Tue3FoP.9.
Hu, Yi / Loizou, Philipos C.:
"Evaluation of objective measures for speech enhancement",
paper 2007-Tue3FoP.10.
Song, Myung-Suk / Lee, Chang-Heon / Kang, Hong-Goo:
"Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition",
paper 2073-Tue3FoP.11.
ASR Other I, II
Boulianne, G. / Beaumont, J.-F. / Boisvert, M. / Brousseau, J. / Cardinal, P. / Chapdelaine, C. / Comeau, M. / Ouellet, Pierre / Osterrath, F.:
"Computer-assisted closed-captioning of live TV broadcasts in French",
paper 1424-Mon2A2O.1.
Afify, Mohamed / Sarikaya, Ruhi / Kuo, Hong-Kwang Jeff / Besacier, Laurent / Gao, Yuqing:
"On the use of morphological analysis for dialectal Arabic speech recognition",
paper 1444-Mon2A2O.2.
Trancoso, Isabel / Nunes, Ricardo / Neves, Luís / Viana, Céu / Moniz, Helena / Caseiro, Diamantino / Mata, Ana Isabel:
"Recognition of classroom lectures in european portuguese",
paper 1524-Mon2A2O.3.
Pellegrini, Thomas / Lamel, Lori:
"Investigating automatic decomposition for ASR in less represented languages",
paper 1776-Mon2A2O.4.
Nimaan, Abdillahi / Nocéra, Pascal / Bonastre, Jean-François:
"Automatic transcription of Somali language",
paper 1817-Mon2A2O.5.
Çetin, Özgür / Shriberg, Elizabeth:
"Analysis of overlaps in meetings by dialog factors, hot spots, speakers, and collection site: insights for automatic speech recognition",
paper 1915-Mon2A2O.6.
Takeda, Ryu / Yamamoto, Shun'ichi / Komatani, Kazunori / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Improving speech recognition of two simultaneous speech signals by integrating ICA BSS and automatic missing feature mask generation",
paper 1729-Thu1CaP.1.
Kim, Wooil / Hansen, John H. L.:
"Missing-feature reconstruction for band-limited speech recognition in spoken document retrieval",
paper 1826-Thu1CaP.2.
Koo, Hahn / Cheng, Yan Ming:
"Incremental learning of MAP context-dependent edit operations for spoken phone number recognition in an embedded platform",
paper 1032-Thu1CaP.3.
Obuchi, Yasunari / Hataoka, Nobuo:
"Development and evaluation of speech database in automotive environments for practical speech recognition systems",
paper 1168-Thu1CaP.4.
Yu, Dong / Ju, Yun-Cheng / Acero, Alex:
"An effective and efficient utterance verification technology using word n-gram filler models",
paper 1408-Thu1CaP.5.
Górriz, J. M. / Ramírez, Javier / Puntonet, C. G. / Segura, José C.:
"An efficient bispectrum phase entropy-based algorithm for VAD",
paper 1440-Thu1CaP.6.
Cerva, Petr / Nouza, Jan / Silovsky, Jan:
"Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination",
paper 1441-Thu1CaP.7.
Nakamura, Satoshi / Fujimoto, Masakiyo / Takeda, Kazuya:
"CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition",
paper 1726-1CaP.8.
Chu, Cheng-Tao / Sung, Yun-Hsuan / Zhao, Yuan / Jurafsky, Daniel:
"Detection of word fragments in Mandarin telephone conversation",
paper 1730-Thu1CaP.9.
Huo, Qiang / Li, Wei:
"A DTW-based dissimilarity measure for left-to-right hidden Markov models and its application to word confusability analysis",
paper 1745-Thu1CaP.10.
Gómez, Angel M. / Ramos-Muñoz, Juan J. / Peinado, Antonio M. / Sánchez, Victoria:
"Multi-flow block interleaving applied to distributed speech recognition over IP networks",
paper 1365-Thu1CaP.11.
Lin, Edward C. / Yu, Kai / Rutenbar, Rob A. / Chen, Tsuhan:
"Moving speech recognition from software to silicon: the in silico vox project",
paper 1942-Thu1CaP.12.
Ma, Chengyuan / Tsao, Yu / Lee, Chin-Hui:
"A study on detection based automatic speech recognition",
paper 2053-Thu1CaP.13.
Chitturi, Rahul / Hasegawa-Johnson, Mark:
"Novel time domain multi-class SVMs for landmark detection",
paper 1904-Thu1CaP.14.
Modeling Prosodic Features
Ananthakrishnan, Sankaranarayanan / Narayanan, Shrikanth:
"Combining acoustic, lexical, and syntactic evidence for automatic unsupervised prosody labeling",
paper 1335-Mon2A3O.1.
Rosenberg, Andrew / Hirschberg, Julia:
"On the correlation between energy and pitch accent in read English speech",
paper 1294-Mon2A3O.2.
Hirose, Keikichi / Asano, Yasufumi / Minematsu, Nobuaki:
"Corpus-based generation of fundamental frequency contours using generation process model and considering emotional focuses",
paper 1902-Mon2A3O.3.
Dubeda, Tomás:
"Prosodic boundaries in Czech: an experiment based on delexicalized speech",
paper 1056-Mon2A3O.4.
Yi, Lifu / Li, Jian / Lou, Xiaoyan / Hao, Jie:
"Totally data-driven intonation prediction model using a novel F0 contour parametric representation",
paper 1465-Mon2A3O.5.
Dilley, Laura / Breen, Mara / Bolivar, Marti / Kraemer, John / Gibson, Edward:
"A comparison of inter-transcriber reliability for two systems of prosodic annotation: rap (rhythm and pitch) and toBI (tones and break indices)",
paper 1619-Mon2A3O.6.
Spoken Information Retrieval
Alphonso, Issac / Chang, Shuangyu:
"Saliency parsing for automated directory assistance",
paper 1421-Mon2WeO.1.
Iwata, Kohei / Itoh, Yoshiaki / Kojima, Kazunori / Ishigame, Masaaki / Tanaka, Kazuyo / Lee, Shi-wook:
"Open-vocabulary spoken document retrieval based on new subword models and subword phonetic similarity",
paper 1342-Mon2WeO.2.
Li, Xiang / Jan, Ea-Ee / Wu, Cheng / Lubensky, David:
"Improved topic classification over maximum entropy model using k-norm based new objectives",
paper 2066-Mon2WeO.3.
Pan, Yi-cheng / Chen, Jia-yu / Lee, Yen-shin / Fu, Yi-sheng / Lee, Lin-shan:
"Efficient interactive retrieval of spoken documents with key terms ranked by reinforcement learning",
paper 1577-Mon2WeO.4.
Sudoh, Katsuhito / Tsukada, Hajime / Isozaki, Hideki:
"Discriminative named entity recognition of speech data using speech recognition confidence",
paper 1153-Mon2WeO.5.
Turunen, Ville T. / Kurimo, Mikko:
"Using latent semantic indexing for morph-based spoken document retrieval",
paper 1220-Mon2WeO.6.
Front-End Methods for ASR
Schlüter, Ralf / Zolnay, András / Ney, Hermann:
"Feature combination using linear discriminant analysis and its pitfalls",
paper 1077-Mon2BuP.1.
Valente, Fabio / Hermansky, Hynek:
"Discriminant linear processing of time-frequency plane",
paper 1300-Mon2BuP.2.
Uraga, Esmeralda / Hain, Thomas:
"Automatic speech recognition experiments with articulatory data",
paper 1725-Mon2BuP.3.
Stouten, Frederik / Martens, Jean-Pierre:
"Speech recognition with phonological features: some issues to attend",
paper 1081-Mon2BuP.4.
Wölfel, Matthias / Fügen, Christian / Ikbal, Shajith / McDonough, John W.:
"Multi-source far-distance microphone selection and combination for automatic transcription of lectures",
paper 1253-Mon2BuP.5.
Breithaupt, Colin / Martin, Rainer:
"Statistical analysis and performance of DFT domain noise reduction filters for robust speech recognition",
paper 1537-Mon2BuP.6.
García, L. / Segura, José C. / Benítez, Carmen / Ramírez, Javier / Torre, Ángel de la:
"Normalization of the inter-frame information using smoothing filtering",
paper 1687-Mon2BuP.7.
Ghulam, Muhammad / Horikawa, Junsei / Nitta, Tsuneo:
"Comparative study on contributions of pitch-synchronization and peak-amplitude towards robustness issue of ASR",
paper 1781-Mon2BuP.8.
Ariki, Yasuo / Kato, Shunsuke / Takiguchi, Tetsuya:
"Phoneme recognition based on fisher weight map to higher-order local auto-correlation",
paper 1883-Mon2BuP.9.
Boril, Hynek / Fousek, Petr / Pollák, Petr:
"Data-driven design of front-end filter bank for Lombard speech recognition",
paper 1803-Mon2BuP.10.
Ljolje, Andrej:
"Optimization of class weights for LDA feature transformations",
paper 2031-Mon2BuP.11.
Pylkkönen, Janne:
"LDA based feature estimation methods for LVCSR",
paper 1213-Mon2BuP.12.
Farahani, G. / Ahadi, S.M. / Homayounpour, M. Mehdi:
"Robust feature extraction based on spectral peaks of group delay and autocorrelation function and phase domain analysis",
paper 1563-Mon2BuP.13.
Panchapagesan, Sankaran:
"Frequency warping by linear transformation of standard MFCC",
paper 1924-Mon2BuP.14.
Language and Dialect Recognition
Reyes-Herrera, Ana Lilia / Villaseñor-Pineda, Luis / Montes-y-Gómez, Manuel:
"Automatic language identification using wavelets",
paper 1998-Mon2CaP.1.
Bauer, Josef G. / Timoshenko, Ekaterina:
"Minimum classification error training of hidden Markov models for acoustic language identification",
paper 1981-Mon2CaP.2.
Timoshenko, Ekaterina / Bauer, Josef G.:
"Unsupervised adaptation for acoustic language identification",
paper 1494-Mon2CaP.3.
Basavaraja, S. V. / Sreenivas, T. V.:
"Low complexity LID using pruned pattern tables of LZW",
paper 1398-Mon2CaP.4.
Yang, Xi / Zhai, Lu-Feng / Siu, Manhung / Gish, Herbert:
"Improved language identification using support vector machines for language modeling",
paper 1450-Mon2CaP.5.
Navratil, Jiri:
"Recent advances in phonotactic language recognition using binary-decision trees",
paper 1338-Mon2CaP.6.
Lin, Chi-Yueh / Wang, Hsiao-Chuan:
"Fusion of phonotactic and prosodic knowledge for language identification",
paper 1166-Mon2CaP.7.
Li, Haizhou / Ma, Bin / Tong, Rong:
"Vector-based spoken language recognition using output coding",
paper 1155-Mon2CaP.8.
Guijarrubia, Victor G. / Torres, M. Ines:
"Basque-Spanish language identification using phone-based methods",
paper 1892-Mon2CaP.9.
Ikeno, Ayako / Hansen, John H. L.:
"The role of prosody in the perception of US native English accents",
paper 1437-Mon2CaP.10.
Vieru-Dimulescu, Bianca / Boula de Mareüil, Philippe:
"Perceptual identification and phonetic analysis of 6 foreign accents in French",
paper 1251-Mon2CaP.11.
Huang, Rongqing / Hansen, John H. L.:
"Unsupervised Spanish dialect classification",
paper 1242-Mon2CaP.12.
Spoken Dialog Systems I, II
Gieselmann, Petra / Waibel, Alex:
"Dynamic extension of a grammar-based dialogue system: constructing an all-recipes knowing robot",
paper 1091-Mon2FoP.1.
Gruenstein, Alexander / Seneff, Stephanie / Wang, Chao:
"Scalable and portable web-based multimodal dialogue interaction with geographical databases",
paper 1095-Mon2FoP.2.
Ackermann, Chantal / Libossek, Marion:
"System- versus user-initiative dialog strategy for driver information systems",
paper 1172-Mon2FoP.3.
Krsmanovic, Filip / Spencer, Curtis / Jurafsky, Daniel / Ng, Andrew Y.:
"Have we met? MDP based speaker ID for robot dialogue",
paper 1193-Mon2FoP.4.
Son, Rob J. J. H. van / Wesseling, Wieneke / Pols, Louis C. W.:
"Prominent words as anchors for TRP projection",
paper 1235-Mon2FoP.5.
Cuayáhuitl, Heriberto / Renals, Steve / Lemon, Oliver / Shimodaira, Hiroshi:
"Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces",
paper 1282-Mon2FoP.6.
Mayer, Jörg / Jasinskaja, Ekaterina / Kölsch, Ulrike:
"Pitch range and pause duration as markers of discourse hierarchy: perception experiments",
paper 1290-Mon2FoP.7.
Roque, Antonio / Leuski, Anton / Rangarajan, Vivek / Robinson, Susan / Vaswani, Ashish / Narayanan, Shrikanth / Traum, David:
"Radiobot-CFF: a spoken dialogue system for military training",
paper 1828-Mon2FoP.8.
Yamada, Shinya / Itoh, Toshihiko / Araki, Kenji:
"Is voice quality enough? - study on how the situation and user²s awareness influence the utterance features",
paper 1955-Mon2FoP.9.
Juhár, Jozef / Ondas, Stanislav / Cizmár, Anton / Rusko, Milan / Rozinaj, Gregor / Jarina, Roman:
"Development of slovak GALAXY/voiceXML based spoken language dialogue system to retrieve information from the internet",
paper 2056-Mon2FoP.10.
Degerstedt, Lars / Jönsson, Arne:
"LINTest: a development tool for testing dialogue systems",
paper 1555-Mon2FoP.11.
Ito, Akinori / Shimada, Keisuke / Suzuki, Motoyuki / Makino, Shozo:
"A user simulator based on voiceXML for evaluation of spoken dialog systems",
paper 1358-Tue2A3O.1.
Jokinen, Kristiina / Hurtig, Topi:
"User expectations and real experience on a multimodal interactive system",
paper 1815-Tue2A3O.2.
Burkhardt, F. / Ajmera, J. / Englert, Roman / Stegmann, J. / Burleson, W.:
"Detecting anger in automated voice portal dialogs",
paper 1977-Tue2A3O.3.
Turunen, Markku / Hakulinen, Jaakko / Kainulainen, Anssi:
"Evaluation of a spoken dialogue system with usability tests and long-term pilot studies: similarities and differences",
paper 1978-Tue2A3O.4.
Weng, Fuliang / Varges, Sebastian / Raghunathan, Badri / Ratiu, Florin / Pon-Barry, Heather / Lathrop, Brian / Zhang, Qi / Bratt, Harry / Scheideck, Tobias / Xu, Kui / Purver, Matthew / Mishra, Rohit / Lien, Annie / Raya, M. / Peters, S. / Meng, Y. / Russell, J. / Cavedon, Lawrence / Shriberg, Elizabeth / Schmidt, H. / Prieto, R.:
"CHAT: a conversational helper for automotive tasks",
paper 2020-Tue2A3O.5.
Georgila, Kallirroi / Henderson, James / Lemon, Oliver:
"User simulation for spoken dialogue systems: learning and evaluation",
paper 2035-Tue2A3O.6.
Speaker Characterization and Recognition I-IV
Chao, Yi-Hsiang / Tsai, Wei-Ho / Wang, Hsin-Min / Chang, Ruei-Chuan:
"Improving the characterization of the alternative hypothesis via kernel discriminant analysis for likelihood ratio-based speaker verification",
paper 1431-Mon3A1O.1.
Lei, Zhenchun / Yang, Yingchun / Wu, Zhaohui:
"A discriminative method for speaker verification using the difference information",
paper 1952-Mon3A1O.2.
Scheffer, Nicolas / Bonastre, Jean-François:
"A multiclass framework for speaker verification within an acoustic event sequence system",
paper 1574-Mon3A1O.3.
Ma, Bin / Zhu, Donglai / Tong, Rong / Li, Haizhou:
"Speaker cluster based GMM tokenization for speaker recognition",
paper 1429-Mon3A1O.4.
Garreton, Claudio / Yoma, Nestor Becerra / Molina, Carlos / Huenupan, Fernando:
"Intra-speaker variability compensation in speaker verification with limited enrolling data",
paper 1425-Mon3A1O.5.
Chetty, Girija / Wagner, Michael:
"Speaking faces for face-voice speaker identity verification",
paper 2025-Mon3A1O.6.
Prahallad, Kishore / Sudhakar, Varanasi / Ranganatham, Veluru / Bharat, Krishna M. / Debashish, S. Roy:
"Significance of formants from difference spectrum for speaker identification",
paper 1583-Tue1CaP.1.
Zamalloa, Maider / Bordel, Germán / Rodríguez, Luis Javier / Penagarikano, Mikel / Uribe, Juan Pedro:
"Using genetic algorithms to weight acoustic features for speaker recognition",
paper 1240-Tue1CaP.2.
Padilla, Michael T. / Quatieri, Thomas F. / Reynolds, Douglas A.:
"Missing feature theory with soft spectral subtraction for speaker verification",
paper 1918-Tue1CaP.3.
Mary, Leena / Yegnanarayana, B.:
"Prosodic features for speaker verification",
paper 1999-Tue1CaP.4.
Liu, Ming / Huang, Thomas S.:
"Unsupervised learning of HMM topology for text-dependent speaker verification",
paper 1302-Tue1CaP.5.
Anguita, Jan / Hernando, Javier:
"On the use of Jacobian adaptation in real speaker verification applications",
paper 1604-Tue1CaP.6.
Liu, Ming / Ning, Huazhong / Huang, Thomas S. / Zhang, Zhengyou:
"A novel framework of text-independent speaker verification based on utterance transform and iterative cohort modeling",
paper 2034-Tue1CaP.7.
Prakash, Vinod / Hansen, John H. L.:
"A cohort - UBM approach to mitigate data sparseness for in-set/out-of-set speaker recognition",
paper 1847-Tue1CaP.8.
Varadarajan, Vaishnevi S. / Hansen, John H. L.:
"Analysis of lombard effect under different types and levels of noise with application to in-set speaker ID systems",
paper 1599-Tue1CaP.9.
McCree, Alan:
"Reducing speech coding distortion for speaker identification",
paper 1989-Tue1CaP.10.
Kato, Tsuneo / Kawai, Hisashi:
"A text-prompted distributed speaker verification system implemented on a cellular phone and a mobile terminal",
paper 1896-Tue1CaP.11.
Vishnubhotla, Srikanth / Espy-Wilson, Carol Y.:
"Automatic detection of irregular phonation in continuous speech",
paper 1893-Tue1CaP.12.
Ramasubramanian, V. / Vijaywargiay, Deepak / Praveen, Kumar V.:
"Highly noise robust text-dependent speaker recognition based on hypothesized wiener filtering",
paper 1474-Wed1A1O.1.
Fujihara, Hiromasa / Kitahara, Tetsuro / Goto, Masataka / Komatani, Kazunori / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Speaker identification under noisy environments by using harmonic structure extraction and reliable frame weighting",
paper 1525-Wed1A1O.2.
Stergiou, Andreas / Pnevmatikakis, Aristodemos / Polymenakos, Lazaros C.:
"Enhancing the performance of a GMM-based speaker identification system in a multi-microphone setup",
paper 1608-Wed1A1O.3.
Longworth, C. / Gales, M. J. F.:
"Discriminative adaptation for speaker verification",
paper 1553-Wed1A1O.4.
Hatch, Andrew O. / Kajarekar, Sachin / Stolcke, Andreas:
"Within-class covariance normalization for SVM-based speaker recognition",
paper 1874-Wed1A1O.5.
Espy-Wilson, Carol Y. / Manocha, Sandeep / Vishnubhotla, Srikanth:
"A new set of features for text-independent speaker identification",
paper 1880-Wed1A1O.6.
Ofoegbu, Uchechukwu O. / Iyer, Ananth N. / Yantorno, Robert E. / Wenndt, Stanley J.:
"Detection of a third speaker in telephone conversations",
paper 1133-Wed3CaP.1.
Biatov, Konstantin / Köhler, Joachim:
"Improvement speaker clustering using global similarity features",
paper 1451-Wed3CaP.2.
Narayanaswamy, Balakrishnan / Gangadharaiah, Rashmi / Stern, Richard M.:
"Voting for two speaker segmentation",
paper 1932-Wed3CaP.3.
Preti, Alexandre / Bonastre, Jean-François:
"Unsupervised model adaptation for speaker verification",
paper 1554-Wed3CaP.4.
Zheng, Rong / Zhang, Shuwu / Xu, Bo:
"A quality measure method using Gaussian mixture models and divergence measure for speaker identification",
paper 1328-Wed3CaP.5.
Zhang, Yushi / Abdulla, Waleed H.:
"Gammatone auditory filterbank and independent component analysis for speaker identification",
paper 1354-Wed3CaP.6.
Wu, Wei / Zheng, Thomas Fang / Xu, Ming-Xing / Bao, Huan-Jun:
"Study on speaker verification on emotional speech",
paper 1124-Wed3CaP.7.
Farrús, M. / Garde, A. / Ejarque, P. / Luque, J. / Hernando, Javier:
"On the fusion of prosody, voice spectrum and face features for multimodal person verification",
paper 1256-Wed3CaP.8.
Pruthi, Tarun / Espy-Wilson, Carol Y.:
"An MRI based study of the acoustic effects of sinus cavities and its application to speaker recognition",
paper 1411-Wed3CaP.9.
Kojima, Mariko / Matsui, Tomoko / Kawanami, Hiromichi / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Speaker verification with non-audible murmur segments",
paper 1773-Wed3CaP.10.
Müller, Christian:
"Automatic recognition of speakers² age and gender on the basis of empirical studies",
paper 1031-Wed3CaP.11.
Fox, E. J. S. / Roberts, J. D. / Bennamoun, M.:
"Text-independent speaker identification in birds",
paper 1068-Wed3CaP.12.
Potamitis, Ilyas / Ganchev, Todor / Fakotakis, Nikos:
"Automatic acoustic identification of insects inspired by the speaker recognition paradigm",
paper 1505-Wed3CaP.13.
System Combination
Siniscalchi, Sabato Marco / Li, Jinyu / Lee, Chin-Hui:
"A study on lattice rescoring with knowledge scores for automatic speech recognition",
paper 1319-Mon3A2O.1.
Stüker, Sebastian / Fügen, Christian / Burger, Susanne / Wölfel, Matthias:
"Cross-system adaptation and combination for continuous speech recognition: the influence of phoneme set and acoustic front-end",
paper 1509-Mon3A2O.2.
Breslin, C. / Gales, M. J. F.:
"Generating complementary systems for speech recognition",
paper 1541-Mon3A2O.3.
Zhang, Rong / Rudnicky, Alexander I.:
"Investigations of issues for using multiple acoustic models to improve continuous speech recognition",
paper 1707-Mon3A2O.4.
Chen, I-Fan / Lee, Lin-shan:
"A new framework for system combination based on integrated hypothesis space",
paper 1728-Mon3A2O.5.
Hoffmeister, Björn / Klein, Tobias / Schlüter, Ralf / Ney, Hermann:
"Frame based system combination and a comparison with weighted ROVER and CNC",
paper 1523-Mon3A2O.6.
Interpreting Prosodic Variation
Yuan, Jiahong / Liberman, Mark / Cieri, Christopher:
"Towards an integrated understanding of speaking rate in conversation",
paper 1795-Mon3A3O.1.
Vu, Minh Quang / Trân, Ðô Ðat / Castelli, Eric:
"Prosody of interrogative and affirmative sentences in vietnamese language: analysis and perceptive results",
paper 1844-Mon3A3O.2.
Venditti, Jennifer J. / Hirschberg, Julia / Liscombe, Jackson:
"Intonational cues to student questions in tutoring dialogs",
paper 1407-Mon3A3O.3.
Krahmer, Emiel / Swerts, Marc:
"Testing the effect of audiovisual cues to prominence via a reaction-time experiment",
paper 1288-Mon3A3O.4.
Gravano, Agustín / Hirschberg, Julia:
"Effect of genre, speaker, and word class on the realization of given and new information",
paper 1747-Mon3A3O.5.
Vainio, Martti / Järvikivi, Juhani / Werner, Stefan:
"Word order and tonal shape in the production of focus in short Finnish utterances",
paper 1595-Mon3A3O.6.
Articulatory Modeling
Kröger, Bernd J. / Birkholz, Peter / Kannampuzha, Jim / Neuschaefer-Rube, Christiane:
"Modeling sensory-to-motor mappings using neural nets and a 3d articulatory speech synthesizer",
paper 1192-Mon3WeS.1.
Fontecave, Julie / Berthommier, Frédéric:
"Semi-automatic extraction of vocal tract movements from cineradiographic data",
paper 1439-Mon3WeS.2.
Jou, Szu-Chen / Schultz, Tanja / Walliczek, Matthias / Kraft, Florian / Waibel, Alex:
"Towards continuous speech recognition using surface electromyography",
paper 1592-Mon3WeS.3.
Richmond, Korin:
"A trajectory mixture density network for the acoustic-articulatory inversion mapping",
paper 1790-Mon3WeS.4.
Metze, Florian:
"Articulatory features for "meeting" speech recognition",
paper 1891-Mon3WeS.5.
Krnoul, Zdenek / Zelezný, Milos / Müller, Ludek / Kanis, Jakub:
"Training of coarticulation models using dominance functions and visual unit selection methods for audio-visual speech synthesis",
paper 1905-Mon3WeS.6.
Acoustic Modeling I - Training and Topologies
Zhang, Le / Renals, Steve:
"Phone recognition analysis for trajectory HMM",
paper 1203-Mon3BuP.1.
Keshet, Joseph / Shalev-Shwartz, Shai / Bengio, Samy / Singer, Yoram / Chazan, Dan:
"Discriminative kernel-based phoneme sequence recognition",
paper 1284-Mon3BuP.2.
Morris, Jeremy / Fosler-Lussier, Eric:
"Combining phonetic attributes using conditional random fields",
paper 1287-Mon3BuP.3.
Nagarajan, T. / O'Shaughnessy, Douglas:
"Discriminative MLE training using a product of Gaussian likelihoods",
paper 1292-Mon3BuP.4.
Li, Hao-Zheng / O'Shaughnessy, Douglas:
"State-level variable modeling for phoneme classification",
paper 1332-Mon3BuP.5.
Li, Xiaolong / Deng, Li / Yu, Dong / Acero, Alex:
"A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model",
paper 1409-Mon3BuP.6.
Casar, Marta / Fonollosa, Jose A. R.:
"Analysis of HMM temporal evolution for automatic speech recognition and utterance verification",
paper 1586-Mon3BuP.7.
Tang, Min / Ganapathiraju, Aravind:
"Improvements to bucket box intersection algorithm for fast GMM computation in embedded speech recognition systems",
paper 1678-Mon3BuP.8.
Markov, Konstantin / Nakamura, Satoshi:
"Forward-backwards training of hybrid HMM/BN acoustic models",
paper 1838-Mon3BuP.9.
Gehrig, Dirk / Schaaf, Thomas:
"A comparative study of Gaussian selection methods in large vocabulary continuous speech recognition",
paper 1954-Mon3BuP.10.
Suk, Soo-Young / Hahm, Seong-Jun / Jung, Ho-Youl / Chung, Hyun-Yeol:
"A successive state and mixture splitting for optimizing the size of models in speech recognition",
paper 2022-Mon3BuP.11.
Ion, Valentin / Haeb-Umbach, Reinhold:
"Improved source modeling and predictive classification for channel robust speech recognition",
paper 1083-Mon3BuP.12.
Acoustic Signal Segmentation and Classification
Kühne, Marco / Togneri, Roberto:
"Automatic English stop consonants classification using wavelet analysis and hidden Markov models",
paper 1174-Mon3CaP.1.
Wu, Tingyao / Compernolle, Dirk Van / Duchateau, Jacques / Hamme, Hugo Van:
"Single frame selection for phoneme classification",
paper 1247-Mon3CaP.2.
Dusan, Sorin / Rabiner, Lawrence:
"On the relation between maximum spectral transition positions and phone boundaries",
paper 1317-Mon3CaP.3.
Yingthawornsuk, T. / Keskinpala, H. Kaymaz / France, D. / Wilkes, D. M. / Shiavi, R. G. / Salomon, R. M.:
"Objective estimation of suicidal risk using vocal output characteristics",
paper 1321-Mon3CaP.4.
Didiot, E. / Illina, I. / Mella, O. / Fohr, D. / Haton, Jean-Paul:
"A wavelet-based parameterization for speech/music segmentation",
paper 1361-Mon3CaP.5.
Nagino, Goshu / Shozakai, Makoto:
"Distance measure between Gaussian distributions for discriminating speaking styles",
paper 1383-Mon3CaP.6.
Pernkopf, Franz / Pham, Tuan Van:
"Bayesian networks for phonetic classification using time-scale features",
paper 1532-Mon3CaP.7.
Beringer, Nicole:
"Fast and effective retraining on contrastive vocal characteristics with bidirectional long short-term memory nets",
paper 1602-Mon3CaP.8.
Ma, Ning / Green, Phil / Coy, André:
"Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source",
paper 1639-Mon3CaP.9.
Leelaphattarakij, Pairote / Punyabukkana, Proadpran / Suchato, Atiwong:
"Locating phone boundaries from acoustic discontinuities using a two-staged approach",
paper 1734-Mon3CaP.10.
Fu, Qiang / Juang, Biing-Hwang:
"Investigation on rescoring using minimum verification error (MVE) detectors",
paper 1761-Mon3CaP.11.
Fu, Qiang / Moreno-Daniel, Antonio / Juang, Biing-Hwang / Zhou, Jian-Lai / Soong, Frank K.:
"Generalization of the minimum classification error (MCE) training based on maximizing generalized posterior probability (GPP)",
paper 1780-Mon3CaP.12.
Carlin, Michael A. / Smolenski, Brett Y. / Wenndt, Stanley J.:
"Unsupervised detection of whispered speech in the presence of normal phonation",
paper 1990-Mon3CaP.13.
Anguera, Xavier / Wooters, Chuck / Hernando, Javier:
"Friends and enemies: a novel initialization for speaker diarization",
paper 1661-Mon3CaP.14.
Linguistics, Phonology, and Phonetics I, II
Surana, Kushan / Slifka, Janet:
"Acoustic cues for the classification of regular and irregular phonation",
paper 1755-Mon3FoP.1.
Nitisaroj, Rattima:
"Realizations and representations of Thai tones in monomoraic syllables",
paper 1381-Mon3FoP.2.
Jacobi, Irene / Pols, Louis C. W. / Stroop, Jan:
"Measuring and comparing vowel qualities in a Dutch spontaneous speech corpus",
paper 1215-Mon3FoP.3.
Li, Aijun / Fang, Qiang / Xiong, Ziyu:
"Phonetic research on accented Chinese in three dialectal regions: Shanghai, Wuhan and Xiamen",
paper 1143-Mon3FoP.4.
Zhang, Chi / Wu, Ji / Xiao, Xi / Wang, Zuoying:
"Pronunciation variation modeling for Mandarin with accent",
paper 1849-Mon3FoP.5.
Nielsen, Kuniko Y.:
"Specificity and generalizability of spontaneous phonetic imitation",
paper 1326-Mon3FoP.6.
Bael, Christophe Van / Halteren, Hans van:
"On the sufficiency of automatic phonetic transcriptions for pronunciation variation research",
paper 1265-Mon3FoP.7.
Kazemzadeh, Abe / Tepperman, Joseph / Silva, Jorge / You, Hong / Lee, Sungbok / Alwan, Abeer / Narayanan, Shrikanth:
"Automatic detection of voice onset time contrasts for use in pronunciation assessment",
paper 1884-Mon3FoP.8.
Hirano, Hiroko / Kawai, Goh / Hirose, Keikichi / Minematsu, Nobuaki:
"Unfilled pauses in Japanese sentences read aloud by non-native learners",
paper 1871-Mon3FoP.9.
Hamabe, Ryoji / Uchimoto, Kiyotaka / Kawahara, Tatsuya / Isahara, Hitoshi:
"Detection of quotations and inserted clauses and its application to dependency structure analysis in spontaneous Japanese",
paper 1151-Mon3FoP.10.
Tseng, Chun-Han / Chen, Chia-Ping:
"Chinese input method based on reduced Mandarin phonetic alphabet",
paper 1944-Mon3FoP.11.
Suzuki, Yoshimi / Fukumoto, Fumiyo:
"Thesaurus expansion using similar word pairs from patent documents",
paper 1920-Mon3FoP.12.
Schone, Patrick:
"Low-resource autodiacritization of abjads for speech keyword search",
paper 1412-Mon3FoP.13.
Hertz, Susan R.:
"A model of the regularities underlying speaker variation: evidence from hybrid synthesis",
paper 1286-Tue3A3O.1.
Speyer, Augustin:
"Pauses as a tool to ensure rhythmic wellformedness",
paper 1406-Tue3A3O.2.
Watanabe, Michiko / Den, Yasuharu / Hirose, Keikichi / Miwa, Shusaku / Minematsu, Nobuaki:
"Factors affecting speakers² choice of fillers in Japanese presentations",
paper 1498-Tue3A3O.3.
Davel, Marelie / Barnard, Etienne:
"Developing consistent pronunciation models for phonemic variants",
paper 1760-Tue3A3O.4.
Lee, Jinsik / Kim, Seungwon / Lee, Gary Geunbae:
"Grapheme-to-phoneme conversion using automatically extracted associative rules for Korean TTS system",
paper 1405-Tue3A3O.5.
Charoenpornsawat, Paisarn / Schultz, Tanja:
"Example-based grapheme-to-phoneme conversion for Thai",
paper 1782-Tue3A3O.6.
Speech Translation
Riesa, Jason / Mohit, Behrang / Knight, Kevin / Marcu, Daniel:
"Building an English-iraqi Arabic machine translation system for spoken utterances with limited resources",
paper 2012-Tue1A1O.1.
Maskey, Sameer / Zhou, Bowen / Gao, Yuqing:
"A phrase-level machine translation approach for disfluency detection using weighted finite state transducers",
paper 1886-Tue1A1O.2.
Lee, Jonghoon / Lee, Donghyeon / Lee, Gary Geunbae:
"Improving phrase-based Korean-English statistical machine translation",
paper 1371-Tue1A1O.3.
Stallard, David / Choi, Fred / Krstovski, Kriste / Natarajan, Prem / Prasad, Rohit / Saleem, Shirin:
"A hybrid phrase-based/statistical speech translation system",
paper 1732-Tue1A1O.4.
Wang, Chao / Seneff, Stephanie:
"High-quality speech translation in the flight domain",
paper 1135-Tue1A1O.5.
Hsiao, Roger / Venugopal, Ashish / Köhler, Thilo / Zhang, Ying / Charoenpornsawat, Paisarn / Zollmann, Andreas / Vogel, Stephan / Black, Alan W. / Schultz, Tanja / Waibel, Alex:
"Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system",
paper 1712-Tue1A1O.6.
Acoustic Modeling II - Adaptation
Sehr, Armin / Zeller, Marcus / Kellermann, Walter:
"Distant-talking continuous speech recognition based on a novel reverberation model in the feature domain",
paper 1111-Tue1A2O.1.
Lei, Xin / Hamaker, Jon / He, Xiaodong:
"Robust feature space adaptation for telephony speech recognition",
paper 1743-Tue1A2O.2.
Thatphithakkul, Nattanun / Kruatrachue, Boontee / Wutiwiwatchai, Chai / Marukatat, Sanparith / Boonpiam, Vataya:
"A simulated-data adaptation technique for robust speech recognition",
paper 1157-Tue1A2O.3.
Hirsch, Hans-Günter / Finster, Harald:
"A new HMM adaptation approach for the case of a hands-free speech input in reverberant rooms",
paper 1175-Tue1A2O.4.
Tsao, Yu / Lee, Chin-Hui:
"A vector space approach to environment modeling for robust speech recognition",
paper 1617-Tue1A2O.5.
Chien, Jen-Tzung / Ting, Chuan-Wei:
"Subspace modeling and selection for noisy speech recognition",
paper 1333-Tue1A2O.6.
Emotional Speech and Speaker State
Schuller, Björn / Köhler, Niels / Müller, Ronald / Rigoll, Gerhard:
"Recognition of interest in human conversational speech",
paper 1621-Tue1A3O.1.
Ai, Hua / Litman, Diane J. / Forbes-Riley, Kate / Rotaru, Mihai / Tetreault, Joel / Purandare, Amruta:
"Using system and user performance features to improve emotion detection in spoken tutoring dialogs",
paper 1682-Tue1A3O.2.
Devillers, Laurence / Vidrascu, Laurence:
"Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs",
paper 1636-Tue1A3O.3.
Wilting, Janneke / Krahmer, Emiel / Swerts, Marc:
"Real vs. acted emotional speech",
paper 1093-Tue1A3O.4.
Neiberg, Daniel / Elenius, Kjell / Laskowski, Kornel:
"Emotion recognition in spontaneous speech using GMMs",
paper 1581-Tue1A3O.5.
Enos, Frank / Benus, Stefan / Cautin, Robin L. / Graciarena, Martin / Hirschberg, Julia / Shriberg, Elizabeth:
"Personality factors in human deception detection: comparing human to machine performance",
paper 1664-Tue1A3O.6.
Speech and Language in Education
Cleuren, Leen / Duchateau, Jacques / Sips, Alain / Ghesquière, Pol / Hamme, Hugo Van:
"Developing an automatic assessment tool for children²s oral reading",
paper 1113-Tue1WeS.1.
Waple, Christopher / Tsubota, Yasushi / Dantsuji, Masatake / Kawahara, Tatsuya:
"Prototyping a call system for students of Japanese using dynamic diagram generation and interactive hints",
paper 1171-Tue1WeS.2.
Massaro, Dominic W. / Liu, Ying / Chen, Trevor H. / Perfetti, Charles:
"A multilingual embodied conversational agent for tutoring speech and language learning",
paper 1313-Tue1WeS.3.
Heilman, Michael / Collins-Thompson, Kevyn / Callan, Jamie / Eskenazi, Maxine:
"Classroom success of an intelligent tutoring system for lexical practice and reading comprehension",
paper 1325-Tue1WeS.4.
Petersen, Sarah E. / Ostendorf, Mari:
"Assessing the reading level of web pages",
paper 1610-Tue1WeS.5.
Mostow, Jack:
"Is ASR accurate enough for automated reading tutors, and how can we tell?",
paper 1796-Tue1WeS.6.
Tsurutani, Chiharu / Yamauchi, Yutaka / Minematsu, Nobuaki / Luo, Dean / Maruyama, Kazutaka / Hirose, Keikichi:
"Development of a program for self assessment of Japanese pronunciation by English learners",
paper 1805-Tue1WeS.7.
Tepperman, Joseph / Silva, Jorge / Kazemzadeh, Abe / You, Hong / Lee, Sungbok / Alwan, Abeer / Narayanan, Shrikanth:
"Pronunciation verification of children²s speech for automatic literacy assessment",
paper 1814-Tue1WeS.8.
Abdou, Sherif Mahdy / Hamid, Salah Eldeen / Rashwan, Mohsen / Samir, Abdurrahman / Abdel-Hamid, Ossama / Shahin, Mostafa / Nazih, Waleed:
"Computer aided pronunciation learning system using speech recognition techniques",
paper 1888-Tue1WeS.9.
Speech Perception I, II
Lobdell, Bryce / Allen, Jont B.:
"An information theoretic tool for investigating speech perception",
paper 1209-Tue1BuP.1.
Morrison, Geoffrey Stewart:
"An adaptive sampling procedure for speech perception experiments",
paper 1147-Tue1BuP.2.
Viswanathan, Navin / Magnuson, James S. / Fowler, Carol A.:
"Disentangling gestural and auditory contrast accounts of compensation for coarticulation",
paper 2045-Tue1BuP.3.
Yip, Michael C. W.:
"The role of positional probability in the segmentation of Cantonese speech",
paper 1034-Tue1BuP.4.
Haque, Shahina / Takara, Tomio:
"Nasality perception of vowels in different language background",
paper 1108-Tue1BuP.5.
Hodoshima, Nao / Behne, Dawn / Arai, Takayuki:
"Steady-state suppression in reverberation: a comparison of native and nonnative speech perception",
paper 1819-Tue1BuP.6.
Joto, Akiyo:
"Effect of dynamic information of formants on discrimination of English vowels in consonantal contexts by Japanese listeners",
paper 1926-Tue1BuP.7.
Wang, Yue / Behne, Dawn / Jiang, Haisheng / Danyluck, Chad:
"Native and nonnative audio-visual perception of English fricatives in quiet and cafe-noise backgrounds",
paper 1798-Tue1BuP.8.
Grawunder, Sven / Bose, Ines / Hertha, Birgit / Trauselt, Franziska / Anders, Lutz Christian:
"Perceptive and acoustic measurement of average speaking pitch of female and male speakers in German radio news",
paper 1966-Tue1BuP.9.
Assmann, Peter F. / Dembling, Sophia / Nearey, Terrance M.:
"Effects of frequency shifts on perceived naturalness and gender information in speech",
paper 1710-Tue1BuP.10.
Tohyama, Hitomi / Matsubara, Shigeki:
"Influence of pause length on listeners² impressions in simultaneous interpretation",
paper 1959-Tue1BuP.11.
Schwarz, Iris-Corinna / Burnham, Denis:
"New measures to chart toddlers² speech perception and language development: a test of the lexical restructuring hypothesis",
paper 1864-Tue1BuP.12.
Torre, Ángel de la / Roldán, Cristina / Sainz, Manuel:
"Perception of fundamental frequency in cochlear implant patients",
paper 1477-Tue1BuP.13.
Creel, Sarah C. / Dahan, Delphine / Swingley, Daniel:
"Effects of featural similarity and overlap position on lexical confusions and overt similarity judgments",
paper 1298-Wed1A3O.1.
Mixdorff, Hansjörg / Hu, Yu:
"Word structure and tone perception in Mandarin",
paper 1609-Wed1A3O.2.
Woehrling, Cecile / Boula de Mareüil, Philippe:
"Identification of regional accents in French: perception and categorization",
paper 1261-Wed1A3O.3.
Phatak, Sandeep / Allen, Jont B.:
"Consonant and vowel confusions in speech-weighted noise",
paper 1061-Wed1A3O.4.
Broersma, Mirjam:
"Accident - execute: increased activation in nonnative listening",
paper 1511-Wed1A3O.5.
Scholz, Kirstin / Waltermann, Marcel / Huo, Lu / Raake, Alexander / Möller, Sebastian / Heute, Ulrich:
"Estimation of the quality dimension "directness/frequency content" for the instrumental assessment of speech quality",
paper 1219-Wed1A3O.6.
Speech Production, Physiology, and Pathology I, II
Pluymaekers, Mark / Ernestus, Mirjam / Baayen, R. Harald:
"Effects of word frequency on the acoustic durations of affixes",
paper 1241-Tue1FoP.1.
Niu, Xiaochuan / Kain, Alexander B. / Santen, Jan P. H. van:
"A noninvasive, low-cost device to study the velopharyngeal port during speech and some preliminary results",
paper 1829-Tue1FoP.2.
Aboutabit, Noureddine / Beautemps, Denis / Besacier, Laurent:
"Characterization of cued speech vowels from the inner lip contour",
paper 1515-Tue1FoP.3.
Gobl, Christer:
"Modelling aspiration noise during phonation using the LF voice source model",
paper 1718-Tue1FoP.4.
Wei, Jianguo / Lu, Xugang / Dang, Jianwu:
"A simulation based parameter optimization for a coarticulation model",
paper 1772-Tue1FoP.5.
Kacha, A. / Grenez, Francis / Schoentgen, Jean:
"Multivariate analysis of frame-based acoustic cues of dysperiodicities in connected speech",
paper 1388-Tue1FoP.6.
Kovacs, Tom / Finan, Donald S.:
"Effects of midline tongue piercing on spectral centroid frequencies of sibilants",
paper 1310-Tue1FoP.7.
Vijayalakshmi, P. / Reddy, M. R. / O’Shaughnessy, Douglas:
"Assessment of articulatory sub-systems of dysarthric speech using an isolated-style phoneme recognition system",
paper 1281-Tue1FoP.8.
Finan, Donald S. / Boliek, Carol A.:
"Respiratory/laryngeal interactions during sustained vowel production in children",
paper 1833-Tue1FoP.9.
Bunnell, H. Timothy / Polikoff, James B.:
"Acoustic characterization of children with speech delay",
paper 2057-Tue1FoP.10.
Saz, Oscar / Miguel, Antonio / Lleida, Eduardo / Ortega, Alfonso / Buera, Luis:
"Study of time and frequency variability in pathological speech and error reduction methods for automatic speech recognition",
paper 1266-Tue1FoP.11.
Iseli, Markus / Shue, Yen-Liang / Epstein, Melissa A. / Keating, Patricia / Kreiman, Jody / Alwan, Abeer:
"Voice source correlates of prosodic features in american English: a pilot study",
paper 1933-Thu1A3O.1.
Bosch, Louis ten / Baayen, R. Harald / Ernestus, Mirjam:
"On speech variation and word type differentiation by articulatory feature representations",
paper 1923-Thu1A3O.2.
Lee, Sungbok / Bresch, Erik / Adams, Jason / Kazemzadeh, Abe / Narayanan, Shrikanth:
"A study of emotional speech articulation using a fast magnetic resonance imaging technique",
paper 1792-Thu1A3O.3.
Kjellström, Hedvig / Engwall, Olov / Bälter, Olle:
"Reconstructing tongue movements from audio and video",
paper 1071-Thu1A3O.4.
Feng, Gang / Kotenkoff, Cyril:
"New considerations for vowel nasalization based on separate mouth-nose recording",
paper 1096-Thu1A3O.5.
Garnier, Maeva / Bailly, Lucie / Dohen, Marion / Welby, Pauline / Loevenbruck, Helene:
"An acoustic and articulatory study of Lombard speech: global effects on the utterance",
paper 1862-Thu1A3O.6.
Formant Estimation
Cnockaert, Laurence / Schoentgen, Jean / Auzou, Pascal / Ozsancak, Canan / Grenez, Francis:
"Tracking of involuntary formant frequency variations and application to parkinsonian speech",
paper 1043-Tue2A1O.1.
Weruaga, Luis / Al-Khayat, Amar:
"All-pole model estimation of vocal tract on the frequency domain",
paper 1188-Tue2A1O.2.
Darch, Jonathan / Milner, Ben:
"HMM-based MAP prediction of voiced and unvoiced formant frequencies from noisy MFCC vectors",
paper 1540-Tue2A1O.3.
Anand, Joseph M. / Guruprasad, S. / Yegnanarayana, B.:
"Extracting formants from short segments of speech using group delay functions",
paper 1848-Tue2A1O.4.
Özbek, I. Yücel / Demirekler, Mübeccel:
"Tracking of visible vocal tract resonances (VVTR) based on kalman filtering",
paper 2029-Tue2A1O.5.
Chaari, Salma / Ouni, Kais / Ellouze, Noureddine:
"Wavelet ridge track interpretation in terms of formants",
paper 2030-Tue2A1O.6.
Language Processing Beyond and Below the Word-Level
Kurimo, Mikko / Creutz, Mathias / Varjokallio, Matti / Arsoy, Ebru / Saraclar, Murat:
"Unsupervised segmentation of words into morphemes - morpho challenge 2005 application to automatic speech recognition",
paper 1512-Tue2A2O.1.
Arsoy, Ebru / Saraclar, Murat:
"Lattice extension and rescoring based approaches for LVCSR of Turkish",
paper 1622-Tue2A2O.2.
Kobus, Catherine / Damnati, Geraldine / Delphin-Poulat, Lionel / Mori, Renato De:
"Exploiting semantic relations for a spoken language understanding application",
paper 1269-Tue2A2O.3.
Akita, Yuya / Saikou, Masahiro / Nanjo, Hiroaki / Kawahara, Tatsuya:
"Sentence boundary detection of spontaneous Japanese using statistical language model and support vector machines",
paper 1370-Tue2A2O.4.
Virpioja, Sami / Kurimo, Mikko:
"Compact n-gram models by incremental growing and clustering of histories",
paper 1231-Tue2A2O.5.
Camelin, Nathalie / Damnati, Geraldine / Bechet, Frederic / Mori, Renato De:
"Opinion mining in a telephone survey corpus",
paper 1417-Tue2A2O.6.
Robustness and Adaptation for ASR
Peinado, Antonio M. / Gómez, Angel M. / Sánchez, Victoria / Pérez-Córdoba, José L. / Rubio, Antonio J.:
"An integrated solution for error concealment in DSR systems over wireless channels",
paper 1273-Tue2BuP.1.
Gómez, Angel M. / Peinado, Antonio M. / Sánchez, Victoria / Carmona, José L. / Rubio, Antonio J.:
"Interleaving and MMSE estimation with VQ replicas for distributed speech recognition over lossy packet networks",
paper 1279-Tue2BuP.2.
Chen, Gang / Tolba, Hesham / O’Shaughnessy, Douglas:
"Noise-robust speech recognition of conversational telephone speech",
paper 1304-Tue2BuP.3.
Kuroiwa, Shingo / Tsuge, Satoru / Ren, Fuji:
"Lost speech reconstruction method using speech recognition based on missing feature theory and HMM-based speech synthesis",
paper 1347-Tue2BuP.4.
Selouani, Sid-Ahmed / O’Shaughnessy, Douglas:
"Speaker adaptation using evolutionary-based linear transform",
paper 1368-Tue2BuP.5.
Wang, Jingying / Wang, Zuoying:
"A speaker adaptation algorithm using principal curves in noisy environments",
paper 1374-Tue2BuP.6.
Clarke, Constance / Jurafsky, Daniel:
"Limitations of MLLR adaptation with Spanish-accented English: an error analysis",
paper 1611-Tue2BuP.7.
Liao, H. / Gales, M. J. F.:
"Issues with uncertainty decoding for noise robust speech recognition",
paper 1627-Tue2BuP.8.
Xu, Haitian / Rigazio, Luca / Kryze, David:
"Vector taylor series based joint uncertainty decoding",
paper 1688-Tue2BuP.9.
Huo, Qiang / Zhu, Donglai:
"A maximum likelihood training approach to irrelevant variability compensation based on piecewise linear transformations",
paper 1740-Tue2BuP.10.
Mandal, Arindam / Ostendorf, Mari / Stolcke, Andreas:
"Speaker clustered regression-class trees for MLLR adaptation",
paper 1763-Tue2BuP.11.
Tan, Zheng-Hua / Dalsgaard, Paul / Lindberg, Børge:
"Robust speech recognition over mobile networks using combined weighted viterbi decoding and subvector based error concealment",
paper 1825-Tue2BuP.12.
Zen, Heiga / Nankaku, Yoshihiko / Tokuda, Keiichi / Kitamura, Tadashi:
"Speaker adaptation of trajectory HMMs using feature-space MLLR",
paper 1958-Tue2BuP.13.
Povey, Daniel / Saon, George:
"Feature and model space speaker adaptation with full covariance Gaussians",
paper 2050-Tue2BuP.14.
Multimodal, Translation and Information Retrieval
Gispert, Adrià de / Mariño, José B.:
"Linguistic tuple segmentation in n-gram-based statistical machine translation",
paper 1049-Tue2CaP.1.
Oba, Takanobu / Hori, Takaaki / Nakamura, Atsushi:
"Sentence boundary detection using sequential dependency analysis combined with CRF-based chunking",
paper 1657-Tue2CaP.2.
Bangalore, Srinivas / Haffner, Patrick / Kanthak, Stephan:
"Sequence classification for machine translation",
paper 1722-Tue2CaP.3.
Itoh, Yoshiaki / Otake, Takayuki / Iwata, Kohei / Kojima, Kazunori / Ishigame, Masaaki / Tanaka, Kazuyo / Lee, Shi-wook:
"Two-stage vocabulary-free spoken document retrieval - subword identification and re-recognition of the identified sections",
paper 1865-Tue2CaP.4.
Surdeanu, Mihai / Dominguez-Sal, David / Comas, Pere R.:
"Design and performance analysis of a factoid question answering system for spontaneous speech transcriptions",
paper 1046-Tue2CaP.5.
Takezawa, Toshiyuki / Shimizu, Tohru:
"Performance improvement of dialog speech translation by rejecting unreliable utterances",
paper 1100-Tue2CaP.6.
Ettelaie, Emil / Georgiou, Panayiotis G. / Narayanan, Shrikanth:
"Cross-lingual dialog model for speech to speech translation",
paper 1858-Tue2CaP.7.
Akbacak, Murat / Hansen, John H. L.:
"A robust fusion method for multilingual spoken document retrieval systems employing tiered resources",
paper 1835-Tue2CaP.8.
Zhu, Weizhong / Zhou, Bowen / Prosser, Charles / Krbec, Pavel / Gao, Yuqing:
"Recent advances of IBM’s handheld speech translation system",
paper 1590-Tue2CaP.9.
Stenchikova, Svetlana / Hakkani-Tür, Dilek / Tur, Gokhan:
"QASR: question answering using semantic roles for speech interface",
paper 2054-Tue2CaP.10.
Maas, Jan F. / Wrede, Britta / Sagerer, Gerhard:
"Towards a multimodal topic tracking system for a mobile robot",
paper 1121-Tue2CaP.11.
Kaiser, Edward C. / Barthelmess, Paulo:
"Edge-splitting in a cumulative multimodal system, for a no-wait temporal threshold on information fusion, combined with an under-specified display",
paper 2016-Tue2CaP.12.
Hui, Pui-Yu / Meng, Helen M.:
"Joint interpretation of input speech and pen gestures for multimodal human-computer interaction",
paper 1834-Tue2CaP.13.
Advances in Acoustic Segmentation
Cournapeau, David / Kawahara, Tatsuya / Mase, Kenji / Toriyama, Tomoji:
"Voice activity detector based on enhanced cumulant of LPC residual and on-line EM algorithm",
paper 1375-Tue3A1O.1.
Huggins-Daines, David / Rudnicky, Alexander I.:
"A constrained baum-welch algorithm for improved phoneme segmentation and efficient training",
paper 1580-Tue3A1O.2.
Valente, Fabio:
"Infinite models for speaker clustering",
paper 1329-Tue3A1O.3.
Dines, John / Vepa, Jithendra / Hain, Thomas:
"The segmentation of multi-channel meeting recordings for automatic speech recognition",
paper 1548-Tue3A1O.4.
Kuo, Jen-Wei / Wang, Hsin-Min:
"Minimum boundary error training for automatic phonetic segmentation",
paper 1497-Tue3A1O.5.
Schuler, William / Miller, Tim / Wu, Stephen / Exley, Andrew:
"Dynamic evidence models in a DBN phone recognizer",
paper 1770-Tue3A1O.6.
Acoustic Modeling III - LVCSR
Ramabhadran, B. / Siohan, Olivier / Mangu, L. / Zweig, G. / Westphal, M. / Schulz, H. / Soneiro, A.:
"The IBM 2006 speech transcription system for european parliamentary speeches",
paper 2027-Tue3A2O.1.
Fügen, Christian / Wölfel, Matthias / McDonough, John W. / Ikbal, Shajith / Kraft, Florian / Laskowski, Kornel / Ostendorf, Mari / Stüker, Sebastian / Kumatani, Kenichi:
"Advances in lecture recognition: the ISL RT-06s evaluation system",
paper 1415-Tue3A2O.2.
Hwang, Mei-Yuh / Lei, Xin / Wang, Wen / Shinozaki, Takahiro:
"Investigation on Mandarin broadcast news speech recognition",
paper 1916-Tue3A2O.3.
Lei, Xin / Siu, Manhung / Hwang, Mei-Yuh / Ostendorf, Mari / Lee, Tan:
"Improved tone modeling for Mandarin broadcast news speech recognition",
paper 1752-Tue3A2O.4.
Huang, Jui-Ting / Lee, Lin-shan:
"Prosodic modeling in large vocabulary Mandarin speech recognition",
paper 1546-Tue3A2O.5.
Sun, Ying / Willett, Daniel / Brueckner, Raymond / Gruhn, Rainer / Bühler, Dirk:
"Experiments on Chinese speech recognition with tonal models and pitch estimation using the Mandarin speecon data",
paper 1452-Tue3A2O.6.
Speech and Visual Processing
Beskow, Jonas / Granström, Björn / House, David:
"Visual correlates to prominence in several expressive modes",
paper 1922-Tue3WeO.1.
Barkhuysen, Pashiera / Krahmer, Emiel / Swerts, Marc:
"How auditory and visual prosody is used in end-of-utterance detection",
paper 1238-Tue3WeO.2.
Swerts, Marc / Krahmer, Emiel:
"The importance of different facial areas for signalling visual prominence",
paper 1289-Tue3WeO.3.
Chaloupka, Josef:
"Visual speech segmentation and speaker recognition for transcription of TV news",
paper 1485-Tue3WeO.4.
Cortés, G. / García, L. / Benítez, Carmen / Segura, José C.:
"HMM-based continuous sign language recognition using a fast optical flow parameterization of visual information",
paper 1543-Tue3WeO.5.
Shao, Xu / Barker, Jon:
"Audio-visual speech recognition in the presence of a competing speaker",
paper 1589-Tue3WeO.6.
Text-to-Speech I, II
Strom, Volker / Clark, Robert A. J. / King, Simon:
"Expressive prosody for unit-selection speech synthesis",
paper 1522-Tue3BuP.1.
Carlson, Rolf / Gustafson, Kjell / Strangert, Eva:
"Cues for hesitation in speech synthesis",
paper 1516-Tue3BuP.2.
Alías, Francesc / Socoró, Joan Claudi / Sevillano, Xavier / Iriondo, Ignasi / Gonzalvo, Xavier:
"Multi-domain text-to-speech synthesis by automatic text classification",
paper 1579-Tue3BuP.3.
Yi, Lifu / Li, Jian / Lou, Xiaoyan / Hao, Jie:
"Phrase break prediction using logistic generalized linear model",
paper 1468-Tue3BuP.4.
Clark, Robert A. J. / King, Simon:
"Joint prosodic and segmental unit selection speech synthesis",
paper 1262-Tue3BuP.5.
Kim, Yeon-Jun / Syrdal, Ann K. / Conkie, Alistair / Beutnagel, Mark C.:
"Phonetically enriched labeling in unit selection TTS synthesis",
paper 2055-Tue3BuP.6.
Bellegarda, Jerome R.:
"Further developments in LSM-based boundary training for unit selection TTS",
paper 1142-Tue3BuP.7.
Nose, Takashi / Yamagishi, Junichi / Kobayashi, Takao:
"A style control technique for speech synthesis using multiple regression HSMM",
paper 1184-Tue3BuP.8.
Ogata, Katsumi / Tachibana, Makoto / Yamagishi, Junichi / Kobayashi, Takao:
"Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis",
paper 1787-Tue3BuP.9.
Abdel-Hamid, Ossama / Abdou, Sherif Mahdy / Rashwan, Mohsen:
"Improving Arabic HMM based speech synthesis quality",
paper 1693-Tue3BuP.10.
Homayounpour, M. Mehdi / Namnabat, Majid:
"Farsbayan: a unit selection based Farsi speech synthesizer",
paper 1997-Tue3BuP.11.
Anberbir, Tadesse / Takara, Tomio:
"Amharic speech synthesis using cepstral method with stress generation rule",
paper 1107-Tue3BuP.12.
Thangthai, Ausdang / Hansakunbuntheung, Chatchawarn / Siricharoenchai, Rungkarn / Wutiwiwatchai, Chai:
"Automatic syllable-pattern induction in statistical Thai text-to-phone transcription",
paper 1964-Tue3BuP.13.
Oosthuizen, H. J. / Phihlela, S. T. / Manamela, M. J. D.:
"Development of prototype text-to-speech systems for northern sotho",
paper 1070-Tue3BuP.14.
You, Jiali / Chen, Yining / Chu, Min / Zhao, Yong / Wang, Jinlin:
"Identify language origin of personal names with normalized appearance number of web pages",
paper 1353-Tue3BuP.15.
Weiss, Christian / Hess, Wolfgang:
"Conditional random fields for hierarchical segment selection in text-to-speech synthesis",
paper 1090-Wed3BuP.1.
Krul, Aleksandra / Damnati, Géraldine / Yvon, François / Moudenc, Thierry:
"Corpus design based on the kullback-leibler divergence for text-to-speech synthesis application",
paper 1647-Wed3BuP.2.
Ling, Zhen-Hua / Wang, Ren-Hua:
"HMM-based unit selection using frame sized speech segments",
paper 1104-Wed3BuP.3.
Taylor, Paul:
"The target cost formulation in unit selection speech synthesis",
paper 1455-Wed3BuP.4.
Tihelka, Daniel / Matousek, Jindrich:
"Unit selection and its relation to symbolic prosody: a new approach",
paper 1618-Wed3BuP.5.
Wu, Yi-Jian / Guo, Wu / Wang, Ren-Hua:
"Minimum generation error criterion for tree-based clustering of context dependent HMMs",
paper 1373-Wed3BuP.6.
Kang, Heng / Liu, Wenju:
"Selective-LPC based representation of STRAIGHT spectrum and its applications in spectral smoothing",
paper 1109-Wed3BuP.7.
Jilka, Matthias / Möbius, Bernd:
"Towards a comprehensive investigation of factors relevant to peak alignment using a unit selection corpus",
paper 1565-Wed3BuP.8.
Utama, Robert J. / Syrdal, Ann K. / Conkie, Alistair:
"Six approaches to limited domain concatenative speech synthesis",
paper 1047-Wed3BuP.9.
Fischer, V. / Kunzmann, S.:
"From pre-recorded prompts to corporate voices: on the migration of interactive voice response applications",
paper 1042-Wed3BuP.10.
Park, Seung Seop / Shin, Jong Won / Kim, Nam Soo:
"Automatic speech segmentation with multiple statistical models",
paper 1199-Wed3BuP.11.
Pärssinen, Kimmo / Moberg, Marko:
"Evaluation of perceptual quality of control point reduction in rule-based synthesis",
paper 1178-Wed3BuP.12.
Coorman, Geert:
"Segment connection networks for corpus-based speech synthesis",
paper 1962-Wed3BuP.13.
Special Populations - Learners, Aged, Challenged
Tsuji, Ryo / Kasami, Tomohiko / Ishikawa, Shogo / Kiriyama, Shinya / Takebayashi, Yoichi / Kitazawa, Shigeyoshi:
"Observations of the spoken language acquisition process based on a multimodal infant behavior corpus",
paper 1953-Tue3CaP.1.
Marklund, Ellen / Lacerda, Francisco:
"Infants² ability to extract verbs from continuous speech",
paper 1986-Tue3CaP.2.
Bion, Ricardo A.H. / Escudero, Paola / Rauber, Andréia S. / Baptista, Barbara O.:
"Category formation and the role of spectral quality in the perception and production of English front vowels",
paper 1270-Tue3CaP.3.
Bijeljac-Babic, Ranka / Dodane, Christelle / Metta, Sabine / Gérard, Claire:
"Productions in bilinguism, early foreign language learning and monolinguism: a prosodic comparison",
paper 1582-Tue3CaP.4.
Hirata, Yukari / Whitehurst, Elizabeth / Cullings, Emily / Whiton, Jacob / Glenn, Carol:
"Training native English speakers to identify Japanese vowel length with fast rate sentences",
paper 1395-Tue3CaP.5.
Chen, Jiang-Chun / Hsu, Wei-Tang / Jang, J.-S. Roger / Lyu, Ren-Yuan / Chiang, Yuang-Chin:
"Formant-based English vowel assessment for Chinese in Taiwan",
paper 1968-Tue3CaP.6.
Metzner, Jörg / Schmittfull, Marcel / Schnell, Karl:
"Substitute sounds for ventriloquism and speech disorders",
paper 1426-Tue3CaP.7.
Wei, Si / Liu, Qing-Sheng / Hu, Yu / Wang, Ren-Hua:
"Automatic Mandarin pronunciation scoring for native learners with dialect accent",
paper 1669-Tue3CaP.8.
Fujita, Kengo / Kato, Tsuneo / Kawai, Hisashi:
"Quick individual fitting methods of simplified hearing compensation for elderly people",
paper 1879-Tue3CaP.9.
Li, Xiao / Malkin, Jonathan / Harada, Susumu / Bilmes, Jeff A. / Wright, Richard / Landay, James:
"An online adaptive filtering algorithm for the vocal joystick",
paper 1872-Tue3CaP.10.
Nakamura, Keigo / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech",
paper 1839-Tue3CaP.11.
San-Segundo, R. / Barra, R. / D’Haro, L. F. / Montero, J. M. / Córdoba, R. / Ferreiros, J.:
"A Spanish speech to sign language translation system for assisting deaf-mute people",
paper 1243-Tue3CaP.12.
Klintfors, Eeva / Lacerda, Francisco:
"Potential relevance of audio-visual integration in mammals for computational modeling",
paper 1992-Tue3CaP.13.
Rytting, C. Anton:
"Finding the gaps: applying a connectionist model of word segmentation to noisy phone-recognized speech data",
paper 2062-Tue3CaP.14.
Robust ASR
Wang, Shizhen / Cui, Xiaodong / Alwan, Abeer:
"Rapid speaker adaptation using regression-tree based spectral peak alignment",
paper 1334-Wed1A2O.1.
Kim, Chanwoo / Chiu, Yu-Hsiang / Stern, Richard M.:
"Physiologically-motivated synchrony-based processing for robust automatic speech recognition",
paper 1975-Wed1A2O.2.
Walliczek, Matthias / Kraft, Florian / Jou, Szu-Chen / Schultz, Tanja / Waibel, Alex:
"Sub-word unit based non-audible speech recognition using surface electromyography",
paper 1596-Wed1A2O.3.
Vicente-Peña, Jesús / Díaz-de-María, Fernando / Kleijn, Bastiaan:
"Individual on-line variance adaptation of frequency filtered parameters for robust ASR",
paper 1221-Wed1A2O.4.
Zhang, Bing / Matsoukas, Spyros / Schwartz, Richard:
"Recent progress on the discriminative region-dependent transform for speech feature extraction",
paper 1573-Wed1A2O.5.
Rademacher, Jan / Wächter, Matthias / Mertins, Alfred:
"Improved warping-invariant features for automatic speech recognition",
paper 1216-Wed1A2O.6.
Speech Summarization
Nenkova, Ani:
"Summarization evaluation for text and speech: issues and approaches",
paper 2079-Wed1WeS.1.
Zhu, Xiaodan / Penn, Gerald:
"Summarization of spontaneous conversations",
paper 1899-Wed1WeS.2.
Chatain, Pierre / Whittaker, Edward / Mrozinski, Joanna / Furui, Sadaoki:
"Perplexity based linguistic model adaptation for speech summarisation",
paper 1677-Wed1WeS.3.
Lee, Lin-shan / Kong, Sheng-yi / Pan, Yi-cheng / Fu, Yi-sheng / Huang, Yu-tsun:
"Multi-layered summarization of spoken document archives by information extraction and semantic structuring",
paper 1568-Wed1WeS.4.
Maskey, Sameer / Hirschberg, Julia:
"Soundbite detection in broadcast news domain",
paper 1690-Wed1WeS.5.
Murray, Gabriel / Renals, Steve:
"Dialogue act compression via pitch contour preservation",
paper 1585-Wed1WeS.6.
Acoustic Modeling IV
Kubo, Toshiaki / Ogawa, Tetsuji / Kobayashi, Tetsunori:
"Manifold HLDA and its application to robust speech recognition",
paper 1949-Wed1BuP.1.
Buera, Luis / Lleida, Eduardo / Nolazco-Flores, Juan A. / Miguel, Antonio / Ortega, Alfonso:
"Time-dependent cross-probability model for multi-environment model based LInear normalization",
paper 1271-Wed1BuP.2.
Povey, Daniel:
"SPAM and full covariance for speech recognition",
paper 2047-Wed1BuP.3.
Sakti, Sakriani / Markov, Konstantin / Nakamura, Satoshi:
"The use of Bayesian network for incorporating accent, gender and wide-context dependency information",
paper 1812-Wed1BuP.4.
Wang, Yu / Fosler-Lussier, Eric:
"Integrating phonetic boundary discrimination explicitly into HMM systems",
paper 1820-Wed1BuP.5.
Xie, Zhimin / Niyogi, Partha:
"Robust acoustic-based syllable detection",
paper 1327-Wed1BuP.6.
He, Lei / Hao, Jie:
"A tone recognition framework for continuous Mandarin speech",
paper 1348-Wed1BuP.7.
Hämäläinen, Annika / Bosch, Louis ten / Boves, Lou:
"Pronunciation variant-based multi-path HMMs for syllables",
paper 1630-Wed1BuP.8.
Park, Junho / Ko, Hanseok:
"A new state-dependent phonetic tied-mixture model with head-body-tail structured HMM for real-time continuous phoneme recognition system",
paper 1982-Wed1BuP.9.
Zgank, Andrej / Kacic, Zdravko:
"Conversion from phoneme based to grapheme based acoustic models for speech recognition",
paper 1500-Wed1BuP.10.
Kim, Bong-Wan / Choi, Dae-Lim / Um, Yongnam / Lee, Yong-Ju:
"Phone vector DHMM to decode a phone recognizer's output",
paper 1903-Wed1BuP.11.
Nagarajan, T. / Vijayalakshmi, P. / O'Shaughnessy, Douglas:
"Combining multiple-sized sub-word units in a speech recognition system using baseform selection",
paper 1280-Wed1BuP.12.
Miguel, Antonio / Lleida, Eduardo / Juan, Alfons / Buera, Luis / Ortega, Alfonso / Saz, Oscar:
"Local transformation models for speech recognition",
paper 1275-Wed1BuP.13.
Large Vocabulary Speech Recognition
Imai, Toru / Sato, Shoei / Kobayashi, Akio / Onoe, Kazuo / Homma, Shinichi:
"Online speech detection and dual-gender speech recognition for captioning broadcast news",
paper 1103-Wed1CaP.1.
Hazen, Timothy J.:
"Automatic alignment and error correction of human generated transcripts for long speech recordings",
paper 1258-Wed1CaP.2.
Chang, Shuangyu:
"Improving speech recognition accuracy with multi-confidence thresholding",
paper 1346-Wed1CaP.3.
Servan, Christophe / Raymond, Christian / Béchet, Frédéric / Nocéra, Pascal:
"Conceptual decoding from word lattices: application to the spoken dialogue corpus MEDIA",
paper 1416-Wed1CaP.4.
Huang, Shilei / Xie, Xiang / Kuang, Jingming:
"Improving the performance of out-of-vocabulary word rejection by using support vector machines",
paper 1535-Wed1CaP.5.
Demuynck, Kris / Compernolle, Dirk Van / Hamme, Hugo Van:
"Robust phone lattice decoding",
paper 1631-Wed1CaP.6.
Lecouteux, Benjamin / Linarès, Georges / Nocéra, Pascal / Bonastre, Jean-François:
"Imperfect transcript driven speech recognition",
paper 1660-Wed1CaP.7.
Xue, Jian / Hu, Rusheng / Zhao, Yunxin:
"New improvements in decoding speed and latency for automatic captioning",
paper 1739-Wed1CaP.8.
Saleem, Shirin / Prasad, Rohit / Natarajan, Prem:
"Colloquial Iraqi ASR for speech translation",
paper 1771-Wed1CaP.9.
Hakamata, Tomohiro / Lee, Akinobu / Nankaku, Yoshihiko / Tokuda, Keiichi:
"Reducing computation on parallel decoding using frame-wise confidence scores",
paper 1878-Wed1CaP.10.
Ketabdar, Hamed / Vepa, Jithendra / Bengio, Samy / Bourlard, Hervé:
"Posterior based keyword spotting with a priori thresholds",
paper 1939-Wed1CaP.11.
Zhou, Zhengyu / Meng, Helen M. / Lo, Wai Kit:
"A multi-pass error detection and correction framework for Mandarin LVCSR",
paper 1947-Wed1CaP.12.
Nouza, Jan / Zdansky, Jindrich / Cerva, Petr / Kolorenc, Jan:
"Continual on-line monitoring of Czech spoken broadcast programs",
paper 1478-Wed1CaP.13.
Speech/Noise/Music Segmentation
Zhang, Shilei / Jiang, Hongchen / Zhang, Shuwu / Xu, Bo:
"Fast SVM training based on the choice of effective samples for audio classification",
paper 1073-Wed1FoP.1.
Schmalenstroeer, Joerg / Haeb-Umbach, Reinhold:
"Online speaker change detection by combining BIC with microphone array beamforming",
paper 1078-Wed1FoP.2.
Ramírez, Javier / Yélamos, Pablo / Górriz, J. M. / Segura, José C. / García, L.:
"Speech/non-speech discrimination combining advanced feature extraction and SVM learning",
paper 1134-Wed1FoP.3.
Jarifi, Safaa / Pastor, Dominique / Rosec, Olivier:
"Cooperation between global and local methods for the automatic segmentation of speech synthesis corpora",
paper 1160-Wed1FoP.4.
Heckmann, Martin / Moebus, Marco / Joublin, Frank / Goerick, Christian:
"Speaker independent voiced-unvoiced detection evaluated in different speaking styles",
paper 1249-Wed1FoP.5.
Anguera, Xavier / Wooters, Chuck / Pardo, Jose M.:
"Robust speaker diarization for meetings: ICSI RT06s evaluation system",
paper 1716-Wed1FoP.6.
Coy, André / Barker, Jon:
"A multipitch tracker for monaural speech segmentation",
paper 1979-Wed1FoP.7.
Chitturi, Rahul / Hasegawa-Johnson, Mark:
"Novel entropy based moving average refiners for HMM landmarks",
paper 1911-Wed1FoP.8.
Kim, Gibak / Cho, Nam Ik:
"Two-microphone voice activity detection in the presence of coherent interference",
paper 1917-Wed1FoP.9.
Myrvoll, Tor André / Matsui, Tomoko:
"On a greedy learning algorithm for dPLRM with applications to phonetic feature detection",
paper 2063-Wed1FoP.10.
Pitch Estimation
Moore II, Elliot / Torres, Juan:
"Improving glottal waveform estimation through rank-based glottal quality assessment",
paper 1296-Wed2A1O.1.
Alías, Francesc / Monzo, Carlos / Socoró, Joan Claudi:
"A pitch marks filtering algorithm based on restricted dynamic programming",
paper 1625-Wed2A1O.2.
Malyska, Nicolas / Quatieri, Thomas F.:
"Analysis of nonmodal phonation using minimum entropy deconvolution",
paper 1807-Wed2A1O.3.
Nakano, Tomoyasu / Goto, Masataka / Hiraga, Yuzuru:
"An automatic singing skill evaluation method for unknown melodies using pitch interval accuracy and vibrato features",
paper 1854-Wed2A1O.4.
Zahorian, Stephen A. / Dikshit, Princy / Hu, Hongbing:
"A spectral-temporal method for pitch tracking",
paper 1910-Wed2A1O.5.
Rahman, M. Shahidur / Tanaka, Hirobumi / Shimamura, Tetsuya:
"Pitch determination using aligned AMDF",
paper 1960-Wed2A1O.6.
Acoustic Modeling V - Novel Approaches
Han, Yan / Boves, Lou:
"Syllable-length path mixture hidden Markov models with trajectory clustering for continuous speech recognition",
paper 1460-Wed2A2O.1.
Cincarek, Tobias / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training",
paper 1481-Wed2A2O.2.
Lévy, Christophe / Linarès, Georges / Bonastre, Jean-François:
"GMM-based acoustic modeling for embedded speech recognition",
paper 1255-Wed2A2O.3.
Wachter, Mathias De / Demuynck, Kris / Compernolle, Dirk Van:
"Boosting HMM performance with a memory upgrade",
paper 1126-Wed2A2O.4.
Deng, Y. / Li, X. / Kwan, C. / Xu, R. / Raj, B. / Stern, Richard M. / Williamson, D.:
"An integrated approach to improve speech recognition rate for non-native speakers",
paper 1472-Wed2A2O.5.
Hu, Rusheng / Zhao, Yunxin:
"Bayesian decision tree state tying for conversational speech recognition",
paper 1263-Wed2A2O.6.
Corpus-Based Synthesis
Kirkpatrick, Barry / O’Brien, Darragh / Scaife, Ronán:
"Feature extraction for spectral continuity measures in concatenative speech synthesis",
paper 1385-Wed2A3O.1.
Sakai, Shinsuke / Kawahara, Tatsuya:
"Decision tree-based training of probabilistic concatenation models for corpus-based speech synthesis",
paper 1564-Wed2A3O.2.
Zhao, Yong / Peng, Di / Wang, Lijuan / Chu, Min / Chen, Yining / Yu, Peng / Guo, Jun:
"Constructing stylistic synthesis databases from audio books",
paper 1559-Wed2A3O.3.
Conkie, Alistair / Syrdal, Ann K.:
"Expanding phonetic coverage in unit selection synthesis through unit substitution from a donor voice",
paper 2001-Wed2A3O.4.
Taylor, Paul:
"Unifying unit selection and hidden Markov model speech synthesis",
paper 1456-Wed2A3O.5.
Black, Alan W.:
"CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling",
paper 1394-Wed2A3O.6.
Spoken Dialog Technology R&D
Rieser, Verena / Lemon, Oliver:
"Cluster-based user simulations for learning dialogue strategies",
paper 1127-Wed2WeS.1.
Lewis, Charles / Fabbrizio, Giuseppe Di:
"Prompt selection with reinforcement learning in an AT&t call routing application",
paper 1744-Wed2WeS.2.
Goronzy, Silke / Mochales, Raquel / Beringer, Nicole:
"Developing speech dialogs for multimodal HMIs using finite state machines",
paper 1544-Wed2WeS.3.
Pfleger, Norbert / Schehl, Jan:
"Development of advanced dialog systems with PATE",
paper 1598-Wed2WeS.4.
Subramanian, Rajah Annamalai / Cohen, Philip:
"A joint intention-based dialogue engine",
paper 1843-Wed2WeS.5.
Möller, Sebastian / Englert, Roman / Engelbrecht, Klaus / Hafner, Verena / Jameson, Anthony / Oulasvirta, Antti / Raake, Alexander / Reithinger, Norbert:
"Memo: towards automatic usability evaluation of spoken dialogue services by user error simulations",
paper 1131-Wed2WeS.6.
Modeling Speaker Emotional State
Matthews, Brett / Bakis, Raimo / Eide, Ellen:
"Synthesizing breathiness in natural speech with sinusoidal modelling",
paper 1087-Wed2BuP.1.
Nicolao, Mauro / Drioli, Carlo / Cosi, Piero:
"Voice GMM modelling for FESTIVAL/MBROLA emotive TTS synthesis",
paper 1597-Wed2BuP.2.
Cabral, João P. / Oliveira, Luís C.:
"Emovoice: a system to generate emotions in speech",
paper 1645-Wed2BuP.3.
Wu, Zhiyong / Zhang, Shen / Cai, Lianhong / Meng, Helen M.:
"Real-time synthesis of Chinese visual speech and facial expressions using MPEG-4 FAP features in a three-dimensional avatar",
paper 1823-Wed2BuP.4.
Yang, Hongwu / Meng, Helen M. / Cai, Lianhong:
"Modeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis",
paper 1935-Wed2BuP.5.
Zhang, Sheng / Ching, P. C. / Kong, Fanrang:
"Automatic emotion recognition of speech signal in Mandarin",
paper 1128-Wed2BuP.6.
Kao, Yi-hao / Lee, Lin-shan:
"Feature analysis for emotion recognition from Mandarin speech considering the special characteristics of Chinese language",
paper 1504-Wed2BuP.7.
Schuller, Björn / Rigoll, Gerhard:
"Timing levels in segment-based speech emotion recognition",
paper 1695-Wed2BuP.8.
Nisimura, Ryuichi / Omae, Souji / Kawahara, Hideki / Irino, Toshio:
"Analyzing dialogue data for real-world emotional speech classification",
paper 1675-Wed2BuP.9.
Alm, Cecilia Ovesdotter / Llorà, Xavier:
"Evolving emotional prosody",
paper 1741-Wed2BuP.10.
Luo, Xin / Fu, Qian-Jie / Galvin III, John J.:
"Vocal emotion recognition with cochlear implants",
paper 1315-Wed2BuP.11.
Matsunaga, S. / Sakaguchi, S. / Yamashita, M. / Miyahara, S. / Nishitani, S. / Shinohara, K.:
"Emotion detection in infants² cries based on a maximum likelihood approach",
paper 1345-Wed2BuP.12.
Tepperman, Joseph / Traum, David / Narayanan, Shrikanth:
""yeah right": sarcasm recognition for spoken dialogue systems",
paper 1821-Wed2BuP.13.
Kumar, Rohit / Rosé, Carolyn P. / Litman, Diane J.:
"Identification of confusion and surprise in spoken dialog using prosodic features",
paper 1921-Wed2BuP.14.
Nwe, Tin Lay / Li, Haizhou / Dong, Minghui:
"Analysis and detection of speech under sleep deprivation",
paper 1934-Wed2BuP.15.
Vasilescu, Ioana / Adda-Decker, Martine:
"Language, gender, speaking style and language proficiency as factors influencing the autonomous vocalic filler production in spontaneous speech",
paper 1994-Wed2BuP.16.
Language Modeling and ASR Applications
Lavecchia, Caroline / Smaïli, Kamel / Haton, Jean-Paul:
"How to handle gender and number agreement in statistical language models?",
paper 1362-Wed2CaP.1.
Chan, Oscar / Togneri, Roberto:
"Prosodic features for a maximum entropy language model",
paper 1150-Wed2CaP.2.
Mori, Shinsuke:
"Language model adaptation with a word list and a raw corpus",
paper 1146-Wed2CaP.3.
Wiggers, Pascal / Rothkrantz, Léon J.M.:
"Topic-based language modeling with dynamic Bayesian networks",
paper 1882-Wed2CaP.4.
Yamamoto, Hirofumi / Kikui, Genichiro / Nakamura, Satoshi / Sagisaka, Yoshinori:
"Speech recognition of foreign out-of-vocabulary words using a hierarchical language model",
paper 1692-Wed2CaP.5.
Hu, Xinhui / Yamamoto, Hirofumi / Kikui, Genichiro / Sagisaka, Yoshinori:
"Language modeling of Chinese personal names based on character units for continuous Chinese speech recognition",
paper 1979-Wed2CaP.6.
Lakshmi, A. / Murthy, Hema A.:
"A syllable based continuous speech recognizer for Tamil",
paper 1055-Wed2CaP.7.
Woszczyna, Monika / Charoenpornsawat, Paisarn / Schultz, Tanja:
"Spontaneous Thai speech recognition",
paper 1419-Wed2CaP.8.
Gerosa, M. / Giuliani, D. / Narayanan, Shrikanth:
"Acoustic analysis and automatic recognition of spontaneous children²s speech",
paper 1082-Wed2CaP.9.
Vertanen, Keith:
"Speech and speech recognition during dictation corrections",
paper 1094-Wed2CaP.10.
Smídl, Lubos / Psutka, Josef V.:
"Comparison of keyword spotting methods for searching in speech",
paper 1587-Wed2CaP.11.
Balakrishna, Mithun / Cerovic, Cyril / Moldovan, Dan / Cave, Ellis:
"Automatic generation of statistical language models for interactive voice response applications",
paper 1648-Wed2CaP.12.
Ju, Yun-Cheng / Wang, Ye-Yi / Acero, Alex:
"Call analysis with classification using speech and non-speech features",
paper 2011-Wed2CaP.13.
Spoken Language Understanding
Wu, Wei-Lin / Lu, Ru-Zhan / Liu, Hui / Gao, Feng:
"A spoken language understanding approach using successive learners",
paper 1987-Wed2FoP.1.
Stewart, Osamuyimen / Huerta, Juan / Jan, Ea-Ee / Wu, Cheng / Li, Xiang / Lubensky, David:
"Conversational help desk: vague callers and context switch",
paper 1291-Wed2FoP.2.
Rosset, Sophie / Galibert, Olivier / Illouz, Gabriel / Max, Aurélien:
"Integrating spoken dialog and question answering: the ritel project",
paper 1529-Wed2FoP.3.
Prommer, Thomas / Holzapfel, Hartwig / Waibel, Alex:
"Rapid simulation-driven reinforcement learning of multimodal dialog strategies in human-robot interaction",
paper 1551-Wed2FoP.4.
Aist, Gregory / Allen, James / Campana, Ellen / Galescu, Lucian / Gallo, Carlos A. Gómez / Stoness, Scott C. / Swift, Mary / Tanenhaus, Michael:
"Software architectures for incremental understanding of human speech",
paper 1869-Wed2FoP.5.
Schiel, Florian / Draxler, Christoph / Libossek, Marion:
"Lingua machinae - an unorthodox proposal",
paper 2026-Wed2FoP.6.
Pon-Barry, Heather / Weng, Fuliang / Varges, Sebastian:
"Evaluation of content presentation strategies for an in-car spoken dialogue system",
paper 2044-Wed2FoP.7.
Goel, Vaibhava / Gopinath, Ramesh:
"On designing context sensitive language models for spoken dialog systems",
paper 2052-Wed2FoP.8.
Liu, Yang:
"Using SVM and error-correcting codes for multiclass dialog act classification in meeting corpus",
paper 1306-Wed2FoP.9.
Holzapfel, Hartwig / Waibel, Alex:
"A multilingual expectations model for contextual utterances in mixed-initiative spoken dialogue",
paper 1614-Wed2FoP.10.
Fukubayashi, Yuichiro / Komatani, Kazunori / Ogata, Tetsuya / Okuno, Hiroshi G.:
"Dynamic help generation by estimating user²s mental model in spoken dialogue systems",
paper 1750-Wed2FoP.11.
Surendran, Dinoj / Levow, Gina-Anne:
"Dialog act tagging with support vector machines and hidden Markov models",
paper 1831-Wed2FoP.12.
Segmentation and VAD
Torre, Ángel de la / Ramírez, Javier / Benítez, Carmen / Segura, José C. / García, L. / Rubio, Antonio J.:
"Noise robust model-based voice activity detection",
paper 1476-Wed3A1O.1.
Shi, Yu / Soong, Frank K. / Zhou, Jian-Lai:
"Auto-segmentation based VAD for robust ASR",
paper 1749-Wed3A1O.2.
Boakye, Kofi / Stolcke, Andreas:
"Improved speech activity detection using cross-channel features for recognition of multiparty meetings",
paper 1824-Wed3A1O.3.
Kida, Yusuke / Kawahara, Tatsuya:
"Evaluation of voice activity detection by combining multiple features with weight adaptation",
paper 1152-Wed3A1O.4.
Lee, Keansub / Ellis, Daniel P. W.:
"Voice activity detection in personal audio recordings using autocorrelogram compensation",
paper 1753-Wed3A1O.5.
Rifkin, Ryan / Mesgarani, Nima:
"Discriminating speech and non-speech with regularized least squares",
paper 1779-Wed3A1O.6.
Technologies for Specific Populations: Learners and Challenged
Lee, John / Seneff, Stephanie:
"Automatic grammar correction for second-language learners",
paper 1299-Wed3A3O.1.
Neri, Ambra / Cucchiarini, Catia / Strik, Helmer:
"ASR-based corrective feedback on pronunciation: does it really work?",
paper 1372-Wed3A3O.2.
Dong, Minghui / Li, Haizhou / Nwe, Tin Lay:
"Evaluating prosody of Mandarin speech for language learning",
paper 1432-Wed3A3O.3.
Trancoso, Isabel / Duarte, Carlos / Serralheiro, António / Caseiro, Diamantino / Carriço, Luís / Viana, Céu:
"Spoken language technologies applied to digital talking books",
paper 1448-Wed3A3O.4.
Iida, Akemi / Ito, Jun / Kajima, Shimpei / Sugawara, Tsutomu:
"Building an English speech synthesis system from a Japanese ALS patient²s voice",
paper 1948-Wed3A3O.5.
Karpov, Alexey / Ronzhin, Andrey / Cadiou, Alexandre:
"Multi-modal system ICANDO: intellectual computer assistant for disabled operators",
paper 1234-Wed3A3O.6.
The Prosody of Turn-Taking and Dialog Acts
Skantze, Gabriel / House, David / Edlund, Jens:
"User responses to prosodic variation in fragmentary grounding utterances in dialog",
paper 1229-Wed3WeS.1.
Ishi, Carlos Toshinori / Ishiguro, Hiroshi / Hagita, Norihiro:
"Analysis of prosodic and linguistic cues of phrase finals for turn-taking and dialog acts",
paper 1961-Wed3WeS.2.
Schlangen, David:
"From reaction to prediction: experiments with computational models of turn-taking",
paper 1200-Wed3WeS.3.
Kolár, Jáchym / Shriberg, Elizabeth / Liu, Yang:
"On speaker-specific prosodic models for automatic dialog act segmentation of multi-party meetings",
paper 1900-Wed3WeS.4.
Ward, Nigel G. / Bayyari, Yaffa Al:
"A case study in the identification of prosodic cues to turn-taking: back-channeling in Arabic",
paper 1257-Wed3WeS.5.
Edlund, Jens / Heldner, Mattias:
"/nailon/ - software for online analysis of prosody",
paper 1557-Wed3WeS.6.
Multichannel Speech Enhancement/Speech Perception
Li, Junfeng / Akagi, Masato / Suzuki, Yôiti:
"Improved hybrid microphone array post-filter by integrating a robust speech absence probability estimator for speech enhancement",
paper 1035-Wed3FoP.1.
Gerkmann, Timo / Martin, Rainer:
"Soft decision combining for dual channel noise reduction",
paper 1059-Wed3FoP.2.
Chen, Guo / Parsa, Vijay:
"An improved affine projection algorithm based crosstalk resistant adaptive noise canceller",
paper 1320-Wed3FoP.3.
Leukimmiatis, Stamatis / Dimitriadis, Dimitrios / Maragos, Petros:
"An optimum microphone array post-filter for speech applications",
paper 1389-Wed3FoP.4.
Flego, Federico / Omologo, Maurizio:
"Multi-microphone periodicity function for robust F0 estimation in real noisy and reverberant environments",
paper 1536-Wed3FoP.5.
Abutalebi, H. R. / Pourahmadi, M. / Aghabozorgi, M.R.:
"A new dual-microphone speech enhancement method for oriented noises",
paper 1973-Wed3FoP.6.
Lovitt, Andrew / Allen, Jont B.:
"50 years late: repeating miller-nicely 1955",
paper 1297-Wed3FoP.7.
Sakamoto, Shuichi / Yoshikawa, Tadahiro / Amano, Shigeaki / Suzuki, Yôiti / Kondo, Tadahisa:
"New 20-word lists for word intelligibility test in Japanese",
paper 1517-Wed3FoP.8.
Li, Guoping / Lutman, Mark E.:
"Sparseness and speech perception in noise",
paper 1466-Wed3FoP.9.
Liu, Wei M. / Mason, John S. D. / Evans, Nicholas W. D. / Jellyman, Keith A.:
"An assessment of automatic speech recognition as speech intelligibility estimation in the context of additive noise",
paper 1191-Wed3FoP.10.
Wältermann, Marcel / Scholz, Kirstin / Raake, Alexander / Heute, Ulrich / Möller, Sebastian:
"Underlying quality dimensions of modern telephone connections",
paper 1089-Wed3FoP.11.
Chen, Guo / Parsa, Vijay / Scollie, Susan:
"An ERB loudness pattern based objective speech quality measure",
paper 1318-Wed3FoP.12.
Diarization in ASR
Ning, Huazhong / Liu, Ming / Tang, Hao / Huang, Thomas S.:
"A spectral clustering approach to speaker diarization",
paper 1607-Thu1A1O.1.
Zdansky, Jindrich:
"BINSEG: an efficient speaker-based segmentation technique",
paper 1459-Thu1A1O.2.
Gallardo-Antolín, Ascensión / Anguera, Xavier / Wooters, Chuck:
"Multi-stream speaker diarization systems for the meetings domain",
paper 1620-Thu1A1O.3.
Lopes, Carla / Perdigão, Fernando:
"Improved performance evaluation of speech event detectors",
paper 1615-Thu1A1O.4.
Pardo, Jose M. / Anguera, Xavier / Wooters, Chuck:
"Speaker diarization for multiple distant microphone meetings: mixing acoustic features and inter-channel time differences",
paper 1337-Thu1A1O.5.
Pham, Tuan Van / Kubin, Gernot:
"Low-complexity and efficient classification of voiced/unvoiced/silence for noisy environments",
paper 1400-Thu1A1O.6.
Language Model Adaptation, Refinement, and Evaluation
Suzuki, Motoyuki / Kajiura, Yasutomo / Ito, Akinori / Makino, Shozo:
"Unsupervised language model adaptation based on automatic text collection from WWW",
paper 1806Thu1A2O.1.
Tam, Yik-Cheung / Schultz, Tanja:
"Unsupervised language model adaptation using latent semantic marginals",
paper 1705-Thu1A2O.2.
Mrva, David / Woodland, Philip C.:
"Unsupervised language model adaptation for Mandarin broadcast conversation transcription",
paper 1549-Thu1A2O.3.
Klakow, Dietrich:
"Language model adaptation for tiny adaptation corpora",
paper 1446-Thu1A2O.4.
Ljolje, Andrej:
"Pronunciation dependent language models",
paper 1991-Thu1A2O.5.
Nanavati, Amit Anil / Rajput, Nitendra:
"Improving perplexity measures to incorporate acoustic confusability",
paper 1940-Thu1A2O.6.
Voice Morphing
Qin, Long / Wu, Yi-Jian / Ling, Zhen-Hua / Wang, Ren-Hua:
"Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format",
paper 1105-Thu1BuP.1.
Lee, Chung-Han / Wu, Chung-Hsien:
"Map-based adaptation for speech conversion using adaptation data selection and non-parallel training",
paper 1164-Thu1BuP.2.
Nurminen, Jani / Tian, Jilei / Popa, Victor:
"Novel method for data clustering and mode selection with application in voice conversion",
paper 1463-Thu1BuP.3.
Sündermann, David / Höge, Harald / Bonafonte, Antonio / Ney, Hermann / Hirschberg, Julia:
"Text-independent cross-language voice conversion",
paper 1665-Thu1BuP.4.
Ohtani, Yamato / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro:
"Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation",
paper 1681-Thu1BuP.5.
Nakagiri, Mikihiro / Toda, Tomoki / Kashioka, Hideki / Shikano, Kiyohiro:
"Improving body transmitted unvoiced speech with statistical voice conversion",
paper 1719-Thu1BuP.6.
Saino, Keijiro / Zen, Heiga / Nankaku, Yoshihiko / Lee, Akinobu / Tokuda, Keiichi:
"An HMM-based singing voice synthesis system",
paper 2077-Thu1BuP.7.
Uto, Yosuke / Nankaku, Yoshihiko / Toda, Tomoki / Lee, Akinobu / Tokuda, Keiichi:
"Voice conversion based on mixtures of factor analyzers",
paper 2076-Thu1BuP.8.
Tian, Jilei / Nurminen, Jani / Popa, Victor:
"Efficient Gaussian mixture model evaluation in voice conversion",
paper 1533-Thu1BuP.9.
Nakano, Yuji / Tachibana, Makoto / Yamagishi, Junichi / Kobayashi, Takao:
"Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis",
paper 1784-Thu1BuP.10.
Shuang, Zhi-Wei / Bakis, Raimo / Shechtman, Slava / Chazan, Dan / Qin, Yong:
"Frequency warping based on mapping formant parameters",
paper 1768-Thu1BuP.11.
Lin, Cheng-Yuan / Jang, J.-S. Roger:
"Automatic phonetic segmentation by using a SPM-based approach for a Mandarin singing voice corpus",
paper 1489-Thu1BuP.12.
Lal, Partha:
"A comparison of singing evaluation algorithms",
paper 1119-Thu1BuP.13.
Prosody
Ng, Raymond W. M. / Lee, Tan / Gu, Wentao:
"Towards automatic parameter extraction of command-response model for Cantonese",
paper 1363-Thu1FoP.1.
Campillo, Francisco / Santen, Jan P. H. van / Banga, Eduardo R.:
"A model for the f0 reset in corpus-based intonation approaches",
paper 1404-Thu1FoP.2.
Bailly, Gérard / Gorisch, Jan:
"Generating German intonation with a trainable prosodic model",
paper 2017-Thu1FoP.3.
Kim, Seungwon / Lee, Jinsik / Kim, Byeongchang / Lee, Gary Geunbae:
"Incorporating second-order information into two-step major phrase break prediction for Korean",
paper 1487-Thu1FoP.4.
Yi, Lifu / Li, Jian / Lou, Xiaoyan / Hao, Jie:
"Totally data-driven duration modeling based on generalized linear model for Mandarin TTS",
paper 1837-Thu1FoP.5.
Özturk, Özlem / Ciloglu, Tolga:
"Segmental duration modeling in Turkish",
paper 2004-Thu1FoP.6.
Dalen, Rogier C. van / Wiggers, Pascal / Rothkrantz, Léon J. M.:
"Lexical stress in continuous speech recognition",
paper 1578-Thu1FoP.7.
Wang, Siwei / Levow, Gina-Anne:
"Improving tone recognition with combined frequency and amplitude modelling",
paper 1651-Thu1FoP.8.
Lin, Che-Kuang / Lee, Lin-shan:
"Latent prosodic modeling (LPM) for speech with applications in recognizing spontaneous Mandarin speech with disfluencies",
paper 1901-Thu1FoP.9.
Hirose, Keikichi / Hu, Hui / Wang, Xiaodong / Minematsu, Nobuaki:
"Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model",
paper 1929-Thu1FoP.10.
Solorio, Thamar / Fuentes, Olac / Ward, Nigel G. / Bayyari, Yaffa Al:
"Prosodic feature generation for back-channel prediction",
paper 1724-Thu1FoP.11.
Wesseling, Wieneke / Son, Rob J. J. H. van / Pols, Louis C. W.:
"On the sufficiency and redundancy of pitch for TRP projection",
paper 1972-Thu1FoP.12.
Discriminative Training
Gibson, Matthew / Hain, Thomas:
"Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition",
paper 1653-Thu2A1O.1.
Du, Jun / Liu, Peng / Soong, Frank K. / Zhou, Jian-Lai / Wang1, Ren-Hua:
"Minimum divergence based discriminative training",
paper 1703-Thu2A1O.2.
Li, Xinwei / Jiang, Hui:
"Solving large margin estimation of HMMS via semidefinite programming",
paper 1064-Thu2A1O.3.
Yu, Dong / Deng, Li / He, Xiaodong / Acero, Alex:
"Use of incrementally regulated discriminative margins in MCE training for speech recognition",
paper 1410-Thu2A1O.4.
Li, Jinyu / Yuan, Ming / Lee, Chin-Hui:
"Soft margin estimation of hidden Markov model parameters",
paper 1316-Thu2A1O.5.
Wang, Ye-Yi / Acero, Alex:
"Discriminative models for spoken language understanding",
paper 1766-Thu2A1O.6.
Speech Synthesis
Gibert, G. / Bailly, Gérard / Elisei, F.:
"Evaluating a virtual speech cuer",
paper 1539-Thu2A3O.1.
Tomokiyo, Laura Mayfield / Peterson, Kay / Black, Alan W. / Lenzo, Kevin A.:
"Intelligibility of machine translation output in speech synthesis",
paper 1268-Thu2A3O.2.
Tachibana, Makoto / Nose, Takashi / Yamagishi, Junichi / Kobayashi, Takao:
"A technique for controlling voice quality of synthetic speech using multiple regression HSMM",
paper 1778-Thu2A3O.3.
Polyakova, Tatyana / Bonafonte, Antonio:
"Learning from errors in grapheme-to-phoneme conversion",
paper 1742-Thu2A3O.4.
Toda, Tomoki / Ohtani, Yamato / Shikano, Kiyohiro:
"Eigenvoice conversion based on Gaussian mixture model",
paper 1717-Thu2A3O.5.
Langner, Brian / Kumar, Rohit / Chan, Arthur / Gu, Lingyun / Black, Alan W.:
"Generating time-constrained audio presentations of structured information",
paper 2075-Thu2A3O.6.
Multimodal Processing
Alsaade, F. / Ariyaeeinia, A. / Meng, L. / Malegaonkar, A.:
"Multimodal authentication using qualitative support vector machines",
paper 1364-Thu2WeO.1.
Pitsikalis, Vassilis / Katsamanis, Athanassios / Papandreou, George / Maragos, Petros:
"Adaptive multimodal fusion by uncertainty compensation",
paper 1950-Thu2WeO.2.
Hardison, Debra M.:
"Effects of familiarity with faces and voices on second-language speech processing: components of memory traces",
paper 1097-Thu2WeO.3.
Tamura, Satoshi / Hashimoto, Koji / Zhu, Jiong / Hayamizu, Satoru / Asai, Hirotsugu / Tanahashi, Hideki / Kanagawa, Makoto:
"Automatic metadata generation and video editing based on speech and image recognition for medical education contents",
paper 1132-Thu2WeO.4.
Almajai, Ibrahim / Milner, Ben / Darch, Jonathan:
"Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise",
paper 1634-Thu2WeO.5.
Govokhina, Oxana / Bailly, Gérard / Breton, Gaspard / Bagshaw, Paul:
"TDA: a new trainable trajectory formation system for facial animation",
paper 1274-Thu2WeO.6.
Speech Analysis
Biagetti, Giorgio / Crippa, Paolo / Turchetti, Claudio:
"Modeling of speech signals based on Bessel-like orthogonal transform",
paper 1054-Thu2BuP.1.
Jinachitra, Pamornpol:
"Glottal closure and opening detection for flexible parametric voice coding",
paper 1359-Thu2BuP.2.
Trmal, Jan / Vanek, Jan / Müller, Ludek / Zelinka, Jan:
"Independent components for acoustic modeling",
paper 1526-Thu2BuP.3.
Mehta, Daryush / Quatieri, Thomas F.:
"Pitch-scale modification using the modulated aspiration noise source",
paper 1542-Thu2BuP.4.
Ezzat, Tony / Bouvrie, Jake / Poggio, Tomaso:
"Max-Gabor analysis and synthesis of spectrograms",
paper 1561-Thu2BuP.5.
Quintana-Morales, Pedro J. / Navarro-Mesa, Juan L. / Ravelo-Garcia, Antonio G. / Lorenzo-Garcia, Fernando D.:
"Monitoring of the natural voice variations in open and closed phases with frequency warped ARMA modeling",
paper 1572-Thu2BuP.6.
Kameoka, Hirokazu / Roux, Jonathan Le / Ono, Nobutaka / Sagayama, Shigeki:
"Speech analyzer using a joint estimation model of spectral envelope and fine structure",
paper 1641-Thu2BuP.7.
Errity, Andrew / McKenna, John:
"An investigation of manifold learning for speech analysis",
paper 1667-Thu2BuP.8.
Bouvrie, Jake / Ezzat, Tony:
"An incremental algorithm for signal reconstruction from short-time fourier transform magnitude",
paper 1691-Thu2BuP.9.
Takahashi, Toru / Nishi, Masashi / Irino, Toshio / Kawahara, Hideki:
"Automatic assignment of anchoring points on vowel templates for defining correspondence between time-frequency representations of speech samples",
paper 1737-Thu2BuP.10.
Prasad, S. / Srinivasan, S. / Pannuri, M. / Lazarou, G. / Picone, Joseph:
"Nonlinear dynamical invariants for speech recognition",
paper 1799-Thu2BuP.11.
Advances in Noisy ASR
Lin, Shih-Hsiang / Yeh, Yao-Ming / Chen, Berlin:
"Exploiting polynomial-fit histogram equalization and temporal average for robust speech recognition",
paper 1195-Thu2CaP.1.
Demange, Sébastien / Cerisara, Christophe / Haton, Jean-Paul:
"Missing data mask models with global frequency and temporal constraints",
paper 1226-Thu2CaP.2.
Misra, Hemant / Vepa, Jithendra / Bourlard, Hervé:
"Multi-stream ASR: an oracle perspective",
paper 1663-Thu2CaP.3.
Iwano, Koji / Kojima, Kaname / Furui, Sadaoki:
"A weight estimation method using LDA for multi-band speech recognition",
paper 1680-Thu2CaP.4.
Hsu, Chang-wen / Lee, Lin-shan:
"Powered cepstral normalization (p-CN) for robust features in speech recognition",
paper 1746-Thu2CaP.5.
Ding, Pei / He, Lei / Yan, Xiang / Hao, Jie:
"Robust automatic speech recognition for accented Mandarin in car environments",
paper 1764-Thu2CaP.6.
Lu, Xugang / Unoki, Masashi / Akagi, Masato:
"A robust feature extraction based on the MTF concept for speech recognition in reverberant environment",
paper 1801-Thu2CaP.7.
Kim, Young Joon / Lim, Woohyung / Kim, Nam Soo:
"Clean speech feature estimation based on soft spectral masking",
paper 1897-Thu2CaP.8.
Vali, Mansoor / Salehi, Seyyed Ali Seyyed / Karimi, Kazem:
"Robust speech recognition by modifying clean and telephone feature vectors using bidirectional neural network",
paper 2072-Thu2CaP.9.
Tai, Chung-fu / Hung, Jeih-weih:
"Silence energy normalization for robust speech recognition in additive noise environment",
paper 1492-Thu2CaP.10.
Segbroeck, Maarten Van / hamme, Hugo Van:
"Handling convolutional noise in missing data automatic speech recognition",
paper 1248-Thu2CaP.11.
Kitaoka, Norihide / Hamaguchi, Souta / Nakagawa, Seiichi:
"Noisy speech recognition based on selection of multiple noise suppression methods using noise GMMs",
paper 1207-Thu2CaP.12.
Aradilla, Guillermo / Vepa, Jithendra / Bourlard, Hervé:
"Using posterior-based features in template matching for speech recognition",
paper 1186-Thu2CaP.13.
Obuchi, Yasunari / Hataoka, Nobuo:
"Hypothesis-based feature combination of multiple speech inputs for robust speech recognition in automotive environments",
paper 1165-Thu2CaP.14.
Source Separation and Localization
Koldovsky, Zbynek / Nouza, Jan / Kolorenc, Jan:
"Continuous time-frequency masking method for blind speech separation with adaptive choice of threshold parameter using ICA",
paper 1224-Thu2FoP.1.
Liang, Yanxue / Hagiwara, Ichiro:
"Multistage convolutive blind source separation for speech mixture",
paper 1369-Thu2FoP.2.
Asano, Futoshi / Ogata, Jun:
"Detection and separation of speech events in meeting recordings",
paper 1098-Thu2FoP.3.
Abad, Alberto / Segura, Carlos / Macho, Duàn / Hernando, Javier / Nadeu, Climent:
"Audio person tracking in a smart-room environment",
paper 1649-Thu2FoP.4.
Gehrig, Tobias / Klee, Ulrich / McDonough, John W. / Ikbal, Shajith / Wölfel, Matthias / Fügen, Christian:
"Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters",
paper 2038-Thu2FoP.5.
Heckmann, Martin / Rodemann, Tobias / Scholling, Bjorn / Joublin, Frank / Goerick, Christian:
"Modeling the precedence effect for binaural sound source localization in noisy and echoic environments",
paper 1196-Thu2FoP.6.
Talantzis, Fotios / Constantinides, Anthony G. / Polymenakos, Lazaros C.:
"Using a differential microphone array to estimate the direction of arrival of two acoustic sources",
paper 1190-Thu2FoP.7.
Brutti, Alessio / Omologo, Maurizio / Svaizer, Piergiorgio:
"Speaker localization based on oriented global coherence field",
paper 1467-Thu2FoP.8.
Radfar, M. H. / Dansereau, R. M. / Sayadiyan, A.:
"Performance evaluation of three features for model-based single channel speech separation problem",
paper 2005-Thu2FoP.9.
Schmidt, Mikkel N. / Olsson, Rasmus K.:
"Single-channel speech separation using sparse non-negative matrix factorization",
paper 1652-Thu2FoP.10.
Hu, Rong / Zhao, Yunxin:
"Adaptive speech enhancement for speech separation in diffuse noise",
paper 1751-Thu2FoP.11.
Attias, H. T.:
"A probabilistic graphical model for microphone array source separation using rich pre-trained source models",
paper 1946-Thu2FoP.12.
Visser, Erik:
"Geometrically constrained permutation-free source separation in an undercomplete speech unmixing scenario",
paper 1086-Thu2FoP.13.
Olszewski, Dirk / Linhard, Klaus:
"Highly directional multi-beam audio loudspeaker",
paper 1239-Thu2FoP.14.