International Conference on Auditory-Visual Speech Processing 2008

Tangalooma Wild Dolphin Resort, Moreton Island, Queensland, Australia
September 26-29, 2008



Bibliographic Reference

[AVSP-2008] Proceedings, International Conference on Auditory-Visual Speech Processing 2008, Tangalooma Wild Dolphin Resort, Moreton Island, Queensland, Australia, September 26-29, 2008; ed. by Roland Göcke, Patrick Lucey and Simon Lucey. ISCA Archive, http://www.isca-speech.org/archive_open/avsp08



Author Index and Quick Access to Abstracts

Abrahamyan   Abry   Alm   Ambadar   Asthana   Back   Badin   Bailly (43)   Bailly (111)   Bailly (153)   Barbosa (131)   Barbosa (173)   Beautemps (63)   Beautemps (153)   Bégault   Behne (47)   Behne (205)   Belsby   Bothe   Brungart   Burnham   Cathiard (63)   Cathiard (199)   Cavedon   Chaloupka   Chetty   COHN (3)   Cohn (167)   Cosi   Cox   Davis (107)   Davis (127)   Dean   Dohen   Dreves   Edge (185)   Edge (229)   Elisei (111)   Elisei (153)   Erdogan   Fagel (43)   Fagel (59)   Fagel (75)   Fagel (195)   Fang   Fels   Garnier   Giorgolo   Göcke   Gracco   Hagita   Haq   Harđarson   Harvey   Hayamizu   Hill   Hilton   Hodgins   Howlett   Imai   Ishi   Ishiguro   Iyer   Jackson (185)   Jackson (229)   Jesse   Johnson   Kaasa   Kannampuzha   Karabalkan   Kim (107)   Kim (127)   Kitaoka   Kong   Krňoul   Kröger   Kroos (55)   Kroos (107)   Kroos (127)   Kuehnel   Kuratate (127)   Kuratate (191)   Lan   Lewis   Lidestam   Liew (121)   Liew (223)   Lucey, Patrick (69)   Lucey, Patrick (167)   Lucey, Simon   Luerssen   Madany (59)   Madany (195)   MATTHEWS (5)   Matthews (7)   Ménard (63)   Ménard (199)   Miyajima   Moeller   Newman   Nouza   Numahata   Ozgur   Paine   Pan   Pandzic   Potamianos   Powers   Pritchard   Riley   Roštík   Sakamoto   Sato   Simonsen   Simpson   Sridharan (69)   Sridharan (137)   Sridharan (167)   Stelarc   Stevens   Suzuki   Takagi   Takeda   Tamura   Tanaka   Theobald (7)   Theobald (179)   Tisato   Troille (63)   Troille (199)   Unel   VATIKIOTIS-BATESON (1)   Vatikiotis-Bateson (13)   Vatikiotis-Bateson (131)   Vatikiotis-Bateson (173)   Verstraten   Wagner   Wang, S. L.   Wang, Yue   van Wassenhove   Wechsung   Weiss   Wilkinson   Wu, Chun-Huei   Wu, Junru   Yehia (131)   Yehia (173)   Yilmaz   Zdansky   Železný (147)   Železný (215)   Zoric  

Names written in boldface refer to first authors, in CAPITAL letters to keynote and invited papers. Full papers can be accessed from the abstracts (ISCA members only). Please note that each abstract opens in a separate window.



Table of Contents and Access to Abstracts

Invited Papers

Vatikiotis-Bateson, Eric: "Concurrency, synchrony, and temporal organization", 1.

Cohn, Jeffrey F.: "Facial dynamics reveals person identity and communicative intent, regulates person perception and social interaction", 3.

Matthews, Iain: "Active appearance models for facial analysis", 5.

Contributed Papers

Theobald, Barry-John / Wilkinson, Nicholas / Matthews, Iain: "On evaluating synthesised visual speech", 7-12.

Fels, Sidney / Pritchard, Robert / Vatikiotis-Bateson, Eric: "Building a portable gesture-to-audio/visual speech system", 13-18.

Brungart, Douglas S. / Iyer, Nandini / Simpson, Brian D. / Wassenhove, Virginie van: "The effects of temporal asynchrony on the intelligibility of accelerated speech", 19-24.

Chaloupka, Josef / Nouza, Jan / Zdansky, Jindrich: "Audio-visual voice command recognition in noisy conditions", 25-30.

Giorgolo, Gianluca / Verstraten, Frans A. J.: "Perception of ‘speech-and-gesture˛ integration", 31-36.

Ishi, Carlos Toshinori / Ishiguro, Hiroshi / Hagita, Norihiro: "Analysis of inter- and intra-speaker variability of head motions during spoken dialogue", 37-42.

Fagel, Sascha / Bailly, Gérard: "German text-to-audiovisual-speech by 3-d speaker cloning", 43-46.

Behne, Dawn / Wang, Yue / Belsby, Stein-Ove / Kaasa, Solveig / Simonsen, Lisa / Back, Kirsti: "Visual field advantage in the perception of audiovisual speech segments", 47-50.

Tamura, Satoshi / Miyajima, Chiyomi / Kitaoka, Norihide / Hayamizu, Satoru / Takeda, Kazuya: "CENSREC-AV: evaluation frameworks for audio-visual speech recognition", 51-54.

Kroos, Christian / Dreves, Ashlie: "Mcgurk effect persists with a partially removed visual signal", 55-58.

Fagel, Sascha / Madany, Katja: "Guided non-linear model estimation (gnoME)", 59-62.

Troille, Emilie / Cathiard, Marie-Agnčs / Abry, Christian / Ménard, Lucie / Beautemps, Denis: "Multimodal perception of anticipatory behavior - Comparing blind, hearing and cued speech subjects", 63-68.

Lucey, Patrick / Potamianos, Gerasimos / Sridharan, Sridha: "Patch-based analysis of visual speech from multiple views", 69-74.

Fagel, Sascha / Kuehnel, Christine / Weiss, Benjamin / Wechsung, Ina / Moeller, Sebastian: "A comparison of German talking heads in a smart home environment", 75-78.

Sakamoto, Shuichi / Tanaka, Akihiro / Numahata, Shun / Imai, Atsushi / Takagi, Tohru / Suzuki, Yôiti: "Effect of audio-visual asynchrony between time-expanded speech and a moving image of a talker˛s face on detection and tolerance thresholds", 79-82.

Kröger, Bernd J. / Kannampuzha, Jim: "A neurofunctional model of speech production including aspects of auditory and audio-visual speech perception", 83-88.

Dohen, Marion / Wu, Chun-Huei / Hill, Harold: "Auditory-visual perception of prosodic information: inter-linguistic analysis - contrastive focus in French and Japanese", 89-94.

Garnier, Maëva: "May speech modifications in noise contribute to enhance audio-visible cues to segment perception?", 95-100.

Jesse, Alexandra / Johnson, Elizabeth K.: "Audiovisual alignment in child-directed speech facilitates word learning", 101-106.

Kim, Jeesun / Kroos, Christian / Davis, Chris: "Hearing a talking face: an auditory influence on a visual detection task", 107-110.

Bailly, Gérard / Bégault, Antoine / Elisei, Frédéric / Badin, Pierre: "Speaking with smile or disgust: data and models", 111-114.

Chetty, Girija / Wagner, Michael: "A multilevel fusion approach for audiovisual emotion recognition", 115-120.

Wu, Junru / Pan, Xiaosheng / Kong, Jiangping / Liew, Alan Wee-Chung: "Statistical correlation analysis between lip contour parameters and formant parameters for Mandarin monophthongs", 121-126.

Burnham, Denis / Abrahamyan, A. / Cavedon, L. / Davis, Chris / Hodgins, A. / Kim, Jeesun / Kroos, Christian / Kuratate, Takaaki / Lewis, T. / Luerssen, M. / Paine, G. / Powers, D. / Riley, M. / Stelarc, Stelarc / Stevens, K.: "From talking to thinking heads: report 2008", 127-130.

Barbosa, Adriano V. / Yehia, Hani C. / Vatikiotis-Bateson, Eric: "Algorithm for computing spatiotemporal coordination", 131-136.

Dean, David / Sridharan, Sridha: "Fused HMM adaptation of synchronous HMMs for audio-visual speaker verification", 137-141.

Cosi, Piero / Tisato, Graziano: "Describing "INTERFACE" a matlabÉ tool for building talking heads", 143-146.

Železný, Miloš: "Analysis of technologies and resources for multimodal information kiosk for deaf users", 147-152.

Bailly, Gérard / Fang, Yu / Elisei, Frédéric / Beautemps, Denis: "Retargeting cued speech hand gestures for different talking heads and speakers", 153-158.

Lidestam, Björn: "A, v, and AV discrimination of vowel duration", 159-162.

Zoric, Goranka / Pandzic, Igor S.: "Towards real-time speech-based facial animation applications built on HUGE architecture", 163-166.

Lucey, Patrick / Howlett, Jessica / Cohn, Jeffrey F. / Lucey, Simon / Sridharan, Sridha / Ambadar, Zara: "Improving pain recognition through better utilisation of temporal information", 167-172.

Barbosa, Adriano V. / Yehia, Hani C. / Vatikiotis-Bateson, Eric: "Linguistically valid movement behavior measured non-invasively", 173-177.

Cox, Stephen / Harvey, Richard / Lan, Yuxuan / Newman, Jacob / Theobald, Barry-John: "The challenge of multispeaker lip-reading", 179-184.

Haq, Sanaul / Jackson, Philip J. B. / Edge, James D.: "Audio-visual feature selection and reduction for emotion classification", 185-190.

Kuratate, Takaaki: "Text-to-AV synthesis system for Thinking Head Project", 191-194.

Madany, Katja / Fagel, Sascha: "Objective and perceptual evaluation of parameterizations of 3d motion captured speech data", 195-198.

Sato, Marc / Troille, Emilie / Ménard, Lucie / Cathiard, Marie-Agnčs / Gracco, Vincent: "Listening while speaking: new behavioral evidence for articulatory-to-auditory feedback projections", 199-204.

Alm, Magnus / Behne, Dawn: "Age-related experience in audio-visual speech perception", 205-208.

Harđarson, Ţórir / Bothe, Hans-Heinrich: "A model for the dynamics of articulatory lip movements", 209-214.

Krňoul, Zdeněk / Roštík, Patrik / Železný, Miloš: "Evaluation of synthesized sign and visual speech by deaf", 215-218.

Ozgur, Erol / Yilmaz, Berkay / Karabalkan, Harun / Erdogan, Hakan / Unel, Mustafa: "Lip segmentation using adaptive color space training", 219-222.

Wang, S. L. / Liew, Alan Wee-Chung: "Static and dynamic lip feature analysis for speaker verification", 223-227.

Edge, James D. / Hilton, Adrian / Jackson, Philip J. B.: "Parameterisation of 3d speech lip movements", 229-234.

Göcke, Roland / Asthana, Akshay: "A comparative study of 2d and 3d lip tracking methods for AV ASR", 235-240.