<< home

Programme of the 6th ISCA Workshop on Speech Synthesis (SSW6), Bonn, August 22 − 24, 2007
The Blizzard Meeting will take place on Saturday, 25th of August.
The Programme shown below is subject to alterations.

abstracts (pdf-file, 222kb)

Wednesday, 22nd August

Thursday, 23rd August

Friday, 24th August

 

Registration
(9.00-10.00)

 

Session 3: Voice Conversion
(9.00-10.40)

  • Ohta, Kumi / Ohtani, Yamato / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro: "Regression approaches to voice quality control based on one-to-many eigenvoice conversion"
  • Tani, Daisuke / Ohtani, Yamato / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro: "An evaluation of many-to-one voice conversion algorithms with pre-stored speaker data sets"
  • Cabral, João P. / Renals, Steve / Richmond, Korin / Yamagishi, Junichi: "Towards an improved modeling of the glottal source in statistical parametric speech synthesis"
  • Mesbahi, Larbi / Barreaud, Vincent / Boeffard, Olivier: "GMM-based speech transformation systems under data reduction"

Session 7: Inventory Construction
(9.00-10.40)

  • Aylett, Matthew P. / King, Simon: "Single speaker segmentation and inventory selection using dynamic time warping self organization and joint multigram mapping"
  • Lambert, Tanya / Braunschweiler, Norbert / Buchholz, Sabine: "How (not) to select your voice corpus: random selection vs. phonologically balanced"
  • Latacz, Lukas / Kong, Yuk On / Verhelst, Werner: "Unit selection synthesis using long non-uniform units and phonemic identity matching"
  • Gruber, Martin / Tihelka, Daniel / Matousek, Jindrich: "Evaluation of various unit types in the unit selection approach for the Czech language using the Festival system"
 

Opening Ceremony
(10.00-10.10)

 
 

Keynote 1
(10.10-11.00)
Bernd Kröger:
”Perspectives for articulatory speech synthesis”

 
Coffee
(10.40-11.00)
Coffee
(10.40-11.00)

Session 1: Various Topics
(11.00-12.40)

  • Govokhina, Oxana / Bailly, Gérard / Breton, Gaspard: "Learning optimal audiovisual phasing for an HMM-based control model for facial animation"
  • Birkholz, Peter / Steiner, Ingmar / Breuer, Stefan: "Control concepts for articulatory speech synthesis"
  • Kain, Alexander B. / Miao, Qi / Santen, Jan P. H. van: "Spectral control in concatenative speech synthesis"
  • Kirkpatrick, Barry / O'Brien, Darragh / Scaife, Ronán: "Feature transformation applied to the detection of discontinuities in concatenated speech"

Session 4: Speech Synthesis by HMM
(11.00-12.40)

  • Yamagishi, Junichi / Kobayashi, Takao / Renals, Steve / King, Simon / Zen, Heiga / Toda, Tomoki / Tokuda, Keiichi: "Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV"
  • Maia, Ranniery / Toda, Tomoki / Zen, Heiga / Nankaku, Yoshihiko / Tokuda, Keiichi: "An excitation model for HMM-based speech synthesis based on residual modeling"
  • Liang, Hui / Qian, Yao / Soong, Frank K.: "An HMM-based bilingual (Mandarin-English) TTS"
  • Roux, Justus C. / Visagie, Albert S.: "Data-driven approach to rapid prototyping Xhosa speech synthesis"

Keynote 2
(11.00-11.50)
Alan Black:
“The Blizzard Challenge: Evaluating Corpus-based Speech Synthesis Techniques”

Session 8: Applications
(11.50-12.40)

  • Moers, Donata / Wagner, Petra / Breuer, Stefan: "Assessing the adequate treatment of fast speech in unit selection speech synthesis systems for the visually impaired"
  • Wolters, Maria / Campbell, Pauline / DePlacido, Christine / Liddell, Amy / Owens, David: "Making speech synthesis more accessible to older people"
Lunch
(12.40-14.00)
Lunch
(12.40-14.00)
Lunch
(12.40-14.00)

Session 2: Expressive Speech Synthesis
(14.00-16.05)

  • Campbell, Nick: "Towards conversational speech synthesis: lessons learned from the expressive speech processing project"
  • Sakai, Shinsuke / Ni, Jinfu / Maia, Ranniery / Tokuda, Keiichi / Tsuzaki, Minoru / Toda, Tomoki / Kawai, Hisashi / Nakamura, Satoshi: "Communicative speech synthesis with XIMERA: a first step"
  • Fernandez, Raul / Ramabhadran, Bhuvana: "Automatic exploration of corpus-specific properties for expressive text-to-speech: a case study in emphasis"
  • Wollermann, Charlotte / Lasarcyk, Eva: "Modeling and perceiving of (un)certainty in articulatory speech synthesis"
  • Wang, Lijuan / Chu, Min / Peng, Yaya / Zhao, Yong / Soong, Frank K.: "Perceptual annotation of expressive speech"

Session 5: Tone and Tone Accent Languages
(14.00-15.40)

  • Minematsu, Nobuaki / Kuroiwa, Ryo / Hirose, Keikichi / Watanabe, Michiko: "CRF-based statistical learning of Japanese accent Sandhi for developing Japanese text-to-speech synthesis systems"
  • Sun, Qinghua / Hirose, Keikichi / Minematsu, Nobuaki: "Two-step generation of Mandarin F0 contours based on tone nucleus and superpositional models"
  • Chomphan, Suphattharachai / Kobayashi, Takao: "Design of tree-based context clustering for an HMM-based Thai speech synthesis system"
  • Bachmann, Arne / Breuer, Stefan: "Development of a BOSS unit selection module for tone languages"

Session 9: Systems
(14.00-15.40)

  • Zen, Heiga / Nose, Takashi / Yamagishi, Junichi / Sako, Shinji / Masuko, Takashi / Black, Alan W. / Tokuda, Keiichi: "The HMM-based speech synthesis system (HTS) version 2.0"
  • Weiss, Christian / Oliveira, Luis C. / Paulo, Sergio / Mendes, Carlos / Figueira, Luis / Vala, Marco / Sequeira, Pedro / Paiva, Ana / Vogt, Thurid / Andre, Elisabeth: "eCIRCUS: building voices for autonomous speaking agents"
  • Barbisch, Martin / Dogil, Grzegorz / Möbius, Bernd / Säuberlich, Bettina / Schweitzer, Antje: "Unit selection synthesis in the SmartWeb project"
  • Silen, Hanna / Helander, Elina / Koppinen, Konsta / Gabbouj, Moncef: "Building a Finnish unit selection TTS system"

Poster Session 1
(16.05-17.15)

  • Schnell, Karl / Lacroix, Arild: "Joint analysis of speech frames for synthesis based on lossy tube models"
  • Adsett, Connie R. / Marchand, Yannick: "Are rule-based syllabification methods adequate for languages with low syllabic complexity? The case of Italian"
  • Huckvale, Mark / Yanagisawa, Kayoko: "Spoken language conversion with accent morphing"
  • Demenko, Grazyna / Wagner, Agnieszka / Jilka, Matthias / Möbius, Bernd: "Comparative investigation of peak alignment in Polish and German unit selection corpora"
  • Klessa, Katarzyna / Szymanski, Marcin / Breuer, Stefan / Demenko, Grazyna: "Optimization of Polish segmental duration prediction with CART"
  • Hirai, Toshio / Yamagishi, Junichi / Tenpaku, Seiichi: "Utilization of an HMM-based feature generation module in 5-ms-segment concatenative speech synthesis"
  • Lolive, Damien / Barbot, Nelly / Boeffard, Olivier: "Clustering Algorithm for F0 Curves Based on Hidden Markov Models"
  • Kumar, Rohit / Gangadharaiah, Rashmi / Rao, Sharath / Prahallad, Kishore / Rosé, Carolyn P. / Black, Alan W.: "Building a better Indian English voice using "more data""
  • Schröder, Marc / Hunecke, Anna: "Creating German unit selection voices for the MARY TTS platform from the BITS corpora"

Poster Session 2
(15.40-16.45)

  • Kain, Alexander B. / Santen, Jan P. H. van: "Unit-selection text-to-speech synthesis using an asynchronous interpolation model"
  • Hertrich, Ingo / Ackermann, Hermann: "Modelling voiceless speech segments by means of an additive procedure based on the computation of formant sinusoids"
  • Toth, Arthur R. / Black, Alan W.: "Using articulatory position data in voice transformation"
  • Raj, Anand Arokia / Sarkar, Tanuja / Pammi, Satish Chandra / Yuvaraj, Santhosh / Bansal, Mohit / Prahallad, Kishore / Black, Alan W.: "Text processing for text-to-speech systems in Indian languages"
  • Erro, Daniel / Moreno, Asunción / Bonafonte, Antonio: "Flexible harmonic/stochastic speech synthesis"
  • Romportl, Jan / Kala, Jirí: "Prosody modelling in Czech text-to-speech synthesis"
  • Zhao, Yong / Zhang, Chengsuo / Soong, Frank K. / Chu, Min / Xiao, Xi: "Measuring attribute dissimilarity with HMM KL-divergence for speech synthesis"
  • Chevelu, Jonathan / Barbot, Nelly / Boeffard, Olivier / Delhay, Arnaud: "Lagrangian relaxation for optimal corpus design"
  • Krul, Aleksandra / Damnati, Géraldine / Yvon, François / Boidin, Cédric / Moudenc, Thierry: "Adaptive database reduction for domain specific speech synthesis"
  • Adell, Jordi / Bonafonte, Antonio / Escudero, David: "Statistical analysis of filled pauses rhythm for disfluent speech synthesis"
  • Gu, Wentao / Lee, Tan: "Quantitative analysis of F0 contours of emotional speech of Mandarin"

Poster Session 3
(15.40-16.45)

  • Marchand, Yannick / Adsett, Connie R. / Damper, Robert I.: "Evaluating automatic syllabification algorithms for English"
  • Kominek, John / Schultz, Tanja / Black, Alan W.: "Voice building from insufficient data - classroom experiences with web-based language development tools"
  • Cahill, Peter / Macek, Jan / Carson-Berndsen, Julie: "SVM based feature extraction in speech synthesis"
  • Nankaku, Yoshihiko / Nakamura, Kenichi / Toda, Tomoki / Tokuda, Keiichi: "Spectral conversion based on statistical models including time-sequence matching"
  • Klabbers, Esther / Mishra, Taniya / Santen, Jan P. H. van: "Analysis of affective speech recordings using the superpositional intonation model"
  • Beux, Sylvain Le / Rilliard, Albert / d'Alessandro, Christophe: "Calliphony: a real-time intonation controller for expressive speech synthesis"
  • Mandal, Shyamal Kumar Das / Datta, Asoke Kumar: "Epoch synchronous non-overlap-add (ESNOLA) method-based concatenative speech synthesis system for Bangla"
  • Hansakunbuntheung, Chatchawarn / Kato, Hiroaki / Sagisaka, Yoshinori: "Syllable-based Thai duration model using multi-level linear regression and syllable accommodation"
  • Gonzalvo, Xavier / Socoró, Joan Claudi / Iriondo, Ignasi / Monzo, Carlos / Martínez, Elisa: "Linguistic and mixed excitation improvements on a HMM-based speech synthesis for Castilian Spanish"
  • Lyudovyk, Tetyana / Robeiko, Valentyna: "Inventory of intonation contours for text-to-speech synthesis"
Coffee
(16.05-17.15)
Coffee
(15.40-16.50)
Coffee
(15.40-16.50)

Boat trip on the Rhine river
(18.00-22.00)

Session 6: Prosody Modelling
(16.50-18.30)

  • Shechtman, Slava: "Maximum-likelihood dynamic intonation model for concatenative text to speech system"
  • Reichel, Uwe D.: "Data-driven extraction of intonation contour classes"
  • Mishra, Taniya / Tucker Prud'hommeaux, Emily / Santen, Jan P. H. van: "Word accentuation prediction using a neural net classifier"
  • Badino, Leonardo / Clark, Robert A. J.: "Issues of optionality in pitch accent placement"

Session 10: Evaluation
(16.50-18.05)

  • Bunnell, H. Timothy / Lilley, Jason: "Analysis methods for assessing TTS intelligibility"
  • Langner, Brian / Black, Alan W.: "Understandable production of massive synthesis"
  • Hooijdonk, Charlotte van / Commandeur, Edwin / Cozijn, Reinier / Krahmer, Emiel / Marsi, Erwin: "The online evaluation of speech synthesis using eye movements"

Closing Ceremony
(18.05-18.20)

<< home