The Seventh ISCA Tutorial and Research Workshop on Speech Synthesis

Kyoto, Japan
September 22-24, 2010

Kyoto, Japan, September 22-24, 2010,

Bibliographic Reference

[SSW7-2010] The Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, Kyoto, Japan, September 22-24, 2010, ed. by Y. Sagisaka and K. Tokuda; ISSN 1680-8908; ISCA Archive, http://www.isca-speech.org/archive/ssw7


Introduction to the Workshop


Author Index and Quick Access to Abstracts

Abou-Zleikha   Ainon   Alku   Andersson   Anumanchipalli (154)   Anumanchipalli (206)   Arnold   Bellegarda   Beutnagel (179)   Beutnagel (371)   Black (148)   Black (154)   Black (162)   Black (206)   Bonafonte   Braunschweiler   Breen   Buchholz   Bunnell   Byrne   Cabral   Cadic   Cahill   Carson-Berndsen   Cen   Chan   Chao   Charfuelan (114)   Charfuelan (240)   Chen, Chia-Ping   Chen, Sin-Horng   Cheng   Chiang   Cho   Chonavel   Clark (142)   Clark (173)   Conkie (45)   Conkie (179)   Cosi   d'Alessandro   Danieli   Dines (192)   Dines (224)   Dong   Drugman   Dutoit   Fernandez   Gales   Garner (192)   Garner (224)   Gibson   Gil   Godoy   Goldman   Guan (192)   Guan (236)   Han   Hashimoto   Hayashida   Hirose   Hirsimäki   Huang, Dong-Yan (258)   Huang, Dong-Yan (345)   Huang, Xiaohan   Huang, Yi-Chin   Ircing   Isaac   Ito   Janska   Jauk   Kain   Karhila   Kato   KAWAHARA   Kawai   Kenmochi   Kim (179)   Kim (371)   KING (38)   King (192)   King (317)   Klepp   Knill   Kobayashi   Krstulovic   Kurimo   Lanchantin   Langner   Lasarcyk   Latorre   Lee   Leen   Li   Liang (192)   Liang (224)   Maestre   Maia   Mao   Mase   Minematsu   Möbius   Moers   Mustafa   Muthukumar   Muto   Nallasamy   Nankaku (100)   Nankaku (106)   Nankaku (211)   Ni   Nicolao   Nishizawa   Nose   Nurminen   Oertel   Ohtani   Ong (258)   Ong (345)   Oura (192)   Oura (211)   Pammi   Parlikar   Picart   Pollet   Polyákova   Prahallad (148)   Prahallad (162)   Pulakka   Qian   Raghavendra   Rahardja (258)   Rahardja (345)   Raitio   Rajkumar   Renals (136)   Renals (365)   Richmond   Rodet   Roekhaut   Romportl   Rosec   Roux   Saheer (192)   Saheer (224)   Saino   Saito   Santos   Saruwatari   Scholtz   Schröder (114)   Schröder (240)   Shannon   Shao   Shikano   Shiota   Simon   Soong   Speer   Steiner (114)   Steiner (240)   Suni   Suzuki   Syrdal (45)   Syrdal (179)   Tachibana   Takaki   Tamburini   Tesser   Thomson   Tian (192)   Tian (236)   Toda   Toit   Tokuda (100)   Tokuda (106)   Tokuda (192)   Tokuda (211)   Türk   Vainio   Veaux   Villavicencio   Wagner (355)   Wagner (377)   Wang, Lijuan   Wang, Miaomiao   Wang, Yih-Ru   Watts   Wen   Wester   White   Windmann   Wollermann   Wolters   Wu, Chung-Hsien   Wu, Yi-Jian (192)   Wu, Yi-Jian (236)   Yamada   Yamagishi (173)   Yamagishi (192)   Yamagishi (236)   Yamagishi (317)   Yamagishi (365)   Yamamoto   Yamashita   Yang   Young   Yu   Zainuddin   Zen (88)   Zen (186)   Zovato (120)   Zovato (130)  

Names written in boldface refer to first authors, in CAPITAL letters to keynote, tutorial, or invited papers. Full papers can be accessed from the abstracts. Please note that each abstract opens in a separate window.



Table of Contents and Access to Abstracts

Tutorials

Kawahara, Hideki: "Exploration of the other aspect of vocoder revisited: A-Z STRAIGHT, TANDEM-STRAIGHT and morphing", 32-37.

King, Simon: "Speech synthesis without the right data", 38.

Concatenative Speech Synthesis

Bunnell, H. Timothy: "Crafting small databases for unit selection TTS: effects on intelligibility", 40-44.

Conkie, Alistair / Syrdal, Ann K.: "Composite TTS voices", 45-48.

Kain, Alexander / Leen, Todd: "Compression of line spectral frequency parameters using the asynchronous interpolation model", 49-54.

Voice Conversion

Villavicencio, Fernando / Maestre, Esteban: "GMM-PCA based speaker-timbre conversion on full-quality speech", 56-61.

Huang, Yi-Chin / Wu, Chung-Hsien / Lee, Chung-Han / Chao, Yu-Ting: "Voice conversion using precise speech alignment based on spectral property and eigen-codeword distribution", 62-67.

Godoy, Elizabeth / Rosec, Olivier / Chonavel, Thierry: "On transforming spectral peaks in voice conversion", 68-73.

Hayashida, Chie / Toda, Tomoki / Ohtani, Yamato / Saruwatari, Hiroshi / Shikano, Kiyohiro: "Linear transformation approaches to many-to-one voice conversion", 74-79.

Nose, Takashi / Kobayashi, Takao: "HMM-based robust voice conversion using adaptive F0 quantization", 80-85.

Statistical Parametric Speech Synthesis

Maia, Ranniery / Zen, Heiga / Gales, M. J. F.: "Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters", 88-93.

Yu, Kai / Thomson, Blaise / Young, Steve: "From discontinuous to continuous F0 modelling in HMM-based speech synthesis", 94-99.

Takaki, Shinji / Nankaku, Yoshihiko / Tokuda, Keiichi: "Spectral modeling with contextual additive structure for HMM-based speech synthesis", 100-105.

Hashimoto, Kei / Nankaku, Yoshihiko / Tokuda, Keiichi: "Bayesian speech synthesis framework integrating training and synthesis processes", 106-111.

Expressive Speech Synthesis

Steiner, Ingmar / Schröder, Marc / Charfuelan, Marcela / Klepp, Annette: "Symbolic vs. acoustics-based style control for expressive unit selection", 114-119.

Romportl, Jan / Zovato, Enrico / Santos, Raúl / Ircing, Pavel / Gil, José Relaño / Danieli, Morena: "Application of expressive TTS synthesis in an advanced ECA system", 120-125.

Yang, Chih-Yung / Chen, Chia-Ping: "A hidden Markov model-based approach for emotional speech synthesis", 126-129.

Tesser, Fabio / Zovato, Enrico / Nicolao, Mauro / Cosi, Piero: "Two vocoder techniques for neutral to emotional timbre conversion", 130-135.

Evaluation and Applications

Wolters, Maria K. / Isaac, Karl B. / Renals, Steve: "Evaluating speech synthesis intelligibility using Amazon Mechanical Turk", 136-141.

Janska, Anna C. / Clark, Robert A. J.: "Further exploration of the possibilities and pitfalls of multidimensional scaling as a tool for the evaluation of the quality of synthesized speech", 142-147.

Prahallad, Kishore / Black, Alan W.: "Handling large audio files in audio books for building synthetic voices", 148-153.

Anumanchipalli, Gopala Krishna / Muthukumar, Prasanna Kumar / Nallasamy, Udhyakumar / Parlikar, Alok / Black, Alan W. / Langner, Brian: "Improving speech synthesis for noisy environments", 154-159.

Prosody and Conversation

Prahallad, Kishore / Raghavendra, E. Veera / Black, Alan W.: "Learning speaker-specific phrase breaks for text-to-speech systems", 162-166.

Nishizawa, Nobuyuki / Kato, Tsuneo: "Substitution of state distributions to reproduce natural prosody on HMM-based speech synthesizers", 167-172.

Andersson, Sebastian / Yamagishi, Junichi / Clark, Robert A. J.: "Utilising spontaneous conversational speech in HMM-based speech synthesis", 173-178.

Syrdal, Ann K. / Conkie, Alistair / Kim, Yeon-Jun / Beutnagel, Mark C.: "Speech acts and dialog TTS", 179-183.

Multi-Lingual Speech Synthesis

Zen, Heiga / Braunschweiler, Norbert / Buchholz, Sabine / Knill, Kate / Krstulovic, Sacha / Latorre, Javier: "HMM-based polyglot speech synthesis by speaker and language adaptive training", 186-191.

Wester, Mirjam / Dines, John / Gibson, Matthew / Liang, Hui / Wu, Yi-Jian / Saheer, Lakshmi / King, Simon / Oura, Keiichiro / Garner, Philip N. / Byrne, William / Guan, Yong / Hirsimäki, Teemu / Karhila, Reima / Kurimo, Mikko / Shannon, Matt / Shiota, Sayaka / Tian, Jilei / Tokuda, Keiichi / Yamagishi, Junichi: "Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project", 192-197.

Selected Topics

Bellegarda, Jerome R.: "Toward naturally expressive speech synthesis: data–driven emotion detection using latent affective analysis", 200-205.

Anumanchipalli, Gopala Krishna / Cheng, Ying-Chang / Fernandez, oseph / Huang, Xiaohan / Mao, Qi / Black, Alan W.: "KLATTSTAT: knowledge-based parametric speech synthesis", 206-210.

Oura, Keiichiro / Mase, Ayami / Yamada, Tomohiko / Muto, Satoru / Nankaku, Yoshihiko / Tokuda, Keiichi: "Recent development of the HMM-based singing voice synthesis system — Sinsy", 211-216.

Wang, Lijuan / Qian, Xiaojun / Han, Wei / Soong, Frank K.: "Photo-real lips synthesis with trajectory-guided sample selection", 217-222.

Poster Sessions

Saheer, Lakshmi / Dines, John / Garner, Philip N. / Liang, Hui: "Implementation of VTLN for statistical speech synthesis", 224-229.

Lasarcyk, Eva / Wollermann, Charlotte: "Do prosodic cues influence uncertainty perception in articulatory speech synthesis?", 230-235.

Guan, Yong / Tian, Jilei / Wu, Yi-Jian / Yamagishi, Junichi / Nurminen, Jani: "An unified and automatic approach of Mandarin HTS system", 236-239.

Pammi, Sathish / Schröder, Marc / Charfuelan, Marcela / Türk, Oytun / Steiner, Ingmar: "Synthesis of listener vocalisations with imposed intonation contours", 240-245.

Ni, Jinfu / Kawai, Hisashi: "An investigation of the impact of speech transcript errors on HMM voices", 246-251.

Saino, Keijiro / Tachibana, Makoto / Kenmochi, Hideki: "An HMM-based singing style modeling system for singing voice synthesizers", 252-257.

Huang, Dong-Yan / Rahardja, Susanto / Ong, Ee Ping: "Lombard effect mimicking", 258-263.

Chiang, Chen Yu / Chen, Sin-Horng / Wang, Yih-Ru: "Unsupervised prosody labeling for constructing Mandarin TTS", 264-269.

Picart, Benjamin / Drugman, Thomas / Dutoit, Thierry: "Analysis and synthesis of hypo- and hyperarticulated speech", 270-275.

Rajkumar, Rajakrishnan / White, Michael / Speer, Shari R. / Ito, Kiwako: "Evaluating prosody in synthetic speech with online (eye-tracking) and offline (rating) methods", 276-281.

Shao, Xu / Pollet, Vincent / Breen, Andrew: "Refined statistical model tuning for speech synthesis", 284-287.

Cadic, Didier / d'Alessandro, Christophe: "High quality TTS voices within one day", 288-293.

Polyákova, Tatyana / Bonafonte, Antonio: "Nativization of English words in Spanish using analogy", 294-299.

Yamamoto, Asami / Suzuki, Kazuhiro / Cho, Kook / Yamashita, Yoichi: "Automatic prosodic labeling of accent information for Japanese spoken sentences", 300-305.

Abou-Zleikha, Mohamed / Cahill, Peter / Carson-Berndsen, Julie: "An automatic pitch model with distance function", 306-311.

Dong, Minghui / Cen, Ling / Chan, Paul / Li, Haizhou: "Considering readability in text-to-speech recording script design", 312-316.

Watts, Oliver / Yamagishi, Junichi / King, Simon: "Letter-based speech synthesis", 317-322.

Veaux, Christophe / Lanchantin, Pierre / Rodet, Xavier: "Joint prosodic and segmental unit selection for expressive speech synthesis", 323-327.

Scholtz, Pieter E. / Roux, Justus C. / Toit, Jacques P. du: "Speech synthesis in the mobile user interface", 328-331.

Raitio, Tuomo / Suni, Antti / Pulakka, Hannu / Vainio, Martti / Alku, Paavo: "Comparison of formant enhancement methods for HMM-based speech synthesis", 334-339.

Mustafa, Mumtaz B. / Ainon, Raja N. / Zainuddin, Roziati: "EM-HTS: real-time HMM-based Malay emotional speech synthesis", 340-344.

Huang, Dong-Yan / Rahardja, Susanto / Ong, Ee Ping: "High level emotional speech morphing using STRAIGHT", 345-350.

Goldman, Jean-Philippe / Roekhaut, Sophie / Simon, Anne Catherine: "Adding speaking style to a TTS system", 351-354.

Moers, Donata / Jauk, Igor / Möbius, Bernd / Wagner, Petra: "Synthesizing fast speech by implementing multi-phone units in unit selection speech synthesis", 355-358.

Wang, Miaomiao / Wen, Miaomiao / Saito, Daisuke / Hirose, Keikichi / Minematsu, Nobuaki: "Improved generation of prosodic features in HMM-based Mandarin speech synthesis", 359-364.

Cabral, João P. / Renals, Steve / Richmond, Korin / Yamagishi, Junichi: "An HMM-based speech synthesiser using glottal post-filtering", 365-370.

Kim, Yeon-Jun / Beutnagel, Mark C.: "A study of lexical stress patterns in unit selection synthesis", 371-376.

Windmann, Andreas / Wagner, Petra / Tamburini, Fabio / Arnold, Denis / Oertel, Catharine: "Automatic prominence annotation of a German speech synthesis corpus: towards prominence-based prosody generation for unit selection synthesis", 377-382.