Eighth ISCA Workshop on Speech Synthesis

Barcelona, Catalonia, Spain
August 31-September 2, 2013



Bibliographic Reference

[SSW8] Eighth ISCA Workshop on Speech Synthesis (SSW-8), Barcelona, Catalonia, Spain, August 31-September 2, 2013; ISCA Archive, http://www.isca-speech.org/archive/ssw8


Introduction to the Workshop



Author Index and Quick Access to Abstracts

Aalto   Aihara   Akagi   Alghamdi   Alías (171)   Alías (241)   Alkanhal   Alkhairy   Alkhalifa   Alku   Almosallam   Alonso   Anumanchipalli   Ariki   Arnela   Astrinaki (207)   Astrinaki (243)   Astrinaki (245)   Astrinaki (247)   Aylett   Bangalore   Barbot   Barra-Chicote   Baumann   Bell   Bhaskararao   Blaauw   Black (13)   Black (95)   Boeffard   Bonada   Braunschweiler   Brognaux   Calzada Defez   Charfuelan   Chen, John   Chen, Langzhou   Chiu   Chng   Christina   Clark (25)   Clark (41)   Clark (101)   Clark (247)   Conkie (7)   Conkie (255)   Cosi   Csapó   d’Alessandro   Dinh   Drugman   Dutoit (207)   Dutoit (243)   Dutoit (245)   Erro   Ferrer   Gales   Giurgiu (65)   Giurgiu (101)   Golipour   Guasch   Hashimoto, Hiroya   Hashimoto, Kei   Hernaez   Hinterleitner   Hirose   Hojo   Hu   Huang   Ijima   Inukai   Iwata   Kameoka   Karhila   Kato, Tsuneo   Kato, Yumiko O.   Kimura   King (41)   King (65)   King (101)   King (113)   King (119)   King (165)   King (207)   King (243)   King (245)   King (261)   Kinnunen   Kobayashi   Krishnan   Kurimo   Latorre (119)   Latorre (135)   Legát   Li   Lin   Ling (207)   Ling (243)   Liu   Lorenzo-Trueba   Lu   Luong (31)   Luong (279)   MacDonald   Le Maguer   Mamiya (41)   Mamiya (101)   Matoušek   Matsui   Merritt   Minematsu   Miyazaki   Mizuno   Möller   Moinet (207)   Moinet (243)   Montaño   Montero (65)   Montero (159)   Moore   Muresan   Murthy   Nagarajan   Nakamura   Nakatoh   Nandwana   Nankaku   Navas   Németh   Neubig   Nicolao   Nishizawa   Norrenbrock   Oura (247)   Oura (297)   Paci   Pammi   Parlikar (13)   Parlikar (95)   Phan   Phung   Picart   Pidcock   Potard (59)   Potard (217)   Prahalad   Prahallad   Prakash   Pucher (77)   Pucher (83)   Rachel   Raitio   Ramani   Remes   Richmond (135)   Richmond (207)   Richmond (243)   Sagayama   Saheer   Sakti   Samudravijaya   San-Segundo   Schabus (77)   Schabus (83)   Schlangen   SERRA   Serrano   Shanmugam   Sitaram   Socoró Carrié   Solomi   Sommavilla   Sridhar   Stan (41)   Stan (101)   Suni   Syrdal   Takashima   Takiguchi   Ternström   Tesser (107)   Tesser (183)   Tihelka   Toda   Tokuda   Toman (77)   Toman (83)   Umbert   Vadapalli   Vainio   Valentini-Botinhao   Veaux   Vijayalakshmi   Virtanen   Vu   Wan   WARD   Watson   Watts (41)   Watts (101)   Watts (159)   Watts (261)   Wester   Wu, Chung-Hsien   Wu, Zhizheng   Yamagishi (41)   Yamagishi (101)   Yamagishi (113)   Yamagishi (135)   Yamagishi (207)   Yamagishi (243)   Yamagishi (245)   Yamagishi (247)   Yanagisawa   Yoshimura   Yoshizato   ZEN  

Names written in boldface refer to first authors, in CAPITAL letters to keynote and invited papers. Full papers can be accessed from the abstracts. Please note that each abstract opens in a separate window.



Table of Contents and Access to Abstracts

Keynote Papers

Zen, Heiga: "Deep learning in speech synthesis", 309.

Ward, Nigel: "Prosodic patterns in dialog", 311-312.

Serra, Xavier: "Singing voice synthesis in the context of music technology research", 313.

Prosody and Pausing

Braunschweiler, Norbert / Chen, Langzhou: "Automatic detection of inhalation breath pauses for improved pause modelling in HMM-TTS", 1-6.

Sridhar, Vivek Kumar Rangarajan / Chen, John / Bangalore, Srinivas / Conkie, Alistair: "Role of pausing in text-to-speech synthesis for simultaneous interpretation", 7-11.

Parlikar, Alok / Black, Alan W.: "Minimum error rate training for phrasing in speech synthesis", 13-17.

Picart, Benjamin / Brognaux, Sandrine / Drugman, Thomas: "HMM-based speech synthesis of live sports commentaries: integration of a two-layer prosody annotation", 19-24.

Open Challenges in Speech Synthesis

Inukai, Tatsuo / Toda, Tomoki / Neubig, Graham / Sakti, Sakriani / Nakamura, Satoshi: "Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric", 89-94.

Sitaram, Sunayana / Anumanchipalli, Gopala Krishna / Chiu, Justin / Parlikar, Alok / Black, Alan W.: "Text to speech in new languages without a standardized orthography", 95-100.

Watts, Oliver / Stan, Adriana / Clark, Robert A. J. / Mamiya, Yoshitaka / Giurgiu, Mircea / Yamagishi, Junichi / King, Simon: "Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from ‘found’ data: evaluation and analysis", 101-106.

Robustness in Synthetic Speech

Nicolao, Mauro / Tesser, Fabio / Moore, Roger K.: "A phonetic-contrast motivated adaptation to control the degree-of-articulation on Italian HMM-based synthetic voices", 107-112.

Valentini-Botinhao, Cassia / Wester, Mirjam / Yamagishi, Junichi / King, Simon: "Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise", 113-118.

Yanagisawa, Kayoko / Latorre, Javier / Wan, Vincent / Gales, Mark J. F. / King, Simon: "Noise robustness in HMM-TTS speaker adaptation", 119-124.

Issues in HMM-based Speech Synthesis

Erro, Daniel / Alonso, Agustin / Serrano, Luis / Navas, Eva / Hernaez, Inma: "New method for rapid vocal tract length adaptation in HMMbased speech synthesis", 125-128.

Hojo, Nobukatsu / Yoshizato, Kota / Kameoka, Hirokazu / , Daisuke Saito / Sagayama, Shigeki: "Text-to-speech synthesizer based on combination of composite wavelet and hidden Markov models", 129-134.

Hu, Qiong / Richmond, Korin / Yamagishi, Junichi / Latorre, Javier: "An experimental comparison of multiple vocoder types", 135-140.

Ijima, Yusuke / Miyazaki, Noboru / Mizuno, Hideyuki: "Statistical model training technique for speech synthesis based on speaker class", 141-145.

Synthetic Singing Voices

Astrinaki, Maria / Moinet, Alexis / Yamagishi, Junichi / Richmond, Korin / Ling, Zhen-Hua / King, Simon / Dutoit, Thierry: "Mage - reactive articulatory feature control of HMM-based parametric speech synthesis", 207-211.

Umbert, Martí / Bonada, Jordi / Blaauw, Merlijn: "Systematic database creation for expressive singing voice synthesis control", 213-216.

Expressive Speech Synthesis

Aylett, Matthew P. / Potard, Blaise / Pidcock, Christopher J.: "Expressive speech synthesis: synthesising ambiguity", 217-221.

Baumann, Timo / Schlangen, David: "Interactional adequacy as a factor in the perception of synthesized speech", 223-227.

Csapó, Tamás Gábor / Németh, Géza: "A novel irregular voice model for HMM-based speech synthesis", 229-234.

Iwata, Kazuhiko / Kobayashi, Tetsunori: "Expression of speaker’s intentions through sentence-final particle/ intonation combinations in Japanese conversational speech synthesis", 235-240.

Demo Session

Guasch, Oriol / Ternström, Sten / Arnela, Marc / Alías, Francesc: "Unified numerical simulation of the physics of voice. the EUNISON project", 241-242.

Astrinaki, Maria / Moinet, Alexis / Yamagishi, Junichi / Richmond, Korin / Ling, Zhen-Hua / King, Simon / Dutoit, Thierry: "Mage - HMM-based speech synthesis reactively controlled by the articulators", 243.

Astrinaki, Maria / Yamagishi, Junichi / King, Simon / d’Alessandro, Nicolas / Dutoit, Thierry: "Reactive accent interpolation through an interactive map application", 245.

Veaux, Christophe / Astrinaki, Maria / Oura, Keiichiro / Clark, Robert A. J. / Yamagishi, Junichi: "Real-time control of expressive speech synthesis using kinect body tracking", 247-248.

General Topics in Speech Synthesis (Poster Sessions)

Calzada Defez, Àngel / Socoró Carrié, Joan Claudi / Clark, Robert A. J.: "Parametric model for vocal effort interpolation with harmonics plus noise models", 25-30.

Dinh, Anh-Tuan / Phan, Thanh-Son / Vu, Tat-Thang / Luong, Chi Mai: "Vietnamese HMM-based speech synthesis with prosody information", 31-34.

Hashimoto, Hiroya / Hirose, Keikichi / Minematsu, Nobuaki: "Context labels based on "bunsetsu" for HMM-based speech synthesis of Japanese", 35-39.

Mamiya, Yoshitaka / Stan, Adriana / Yamagishi, Junichi / Bell, Peter / Watts, Oliver / Clark, Robert A. J. / King, Simon: "Using adaptation to improve speech transcription alignment in noisy and reverberant environments", 41-46.

Nishizawa, Nobuyuki / Kato, Tsuneo: "Speech synthesis using a maximally decimated pseudo QMF bank for embedded devices", 47-52.

Pammi, Sathish / Charfuelan, Marcela: "HMM-based scost quality control for unit selection speech synthesis", 53-57.

Saheer, Lakshmi / Potard, Blaise: "Understanding factors in emotion perception", 59-64.

San-Segundo, Rubén / Montero, Juan Manuel / Giurgiu, Mircea / Muresan, Ioana / King, Simon: "Multilingual number transcription for text-to-speech conversion", 65-69.

Takashima, Ryoichi / Aihara, Ryo / Takiguchi, Tetsuya / Ariki, Yasuo: "Noise-robust voice conversion based on spectral mapping on sparse space", 71-75.

Toman, Markus / Pucher, Michael / Schabus, Dietmar: "Cross-variety speaker transformation in HSMM-based speech synthesis", 77-81.

Toman, Markus / Pucher, Michael / Schabus, Dietmar: "Multi-variety adaptive acoustic modeling in HSMM-based speech synthesis", 83-87.

Hinterleitner, Florian / Norrenbrock, Christoph / Möller, Sebastian: "Is intelligibility still the main problem? a review of perceptual quality dimensions of synthetic speech", 147-151.

Maguer, Sébastien Le / Barbot, Nelly / Boeffard, Olivier: "Evaluation of contextual descriptors for HMM-based speech synthesis in French", 153-158.

Lorenzo-Trueba, Jaime / Barra-Chicote, Roberto / , Junichi Yamagishi / Watts, Oliver / Montero, Juan Manuel: "Towards speaking style transplantation in speech synthesis", 159-163.

Merritt, Thomas / King, Simon: "Investigating the shortcomings of HMM synthesis", 165-170.

Montaño, Raúl / Alías, Francesc / Ferrer, Josep: "Prosodic analysis of storytelling discourse modes and narrative situations oriented to text-to-speech synthesis", 171-176.

Remes, Ulpu / Karhila, Reima / Kurimo, Mikko: "Objective evaluation measures for speaker-adaptive HMM-TTS systems", 177-181.

Tesser, Fabio / Sommavilla, Giacomo / Paci, Giulio / Cosi, Piero: "Experiments with signal-driven symbolic prosody for statistical parametric speech synthesis", 183-187.

Vadapalli, Anandaswarup / Bhaskararao, Peri / Prahallad, Kishore: "Significance of word-terminal syllables for prediction of phrase breaks in text-to-speech systems for Indian languages", 189-194.

Watson, Catherine / Liu, Wei / MacDonald, Bruce: "The effect of age and native speaker status on synthetic speech intelligibility", 195-200.

Wu, Zhizheng / Virtanen, Tuomas / Kinnunen, Tomi / Chng, Eng Siong / Li, Haizhou: "Exemplar-based voice conversion using non-negative spectrogram deconvolution", 201-206.

Almosallam, Ibrahim / Alkhalifa, Atheer / Alghamdi, Mansour / Alkanhal, Mohamed / Alkhairy, Ashraf: "SASSC: a standard Arabic single speaker corpus", 249-253.

Golipour, Ladan / Conkie, Alistair / Syrdal, Ann: "Prosodically modifying speech for unit selection speech synthesis databases", 255-259.

Lu, Heng / King, Simon / Watts, Oliver: "Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis", 261-265.

Matoušek, Jindřich / Tihelka, Daniel / Legát, Milan: "Is unit selection aware of audible artifacts?", 267-271.

Matsui, Kenji / Kimura, Kenta / Nakatoh, Yoshihisa / Kato, Yumiko O.: "Development of electrolarynx with hands-free prosody control", 273-277.

Phung, Trung-Nghia / Luong, Chi Mai / Akagi, Masato: "A hybrid TTS between unit selection and HMM-based TTS under limited data conditions", 279-284.

Suni, Antti / Aalto, Daniel / Raitio, Tuomo / Alku, Paavo / Vainio, Martti: "Wavelets for intonation modeling in HMM speech synthesis", 285-290.

Ramani, B. / Christina, S. Lilly / Rachel, G. Anushiya / Solomi, V. Sherlin / Nandwana, Mahesh Kumar / Prakash, Anusha / Shanmugam, S. Aswin / Krishnan, Raghava / Prahalad, S. Kishore / Samudravijaya, K. / Vijayalakshmi, P. / Nagarajan, T. / Murthy, Hema A.: "A common attribute based unified HTS framework for speech synthesis in Indian languages", 291-296.

Yoshimura, Takenori / Hashimoto, Kei / Oura, Keiichiro / Nankaku, Yoshihiko / Tokuda, Keiichi: "Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis", 297-302.

Huang, Yi-Chin / Wu, Chung-Hsien / Lin, Shih-Lun: "Residual compensation based on articulatory feature-based phone clustering for hybrid Mandarin speech synthesis", 303-307.