ISCA Archive SSW 2019 Sessions Website Booklet
  ISCA Archive Sessions Website Booklet
top

10th ISCA Workshop on Speech Synthesis

Vienna, Austria
20-22 September 2019

Chair: Michael Pucher
doi: 10.21437/SSW.2019

keynote 1: Deep learning for speech synthesis - Aäron van den Oord


Deep learning for speech synthesis
Aäron van den Oord




poster 1: Voice conversion and multi-speaker TTS


Multi-Speaker Modeling for DNN-based Speech Synthesis Incorporating Generative Adversarial Networks
Hiroki Kanagawa, Yusuke Ijima

Speaker Adaptation of Acoustic Model using a Few Utterances in DNN-based Speech Synthesis Systems
Ivan Himawan, Sandesh Aryal, Iris Ouyang, Shukhan Ng, Pierre Lanchantin

DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity for Multi-speaker Modeling in Speech Synthesis
Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari

Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion
Wen-Chin Huang, Yi-Chiao Wu, Kazuhiro Kobayashi, Yu-Huai Peng, Hsin-Te Hwang, Patrick Lumban Tobing, Yu Tsao, Hsin-Min Wang, Tomoki Toda

Statistical Voice Conversion with Quasi-periodic WaveNet Vocoder
Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda

Voice Conversion without Explicit Separation of Source and Filter Components Based on Non-negative Matrix Factorization
Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu

Voice conversion based on full-covariance mixture density networks for time-variant linear transformations
Gaku Kotani, Daisuke Saito

Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion
Tobias Gburrek, Thomas Glarner, Janek Ebbers, Reinhold Haeb-Umbach, Petra Wagner

Novel Inception-GAN for Whispered-to-Normal Speech Conversion
Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh Shah, Hemant Patil

Implementation of DNN-based real-time voice conversion and its improvements by audio data augmentation and mask-shaped device
Riku Arakawa, Shinnosuke Takamichi, Hiroshi Saruwatari


keynote 2: Synthesizing animal vocalizations and modelling animal speech - Tecumseh Fitch and Bart de Boer


Synthesizing animal vocalizations and modelling animal speech
Tecumseh Fitch, Bart de Boer





keynote 3: Natural Language Generation: Creating Text - Claire Gardent


Natural Language Generation: Creating Text
Claire Gardent





×

keynote 1: Deep learning for speech synthesis - Aäron van den Oord

oral 1: Neural vocoder

oral 2: Adaptation

poster 1: Voice conversion and multi-speaker TTS

keynote 2: Synthesizing animal vocalizations and modelling animal speech - Tecumseh Fitch and Bart de Boer

oral 3: Evaluation and performance

oral 4: Speech science

poster 2: Applications and practical issues

keynote 3: Natural Language Generation: Creating Text - Claire Gardent

oral 5: Language and dialect varieties

oral 6: Sequence to sequence model

poster 3: Prosody