ISCA Archive
Fifth ISCA ITRW on Speech Synthesis
June 14-16, 2004
Pittsburgh, PA, USA
Bibliographic Reference
[SSW5-2004] Fifth ISCA ITRW on Speech Synthesis (SSW5) ,
Pittsburgh, PA, USA, June 14-16, 2004,
ed. by
Alan W. Black and Kevin Lenzo,
ISCA Archive, http://www.isca-speech.org/archive/ssw5
Author Index and Quick Access to Abstracts
Aaron
Adell
Agüero
Alvarez
Aylett
Badino
Bailly
Baker
Bakis
Bali
Barolo
Bazin
Bellegarda
Black (31)
Black (103)
Black (155)
Black (203)
Black (223)
Black (229)
Black (231)
Bonafonte (67)
Bonafonte (139)
Braunschweiler
Clark (91)
Clark (173)
Collins-Thompson
Conkie
Cosi
Damper
Dijkstra
Dogil
Drioli
Eide
Elisei
Fackrell
Fujisaki
Gibert
Gu
Gustafson
Hamza
Hirai
Hirose (161)
Hirose (227)
Hosom
Ito
Jilka
Kain
Kamoshida
Kang
Kawai
Kim
King (7)
King (19)
King (173)
Kishore
Kitamura
Klabbers (61)
Klabbers (73)
Kominek (155)
Kominek (223)
Krishna (109)
Krishna (197)
Kumar
Langner
Marchand
Mariam
Marseters
Marsi
Miao
Minematsu
Minoru
Mishra
Möbius
Murthy
Nagamatsu
Ni
Niu
Nukaga
Pacchiotti
Picheny
Pitrelli
Pols
Quazza (217)
Quazza (219)
Ramakrishnan
Renato
Richmond
Rutten
Sakai
Sandri
Sangal
van Santen (25)
van Santen (61)
van Santen (73)
Sato
Schweitzer
Segi
Shiga
Sjölander
Skut
Son
Syrdal (49)
Syrdal (127)
Takagi
Talkin
Talukdar
Tao
Tenpaku
Tesser
Tisato
Toda (31)
Toda (179)
Toda (232)
Tokuda (31)
Tokuda (179)
Tokuda (191)
Toth (203)
Toth (225)
Tsuzaki
Vepa
White
Zen
Zhang
Zovato
Names written in boldface refer to first authors.
Full papers can be accessed from the abstracts (ISCA members only).
Please note that each abstract opens in a separate window.
Table of Contents and Access to Abstracts
Oral Sessions
Schweitzer, Antje / Braunschweiler, Norbert / Dogil, Grzegorz / Möbius, Bernd:
"Assessing the acceptability of the Smartkom speech synthesis voices",
1-6.
Vepa, Jithendra / King, Simon:
"Subjective evaluation of join cost & smoothing methods",
7-12.
Marsi, Erwin:
"Optionality in evaluating prosody prediction",
13-18.
Shiga, Yoshinori / King, Simon:
"Accurate spectral envelope estimation for articulation-to-speech synthesis",
19-24.
Kain, Alexander / Niu, Xiaochuan / Hosom, John-Paul / Miao, Qi / Santen, Jan P. H. van:
"Formant re-synthesis of dysarthric speech",
25-30.
Toda, Tomoki / Black, Alan W. / Tokuda, Keiichi:
"Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis",
31-36.
Hirai, Toshio / Tenpaku, Seiichi:
"Using 5 ms segments in concatenative speech synthesis",
37-42.
Nukaga, Nobuo / Kamoshida, Ryota / Nagamatsu, Kenji:
"Unit selection using pitch synchronous cross correlation for Japanese concatenative speech synthesis",
43-48.
Syrdal, Ann K. / Conkie, Alistair D.:
"Data-driven perceptually based join costs",
49-54.
Aylett, Matthew:
"Merging data driven and rule based prosodic models for unit selection TTS",
55-60.
Santen, Jan P. H. van / Mishra, Taniya / Klabbers, Esther:
"Estimating phrase curves in the general superpositional intonation model",
61-66.
Agüero, Pablo Daniel / Bonafonte, Antonio:
"Intonation modeling for TTS using a joint extraction and prediction approach",
67-72.
Klabbers, Esther / Santen, Jan P. H. van:
"Clustering of foot-based pitch contours in expressive speech",
73-78.
Eide, E. / Aaron, A. / Bakis, R. / Hamza, W. / Picheny, Michael / Pitrelli, J.:
"A corpus-based approach to expressive speech synthesis",
79-84.
Gibert, Guillaume / Bailly, Gérard / Elisei, Frédéric:
"Audiovisual text-to-cued speech synthesis",
85-90.
Poster Sessions
Baker, Rachel / Clark, Robert A. J. / White, Michael:
"Synthesising contextually appropriate intonation in limited domains",
91-96.
Dijkstra, Jelske / Pols, Louis C. W. / Son, Rob J. J. H. van:
"Frisian TTS, an example of bootstrapping TTS for minority languages",
97-102.
Mariam, Sebsibe H. / Kishore, S. P. / Black, Alan W. / Kumar, Rohit / Sangal, Rajeev:
"Unit selection voice for Amharic using Festvox",
103-108.
Bali, Kalika / Talukdar, Partha Pratim / Krishna, N. Sridhar / Ramakrishnan, A.G.:
"Tools for the development of a Hindi speech synthesis system",
109-114.
Segi, Hiroyuki / Takagi, Tohru / Ito, Takayuki:
"A concatenative speech synthesis method using context dependent phoneme sequences with variable length as search units",
115-120.
Fackrell, Justin / Skut, Wojciech:
"Improving pronunciation dictionary coverage of names by modelling spelling variation",
121-126.
Kim, Yeon-Jun / Syrdal, Ann / Jilka, Matthias:
"Improving TTS by higher agreement between predicted versus observed pronunciations",
127-132.
Bellegarda, Jerome R.:
"A novel discontinuity metric for unit selection text-to-speech synthesis",
133-138.
Adell, Jordi / Bonafonte, Antonio:
"Towards phone segmentation for concatenative speech synthesis",
139-144.
Gustafson, Joakim / Sjölander, Kåre:
"Voice creation for conversational fairy-tale characters",
145-150.
Sakai, Shinsuke:
"F0 modeling with multi-layer additive modeling based on a statistical learning technique",
151-154.
Kominek, John / Black, Alan W.:
"Impact of durational outlier removal from unit selection catalogs",
155-160.
Hirose, Keikichi / Sato, Kentaro / Minematsu, Nobuaki:
"Corpus-based synthesis of fundamental frequency contours with various speaking styles from text using F0 contour generation process model",
161-166.
Tao, Jianhua / Kang, Yongguo:
"Multi-source based acoustic model for speech synthesis",
167-172.
Clark, Robert A. J. / Richmond, Korin / King, Simon:
"Festival 2 - build your own general purpose unit selection speech synthesiser",
173-178.
Kawai, Hisashi / Toda, Tomoki / Ni, Jinfu / Minoru, Minoru / Tsuzaki, Tsuzaki / Tokuda, Keiichi:
"XIMERA: a new TTS from ATR based on corpus-based technologies",
179-184.
Tesser, Fabio / Cosi, Piero / Drioli, Carlo / Tisato, Graziano:
"Prosodic data driven modelling of a narrative style in Festival TTS",
185-190.
Zen, Heiga / Tokuda, Keiichi / Kitamura, Tadashi:
"An introduction of trajectory model into HMM-based speech synthesis",
191-196.
Krishna, N. Sridhar / Murthy, Hema A.:
"Duration modeling of Indian languages Hindi and Telugu",
197-202.
Zhang, Jason Y. / Toth, Arthur R. / Collins-Thompson, Kevyn / Black, Alan W.:
"Prominence prediction for supersentential prosodic modeling based on a new database",
203-208.
Damper, Robert I. / Marchand, Yannick / Marseters, John-David / Bazin, Alex:
"Aligning letters and phonemes for speech synthesis",
209-214.
Short Contributions
Rutten, Peter / Talkin, David:
"rvoice studio and activeprompts",
215-216.
Badino, Leonardo / Barolo, Claudia / Quazza, Silvia:
"Language independent phoneme mapping for foreign TTS",
217-218.
Zovato, Enrico / Pacchiotti, Alberto / Quazza, Silvia / Sandri, Stefano:
"Towards emotional speech synthesis: a rule based approach",
219-220.
Renato, Alejandro C. / Alvarez, José A.:
"Corpora of latin american Spanish for research in prosody and synthesis",
221-222.
Kominek, John / Black, Alan W.:
"The CMU Arctic speech databases",
223-224.
Toth, Arthur R.:
"Forced alignment for speech synthesis databases using duration and prosodic phrase breaks",
225-226.
Gu, Wentao / Fujisaki, Hiroya / Hirose, Keikichi:
"Analysis of fundamental frequency contours of Cantonese based on a command-response model",
227-228.
Langner, Brian / Black, Alan W.:
"Creating a database of speech in noise for unit selection synthesis",
229-230.
Black, Alan W.:
"Overview of voice building"
(abstract).
Toda, Tomoki:
"Overview of voice conversion"
(abstract=.
Original Workshop Website
The link to the
original website will bring you to the workshop website as long as it is
maintained.