ISCA Archive

Fifth ISCA ITRW on Speech Synthesis

June 14-16, 2004
Pittsburgh, PA, USA



Bibliographic Reference

[SSW5-2004] Fifth ISCA ITRW on Speech Synthesis (SSW5), Pittsburgh, PA, USA, June 14-16, 2004, ed. by Alan W. Black and Kevin Lenzo, ISCA Archive, http://www.isca-speech.org/archive_open/ssw5



Author Index and Quick Access to Abstracts

Aaron   Adell   Agüero   Alvarez   Aylett   Badino   Bailly   Baker   Bakis   Bali   Barolo   Bazin   Bellegarda   Black (31)   Black (103)   Black (155)   Black (203)   Black (223)   Black (229)   Black (231)   Bonafonte (67)   Bonafonte (139)   Braunschweiler   Clark (91)   Clark (173)   Collins-Thompson   Conkie   Cosi   Damper   Dijkstra   Dogil   Drioli   Eide   Elisei   Fackrell   Fujisaki   Gibert   Gu   Gustafson   Hamza   Hirai   Hirose (161)   Hirose (227)   Hosom   Ito   Jilka   Kain   Kamoshida   Kang   Kawai   Kim   King (7)   King (19)   King (173)   Kishore   Kitamura   Klabbers (61)   Klabbers (73)   Kominek (155)   Kominek (223)   Krishna (109)   Krishna (197)   Kumar   Langner   Marchand   Mariam   Marseters   Marsi   Miao   Minematsu   Minoru   Mishra   Möbius   Murthy   Nagamatsu   Ni   Niu   Nukaga   Pacchiotti   Picheny   Pitrelli   Pols   Quazza (217)   Quazza (219)   Ramakrishnan   Renato   Richmond   Rutten   Sakai   Sandri   Sangal   van Santen (25)   van Santen (61)   van Santen (73)   Sato   Schweitzer   Segi   Shiga   Sjölander   Skut   Son   Syrdal (49)   Syrdal (127)   Takagi   Talkin   Talukdar   Tao   Tenpaku   Tesser   Tisato   Toda (31)   Toda (179)   Toda (232)   Tokuda (31)   Tokuda (179)   Tokuda (191)   Toth (203)   Toth (225)   Tsuzaki   Vepa   White   Zen   Zhang   Zovato  

Names written in boldface refer to first authors. Full papers can be accessed from the abstracts (ISCA members only). Please note that each abstract opens in a separate window.



Table of Contents and Access to Abstracts

Oral Sessions

Schweitzer, Antje / Braunschweiler, Norbert / Dogil, Grzegorz / Möbius, Bernd: "Assessing the acceptability of the Smartkom speech synthesis voices", 1-6.

Vepa, Jithendra / King, Simon: "Subjective evaluation of join cost & smoothing methods", 7-12.

Marsi, Erwin: "Optionality in evaluating prosody prediction", 13-18.

Shiga, Yoshinori / King, Simon: "Accurate spectral envelope estimation for articulation-to-speech synthesis", 19-24.

Kain, Alexander / Niu, Xiaochuan / Hosom, John-Paul / Miao, Qi / Santen, Jan P. H. van: "Formant re-synthesis of dysarthric speech", 25-30.

Toda, Tomoki / Black, Alan W. / Tokuda, Keiichi: "Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis", 31-36.

Hirai, Toshio / Tenpaku, Seiichi: "Using 5 ms segments in concatenative speech synthesis", 37-42.

Nukaga, Nobuo / Kamoshida, Ryota / Nagamatsu, Kenji: "Unit selection using pitch synchronous cross correlation for Japanese concatenative speech synthesis", 43-48.

Syrdal, Ann K. / Conkie, Alistair D.: "Data-driven perceptually based join costs", 49-54.

Aylett, Matthew: "Merging data driven and rule based prosodic models for unit selection TTS", 55-60.

Santen, Jan P. H. van / Mishra, Taniya / Klabbers, Esther: "Estimating phrase curves in the general superpositional intonation model", 61-66.

Agüero, Pablo Daniel / Bonafonte, Antonio: "Intonation modeling for TTS using a joint extraction and prediction approach", 67-72.

Klabbers, Esther / Santen, Jan P. H. van: "Clustering of foot-based pitch contours in expressive speech", 73-78.

Eide, E. / Aaron, A. / Bakis, R. / Hamza, W. / Picheny, Michael / Pitrelli, J.: "A corpus-based approach to expressive speech synthesis", 79-84.

Gibert, Guillaume / Bailly, Gérard / Elisei, Frédéric: "Audiovisual text-to-cued speech synthesis", 85-90.

Poster Sessions

Baker, Rachel / Clark, Robert A. J. / White, Michael: "Synthesising contextually appropriate intonation in limited domains", 91-96.

Dijkstra, Jelske / Pols, Louis C. W. / Son, Rob J. J. H. van: "Frisian TTS, an example of bootstrapping TTS for minority languages", 97-102.

Mariam, Sebsibe H. / Kishore, S. P. / Black, Alan W. / Kumar, Rohit / Sangal, Rajeev: "Unit selection voice for Amharic using Festvox", 103-108.

Bali, Kalika / Talukdar, Partha Pratim / Krishna, N. Sridhar / Ramakrishnan, A.G.: "Tools for the development of a Hindi speech synthesis system", 109-114.

Segi, Hiroyuki / Takagi, Tohru / Ito, Takayuki: "A concatenative speech synthesis method using context dependent phoneme sequences with variable length as search units", 115-120.

Fackrell, Justin / Skut, Wojciech: "Improving pronunciation dictionary coverage of names by modelling spelling variation", 121-126.

Kim, Yeon-Jun / Syrdal, Ann / Jilka, Matthias: "Improving TTS by higher agreement between predicted versus observed pronunciations", 127-132.

Bellegarda, Jerome R.: "A novel discontinuity metric for unit selection text-to-speech synthesis", 133-138.

Adell, Jordi / Bonafonte, Antonio: "Towards phone segmentation for concatenative speech synthesis", 139-144.

Gustafson, Joakim / Sjölander, Kåre: "Voice creation for conversational fairy-tale characters", 145-150.

Sakai, Shinsuke: "F0 modeling with multi-layer additive modeling based on a statistical learning technique", 151-154.

Kominek, John / Black, Alan W.: "Impact of durational outlier removal from unit selection catalogs", 155-160.

Hirose, Keikichi / Sato, Kentaro / Minematsu, Nobuaki: "Corpus-based synthesis of fundamental frequency contours with various speaking styles from text using F0 contour generation process model", 161-166.

Tao, Jianhua / Kang, Yongguo: "Multi-source based acoustic model for speech synthesis", 167-172.

Clark, Robert A. J. / Richmond, Korin / King, Simon: "Festival 2 - build your own general purpose unit selection speech synthesiser", 173-178.

Kawai, Hisashi / Toda, Tomoki / Ni, Jinfu / Minoru, Minoru / Tsuzaki, Tsuzaki / Tokuda, Keiichi: "XIMERA: a new TTS from ATR based on corpus-based technologies", 179-184.

Tesser, Fabio / Cosi, Piero / Drioli, Carlo / Tisato, Graziano: "Prosodic data driven modelling of a narrative style in Festival TTS", 185-190.

Zen, Heiga / Tokuda, Keiichi / Kitamura, Tadashi: "An introduction of trajectory model into HMM-based speech synthesis", 191-196.

Krishna, N. Sridhar / Murthy, Hema A.: "Duration modeling of Indian languages Hindi and Telugu", 197-202.

Zhang, Jason Y. / Toth, Arthur R. / Collins-Thompson, Kevyn / Black, Alan W.: "Prominence prediction for supersentential prosodic modeling based on a new database", 203-208.

Damper, Robert I. / Marchand, Yannick / Marseters, John-David / Bazin, Alex: "Aligning letters and phonemes for speech synthesis", 209-214.

Short Contributions

Rutten, Peter / Talkin, David: "rvoice studio and activeprompts", 215-216.

Badino, Leonardo / Barolo, Claudia / Quazza, Silvia: "Language independent phoneme mapping for foreign TTS", 217-218.

Zovato, Enrico / Pacchiotti, Alberto / Quazza, Silvia / Sandri, Stefano: "Towards emotional speech synthesis: a rule based approach", 219-220.

Renato, Alejandro C. / Alvarez, José A.: "Corpora of latin american Spanish for research in prosody and synthesis", 221-222.

Kominek, John / Black, Alan W.: "The CMU Arctic speech databases", 223-224.

Toth, Arthur R.: "Forced alignment for speech synthesis databases using duration and prosodic phrase breaks", 225-226.

Gu, Wentao / Fujisaki, Hiroya / Hirose, Keikichi: "Analysis of fundamental frequency contours of Cantonese based on a command-response model", 227-228.

Langner, Brian / Black, Alan W.: "Creating a database of speech in noise for unit selection synthesis", 229-230.

Black, Alan W.: "Overview of voice building" (abstract).

Toda, Tomoki: "Overview of voice conversion" (abstract=.


Original Workshop Website

The link to the original website will bring you to the workshop website as long as it is maintained.