Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Japanese Intonation Synthesis Using Superposition and Linear Alignment Models

Jennifer J. Venditti (1), Jan P. H. van Santen (2)

(1) Bell Laboratories and Ohio State University, USA
(2) Oregon Graduate Institute Center for Spoken Language Understanding, USA

This paper outlines a new approach to Tokyo Japanese intonation synthesis, in which the F0 contour of an utterance is generated using the superposition of multi-level phrase curves and lexical accent curves, coupled with linear alignment models which determine the precise alignment of the curves with the segmental material. We first discuss the construction of a phrase curve used to model the prosodic domain termed the 'UA-group' (defined below), and describe the alignment of this curve with the syllabic structure of the utterance. Then, we describe a separate accent curve, carrying independent prominence specification, which is added to this UA-curve in the case of accented phrases. The alignment of the accent curve with the segments is determined by linear alignment models.

Full Paper

Bibliographic reference.  Venditti, Jennifer J. / Santen, Jan P. H. van (2000): "Japanese intonation synthesis using superposition and linear alignment models", In ICSLP-2000, vol.1, 605-608.