4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
Neural networks are used widely today for modeling a variety of different aspects of spoken language. We use them to model Finnish lexical prosody with an aim to shed light on the interaction between the main prosodic parameters: segmental durations, loudness and pitch. We have analyzed the performance of a group of networks which were all trained to generate values for a different prosodic parameter given similar input information. The experiments were performed on speech material contained within our Finnish speech database.
Bibliographic reference. Vainio, Martti / Altosaar, Toomas (1996): "Pitch, loudness, and segmental duration correlates: towards a model for the phonetic aspects of finnish prosody", In ICSLP-1996, 2052-2055.