ISCA Archive SpeechProsody 2008
ISCA Archive SpeechProsody 2008

Phoneme dedicated ANN improves segmental duration model

João Paulo Teixeira, Diamantino Freitas

The Phoneme Dedicated Artificial Neural Network (PDANN) segmental duration model consists of a set of ANNs trained specifically for each phoneme segment in order to avoid miscellaneous influence of different types of phoneme segments. Therefore, each ANN is dedicated to predict the duration of a specific phoneme segment. Objective and subjective measurements of the performance of the PDANN model were compared with those of a typical ANN model using the same input features and database. The results indicate a slight, but clear, perceptually perceived preference towards the PDANN.


Cite as: Teixeira, J.P., Freitas, D. (2008) Phoneme dedicated ANN improves segmental duration model. Proc. Speech Prosody 2008, 371-374

@inproceedings{teixeira08_speechprosody,
  author={João Paulo Teixeira and Diamantino Freitas},
  title={{Phoneme dedicated ANN improves segmental duration model}},
  year=2008,
  booktitle={Proc. Speech Prosody 2008},
  pages={371--374}
}