INTERSPEECH 2009
10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

A Comparison of Linear and Nonlinear Dimensionality Reduction Methods Applied to Synthetic Speech

Andrew Errity, John McKenna

Dublin City University, Ireland

In this study a number of linear and nonlinear dimensionality reduction methods are applied to high dimensional representations of synthetic speech to produce corresponding low dimensional embeddings. Several important characteristics of the synthetic speech, such as formant frequencies and f0, are known and controllable prior to dimensionality reduction. The degree to which these characteristics are retained after dimensionality reduction is examined in visualisation and classification experiments. Results of these experiments indicate that each method is capable of discovering meaningful low dimensional representations of synthetic speech and that the nonlinear methods may outperform linear methods in some cases.

Full Paper

Bibliographic reference.  Errity, Andrew / McKenna, John (2009): "A comparison of linear and nonlinear dimensionality reduction methods applied to synthetic speech", In INTERSPEECH-2009, 1095-1098.