September 22-25, 1997
Speaker variability is a major problem in today's state-of- the-art speech recognition systems. Parameterisation of speech in terms of Acoustic Parameters (APs) motivated by phonetic feature theory has shown to be more robustness to speaker variability as compared to cepstral coefficients when tested on the task of broad-class recognition . Also APs has been successfully applied for identification of semivowels [2,3]. The aim of the present study is to investigate the use of APs for phoneme recognition. An extended set of features is used to distinguish between all phonemes in the TIMIT database and APs related to the extended feature set are found in literature. A separability measure is calculated to investigate the importance of the suggested APs for the separation of phonemes and feature classes. Results show that the APs that are the most important for separation of classes of phonetic features are also the most important for separation of phonemes classes. This indicates that phonemes can be recognised on the basis of phonetic features captured by the use of APs. However much work still needs to be done to understand and reliably extract all of the acoustic correlates of the phonetic features applied.
Bibliographic reference. Hansen, Anya Varnich (1997): "Acoustic parameters optimised for recognition of phonetic features", In EUROSPEECH-1997, 397-400.