Sixth European Conference on Speech Communication and Technology
This paper reports on present work, in which a recurrent neural network is trained to estimate ‘velum height’ during continuous speech. Parallel acoustic-articulatory data comprising more than 400 read TIMIT sentences is obtained using electromagnetic articulography (EMA). This data is processed and used as training data for a range of neural network sizes. The network demonstrating the highest accuracy is identified. This performance is then evaluated in detail by analysing the network’s output for each phonetic segment contained in 50 hand-labelled utterances set aside for testing purposes.
Full Paper (PDF) Gnu-Zipped Postscript
Bibliographic reference. Richmond, Korin (1999): "Estimating velum height from acoustics during continuous speech", In EUROSPEECH'99, 149-152.