EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Time-Domain Based Temporal Processing with Application of Orthogonal Transformations

Petr Motlicek, Jan Cernocký

Brno University of Technology, Czech Republic

In the paper, novel approach that efficiently extracts the temporal information of speech has been proposed. This algorithm is fully employed in time-domain, and the preprocessing blocks are well justified by psychoacoustic studies. The achieved results show the different properties of proposed algorithm compared to the traditional approach. The algorithm is advantageous in terms of possible modifications and computational inexpensiveness. Then, in our experiments, we have focused on different representation of time trajectories. Classical methods that are efficient in conventional feature extraction approaches showed not to be suitable to approximate temporal trajectories of speech. However, the application of some orthogonal transformations, such as discrete Fourier transform or discrete cosine transform, on top of previously derived temporal trajectories outperforms classification in original domain. In addition, these transformed features are very efficient to reduce the dimensionality of data.

Full Paper

Bibliographic reference.  Motlicek, Petr / Cernocký, Jan (2003): "Time-domain based temporal processing with application of orthogonal transformations", In EUROSPEECH-2003, 821-824.