Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization

Manu Airaksinen, Lauri Juvela, Tom Bäckström, Paavo Alku


This study presents an automatic glottal inverse filtering (GIF) technique based on separating the effect of the glottal main excitation from the impulse response of the vocal tract. The proposed method is based on a non-negative matrix factorization (NMF) based decomposition of an ultra short-term spectrogram of the analyzed signal. Unlike other state-of-the-art GIF techniques, the proposed method does not require estimation of glottal closure instants.

The proposed method was objectively evaluated with two test sets of continuous synthetic speech created with a glottal vocoding analysis/synthesis procedure. When compared to a set of reference GIF methods, the proposed NMF technique shows improved estimation accuracy especially for male voices.


DOI: 10.21437/Interspeech.2016-338

Cite as

Airaksinen, M., Juvela, L., Bäckström, T., Alku, P. (2016) Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization. Proc. Interspeech 2016, 1039-1043.

Bibtex
@inproceedings{Airaksinen+2016,
author={Manu Airaksinen and Lauri Juvela and Tom Bäckström and Paavo Alku},
title={Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-338},
url={http://dx.doi.org/10.21437/Interspeech.2016-338},
pages={1039--1043}
}