For pathological voices, hoarseness is mainly due to jitter and air flow turbulence in the vocal tract. In this paper, speech denoising is performed in time-domain, by means of a fast and reliable subspace approach. A low-order singular value decomposition allows separating the signal and the noise subspace of an appropriate data matrix, obtained from short data frames. The filtered signal is reconstructed along the directions spanned by the eigenvectors associated to the signal subspace eigenvalues, thus disregarding the noise contribution. This approach was found to work well for hoarse voices coming from cordectomised patients, giving enhanced voice quality. Its simple structure allows a real-time implementation, suitable for portable device realisation.
Cite as: Manfredi, C., D'Aniello, M., Bruscaglioni, P. (2001) Comparison between AR and SVD approaches for speech denoising. Proc. Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA 2001), 211-216
@inproceedings{manfredi01_maveba, author={Claudia Manfredi and Massimo D'Aniello and Piero Bruscaglioni}, title={{Comparison between AR and SVD approaches for speech denoising}}, year=2001, booktitle={Proc. Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA 2001)}, pages={211--216} }