A joint analysis-synthesis framework is developed for the compressive sensing recovery of speech signals. The signal is assumed to be sparse in the residual domain with the linear prediction filter used as the sparse transformation. Importantly this transform is not known apriori, since estimating the predictor filter requires the knowledge of the signal. Two prediction filters, one comb filter for pitch and another all pole formant filter are needed to induce maximum sparsity. An iterative method is proposed for the estimation of both the prediction filters and the signal itself. Formant prediction filter is used as the synthesis transform, while the pitch filter is used to model the periodicity in the residual excitation signal, in the analysis mode. Significant improvement in the LLR measure is seen over the previously reported formant filter estimation.
Index Terms: Linear prediction, Analysis/Synthesis, Compressive Sensing
Bibliographic reference. Chetupally, Srikanth Raj / Sreenivas, Thippur V. (2012): "Joint pitch-analysis formant-synthesis framework for CS recovery of speech", In INTERSPEECH-2012, 947-950.