In this paper, a new cochannel speech separation algorithm using multi-pitch extraction and speaker model based sequential grouping is proposed. After auditory segmentation based on onset and offset analysis, robust multi-pitch estimation algorithm is performed on each segment and the corresponding voiced portions are segregated. Then speaker pair model based on support vector machine (SVM) is employed to determine the optimal sequential grouping alignments and group the speaker homogeneous segments into pure speaker streams. Systematic evaluation on the speech separation challenge database shows significant improvement over the baseline performance.
Bibliographic reference. Li, Ming / Cao, Chuan / Wang, Di / Lu, Ping / Fu, Qiang / Yan, Yonghong (2008): "Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping", In INTERSPEECH-2008, 151-154.