INTERSPEECH 2008
9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Cochannel Speech Separation Using Multi-Pitch Estimation and Model Based Voiced Sequential Grouping

Ming Li, Chuan Cao, Di Wang, Ping Lu, Qiang Fu, Yonghong Yan

Chinese Academy of Sciences, China

In this paper, a new cochannel speech separation algorithm using multi-pitch extraction and speaker model based sequential grouping is proposed. After auditory segmentation based on onset and offset analysis, robust multi-pitch estimation algorithm is performed on each segment and the corresponding voiced portions are segregated. Then speaker pair model based on support vector machine (SVM) is employed to determine the optimal sequential grouping alignments and group the speaker homogeneous segments into pure speaker streams. Systematic evaluation on the speech separation challenge database shows significant improvement over the baseline performance.

Full Paper

Bibliographic reference.  Li, Ming / Cao, Chuan / Wang, Di / Lu, Ping / Fu, Qiang / Yan, Yonghong (2008): "Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping", In INTERSPEECH-2008, 151-154.