ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Monaural segregation of voiced speech using discriminative random fields

Rohit Prabhavalkar, Zhaozhang Jin, Eric Fosler-Lussier

Techniques for separating speech from background noise and other sources of interference have important applications for robust speech recognition and speech enhancement. Many traditional computational auditory scene analysis (CASA) based approaches decompose the input mixture into a time-frequency (T-F) representation, and attempt to identify the T-F units where the target energy dominates that of the interference. This is accomplished using a two stage process of segmentation and grouping. In this pilot study, we explore the use of Discriminative Random Fields (DRFs) for the task of monaural speech segregation. We find that the use of DRFs allows us to effectively combine multiple auditory features into the system, while simultaneously integrating the the two CASA stages into one. Our preliminary results suggest that CASA based approaches may benefit from the DRF framework.


doi: 10.21437/Interspeech.2009-260

Cite as: Prabhavalkar, R., Jin, Z., Fosler-Lussier, E. (2009) Monaural segregation of voiced speech using discriminative random fields. Proc. Interspeech 2009, 856-859, doi: 10.21437/Interspeech.2009-260

@inproceedings{prabhavalkar09_interspeech,
  author={Rohit Prabhavalkar and Zhaozhang Jin and Eric Fosler-Lussier},
  title={{Monaural segregation of voiced speech using discriminative random fields}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={856--859},
  doi={10.21437/Interspeech.2009-260}
}