Deep Sensing of Breathing Signal During Conversational Speech

Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik


In this paper, we show the first results on the estimation of breathing signal from conversational speech using deep learning algorithms. Respiratory diseases such as COPD, asthma, and respiratory infections are common in the elderly population and patients in health care monitoring and medical alert services in general. In this work, we compare algorithms for the estimation of a known respiratory target signal, measured by respiratory belt transducers positioned across the rib cage and abdomen, from conversational speech. We demonstrate the estimation of the respiratory signal from speech using convolutional and recurrent neural networks. The estimated breathing pattern gives respiratory rate, breathing capacity and thus might provide indications of the pathological condition of the speaker. Evaluation of our model on our database of breathing signal and speech yielded a sensitivity of 91.2% for breath event detection and a mean absolute error of 1.01 breaths per minute for breathing rate estimation.


 DOI: 10.21437/Interspeech.2019-1796

Cite as: Nallanthighal, V.S., Härmä, A., Strik, H. (2019) Deep Sensing of Breathing Signal During Conversational Speech. Proc. Interspeech 2019, 4110-4114, DOI: 10.21437/Interspeech.2019-1796.


@inproceedings{Nallanthighal2019,
  author={Venkata Srikanth Nallanthighal and Aki Härmä and Helmer Strik},
  title={{Deep Sensing of Breathing Signal During Conversational Speech}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={4110--4114},
  doi={10.21437/Interspeech.2019-1796},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1796}
}