ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Artificial bandwidth extension of speech supported by watermark-transmitted side information

Bernd Geiser, Peter Jax, Peter Vary

In recent years research in the area of artificial bandwidth extension (BWE) of speech signals has made significant progress. The intention of BWE is to produce wideband speech with a cut-off frequency of for instance 7 kHz from a narrowband version (e.g., with telephone bandwidth, i.e., 300Hz - 3.4 kHz). The respective algorithms are based on the estimation of parameters of a source model for speech production given the knowledge of the narrowband signal. A theoretical performance bound on this estimation has been formulated in [1]. In order to overcome this boundary we propose the transmission of (compact) side information which can support the parameter estimation. Since a common additional channel would conflict with the requirement of backwards compatibility in narrowband communication systems, the side information is embedded as digital watermark into the narrowband speech.


doi: 10.21437/Interspeech.2005-526

Cite as: Geiser, B., Jax, P., Vary, P. (2005) Artificial bandwidth extension of speech supported by watermark-transmitted side information. Proc. Interspeech 2005, 1497-1500, doi: 10.21437/Interspeech.2005-526

@inproceedings{geiser05_interspeech,
  author={Bernd Geiser and Peter Jax and Peter Vary},
  title={{Artificial bandwidth extension of speech supported by watermark-transmitted side information}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={1497--1500},
  doi={10.21437/Interspeech.2005-526}
}