ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Distributed ASR using speech coder data for efficient feature vector representation

Trond Skogstad, Torbjørn Svendsen

This paper proposes an alternative approach to distributed speech recognition in scenarios where both reliable feature vectors and the reconstruction of the speech signal are required. By transmitting the difference between speech coded information and the desired feature vectors, this system achieves both excellent quality speech reconstruction and ASR recognition performance. Experiments show that a transparent recognition rate is achieved with as little as 0.6 kbps of additional information supplementing the AMR speech coder operating at 4.75 kbps. The total rate is comparable to the ETSI 202 211 extended front-end standard.


doi: 10.21437/Interspeech.2005-837

Cite as: Skogstad, T., Svendsen, T. (2005) Distributed ASR using speech coder data for efficient feature vector representation. Proc. Interspeech 2005, 2861-2864, doi: 10.21437/Interspeech.2005-837

@inproceedings{skogstad05_interspeech,
  author={Trond Skogstad and Torbjørn Svendsen},
  title={{Distributed ASR using speech coder data for efficient feature vector representation}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2861--2864},
  doi={10.21437/Interspeech.2005-837}
}