INTERSPEECH 2004 - ICSLP
Conventional circuit-switched networks are increasingly being replaced by packet-based networks for voice communication applications. Additionally, there has been an increased deployment of services supporting speech based interactions. These trends demand reliable transmission of speech data not just for playback but also to ensure acceptable automatic speech recognition (ASR) performance. In this paper, we present an overview of techniques that have been investigated to improve ASR performance against two major degradation factors in the context of packet networks: (1) information loss due to a low bit-rate codec and (2) packet loss due to channel (network) conditions. In addition, we highlight another key issue, packet loss rate, by showing ASR performance as a function of packet size and channel condition.
Bibliographic reference. Srinivasamurthy, Naveen / Han, Kyu Jeong / Narayanan, Shrikanth (2004): "Robust speech recognition over packet networks: an overview", In INTERSPEECH-2004, 621-624.