ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Analysis of drivers' speech in a car environment

Tomoyuki Kato, Jun Okamoto, Makoto Shozakai

In order to accelerate the promotion of speech recognition systems to the public; understanding characteristics of speech in real environments is one of the most important issues. This paper reports variations of speech characteristics in a car environment. To analyze speech characteristics in the specific environment, a corpus, recorded carefully in terms of equality of utterances and conditions for whole set of speakers, is necessary. We created a new corpus named "Drivers' Japanese Speech Corpus in a Car Environment (DJS-C)": composed of utterances of words useful for the operation of in-vehicle information appliances. Analysis of the DJS-C corpus shows that differences in speech characteristics are diverse among drivers and change with driving conditions. Quantitative analysis and speech recognition experiments show that performance degrades due to Distance between Phonemes, Uniqueness of Speaker's Voice, and SNNR.

doi: 10.21437/Interspeech.2008-454

Cite as: Kato, T., Okamoto, J., Shozakai, M. (2008) Analysis of drivers' speech in a car environment. Proc. Interspeech 2008, 1634-1637, doi: 10.21437/Interspeech.2008-454

  author={Tomoyuki Kato and Jun Okamoto and Makoto Shozakai},
  title={{Analysis of drivers' speech in a car environment}},
  booktitle={Proc. Interspeech 2008},