A large-scale Japanese speech database has been described. The database basically consists of 1) word speech database, 2) continuous speech database, 3) database for large number of speakers, and 4) database for speech synthesis. Multiple transcriptions have been made in five different layers from a simple phonemic descriptions to fine acoustic-phonetic transcriptions. The database has been used to develop algorithms, in speech recognition and synthesis studies and to find acoustic, phonetic and linguistic evidences that will serve as a basic data for speech technologies.
Cite as: Kurematsu, A., Takeda, K., Kuwabara, H., Shikano, K. (1989) ATR Japanese speech database as a tool of speech recognition and synthesis. Proc. Speech Input/Output Assessment and Speech Databases, Vol.2, 43-46
@inproceedings{kurematsu89_sioa, author={Akira Kurematsu and Kazuya Takeda and Hisao Kuwabara and Kiyohiro Shikano}, title={{ATR Japanese speech database as a tool of speech recognition and synthesis}}, year=1989, booktitle={Proc. Speech Input/Output Assessment and Speech Databases}, pages={Vol.2, 43-46} }