Within the context of a multi-speaker corpus of spoken Australian English the impact of speaker characteristics on a simple isolated-word recognition system are analysed. The issue of concern in this study is to measure the effect of the selection of speakers used to train the system, and to devise methods for selecting those speakers likely to comprise the best possible training set for the system. Of all factors affecting performance, the selection of speakers comprising the training set is shown to be the most important. Given a limited amount of speech data from a total population of users, an algorithm is developed to select a sub-set of speakers on whose speech data the system may be trained to obtain optimum performance over all the speakers. A training set that comprises speakers whose characteristics evenly sample the entire speaker space is shown to have attractive properties.
Cite as: Millar, J.B., Hawkins, S.R. (1990) Selecting representative speakers. Proc. ESCA Workshop on Speaker Characterization in Speech Technology, 161-166
@inproceedings{millar90_scst, author={J. Bruce Millar and S. R. Hawkins}, title={{Selecting representative speakers}}, year=1990, booktitle={Proc. ESCA Workshop on Speaker Characterization in Speech Technology}, pages={161--166} }