This paper examines the system combination issue for syllableconfusion-network (SCN) -based Chinese spoken term detection (STD). System combination for STD usually leads to improvements in accuracy but suffers from increased index size or complicated index structure. This paper explores methods for efficient combination of a word-based system and a syllable-based system while keeping the compactness of the indices. First, a composite SCN is generated using two approaches: lattice combination (The SCN is generated from a combined lattice) and confusion network combination (Two SCNs are combined into one). Then a simple compact index is constructed from this composite SCN by merging crosssystem redundant information. The experimental result on a 60-hour corpus shows a relative accuracy improvement of 14.7% is achieved over the baseline syllable-based system. Meanwhile, it reduces the index size by 22.3% compared to the commonly adopted score combination method when achieves comparable accuracy. Index Terms— syllable confusion network, Chinese spoken term detection, system combination, speech indexing
Cite as: Gao, J., Shao, J., Zhao, Q.-W., Yan, Y.-H. (2008) Efficient System Combination for Syllable-confusion-network-based Chinese Spoken Term Detection. Proc. International Symposium on Chinese Spoken Language Processing, 366-369
@inproceedings{gao08b_iscslp, author={Jie Gao and Jian Shao and Qing-Wei Zhao and Yong-Hong Yan}, title={{Efficient System Combination for Syllable-confusion-network-based Chinese Spoken Term Detection}}, year=2008, booktitle={Proc. International Symposium on Chinese Spoken Language Processing}, pages={366--369} }