![]() |
ISCA ArchiveInternational Symposium on Chinese Spoken Language Processing (ISCSLP 2008)Kunming, China
| ![]() |
[ISCSLP 2008] International Symposium on Chinese Spoken Language Processing (ISCSLP 2008), Kunming, China, December 16-19, ISCA Archive, http://www.isca-speech.org/archive_open/iscslp2008/index.html
Tutorial
2 - A Tutorial on How to Construct and Improve
Automatic Pronunciation Proficiency Evaluation System - take PSC test as an
example
Yu Hu, Si Wei and Guoping Hu,
iFLYTEK
PLENARY
Plenary 1 - Speech-To-Speech Translation Technologies
for Real-World Applications
Yuqing Gao
J. Watson Research Center
Plenary 2 - What
Can Speech Researchers Bring to Music Processing?
Plenary 3 - Speech
and Search: Bridging The Gap
Vincent Vanhoucke
Google411
Plenary
4 - Towards Robust Speech Recognition:
Structured Modeling, Irrelevant Variability Normalization and Unsupervised
Online Adaptation
Qiang Huo
Microsoft Research Asia
SPE1
Frontiers of HMM-based TTS
SPE1.1 - Simultaneous Phrasing, Prosody, and Acoustic Model
Training for Text-to-Speech Conversion
Keiichiro Oura, Yoshihiko Nankaku, Tomoki Toda,
Keiichi Tokuda, Rannierry Maia, Shinsuke Sakai and Satoshi Nakamura
pp. 1-4
SPE1.2 - Cross-Stream Dependency Modeling for HMM-based Speech
Synthesis
Zhen-Hua Ling, Wei Zhang and Ren-Hua Wang
pp. 5-8
SPE1.3 - Cross-Lingual
Speaker Adaptation for HMM-based Speech Synthesis
Yi-Jian Wu,
Simon King and Keiichi Tokuda
pp. 9-12
SPE1.4 - HMM-Based Mixed-Language (Mandarin-English) Speech
Synthesis
Yao Qian, Hou-Wei Cao and Frank K. Soong i
pp. 13-16
SPE1.5 - Improving HMM-based Speech Synthesis by Reducing
Over-smoothing Problems
Meng Zhang,
Jian-Hua Tao, Hui-Bin Jia and Xia Wang
pp. 17-20
SPE2 Computer-Assisted Language Learning
SPE2.1 - Pronunciation Space Models for Pronunciation Evaluation
Si Wei,
Yi-Qian Pan, Guo-Ping Hu, Yu Hu and Ren-Hua Wang
pp. 21-24
SPE2.2 - Decision Fusion for Improving Mispronunciation Detection
Using Language Transfer Knowledge and Phoneme-dependent Pronunciation Scoring
W. K. Lo,
Alissa M. Harrison, Helen Meng and Lan Wang
pp. 25-28
SPE2.3 - Mandarin Learning Using Speech and Language Technologies:
A Translation Game in The Travel Domain
Yu-Shi Xu and Stephanie Seneff
pp. 29-32
SPE2.4 - Word Order Correction for Language Transfer Using
Relative Position Language Modeling
Chao-Hong Liu, Chung-Hsien Wu and Matthew Harris
pp. 33-36
SPE2.5 - Improving Automatic Evaluation of Mandarin Pronunciation
with Speaker Adaptive Training (Sat) and MLLR Speaker Adaption
Chao Huang,
Feng Zhang and Frank K. Soong
pp. 37-40
SPE2.6-
Automatic
Assessment of Language Proficiency Through Shadowing
Dean Luo, Nobuaki Minematsu, Yutaka Yamauchi and
Keikichi Hirose
pp. 41
L1 Robust Speech Recognition
L1.1
- Improvements on Mel-frequency Cepstrum Minimum-mean-square-error
Noise Suppressor for Robust Speech Recognition
Dong Yu, Li Deng, Jian Wu, Yi-Fan Gong and Alex Acero
pp. 69
Xiong Xiao, Eng Siong Chng and Hai-Zhou Li
pp. 73
Yuan-Fu Liao, Hung-Hsiang Fang and Chih-Min Yang
pp. 77
Jun Du, Qiang Huo and Yu Hu
pp. 81
Neng-Heng
Zheng, Xia Li, Hou-Wei Cao, Tan Lee and P. C. Ching
pp. 85
Omid Dehzangi,
Bin Ma, Eng Siong Chng and Hai-Zhou Li
pp. 89
L2 Speaker and Language Recognition
L2.1
- Double Gauss Based Unsupervised Score Normalization in Speaker
Verification
Wu Guo,
Li-Rong Dai and Ren-Hua Wang
pp. 165
Yi-Hsiang
Chao, Wei-Ho Tsai and Hsin-Min Wang
pp. 169
Han-Wu Sun,
Bin Ma and Hai-Zhou Li
pp. 173
Chang-Huai
You, Kong-Aik Lee, Bin Ma and Hai-Zhou Li
pp. 177
Han-Wu Sun,
Bin Ma and Hai-Zhou Li
pp. 181
Shuan-Hu
Bai and Hai-Zhou Li
pp. 185
L3 Spoken Language Systems
L3.1
- The Improved TS-base Approaches with Interference
Compensation and Their Evaluations for Speech Enhancement for Speech Enhancement
Jun-Feng
Li, Shuichi Sakamoto, Satoshi Hongo, Masato Akagi and Yoiti Suzuki
pp. 141
S. W. Lee, Frank K. Soong, P. C. Ching and Tan Lee
pp. 145
Hung-Shin
Lee and Berlin Chen
pp. 149
Jing-Jing
Liu, Yu-Shi Xu, Stephanie Seneff and Victor Zue
pp. 153
Yung-Jen
Cheng, Che-Kuang Lin and Lin-Shan Lee
pp. 157
Yong Guan
and Wen-Ju Liu
pp. 161
L4 Speech Analysis and Phonetics
L4.1
- What's in The F0 of Mandarin Speech--Tones, Intonation
and Beyond
Chiu-Yu
Tseng and Zhao-Yu Su
pp. 45
Yu-Jia Li
and Tan Lee
pp. 49
Hong-Lei
Cong, Zhi-Yong Wu, Lian-Hong Cai and Helen M. Meng
pp. 53
Yue-Ning Hu
and Min Chu
pp. 57
Yuan Jia,
Ai-Jun Li and Zi-Yu Xiong
pp. 61
Raymond W.
M. Ng and Tan Lee
pp. 65
L5 Speech Synthesis
L5.1
- Frequency Modulation Technique for Prosodic Modification
Jin-Fu Ni, Shinsuke Sakai, Tohru Shimizu and Satoshi Nakamura
pp. 117
Zhizheng
Wu, Yao Qian and Frank K. Soong
pp. 121
Tao Zhou,
Yuan Dong, De-zhi Huang, Wu Liu and Hai-la Wang
pp. 125
Cheng-Cheng Wang, Zhen-Hua Ling, Bu-Fan Zhang and Li-Rong Dai
pp. 129
Ming-Hui
Dong and Hai-Zhou Li
pp. 133
Heng Lu,
Zhen-Hua Ling, Si Wei, Yu Hu, Li-Rong Dai and Ren-Hua Wang
pp. 137
L6 Speech Recognition
L6.1
- Investigation on Adaptation Using Different
Discriminative Training Criteria Based Linear Regression and Map
Bo Zhu, Zhi-Jie Yan, Yu Hu, Zhi-Guo Wang, Li-Rong Dai and Ren-Hua Wang
pp. 93
Xin-Hui Hu, Hirofumi Yamamoto, Jin-Song Zhang, Keiji Yasuda, You-Zheng Wu
and Hideki Kashioka
pp. 97
Hsuan-Sheng Chiu, Guan-Yu Chen, Chun-Jen Lee and Berlin Chen
pp. 101
I-Fan Chen and Hsin-Min Wang
pp. 105
Yu Hu and Qiang Huo
pp. 109
Yu-Shi Xu, Jing-Jing Liu and Stephanie Seneff
pp. 113-116
P1 Speech Applications
P1.1
- Pronunciation Error Detection for Computer Assisted
Pronunciation Teaching in Mandarin
Min-siong
Liang, Ren-Yuan Lyu, Yuang-Chin Chiang and Jing-Fung Chen
pp. 346-349
P1.2
- A Two-stage Multi-feature Integration Approach to
Unsupervised Speaker Change Detection in Real-time News Broadcasting
Lei Xie and Guang-Sen Wang
pp. 350-353
P1.3 -
Automatic Prosody Boundary Labeling of Mandarin Using Both Text and Acoustic
Information
Chong-Jia
Ni, Wen-Ju Liu and Bo Xu
pp. 354-357
P1.4
- Subword Latent Semantic Analysis for TextTiling-based
Automatic Story Segmentation of Chinese Broadcast News
Yu-Lian Yang, Lei Xie
pp. 358-361
P1.5
- Multipitch Detection Based on Weighted Summary
Correlogram
Xue-Liang Zhang, Wen-Ju liu, Peng Li and Bo Xu
pp. 362-365
P1.6
- Efficient System Combination for Syllable-confusion-network-based Chinese
Spoken Term Detection
Jie Gao, Jian Shao, Qing-Wei Zhao and Yong-Hong Yan
pp. 366-369
P1.7
- The Use of Dynamic Deformable Templates for Lip Tracking
in An Audio-visual Corpus with Large Variations in Head Pose, Face
Illumination and Lip Shapes
Zhi-Yong Wu, Ji-Ying Wu and Helen M. Meng
pp. 370-373
P1.8
- Microphone Array Post-filter Based on Auditory Filtering
Peng Li, Feng-Chai Liao, Ning Cheng, Bo Xu and Wen-Ju Liu
pp. 374-377
P1.9
- Exploring Tone Variations in Chinese Dialects Using
Context Dependent Tone Models
Wei Guo and Min Chu
pp. 378-381
P2 Speech Recognition
P2.1
- A Trellis Based Fast Lattice Generating Algorithm
Wei Li, Ji
Wu and Zhi-Guo Wang
pp. 189-192
P2.2
- Order Adaptation of The Fractional Fourier Transform
Using The Intraframe Pitch Change Rate for Speech Recognition
Hui Yin,
Climent Nadeu, Volker Hohmann, Xiang Xie and Jing-Ming Kuang
pp. 193-196
P2.3
- Large Vocabulary Continuous Speech Recognition in Uyghur: Data
Preparation and Experimental Results
Nasirjan
Tursun and Wushour Silamu
pp. 197-200
P2.4
- A Improvement for Training Efficiency of Semi-tied
Covariance
Si-Bao
Chen, Yu Hu, Bin Luo and Ren-Hua Wang
pp. 201-204
P2.5
- Improved Semi-parametric Mean Trajectory Model Using
Discriminatively Trained Centroids
Ran Xu,
Jie-Lin Pan and Yong-Hong Yan
pp. 205-208
P2.6
- Local Mismatch Phone for Confidence Measure in Standard
and Accented Chinese Speech Recognition
Wen-Xiao
Cao, Yi Liu and Fang Zheng
pp. 209-212
P2.7
- A Combined Task Analysis Method for Data Selection in
Mandarin Isolated Word Recognition System
Zhi-Yang
He, Zhi-Guo Wang, Wei Li and Ji Wu
pp. 213-216
P2.8
- Mandarin Speech Recognition For Nonnative Speakers Based
on Pronunciation Dictionary Adaption
Jian Yang, Pei-Shan
Wu and Dan Xu
pp. 217-220
P2.9
- A New Similarity Measure Between HMMs
Yih-Ru Wang
pp. 221-224
P2.10
- Recognition of Syllable-contracted Words in Spontaneous
Speech Using Word Expansion and Duration Information
Wei-Bin
Liang, Chung-Hsien Wu and Yu-Kai Kang
pp. 225-228
P2.11
- Exploiting Non-target Region Information for Confidence
Measure Based on Bayesian Information Criterion
Cong Liu,
Yu Hu, Xiong-Guo Lei, Zhi-Guo Wang, Li-Rong Dai and Ren-Hua Wang
pp.
229-232
P3 Speaker Recognition
P3.1
-Simplified Deformation Compensation for Emotional Speaker
Recognition
Ying-Chun Yang, Tian Wu and Hong-Bin Lv
pp. 310-313
P3.2
- Interfusing The Confused Region Score of Speaker Verification Systems
Yan-Hua Long, Wu Guo and Li-Rong Dai
pp. 314-317
P3.3
- Parallel Phone Recognizer Based MLLR Speaker Recognition
Eryu Wang, Wu Guo and Li-Rong Dai
pp. 318-321
P3.4
- Eigenchannel Compensation and Symmetric Score for A
Robust Text-independent Speaker Verification
Yuan Dong, Jian Zhao, Xian-Yu Zhao, Liang Lu, Ji-Qing Liu and Hai-La Wang
pp. 322-325
P3.5
- A Sample and Feature Selection Scheme for Gmm-svm Based
Language Recognition
Yan Song and Li-Rong Dai
pp. 326-329
P3.6
- Speaker Recognition Using A Kind of Novel Phonotactic
Information
Xiang Zhang, Xiang Xiao, Hai-Peng Wang, Hong-Bin Suo, Qing-Wei Zhao and
Yong-Hong Yan
pp. 330-333
P3.7
- The Adaptation Schemes in PR-SVM Based Language
Recognition
Bing Xu, Yan Song and Li-Rong Dai
pp. 334-337
P3.8
- Mandarin Tone Perception with Temporal Envelope and
Periodicity Cues from Different Frequency Regions
Meng Yuan,
Tan Lee and Sigfrid D. Soli
pp. 338-341
P3.9
- Prosodic Variation in Cantonese-english Code-mixed Speech
Wen-Tao Gu,
Tan Lee and P. C. Ching
pp. 342-345
P4 Spoken Language Processing
P4.1
- Word Alignment Based on Multi-grain Model
Yan-Qing
He, Yu Zhou and Cheng-Qing Zong
pp. 269-272
P4.2
- Word Reordering Alignment for Combination of Statistical
Machine Translation Systems
Mao-Xi Li
and Cheng-Qing Zong
pp. 273-276
P4.3
- An EMD Based Approach to Transliteration Unit Alignment
Between English and Chinese
Mu-Yun Yang, Shu-Jie Liu, Sheng Li, Ju-Feng Li, Tie-Jun Zhao and Hao-Liang
Qi
pp. 277-280
P4.4
- Analysis and Modeling of Affective Audio Visual Speech
Based on Pad Emotion Space
Shen Zhang, Ying-Jin Xu, Jia Jia and Lian-Hong Cai
pp. 281-284
P4.5
- Noise Reduction Based Random Matrix Theory
XU-Gang Lu, S. Matsuda, T. Shimizu and S. Nakamura
pp. 285-288
P4.6
- Language Model Adaptation for Relevance Feedback in
Information Retrieval
Ying-Lang Chang and Jen-Tzung Chien
pp. 289-292
P4.7
- Predicting and Tagging Dialog-act Using MDP and SVM
Ke-Yan Zhou, Cheng-Qing Zong, Hua Wu and Hai-Feng Wang
pp. 293-296
P4.8
- A Synchronous Method for Automatic Scoring of Language
Learning
Bin Dong and Yong-Hong Yan
pp. 297-301
P4.9
- Using Reference to Tune Language Model for Detection of
Reading Miscues
Chang-Liang Liu, Fu-Ping Pan, Feng-Pei Ge, Bin Dong and Yong-Hong Yan
pp. 302-305
P4.10
- How Syllables Group in Chinese
Mao-Lin
Wang and Yi Xu
pp. 306-309
P5 Speech Processing
P5.1
- Prosodic Modeling for Isolated Mandarin Words and Its
Application
Hung-Kuang Shih, Chen-Yu Chiang, Yih-Ru Wang and Sin-Horng Chen
pp. 233-236
P5.2
- A CSI and Rate-Distortion Based Packet Loss Recovery Algorithm for VoIP
Zhong-Bo Li, Sheng-Hui Zhao, Jing Wang and Jing-Ming Kuang
pp. 237-240
P5.3
- Mandarin Stops Classification Based on Random Forest
Approach
Chi-Yueh
Lin and Hsiao-Chuan Wang
pp. 241-244
P5.4
- A Pitch Synchronous Method for Speech Modification
Chih-Ting Kuo and Hsiao-Chuan Wang
pp. 245-248
P5.5
- Speech Database Compacted for An Embedded Mandarin TTS
System
Qing Guo, Bin Wang and Nobuyuki Katae
pp. 249-252
P5.6
- Prosody Modification on Mixed-language Speech Synthesis
Yi Zhang
and Jian-Hua Tao
pp. 253-256
P5.7
- A Maximum Entropy Based Hierarchical Model for Automatic Prosodic
Boundary Labeling in Mandarin
Fang-Zhou
Liu, Hui-Bin Jia and Jian-Hua Tao
pp. 257-260
P5.8
- Tone Evaluation of Chinese Continuous Speech Based on
Prosodic Words
Yi-Qian Pan, Si Wei and Ren-Hua Wang
pp. 261-264
P5.9
- The Pitch Analysis of Imperative Sentences in Standard
Chinese
Jia Sun,
Ji-Lun Lu, Ai-Jun Li and Yuan Jia
pp. 265-268