Manuel Reyes-Gomez, Nebojsa Jojic, Daniel P. W. Ellis (2004), Towards single-channel unsupervised source separation of speech mixtures: the layered harmonics/formants separation-tracking model, SAPA
Aarthi M. Reddy, Bhiksha Raj (2004), Soft mask estimation for single channel speaker separation, SAPA
Stefan Winter, Hiroshi Sawada, Shoko Araki, Shoji Makino (2004), Hierarchical clustering applied to overcomplete BSS for convolutive mixtures, SAPA
Tuomas Virtanen (2004), Separation of sound sources by convolutive sparse coding, SAPA
Futoshi Asano, Hideki Asoh (2004), Sound source localization and separation based on the EM algorithm, SAPA
Guillaume Lathoud, Iain A. McCowan (2004), A sector-based approach for localization of multiple speakers with microphone arrays, SAPA
Hynek Hermansky (2004), Stochastic techniques in deriving perceptual knowledge, SAPA
Chunghsin Yeh, Axel Röbel (2004), Physical principles driven joint evaluation of multiple f0 hypotheses, SAPA
Tomohiro Nakatani, Keisuke Kinoshita, Masato Miyoshi, Parham S. Zolfaghari (2004), Harmonicity based blind dereverberation with time warping, SAPA
Guoning Hu, DeLiang Wang (2004), Auditory segmentation based on event detection, SAPA
Daniel P. W. Ellis, Keansub Lee (2004), Features for segmenting and classifying long-duration recordings of "personal" audio, SAPA
Marios Athineos, Hynek Hermansky, Daniel P. W. Ellis (2004), PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns, SAPA
Werner Hemmert, Marcus Holmberg, David Gelbart (2004), Auditory-based automatic speech recognition, SAPA
John Hershey, Trausti Kristjansson, Zhengyou Zhang (2004), Model-based fusion of bone and air sensors for speech enhancement and robust speech recognition, SAPA
Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano (2004), MAP estimation of speech spectral component under GGD a priori, SAPA
Yasunari Obuchi (2004), Multiple-microphone robust speech recognition using decoder-based channel selection, SAPA
Plamen Prodanov, Andrzej Drygajlo (2004), Bayesian networks for error handling through multimodality fusion in spoken dialogues with mobile robots, SAPA
Hugo de Paula, Hani Yehia, Mauricio A. Loureiro (2004), Representation and classification of the timbre space of a single musical instrument, SAPA
Shigeki Sagayama, Keigo Takahashi, Hirokazu Kameoka, Takuya Nishimoto (2004), Specmurt anasylis: a piano-roll-visualization of polyphonic music signal by deconvolution of log-frequency spectrum, SAPA
Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno (2004), Drum sound identification for polyphonic music using template adaptation and matching methods, SAPA
Matti P. Ryynänen, Anssi P. Klapuri (2004), Modelling of note events for singing transcription, SAPA
Paris Smaragdis (2004), Discovering auditory objects through non-negativity constraints, SAPA
Renana Peres (2001), Beyond the Equal Error Rate - About the inter-relationship between algorithm and application, Odyssey
Christian J. Wellekens (2001), Seamless navigation in audio files, Odyssey
James L. Wayman (2001), Theory, characterization and testing of general biometric technologies, Odyssey
George Doddington (2001), Speaker Recognition Evaluation -- a challenge and an opportunity, Odyssey
Hirotaka Nakasone (2001), Speaker recognition in forensic environment, Odyssey
Regis Quelavoine (2001), Patent: a public disclosure of intellectual property, Odyssey
Josef Confino (2001), Listen to the Customers: Implementation of a speaker verification system in the bank industry, Odyssey
Mark A. Przybocki, Alvin F. Martin (2001), Odyssey text independent evaluation data, Odyssey
Eric G. Hansen, Raymond E. Slyh, Timothy R. Anderson (2001), Formant and F0 features for speaker recognition, Odyssey
A. Higgins, L. Bahler (2001), Password-based voice verification using SpeakerKey, Odyssey
Orith Toledo-Ronen (2001), Speech detection for text-dependent speaker verification, Odyssey
Alvin F. Martin, Mark A. Przybocki (2001), The NIST Speaker Recognition Evaluations: 1996-2001, Odyssey
Ran D. Zilca (2001), Using second order statistics for text independent speaker verification, Odyssey
Jamal Kharroubi, Dijana Petrovska-Delacrétaz, Gérard (2001) Chollet (2001), Text-independent speaker verification using support vector machines, Odyssey
Walter D. Andrews, Mary A. Kohler, Joseph P. Campbell, John J. Godfrey (2001), Phonetic, idiolectal and acoustic speaker recognition, Odyssey
Ivan Magrin-Chagnolleau, Guillaume Gravier, Raphael Blouet (2001), Overview of the 2000-2001 ELISA Consortium research activities, Odyssey
Dat Tran, Michael Wagner (2001), A generalised normalisation method for speaker verification, Odyssey
Corinne Fredouille, Jean-Francois Bonastre, Teva Merlin (2001), Bayesian bpproach based decision in speaker verification, Odyssey
Roland Auckenthaler, John S. Mason (2001), Gaussian selection applied to text-independent speaker verification, Odyssey
William D. Voiers (2001), Evaluating the effects of communication systems on speaker recognizability by human listeners: The Diagnostic Speaker Recognizability Test (DSRT), Odyssey
Lit Ping Wong, Martin J. Russell (2001), Speaker verification under additive noise conditions with non-stationary SNR using parallel model combination (PMC), Odyssey
Iain A. McCowan, Jason Pelecanos, Sridha Sridharan (2001), Robust speaker recognition using microphone arrays, Odyssey
Andrzej Drygajlo, Mounir El-Maliki (2001), Integration and imputation methods for unreliable feature compensation in GMM based speaker verification, Odyssey
Robert B. Dunn, Thomas F. Quatieri, Douglas A. Reynolds, Joseph P. Campbell (2001), Speaker recognition from coded speech in matched and mismatched conditions, Odyssey
Charles C. Broun, William M. Campbell, David Pearce, Holly Kelleher (2001), Speaker recognition and the ETSI Standard Distributed Speech Recognition Front-End, Odyssey
Ran Gazit, Yaakov Metzger, Orith Toledo-Ronen (2001), Speaker verification over cellular networks, Odyssey
Javier Rodriguez Saeta, Christian Koechling, Javier Hernando (2001), A VQ speaker identification system in car environment for personalized infotainment, Odyssey
Joaquin Gonzalez-Rodriguez, Javier Ortega-Garcia, J.J. Lucena-Molina (2001), On the application of the Bayesian approach in real forensic conditions with GMM-based systems, Odyssey
Hirotaka Nakasone, Steven D. Beck (2001), Forensic automatic speaker recognition, Odyssey
Didier Meuwly, Andrzej Drygajlo (2001), Forensic speaker recognition based on a Bayesian framework and Gaussian mixture modelling (GMM), Odyssey
Yosef A. Solewicz (2001), Noise robustness in forensic speaker verification, Odyssey
Yaakov Metzger (2001), Blind segmentation of a multi-speaker conversation using two different sets of features, Odyssey
Mauro Cettolo (2001), Speaker tracking in a broadcast news corpus, Odyssey
Itshak Lapidot, Hugo Guterman (2001), Resolution limitation in speakers clustering and segmentation problems, Odyssey
Sylvain Meignier, Jean-Francois Bonastre, Stephane Igounet (2001), E-HMM approach for learning and adapting sound models for speaker indexing, Odyssey
William M. Campbell, Charles C. Broun (2001), Text-prompted speaker recognition with polynomial classifiers, Odyssey
Marcos Faundez-Zanuy (2001), On the model size selection for speaker identification, Odyssey
Robert Stapert, John S. Mason (2001), Speaker recognition and the acoustic speech space, Odyssey
Sachin S. Kajarekar, Hynek Hermansky (2001), Speaker verification based on broad phonetic categories, Odyssey
Hassan Ezzaidi, Jean Rouat, Douglas O'Shaughnessy (2001), Combining pitch and MFCC for speaker identification systems, Odyssey
Jason Pelecanos, Sridha Sridharan (2001), Feature warping for robust speaker verification, Odyssey
Özgür Devrim Orman, Levent M. Arslan (2001), Frequency analysis of speaker identification, Odyssey
Raphael Blouet, Frédéric Bimbot (2001), A tree-based approach for score computation in speaker verification, Odyssey
Xiaozheng Zhang, Charles C. Broun (2001), Using lip features for multimodal speaker verification, Odyssey
Fabian Monrose, Michael K. Reiter, Qi Li, Susanne Wetzel (2001), Using voice to generate cryptographic keys, Odyssey
Niko Brümmer, Jason Pelecanos (2001), Unsupervised evaluation of speaker verification systems, Odyssey
Larry P. Heck, Dominique Genoud (2001), Integrating speaker and speech recognizers: Automatic identity claim capture for speaker verification, Odyssey
Hideki Kawahara (2010), Exploration of the other aspect of vocoder revisited: A-Z STRAIGHT, TANDEM-STRAIGHT and morphing, SSW
Simon King (2010), Speech synthesis without the right data, SSW
H. Timothy Bunnell (2010), Crafting small databases for unit selection TTS: effects on intelligibility, SSW
Alistair Conkie, Ann K. Syrdal (2010), Composite TTS voices, SSW
Alexander Kain, Todd Leen (2010), Compression of line spectral frequency parameters using the asynchronous interpolation model, SSW
Fernando Villavicencio, Esteban Maestre (2010), GMM-PCA based speaker-timbre conversion on full-quality speech, SSW
Yi-Chin Huang, Chung-Hsien Wu, Chung-Han Lee, Yu-Ting Chao (2010), Voice conversion using precise speech alignment based on spectral property and eigen-codeword distribution, SSW
Elizabeth Godoy, Olivier Rosec, Thierry Chonavel (2010), On transforming spectral peaks in voice conversion, SSW
Chie Hayashida, Tomoki Toda, Yamato Ohtani, Hiroshi Saruwatari, Kiyohiro Shikano (2010), Linear transformation approaches to many-to-one voice conversion, SSW
Takashi Nose, Takao Kobayashi (2010), HMM-based robust voice conversion using adaptive F0 quantization, SSW
Ranniery Maia, Heiga Zen, M. J. F. Gales (2010), Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters, SSW
Kai Yu, Blaise Thomson, Steve Young (2010), From discontinuous to continuous F0 modelling in HMM-based speech synthesis, SSW
Shinji Takaki, Yoshihiko Nankaku, Keiichi Tokuda (2010), Spectral modeling with contextual additive structure for HMM-based speech synthesis, SSW
Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (2010), Bayesian speech synthesis framework integrating training and synthesis processes, SSW
Ingmar Steiner, Marc Schröder, Marcela Charfuelan, Annette Klepp (2010), Symbolic vs. acoustics-based style control for expressive unit selection, SSW
Jan Romportl, Enrico Zovato, Raúl Santos, Pavel Ircing, José Relaño Gil, Morena Danieli (2010), Application of expressive TTS synthesis in an advanced ECA system, SSW
Chih-Yung Yang, Chia-Ping Chen (2010), A hidden Markov model-based approach for emotional speech synthesis, SSW
Fabio Tesser, Enrico Zovato, Mauro Nicolao, Piero Cosi (2010), Two vocoder techniques for neutral to emotional timbre conversion, SSW
Maria K. Wolters, Karl B. Isaac, Steve Renals (2010), Evaluating speech synthesis intelligibility using Amazon Mechanical Turk, SSW
Anna C. Janska, Robert A. J. Clark (2010), Further exploration of the possibilities and pitfalls of multidimensional scaling as a tool for the evaluation of the quality of synthesized speech, SSW
Kishore Prahallad, Alan W. Black (2010), Handling large audio files in audio books for building synthetic voices, SSW
Gopala Krishna Anumanchipalli, Prasanna Kumar Muthukumar, Udhyakumar Nallasamy, Alok Parlikar, Alan W. Black, Brian Langner (2010), Improving speech synthesis for noisy environments, SSW
Kishore Prahallad, E. Veera Raghavendra, Alan W. Black (2010), Learning speaker-specific phrase breaks for text-to-speech systems, SSW
Nobuyuki Nishizawa, Tsuneo Kato (2010), Substitution of state distributions to reproduce natural prosody on HMM-based speech synthesizers, SSW
Sebastian Andersson, Junichi Yamagishi, Robert A. J. Clark (2010), Utilising spontaneous conversational speech in HMM-based speech synthesis, SSW
Ann K. Syrdal, Alistair Conkie, Yeon-Jun Kim, Mark C. Beutnagel (2010), Speech acts and dialog TTS, SSW
Heiga Zen, Norbert Braunschweiler, Sabine Buchholz, Kate Knill, Sacha Krstulovic, Javier Latorre (2010), HMM-based polyglot speech synthesis by speaker and language adaptive training, SSW
Mirjam Wester, John Dines, Matthew Gibson, Hui Liang, Yi-Jian Wu, Lakshmi Saheer, Simon King, Keiichiro Oura, Philip N. Garner, William Byrne, Yong Guan, Teemu Hirsimäki, Reima Karhila, Mikko Kurimo, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Junichi Yamagishi (2010), Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project, SSW
Jerome R. Bellegarda (2010), Toward naturally expressive speech synthesis: data–driven emotion detection using latent affective analysis, SSW
Gopala Krishna Anumanchipalli, Ying-Chang Cheng, oseph Fernandez, Xiaohan Huang, Qi Mao, Alan W. Black (2010), KLATTSTAT: knowledge-based parametric speech synthesis, SSW
Keiichiro Oura, Ayami Mase, Tomohiko Yamada, Satoru Muto, Yoshihiko Nankaku, Keiichi Tokuda (2010), Recent development of the HMM-based singing voice synthesis system — Sinsy, SSW
Lijuan Wang, Xiaojun Qian, Wei Han, Frank K. Soong (2010), Photo-real lips synthesis with trajectory-guided sample selection, SSW
Lakshmi Saheer, John Dines, Philip N. Garner, Hui Liang (2010), Implementation of VTLN for statistical speech synthesis, SSW
Eva Lasarcyk, Charlotte Wollermann (2010), Do prosodic cues influence uncertainty perception in articulatory speech synthesis?, SSW
Yong Guan, Jilei Tian, Yi-Jian Wu, Junichi Yamagishi, Jani Nurminen (2010), An unified and automatic approach of Mandarin HTS system, SSW
Sathish Pammi, Marc Schröder, Marcela Charfuelan, Oytun Türk, Ingmar Steiner (2010), Synthesis of listener vocalisations with imposed intonation contours, SSW
Jinfu Ni, Hisashi Kawai (2010), An investigation of the impact of speech transcript errors on HMM voices, SSW
Keijiro Saino, Makoto Tachibana, Hideki Kenmochi (2010), An HMM-based singing style modeling system for singing voice synthesizers, SSW
Dong-Yan Huang, Susanto Rahardja, Ee Ping Ong (2010), Lombard effect mimicking, SSW
Chen Yu Chiang, Sin-Horng Chen, Yih-Ru Wang (2010), Unsupervised prosody labeling for constructing Mandarin TTS, SSW
Benjamin Picart, Thomas Drugman, Thierry Dutoit (2010), Analysis and synthesis of hypo- and hyperarticulated speech, SSW
Rajakrishnan Rajkumar, Michael White, Shari R. Speer, Kiwako Ito (2010), Evaluating prosody in synthetic speech with online (eye-tracking) and offline (rating) methods, SSW
Xu Shao, Vincent Pollet, Andrew Breen (2010), Refined statistical model tuning for speech synthesis, SSW
Didier Cadic, Christophe d'Alessandro (2010), High quality TTS voices within one day, SSW
Tatyana Polyákova, Antonio Bonafonte (2010), Nativization of English words in Spanish using analogy, SSW
Asami Yamamoto, Kazuhiro Suzuki, Kook Cho, Yoichi Yamashita (2010), Automatic prosodic labeling of accent information for Japanese spoken sentences, SSW
Mohamed Abou-Zleikha, Peter Cahill, Julie Carson-Berndsen (2010), An automatic pitch model with distance function, SSW
Minghui Dong, Ling Cen, Paul Chan, Haizhou Li (2010), Considering readability in text-to-speech recording script design, SSW
Oliver Watts, Junichi Yamagishi, Simon King (2010), Letter-based speech synthesis, SSW
Christophe Veaux, Pierre Lanchantin, Xavier Rodet (2010), Joint prosodic and segmental unit selection for expressive speech synthesis, SSW
Pieter E. Scholtz, Justus C. Roux, Jacques P. du Toit (2010), Speech synthesis in the mobile user interface, SSW
Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku (2010), Comparison of formant enhancement methods for HMM-based speech synthesis, SSW
Mumtaz B. Mustafa, Raja N. Ainon, Roziati Zainuddin (2010), EM-HTS: real-time HMM-based Malay emotional speech synthesis, SSW
Dong-Yan Huang, Susanto Rahardja, Ee Ping Ong (2010), High level emotional speech morphing using STRAIGHT, SSW
Jean-Philippe Goldman, Sophie Roekhaut, Anne Catherine Simon (2010), Adding speaking style to a TTS system, SSW
Donata Moers, Igor Jauk, Bernd Möbius, Petra Wagner (2010), Synthesizing fast speech by implementing multi-phone units in unit selection speech synthesis, SSW
Miaomiao Wang, Miaomiao Wen, Daisuke Saito, Keikichi Hirose, Nobuaki Minematsu (2010), Improved generation of prosodic features in HMM-based Mandarin speech synthesis, SSW
João P. Cabral, Steve Renals, Korin Richmond, Junichi Yamagishi (2010), An HMM-based speech synthesiser using glottal post-filtering, SSW
Yeon-Jun Kim, Mark C. Beutnagel (2010), A study of lexical stress patterns in unit selection synthesis, SSW
Andreas Windmann, Petra Wagner, Fabio Tamburini, Denis Arnold, Catharine Oertel (2010), Automatic prominence annotation of a German speech synthesis corpus: towards prominence-based prosody generation for unit selection synthesis, SSW
Matthew Purver, Florin Ratiu, Lawrence Cavedon (2006), Robust interpretation in dialogue by combining confidence scores with contextual features, Interspeech
Hui Ye, Steve Young (2006), A clustering approach to semantic decoding, Interspeech
Teruhisa Misu, Tatsuya Kawahara (2006), A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts, Interspeech
Axel Horndasch, Elmar Nöth, Anton Batliner, Volker Warnke (2006), Phoneme-to-grapheme mapping for spoken inquiries to the semantic web, Interspeech
Karl Weilhammer, Matthew N. Stuttle, Steve Young (2006), Bootstrapping language models for dialogue systems, Interspeech
Junlan Feng (2006), Question answering with discriminative learning algorithms, Interspeech
Patrick Kenny, Vishwa Gupta, G. Boulianne, Pierre Ouellet, Pierre Dumouchel (2006), Feature normalization using smoothed mixture transformations, Interspeech
Chia-Hsin Hsieh, Chung-Hsien Wu, Jun-Yu Lin (2006), Stochastic vector mapping-based feature enhancement using prior model and environment adaptation for noisy speech recognition, Interspeech
Babak Nasersharif, Ahmad Akbari (2006), A framework for robust MFCC feature extraction using SNR-dependent compression of enhanced mel filter bank energies, Interspeech
Friedrich Faubel, Matthias Wölfel (2006), Coupling particle filters with automatic speech recognition for speech feature enhancement, Interspeech
Chang-wen Hsu, Lin-shan Lee (2006), Extension and further analysis of higher order cepstral moment normalization (HOCMN) for robust features in speech recognition, Interspeech
Md. Babul Islam, Hiroshi Matsumoto, Kazumasa Yamamoto (2006), An improved mel-wiener filter for mel-LPC based speech recognition, Interspeech
Lluis F. Hurtado, David Griol, Encarna Segarra, Emilio Emilio, Sanchis Sanchis (2006), A stochastic approach for dialog management based on neural networks, Interspeech
Mihai Rotaru, Diane J. Litman (2006), Discourse structure and speech recognition problems, Interspeech
Satanjeev Banerjee, Alexander I. Rudnicky (2006), A texttiling based approach to topic boundary detection in meetings, Interspeech
Stefan Schulz, Hilko Donker (2006), An user-centered development of an intuitive dialog control for speech-controlled music selection in cars, Interspeech
Antoine Raux, Dan Bohus, Brian Langner, Alan W. Black, Maxine Eskenazi (2006), Doing research on a deployed spoken dialogue system: one year of let's go! experience, Interspeech
Jackson Liscombe, Jennifer J. Venditti, Julia Hirschberg (2006), Detecting question-bearing turns in spoken tutorial dialogues, Interspeech
Soundararajan Srinivasan, Yang Shao, Zhaozhang Jin, DeLiang Wang (2006), A computational auditory scene analysis system for robust speech recognition, Interspeech
Runqiang Han, Pei Zhao, Qin Gao, Zhiping Zhang, Hao Wu, Xihong Wu (2006), CASA based speech separation for robust speech recognition, Interspeech
Mark R. Every, Philip J.B. Jackson (2006), Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm, Interspeech
Jon Barker, André Coy, Ning Ma, Martin Cooke (2006), Recent advances in speech fragment decoding techniques, Interspeech
Tuomas Virtanen (2006), Speech recognition using factorial hidden Markov models for separation in the feature space, Interspeech
Ji Ming, Timothy J. Hazen, James R. Glass (2006), Combining missing-feature theory, speech enhancement and speaker-dependent/-independent modeling for speech separation, Interspeech
T. Kristjansson, J. Hershey, P. Olsen, S. Rennie, Ramesh Gopinath (2006), Super-human multi-talker speech recognition: the IBM 2006 speech separation challenge system, Interspeech
Om D. Deshmukh, Carol Y. Espy-Wilson (2006), Modified phase opponency based solution to the speech separation challenge, Interspeech
J. Lööf, M. Bisani, Ch. Gollan, G. Heigold, Björn Hoffmeister, Ch. Plahl, Ralf Schlüter, Hermann Ney (2006), The 2006 RWTH parliamentary speeches transcription system, Interspeech
G. Bouselmi, D. Fohr, I. Illina, Jean-Paul Haton (2006), Multilingual non-native speech recognition using phonetic confusion-based acoustic model modification and graphemic constraints, Interspeech
Joyce Y. C. Chan, P. C. Ching, Tan Lee, Houwei Cao (2006), Automatic speech recognition of Cantonese-English code-mixing utterances, Interspeech
M. Zimmerman, Dilek Hakkani-Tür, J. Fung, N. Mirghafori, L. Gottlieb, Elizabeth Shriberg, Yang Liu (2006), The ICSI+ multilingual sentence segmentation system, Interspeech
Yan Ming Cheng, Changxue Ma, Lynette Melnar (2006), Cross-language evaluation of voice-to-phoneme conversions for voice-tag application in embedded platforms, Interspeech
Huanliang Wang, Yao Qian, Frank K. Soong, Jian-Lai Zhou, Jiqing Han (2006), A multi-space distribution (MSD) approach to speech recognition of tonal languages, Interspeech
Viet Bac Le, Laurent Besacier (2006), Comparison of acoustic modeling techniques for Vietnamese and Khmer ASR, Interspeech
Yi Liu, Pascale Fung (2006), Multi-accent Chinese speech recognition, Interspeech
Seyed Ghorshi, Saeed Vaseghi, Qin Yan (2006), Comparative analysis of formants of British, american and australian accents, Interspeech
Linquan Liu, Thomas Fang Zheng, Wenhu Wu (2006), Automatic initial/final generation for dialectal Chinese speech recognition, Interspeech
Ruhi Sarikaya, Ossama Emam, Imed Zitouni, Yuqing Gao (2006), Maximum entropy modeling for diacritization of Arabic text, Interspeech
Slavomír Lihan, Jozef Juhár, Anton Cizmár (2006), Comparison of Slovak and Czech speech recognition based on grapheme and phoneme acoustic models, Interspeech
Rhys James Jones, Ambrose Choy, Briony Williams (2006), Integrating Festival and Windows, Interspeech
Cosmin Munteanu, Gerald Penn, Ron Baecker, Elaine Toms, David James (2006), Measuring the acceptable word error rate of machine-generated webcast transcripts, Interspeech
Goshu Nagino, Makoto Shozakai (2006), Analyzing reusability of speech corpus based on statistical multidimensional scaling method, Interspeech
Susan Fitt, Korin Richmond (2006), Redundancy and productivity in the speech technology lexicon - can we do better?, Interspeech
Takeshi Yamada, Masakazu Kumakura, Nobuhiko Kitawaki (2006), Word intelligibility estimation of noise-reduced speech, Interspeech
Christoph Draxler (2006), Exploring the unknown - collecting 1000 speakers over the internet for the ph@ttsessionz database of adolescent speakers, Interspeech
Timothy Murphy, Dorel Picovici, Abdulhussain E. Mahdi (2006), A new single-ended measure for assessment of speech quality, Interspeech
Ailbhe Ní Chasaide, John Wogan, Brian Ó Raghallaigh, Áine Ní Bhriain, Eric Zoerner, Harald Berthelsen, Christer Gobl (2006), Speech technology for minority languages: the case of Irish (gaelic), Interspeech
Francisco José Fraga, Carlos Alberto Ynoguti, André Godoi Chiovato (2006), Further investigations on the relationship between objective measures of speech quality and speech recognition rates in noisy environments, Interspeech
Volodya Grancharov, David Y. Zhao, Jonas Lindblom, W. Bastiaan Kleijn (2006), Non-intrusive speech quality assessment with low computational complexity, Interspeech
Min-Siong Liang, Ren-Yuan Lyu, Yuang-Chin Chiang (2006), Using speech recognition technique for constructing a phonetically transcribed taiwanese (min-nan) text corpus, Interspeech
Andrej Zgank, Tomas Rotovnik, Matej Grasic, Marko Kos, Damjan Vlaj, Zdravko Kacic (2006), Sloparl - slovenian parliamentary speech and text corpus for large vocabulary continuous speech recognition, Interspeech
Siew Leng Toh, Fan Yang, Peter A. Heeman (2006), An annotation scheme for agreement analysis, Interspeech
Hitoshi Aoki, Atsuko Kurashima, Akira Takahashi (2006), Conversational quality estimation model for wideband IP-telephony services, Interspeech
Kelley Kilanski, Jonathan Malkin, Xiao Li, Richard Wright, Jeff A. Bilmes (2006), The vocal joystick data collection effort and vowel corpus, Interspeech
Dmitry Sityaev, Katherine Knill, Tina Burrows (2006), Comparison of the ITU-t p.85 standard to other methods for the evaluation of text-to-speech systems, Interspeech
Peter A. Heeman, Andy McMillin, J. Scott Yaruss (2006), An annotation scheme for complex disfluencies, Interspeech
Christophe Van Bael, Lou Boves, Henk van den Heuvel, Helmer Strik (2006), Automatic phonetic transcription of large speech corpora: a comparative study, Interspeech
Yongmei Shi, Lina Zhou (2006), Examining knowledge sources for human error correction, Interspeech
Joon-Hyuk Chang, Woohyung Lim, Nam Soo Kim (2006), Signal modification incorporating perceptual weighting filter, Interspeech
Jani Nurminen (2006), Enhanced dynamic codebook reordering for advanced quantizer structures, Interspeech
Chang-Heon Lee, Sung-Kyo Jung, Thomas Eriksson, Won-Suk Jun, Hong-Goo Kang (2006), An efficient segment-based speech compression technique for hand-held TTS systems, Interspeech
V. Ramasubramanian, D. Harish (2006), An unified unit-selection framework for ultra low bit-rate speech coding, Interspeech
Jes Thyssen, Juin-Hwey Chen (2006), Efficient VQ techniques and general noise shaping in noise feedback coding, Interspeech
Yasheng Qian, Wei-Shou Hsu, Peter Kabal (2006), Classified comfort noise generation for efficient voice transmission, Interspeech
Balázs Kövesi, Dominique Massaloux, David Virette, Julien Bensa (2006), Integration of a CELP coder in the ARDOR universal sound codec, Interspeech
Saikat Chatterjee, T. V. Sreenivas (2006), Two stage transform vector quantization of LSFs for wideband speech coding, Interspeech
Saikat Chatterjee, T. V. Sreenivas (2006), Comparison of prediction based LSF quantization methods using split VQ, Interspeech
Konrad Hofbauer, Gernot Kubin (2006), High-rate data embedding in unvoiced speech, Interspeech
Kyle D. Anderson, Philippe Gournay (2006), Pitch resynchronization while recovering from a late frame in a predictive speech decoder, Interspeech
Suhadi Suhadi, Sorel Stan, Tim Fingscheidt (2006), A novel environment-dependent speech enhancement method with optimized memory footprint, Interspeech
Esfandiar Zavarehei, Saeed Vaseghi, Qin Yan (2006), Weighted codebook mapping for noisy speech enhancement using harmonic-noise model, Interspeech
J. Jensen, R. C. Hendriks, J. S. Erkelens, R. Heusdens (2006), MMSE estimation of complex-valued discrete Fourier coefficients with generalized gamma priors, Interspeech
Amarnag Subramanya, Michael L. Seltzer, Alex Acero (2006), Automatic removal of typed keystrokes from speech signals, Interspeech
Erhard Rank, Gernot Kubin (2006), Lattice LP filtering for noise reduction in speech signals, Interspeech
Om D. Deshmukh, Carol Y. Espy-Wilson (2006), Speech enhancement using modified phase opponency model, Interspeech
Wen Jin, Michael Scordilis (2006), Single channel speech enhancement by frequency domain constrained optimization and temporal masking, Interspeech
Jong Won Shin, Seung Yeol Lee, Hwan Sik Yun, Nam Soo Kim (2006), Speech enhancement based on residual noise shaping, Interspeech
Hannu Pulakka, Laura Laaksonen, Paavo Alku (2006), Quality improvement of telephone speech by artificial bandwidth expansion - listening tests in three languages, Interspeech
Benjamin J. Shannon, Kuldip K. Paliwal (2006), Role of phase estimation in speech enhancement, Interspeech
Benjamin J. Shannon, Kuldip K. Paliwal, Climent Nadeu (2006), Speech enhancement based on spectral estimation from higher-lag autocorrelation, Interspeech
Nitish Krishnamurthy, John H. L. Hansen (2006), Noise update modeling for speech enhancement: when do we do enough?, Interspeech
A. Shahina, B. Yegnanarayana (2006), Mapping neural networks for bandwidth extension of narrowband speech, Interspeech
Amit Das, John H. L. Hansen (2006), Decision directed constrained iterative speech enhancement, Interspeech
Takahiro Murakami, Yoshihisa Ishida (2006), Adaptive filtering for attenuating musical noise caused by spectral subtraction, Interspeech
Yi Hu, Philipos C. Loizou (2006), Evaluation of objective measures for speech enhancement, Interspeech
Myung-Suk Song, Chang-Heon Lee, Hong-Goo Kang (2006), Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition, Interspeech
G. Boulianne, J.-F. Beaumont, M. Boisvert, J. Brousseau, P. Cardinal, C. Chapdelaine, M. Comeau, Pierre Ouellet, F. Osterrath (2006), Computer-assisted closed-captioning of live TV broadcasts in French, Interspeech
Mohamed Afify, Ruhi Sarikaya, Hong-Kwang Jeff Kuo, Laurent Besacier, Yuqing Gao (2006), On the use of morphological analysis for dialectal Arabic speech recognition, Interspeech
Isabel Trancoso, Ricardo Nunes, Luís Neves, Céu Viana, Helena Moniz, Diamantino Caseiro, Ana Isabel Mata (2006), Recognition of classroom lectures in european portuguese, Interspeech
Thomas Pellegrini, Lori Lamel (2006), Investigating automatic decomposition for ASR in less represented languages, Interspeech
Abdillahi Nimaan, Pascal Nocéra, Jean-François Bonastre (2006), Automatic transcription of Somali language, Interspeech
Özgür Çetin, Elizabeth Shriberg (2006), Analysis of overlaps in meetings by dialog factors, hot spots, speakers, and collection site: insights for automatic speech recognition, Interspeech
Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno (2006), Improving speech recognition of two simultaneous speech signals by integrating ICA BSS and automatic missing feature mask generation, Interspeech
Wooil Kim, John H. L. Hansen (2006), Missing-feature reconstruction for band-limited speech recognition in spoken document retrieval, Interspeech
Hahn Koo, Yan Ming Cheng (2006), Incremental learning of MAP context-dependent edit operations for spoken phone number recognition in an embedded platform, Interspeech
Yasunari Obuchi, Nobuo Hataoka (2006), Development and evaluation of speech database in automotive environments for practical speech recognition systems, Interspeech
Dong Yu, Yun-Cheng Ju, Alex Acero (2006), An effective and efficient utterance verification technology using word n-gram filler models, Interspeech
J. M. Górriz, Javier Ramírez, C. G. Puntonet, José C. Segura (2006), An efficient bispectrum phase entropy-based algorithm for VAD, Interspeech
Petr Cerva, Jan Nouza, Jan Silovsky (2006), Two-step unsupervised speaker adaptation based on speaker and gender recognition and HMM combination, Interspeech
Satoshi Nakamura, Masakiyo Fujimoto, Kazuya Takeda (2006), CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition, Interspeech
Cheng-Tao Chu, Yun-Hsuan Sung, Yuan Zhao, Daniel Jurafsky (2006), Detection of word fragments in Mandarin telephone conversation, Interspeech
Qiang Huo, Wei Li (2006), A DTW-based dissimilarity measure for left-to-right hidden Markov models and its application to word confusability analysis, Interspeech
Angel M. Gómez, Juan J. Ramos-Muñoz, Antonio M. Peinado, Victoria Sánchez (2006), Multi-flow block interleaving applied to distributed speech recognition over IP networks, Interspeech
Edward C. Lin, Kai Yu, Rob A. Rutenbar, Tsuhan Chen (2006), Moving speech recognition from software to silicon: the in silico vox project, Interspeech
Chengyuan Ma, Yu Tsao, Chin-Hui Lee (2006), A study on detection based automatic speech recognition, Interspeech
Rahul Chitturi, Mark Hasegawa-Johnson (2006), Novel time domain multi-class SVMs for landmark detection, Interspeech
Sankaranarayanan Ananthakrishnan, Shrikanth Narayanan (2006), Combining acoustic, lexical, and syntactic evidence for automatic unsupervised prosody labeling, Interspeech
Andrew Rosenberg, Julia Hirschberg (2006), On the correlation between energy and pitch accent in read English speech, Interspeech
Keikichi Hirose, Yasufumi Asano, Nobuaki Minematsu (2006), Corpus-based generation of fundamental frequency contours using generation process model and considering emotional focuses, Interspeech
Tomás Dubeda (2006), Prosodic boundaries in Czech: an experiment based on delexicalized speech, Interspeech
Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao (2006), Totally data-driven intonation prediction model using a novel F0 contour parametric representation, Interspeech
Laura Dilley, Mara Breen, Marti Bolivar, John Kraemer, Edward Gibson (2006), A comparison of inter-transcriber reliability for two systems of prosodic annotation: rap (rhythm and pitch) and toBI (tones and break indices), Interspeech
Issac Alphonso, Shuangyu Chang (2006), Saliency parsing for automated directory assistance, Interspeech
Kohei Iwata, Yoshiaki Itoh, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee (2006), Open-vocabulary spoken document retrieval based on new subword models and subword phonetic similarity, Interspeech
Xiang Li, Ea-Ee Jan, Cheng Wu, David Lubensky (2006), Improved topic classification over maximum entropy model using k-norm based new objectives, Interspeech
Yi-cheng Pan, Jia-yu Chen, Yen-shin Lee, Yi-sheng Fu, Lin-shan Lee (2006), Efficient interactive retrieval of spoken documents with key terms ranked by reinforcement learning, Interspeech
Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki (2006), Discriminative named entity recognition of speech data using speech recognition confidence, Interspeech
Ville T. Turunen, Mikko Kurimo (2006), Using latent semantic indexing for morph-based spoken document retrieval, Interspeech
Ralf Schlüter, András Zolnay, Hermann Ney (2006), Feature combination using linear discriminant analysis and its pitfalls, Interspeech
Fabio Valente, Hynek Hermansky (2006), Discriminant linear processing of time-frequency plane, Interspeech
Esmeralda Uraga, Thomas Hain (2006), Automatic speech recognition experiments with articulatory data, Interspeech
Frederik Stouten, Jean-Pierre Martens (2006), Speech recognition with phonological features: some issues to attend, Interspeech
Matthias Wölfel, Christian Fügen, Shajith Ikbal, John W. McDonough (2006), Multi-source far-distance microphone selection and combination for automatic transcription of lectures, Interspeech
Colin Breithaupt, Rainer Martin (2006), Statistical analysis and performance of DFT domain noise reduction filters for robust speech recognition, Interspeech
L. García, José C. Segura, Carmen Benítez, Javier Ramírez, Ángel de la Torre (2006), Normalization of the inter-frame information using smoothing filtering, Interspeech
Muhammad Ghulam, Junsei Horikawa, Tsuneo Nitta (2006), Comparative study on contributions of pitch-synchronization and peak-amplitude towards robustness issue of ASR, Interspeech
Yasuo Ariki, Shunsuke Kato, Tetsuya Takiguchi (2006), Phoneme recognition based on fisher weight map to higher-order local auto-correlation, Interspeech
Hynek Boril, Petr Fousek, Petr Pollák (2006), Data-driven design of front-end filter bank for Lombard speech recognition, Interspeech
Andrej Ljolje (2006), Optimization of class weights for LDA feature transformations, Interspeech
Janne Pylkkönen (2006), LDA based feature estimation methods for LVCSR, Interspeech
G. Farahani, S.M. Ahadi, M. Mehdi Homayounpour (2006), Robust feature extraction based on spectral peaks of group delay and autocorrelation function and phase domain analysis, Interspeech
Sankaran Panchapagesan (2006), Frequency warping by linear transformation of standard MFCC, Interspeech
Ana Lilia Reyes-Herrera, Luis Villaseñor-Pineda, Manuel Montes-y-Gómez (2006), Automatic language identification using wavelets, Interspeech
Josef G. Bauer, Ekaterina Timoshenko (2006), Minimum classification error training of hidden Markov models for acoustic language identification, Interspeech
Ekaterina Timoshenko, Josef G. Bauer (2006), Unsupervised adaptation for acoustic language identification, Interspeech
S. V. Basavaraja, T. V. Sreenivas (2006), Low complexity LID using pruned pattern tables of LZW, Interspeech
Xi Yang, Lu-Feng Zhai, Manhung Siu, Herbert Gish (2006), Improved language identification using support vector machines for language modeling, Interspeech
Jiri Navratil (2006), Recent advances in phonotactic language recognition using binary-decision trees, Interspeech
Chi-Yueh Lin, Hsiao-Chuan Wang (2006), Fusion of phonotactic and prosodic knowledge for language identification, Interspeech
Haizhou Li, Bin Ma, Rong Tong (2006), Vector-based spoken language recognition using output coding, Interspeech
Victor G. Guijarrubia, M. Ines Torres (2006), Basque-Spanish language identification using phone-based methods, Interspeech
Ayako Ikeno, John H. L. Hansen (2006), The role of prosody in the perception of US native English accents, Interspeech
Bianca Vieru-Dimulescu, Philippe Boula de Mareüil (2006), Perceptual identification and phonetic analysis of 6 foreign accents in French, Interspeech
Rongqing Huang, John H. L. Hansen (2006), Unsupervised Spanish dialect classification, Interspeech
Petra Gieselmann, Alex Waibel (2006), Dynamic extension of a grammar-based dialogue system: constructing an all-recipes knowing robot, Interspeech
Alexander Gruenstein, Stephanie Seneff, Chao Wang (2006), Scalable and portable web-based multimodal dialogue interaction with geographical databases, Interspeech
Chantal Ackermann, Marion Libossek (2006), System- versus user-initiative dialog strategy for driver information systems, Interspeech
Filip Krsmanovic, Curtis Spencer, Daniel Jurafsky, Andrew Y. Ng (2006), Have we met? MDP based speaker ID for robot dialogue, Interspeech
Rob J. J. H. van Son, Wieneke Wesseling, Louis C. W. Pols (2006), Prominent words as anchors for TRP projection, Interspeech
Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, Hiroshi Shimodaira (2006), Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces, Interspeech
Jörg Mayer, Ekaterina Jasinskaja, Ulrike Kölsch (2006), Pitch range and pause duration as markers of discourse hierarchy: perception experiments, Interspeech
Antonio Roque, Anton Leuski, Vivek Rangarajan, Susan Robinson, Ashish Vaswani, Shrikanth Narayanan, David Traum (2006), Radiobot-CFF: a spoken dialogue system for military training, Interspeech
Shinya Yamada, Toshihiko Itoh, Kenji Araki (2006), Is voice quality enough? - study on how the situation and user²s awareness influence the utterance features, Interspeech
Jozef Juhár, Stanislav Ondas, Anton Cizmár, Milan Rusko, Gregor Rozinaj, Roman Jarina (2006), Development of slovak GALAXY/voiceXML based spoken language dialogue system to retrieve information from the internet, Interspeech
Lars Degerstedt, Arne Jönsson (2006), LINTest: a development tool for testing dialogue systems, Interspeech
Akinori Ito, Keisuke Shimada, Motoyuki Suzuki, Shozo Makino (2006), A user simulator based on voiceXML for evaluation of spoken dialog systems, Interspeech
Kristiina Jokinen, Topi Hurtig (2006), User expectations and real experience on a multimodal interactive system, Interspeech
F. Burkhardt, J. Ajmera, Roman Englert, J. Stegmann, W. Burleson (2006), Detecting anger in automated voice portal dialogs, Interspeech
Markku Turunen, Jaakko Hakulinen, Anssi Kainulainen (2006), Evaluation of a spoken dialogue system with usability tests and long-term pilot studies: similarities and differences, Interspeech
Fuliang Weng, Sebastian Varges, Badri Raghunathan, Florin Ratiu, Heather Pon-Barry, Brian Lathrop, Qi Zhang, Harry Bratt, Tobias Scheideck, Kui Xu, Matthew Purver, Rohit Mishra, Annie Lien, M. Raya, S. Peters, Y. Meng, J. Russell, Lawrence Cavedon, Elizabeth Shriberg, H. Schmidt, R. Prieto (2006), CHAT: a conversational helper for automotive tasks, Interspeech
Kallirroi Georgila, James Henderson, Oliver Lemon (2006), User simulation for spoken dialogue systems: learning and evaluation, Interspeech
Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, Ruei-Chuan Chang (2006), Improving the characterization of the alternative hypothesis via kernel discriminant analysis for likelihood ratio-based speaker verification, Interspeech
Zhenchun Lei, Yingchun Yang, Zhaohui Wu (2006), A discriminative method for speaker verification using the difference information, Interspeech
Nicolas Scheffer, Jean-François Bonastre (2006), A multiclass framework for speaker verification within an acoustic event sequence system, Interspeech
Bin Ma, Donglai Zhu, Rong Tong, Haizhou Li (2006), Speaker cluster based GMM tokenization for speaker recognition, Interspeech
Claudio Garreton, Nestor Becerra Yoma, Carlos Molina, Fernando Huenupan (2006), Intra-speaker variability compensation in speaker verification with limited enrolling data, Interspeech
Girija Chetty, Michael Wagner (2006), Speaking faces for face-voice speaker identity verification, Interspeech
Kishore Prahallad, Varanasi Sudhakar, Veluru Ranganatham, Krishna M. Bharat, S. Roy Debashish (2006), Significance of formants from difference spectrum for speaker identification, Interspeech
Maider Zamalloa, Germán Bordel, Luis Javier Rodríguez, Mikel Penagarikano, Juan Pedro Uribe (2006), Using genetic algorithms to weight acoustic features for speaker recognition, Interspeech
Michael T. Padilla, Thomas F. Quatieri, Douglas A. Reynolds (2006), Missing feature theory with soft spectral subtraction for speaker verification, Interspeech
Leena Mary, B. Yegnanarayana (2006), Prosodic features for speaker verification, Interspeech
Ming Liu, Thomas S. Huang (2006), Unsupervised learning of HMM topology for text-dependent speaker verification, Interspeech
Jan Anguita, Javier Hernando (2006), On the use of Jacobian adaptation in real speaker verification applications, Interspeech
Ming Liu, Huazhong Ning, Thomas S. Huang, Zhengyou Zhang (2006), A novel framework of text-independent speaker verification based on utterance transform and iterative cohort modeling, Interspeech
Vinod Prakash, John H. L. Hansen (2006), A cohort - UBM approach to mitigate data sparseness for in-set/out-of-set speaker recognition, Interspeech
Vaishnevi S. Varadarajan, John H. L. Hansen (2006), Analysis of lombard effect under different types and levels of noise with application to in-set speaker ID systems, Interspeech
Alan McCree (2006), Reducing speech coding distortion for speaker identification, Interspeech
Tsuneo Kato, Hisashi Kawai (2006), A text-prompted distributed speaker verification system implemented on a cellular phone and a mobile terminal, Interspeech
Srikanth Vishnubhotla, Carol Y. Espy-Wilson (2006), Automatic detection of irregular phonation in continuous speech, Interspeech
V. Ramasubramanian, Deepak Vijaywargiay, Kumar V. Praveen (2006), Highly noise robust text-dependent speaker recognition based on hypothesized wiener filtering, Interspeech
Hiromasa Fujihara, Tetsuro Kitahara, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno (2006), Speaker identification under noisy environments by using harmonic structure extraction and reliable frame weighting, Interspeech
Andreas Stergiou, Aristodemos Pnevmatikakis, Lazaros C. Polymenakos (2006), Enhancing the performance of a GMM-based speaker identification system in a multi-microphone setup, Interspeech
C. Longworth, M. J. F. Gales (2006), Discriminative adaptation for speaker verification, Interspeech
Andrew O. Hatch, Sachin Kajarekar, Andreas Stolcke (2006), Within-class covariance normalization for SVM-based speaker recognition, Interspeech
Carol Y. Espy-Wilson, Sandeep Manocha, Srikanth Vishnubhotla (2006), A new set of features for text-independent speaker identification, Interspeech
Uchechukwu O. Ofoegbu, Ananth N. Iyer, Robert E. Yantorno, Stanley J. Wenndt (2006), Detection of a third speaker in telephone conversations, Interspeech
Konstantin Biatov, Joachim Köhler (2006), Improvement speaker clustering using global similarity features, Interspeech
Balakrishnan Narayanaswamy, Rashmi Gangadharaiah, Richard M. Stern (2006), Voting for two speaker segmentation, Interspeech
Alexandre Preti, Jean-François Bonastre (2006), Unsupervised model adaptation for speaker verification, Interspeech
Rong Zheng, Shuwu Zhang, Bo Xu (2006), A quality measure method using Gaussian mixture models and divergence measure for speaker identification, Interspeech
Yushi Zhang, Waleed H. Abdulla (2006), Gammatone auditory filterbank and independent component analysis for speaker identification, Interspeech
Wei Wu, Thomas Fang Zheng, Ming-Xing Xu, Huan-Jun Bao (2006), Study on speaker verification on emotional speech, Interspeech
M. Farrús, A. Garde, P. Ejarque, J. Luque, Javier Hernando (2006), On the fusion of prosody, voice spectrum and face features for multimodal person verification, Interspeech
Tarun Pruthi, Carol Y. Espy-Wilson (2006), An MRI based study of the acoustic effects of sinus cavities and its application to speaker recognition, Interspeech
Mariko Kojima, Tomoko Matsui, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano (2006), Speaker verification with non-audible murmur segments, Interspeech
Christian Müller (2006), Automatic recognition of speakers² age and gender on the basis of empirical studies, Interspeech
E. J. S. Fox, J. D. Roberts, M. Bennamoun (2006), Text-independent speaker identification in birds, Interspeech
Ilyas Potamitis, Todor Ganchev, Nikos Fakotakis (2006), Automatic acoustic identification of insects inspired by the speaker recognition paradigm, Interspeech
Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee (2006), A study on lattice rescoring with knowledge scores for automatic speech recognition, Interspeech
Sebastian Stüker, Christian Fügen, Susanne Burger, Matthias Wölfel (2006), Cross-system adaptation and combination for continuous speech recognition: the influence of phoneme set and acoustic front-end, Interspeech
C. Breslin, M. J. F. Gales (2006), Generating complementary systems for speech recognition, Interspeech
Rong Zhang, Alexander I. Rudnicky (2006), Investigations of issues for using multiple acoustic models to improve continuous speech recognition, Interspeech
I-Fan Chen, Lin-shan Lee (2006), A new framework for system combination based on integrated hypothesis space, Interspeech
Björn Hoffmeister, Tobias Klein, Ralf Schlüter, Hermann Ney (2006), Frame based system combination and a comparison with weighted ROVER and CNC, Interspeech
Jiahong Yuan, Mark Liberman, Christopher Cieri (2006), Towards an integrated understanding of speaking rate in conversation, Interspeech
Minh Quang Vu, Ðô Ðat Trân, Eric Castelli (2006), Prosody of interrogative and affirmative sentences in vietnamese language: analysis and perceptive results, Interspeech
Jennifer J. Venditti, Julia Hirschberg, Jackson Liscombe (2006), Intonational cues to student questions in tutoring dialogs, Interspeech
Emiel Krahmer, Marc Swerts (2006), Testing the effect of audiovisual cues to prominence via a reaction-time experiment, Interspeech
Agustín Gravano, Julia Hirschberg (2006), Effect of genre, speaker, and word class on the realization of given and new information, Interspeech
Martti Vainio, Juhani Järvikivi, Stefan Werner (2006), Word order and tonal shape in the production of focus in short Finnish utterances, Interspeech
Bernd J. Kröger, Peter Birkholz, Jim Kannampuzha, Christiane Neuschaefer-Rube (2006), Modeling sensory-to-motor mappings using neural nets and a 3d articulatory speech synthesizer, Interspeech
Julie Fontecave, Frédéric Berthommier (2006), Semi-automatic extraction of vocal tract movements from cineradiographic data, Interspeech
Szu-Chen Jou, Tanja Schultz, Matthias Walliczek, Florian Kraft, Alex Waibel (2006), Towards continuous speech recognition using surface electromyography, Interspeech
Korin Richmond (2006), A trajectory mixture density network for the acoustic-articulatory inversion mapping, Interspeech
Florian Metze (2006), Articulatory features for "meeting" speech recognition, Interspeech
Zdenek Krnoul, Milos Zelezný, Ludek Müller, Jakub Kanis (2006), Training of coarticulation models using dominance functions and visual unit selection methods for audio-visual speech synthesis, Interspeech
Le Zhang, Steve Renals (2006), Phone recognition analysis for trajectory HMM, Interspeech
Joseph Keshet, Shai Shalev-Shwartz, Samy Bengio, Yoram Singer, Dan Chazan (2006), Discriminative kernel-based phoneme sequence recognition, Interspeech
Jeremy Morris, Eric Fosler-Lussier (2006), Combining phonetic attributes using conditional random fields, Interspeech
T. Nagarajan, Douglas O'Shaughnessy (2006), Discriminative MLE training using a product of Gaussian likelihoods, Interspeech
Hao-Zheng Li, Douglas O'Shaughnessy (2006), State-level variable modeling for phoneme classification, Interspeech
Xiaolong Li, Li Deng, Dong Yu, Alex Acero (2006), A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model, Interspeech
Marta Casar, Jose A. R. Fonollosa (2006), Analysis of HMM temporal evolution for automatic speech recognition and utterance verification, Interspeech
Min Tang, Aravind Ganapathiraju (2006), Improvements to bucket box intersection algorithm for fast GMM computation in embedded speech recognition systems, Interspeech
Konstantin Markov, Satoshi Nakamura (2006), Forward-backwards training of hybrid HMM/BN acoustic models, Interspeech
Dirk Gehrig, Thomas Schaaf (2006), A comparative study of Gaussian selection methods in large vocabulary continuous speech recognition, Interspeech
Soo-Young Suk, Seong-Jun Hahm, Ho-Youl Jung, Hyun-Yeol Chung (2006), A successive state and mixture splitting for optimizing the size of models in speech recognition, Interspeech
Valentin Ion, Reinhold Haeb-Umbach (2006), Improved source modeling and predictive classification for channel robust speech recognition, Interspeech
Marco Kühne, Roberto Togneri (2006), Automatic English stop consonants classification using wavelet analysis and hidden Markov models, Interspeech
Tingyao Wu, Dirk Van Compernolle, Jacques Duchateau, Hugo Van hamme (2006), Single frame selection for phoneme classification, Interspeech
Sorin Dusan, Lawrence Rabiner (2006), On the relation between maximum spectral transition positions and phone boundaries, Interspeech
T. Yingthawornsuk, H. Kaymaz Keskinpala, D. France, D. M. Wilkes, R. G. Shiavi, R. M. Salomon (2006), Objective estimation of suicidal risk using vocal output characteristics, Interspeech
E. Didiot, I. Illina, O. Mella, D. Fohr, Jean-Paul Haton (2006), A wavelet-based parameterization for speech/music segmentation, Interspeech
Goshu Nagino, Makoto Shozakai (2006), Distance measure between Gaussian distributions for discriminating speaking styles, Interspeech
Franz Pernkopf, Tuan Van Pham (2006), Bayesian networks for phonetic classification using time-scale features, Interspeech
Nicole Beringer (2006), Fast and effective retraining on contrastive vocal characteristics with bidirectional long short-term memory nets, Interspeech
Ning Ma, Phil Green, André Coy (2006), Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source, Interspeech
Pairote Leelaphattarakij, Proadpran Punyabukkana, Atiwong Suchato (2006), Locating phone boundaries from acoustic discontinuities using a two-staged approach, Interspeech
Qiang Fu, Biing-Hwang Juang (2006), Investigation on rescoring using minimum verification error (MVE) detectors, Interspeech
Qiang Fu, Antonio Moreno-Daniel, Biing-Hwang Juang, Jian-Lai Zhou, Frank K. Soong (2006), Generalization of the minimum classification error (MCE) training based on maximizing generalized posterior probability (GPP), Interspeech
Michael A. Carlin, Brett Y. Smolenski, Stanley J. Wenndt (2006), Unsupervised detection of whispered speech in the presence of normal phonation, Interspeech
Xavier Anguera, Chuck Wooters, Javier Hernando (2006), Friends and enemies: a novel initialization for speaker diarization, Interspeech
Kushan Surana, Janet Slifka (2006), Acoustic cues for the classification of regular and irregular phonation, Interspeech
Rattima Nitisaroj (2006), Realizations and representations of Thai tones in monomoraic syllables, Interspeech
Irene Jacobi, Louis C. W. Pols, Jan Stroop (2006), Measuring and comparing vowel qualities in a Dutch spontaneous speech corpus, Interspeech
Aijun Li, Qiang Fang, Ziyu Xiong (2006), Phonetic research on accented Chinese in three dialectal regions: Shanghai, Wuhan and Xiamen, Interspeech
Chi Zhang, Ji Wu, Xi Xiao, Zuoying Wang (2006), Pronunciation variation modeling for Mandarin with accent, Interspeech
Kuniko Y. Nielsen (2006), Specificity and generalizability of spontaneous phonetic imitation, Interspeech
Christophe Van Bael, Hans van Halteren (2006), On the sufficiency of automatic phonetic transcriptions for pronunciation variation research, Interspeech
Abe Kazemzadeh, Joseph Tepperman, Jorge Silva, Hong You, Sungbok Lee, Abeer Alwan, Shrikanth Narayanan (2006), Automatic detection of voice onset time contrasts for use in pronunciation assessment, Interspeech
Hiroko Hirano, Goh Kawai, Keikichi Hirose, Nobuaki Minematsu (2006), Unfilled pauses in Japanese sentences read aloud by non-native learners, Interspeech
Ryoji Hamabe, Kiyotaka Uchimoto, Tatsuya Kawahara, Hitoshi Isahara (2006), Detection of quotations and inserted clauses and its application to dependency structure analysis in spontaneous Japanese, Interspeech
Chun-Han Tseng, Chia-Ping Chen (2006), Chinese input method based on reduced Mandarin phonetic alphabet, Interspeech
Yoshimi Suzuki, Fumiyo Fukumoto (2006), Thesaurus expansion using similar word pairs from patent documents, Interspeech
Patrick Schone (2006), Low-resource autodiacritization of abjads for speech keyword search, Interspeech
Susan R. Hertz (2006), A model of the regularities underlying speaker variation: evidence from hybrid synthesis, Interspeech
Augustin Speyer (2006), Pauses as a tool to ensure rhythmic wellformedness, Interspeech
Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Shusaku Miwa, Nobuaki Minematsu (2006), Factors affecting speakers² choice of fillers in Japanese presentations, Interspeech
Marelie Davel, Etienne Barnard (2006), Developing consistent pronunciation models for phonemic variants, Interspeech
Jinsik Lee, Seungwon Kim, Gary Geunbae Lee (2006), Grapheme-to-phoneme conversion using automatically extracted associative rules for Korean TTS system, Interspeech
Paisarn Charoenpornsawat, Tanja Schultz (2006), Example-based grapheme-to-phoneme conversion for Thai, Interspeech
Jason Riesa, Behrang Mohit, Kevin Knight, Daniel Marcu (2006), Building an English-iraqi Arabic machine translation system for spoken utterances with limited resources, Interspeech
Sameer Maskey, Bowen Zhou, Yuqing Gao (2006), A phrase-level machine translation approach for disfluency detection using weighted finite state transducers, Interspeech
Jonghoon Lee, Donghyeon Lee, Gary Geunbae Lee (2006), Improving phrase-based Korean-English statistical machine translation, Interspeech
David Stallard, Fred Choi, Kriste Krstovski, Prem Natarajan, Rohit Prasad, Shirin Saleem (2006), A hybrid phrase-based/statistical speech translation system, Interspeech
Chao Wang, Stephanie Seneff (2006), High-quality speech translation in the flight domain, Interspeech
Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, Andreas Zollmann, Stephan Vogel, Alan W. Black, Tanja Schultz, Alex Waibel (2006), Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system, Interspeech
Armin Sehr, Marcus Zeller, Walter Kellermann (2006), Distant-talking continuous speech recognition based on a novel reverberation model in the feature domain, Interspeech
Xin Lei, Jon Hamaker, Xiaodong He (2006), Robust feature space adaptation for telephony speech recognition, Interspeech
Nattanun Thatphithakkul, Boontee Kruatrachue, Chai Wutiwiwatchai, Sanparith Marukatat, Vataya Boonpiam (2006), A simulated-data adaptation technique for robust speech recognition, Interspeech
Hans-Günter Hirsch, Harald Finster (2006), A new HMM adaptation approach for the case of a hands-free speech input in reverberant rooms, Interspeech
Yu Tsao, Chin-Hui Lee (2006), A vector space approach to environment modeling for robust speech recognition, Interspeech
Jen-Tzung Chien, Chuan-Wei Ting (2006), Subspace modeling and selection for noisy speech recognition, Interspeech
Björn Schuller, Niels Köhler, Ronald Müller, Gerhard Rigoll (2006), Recognition of interest in human conversational speech, Interspeech
Hua Ai, Diane J. Litman, Kate Forbes-Riley, Mihai Rotaru, Joel Tetreault, Amruta Purandare (2006), Using system and user performance features to improve emotion detection in spoken tutoring dialogs, Interspeech
Laurence Devillers, Laurence Vidrascu (2006), Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs, Interspeech
Janneke Wilting, Emiel Krahmer, Marc Swerts (2006), Real vs. acted emotional speech, Interspeech
Daniel Neiberg, Kjell Elenius, Kornel Laskowski (2006), Emotion recognition in spontaneous speech using GMMs, Interspeech
Frank Enos, Stefan Benus, Robin L. Cautin, Martin Graciarena, Julia Hirschberg, Elizabeth Shriberg (2006), Personality factors in human deception detection: comparing human to machine performance, Interspeech
Leen Cleuren, Jacques Duchateau, Alain Sips, Pol Ghesquière, Hugo Van hamme (2006), Developing an automatic assessment tool for children²s oral reading, Interspeech
Christopher Waple, Yasushi Tsubota, Masatake Dantsuji, Tatsuya Kawahara (2006), Prototyping a call system for students of Japanese using dynamic diagram generation and interactive hints, Interspeech
Dominic W. Massaro, Ying Liu, Trevor H. Chen, Charles Perfetti (2006), A multilingual embodied conversational agent for tutoring speech and language learning, Interspeech
Michael Heilman, Kevyn Collins-Thompson, Jamie Callan, Maxine Eskenazi (2006), Classroom success of an intelligent tutoring system for lexical practice and reading comprehension, Interspeech
Sarah E. Petersen, Mari Ostendorf (2006), Assessing the reading level of web pages, Interspeech
Jack Mostow (2006), Is ASR accurate enough for automated reading tutors, and how can we tell?, Interspeech
Chiharu Tsurutani, Yutaka Yamauchi, Nobuaki Minematsu, Dean Luo, Kazutaka Maruyama, Keikichi Hirose (2006), Development of a program for self assessment of Japanese pronunciation by English learners, Interspeech
Joseph Tepperman, Jorge Silva, Abe Kazemzadeh, Hong You, Sungbok Lee, Abeer Alwan, Shrikanth Narayanan (2006), Pronunciation verification of children²s speech for automatic literacy assessment, Interspeech
Sherif Mahdy Abdou, Salah Eldeen Hamid, Mohsen Rashwan, Abdurrahman Samir, Ossama Abdel-Hamid, Mostafa Shahin, Waleed Nazih (2006), Computer aided pronunciation learning system using speech recognition techniques, Interspeech
Bryce Lobdell, Jont B. Allen (2006), An information theoretic tool for investigating speech perception, Interspeech
Geoffrey Stewart Morrison (2006), An adaptive sampling procedure for speech perception experiments, Interspeech
Navin Viswanathan, James S. Magnuson, Carol A. Fowler (2006), Disentangling gestural and auditory contrast accounts of compensation for coarticulation, Interspeech
Michael C. W. Yip (2006), The role of positional probability in the segmentation of Cantonese speech, Interspeech
Shahina Haque, Tomio Takara (2006), Nasality perception of vowels in different language background, Interspeech
Nao Hodoshima, Dawn Behne, Takayuki Arai (2006), Steady-state suppression in reverberation: a comparison of native and nonnative speech perception, Interspeech
Akiyo Joto (2006), Effect of dynamic information of formants on discrimination of English vowels in consonantal contexts by Japanese listeners, Interspeech
Yue Wang, Dawn Behne, Haisheng Jiang, Chad Danyluck (2006), Native and nonnative audio-visual perception of English fricatives in quiet and cafe-noise backgrounds, Interspeech
Sven Grawunder, Ines Bose, Birgit Hertha, Franziska Trauselt, Lutz Christian Anders (2006), Perceptive and acoustic measurement of average speaking pitch of female and male speakers in German radio news, Interspeech
Peter F. Assmann, Sophia Dembling, Terrance M. Nearey (2006), Effects of frequency shifts on perceived naturalness and gender information in speech, Interspeech
Hitomi Tohyama, Shigeki Matsubara (2006), Influence of pause length on listeners² impressions in simultaneous interpretation, Interspeech
Iris-Corinna Schwarz, Denis Burnham (2006), New measures to chart toddlers² speech perception and language development: a test of the lexical restructuring hypothesis, Interspeech
Ángel de la Torre, Cristina Roldán, Manuel Sainz (2006), Perception of fundamental frequency in cochlear implant patients, Interspeech
Sarah C. Creel, Delphine Dahan, Daniel Swingley (2006), Effects of featural similarity and overlap position on lexical confusions and overt similarity judgments, Interspeech
Hansjörg Mixdorff, Yu Hu (2006), Word structure and tone perception in Mandarin, Interspeech
Cecile Woehrling, Philippe Boula de Mareüil (2006), Identification of regional accents in French: perception and categorization, Interspeech
Sandeep Phatak, Jont B. Allen (2006), Consonant and vowel confusions in speech-weighted noise, Interspeech
Mirjam Broersma (2006), Accident - execute: increased activation in nonnative listening, Interspeech
Kirstin Scholz, Marcel Waltermann, Lu Huo, Alexander Raake, Sebastian Möller, Ulrich Heute (2006), Estimation of the quality dimension "directness/frequency content" for the instrumental assessment of speech quality, Interspeech
Mark Pluymaekers, Mirjam Ernestus, R. Harald Baayen (2006), Effects of word frequency on the acoustic durations of affixes, Interspeech
Xiaochuan Niu, Alexander B. Kain, Jan P. H. van Santen (2006), A noninvasive, low-cost device to study the velopharyngeal port during speech and some preliminary results, Interspeech
Noureddine Aboutabit, Denis Beautemps, Laurent Besacier (2006), Characterization of cued speech vowels from the inner lip contour, Interspeech
Christer Gobl (2006), Modelling aspiration noise during phonation using the LF voice source model, Interspeech
Jianguo Wei, Xugang Lu, Jianwu Dang (2006), A simulation based parameter optimization for a coarticulation model, Interspeech
A. Kacha, Francis Grenez, Jean Schoentgen (2006), Multivariate analysis of frame-based acoustic cues of dysperiodicities in connected speech, Interspeech
Tom Kovacs, Donald S. Finan (2006), Effects of midline tongue piercing on spectral centroid frequencies of sibilants, Interspeech
P. Vijayalakshmi, M. R. Reddy, Douglas O’Shaughnessy (2006), Assessment of articulatory sub-systems of dysarthric speech using an isolated-style phoneme recognition system, Interspeech
Donald S. Finan, Carol A. Boliek (2006), Respiratory/laryngeal interactions during sustained vowel production in children, Interspeech
H. Timothy Bunnell, James B. Polikoff (2006), Acoustic characterization of children with speech delay, Interspeech
Oscar Saz, Antonio Miguel, Eduardo Lleida, Alfonso Ortega, Luis Buera (2006), Study of time and frequency variability in pathological speech and error reduction methods for automatic speech recognition, Interspeech
Markus Iseli, Yen-Liang Shue, Melissa A. Epstein, Patricia Keating, Jody Kreiman, Abeer Alwan (2006), Voice source correlates of prosodic features in american English: a pilot study, Interspeech
Louis ten Bosch, R. Harald Baayen, Mirjam Ernestus (2006), On speech variation and word type differentiation by articulatory feature representations, Interspeech
Sungbok Lee, Erik Bresch, Jason Adams, Abe Kazemzadeh, Shrikanth Narayanan (2006), A study of emotional speech articulation using a fast magnetic resonance imaging technique, Interspeech
Hedvig Kjellström, Olov Engwall, Olle Bälter (2006), Reconstructing tongue movements from audio and video, Interspeech
Gang Feng, Cyril Kotenkoff (2006), New considerations for vowel nasalization based on separate mouth-nose recording, Interspeech
Maeva Garnier, Lucie Bailly, Marion Dohen, Pauline Welby, Helene Loevenbruck (2006), An acoustic and articulatory study of Lombard speech: global effects on the utterance, Interspeech
Laurence Cnockaert, Jean Schoentgen, Pascal Auzou, Canan Ozsancak, Francis Grenez (2006), Tracking of involuntary formant frequency variations and application to parkinsonian speech, Interspeech
Luis Weruaga, Amar Al-Khayat (2006), All-pole model estimation of vocal tract on the frequency domain, Interspeech
Jonathan Darch, Ben Milner (2006), HMM-based MAP prediction of voiced and unvoiced formant frequencies from noisy MFCC vectors, Interspeech
Joseph M. Anand, S. Guruprasad, B. Yegnanarayana (2006), Extracting formants from short segments of speech using group delay functions, Interspeech
I. Yücel Özbek, Mübeccel Demirekler (2006), Tracking of visible vocal tract resonances (VVTR) based on kalman filtering, Interspeech
Salma Chaari, Kais Ouni, Noureddine Ellouze (2006), Wavelet ridge track interpretation in terms of formants, Interspeech
Mikko Kurimo, Mathias Creutz, Matti Varjokallio, Ebru Arsoy, Murat Saraclar (2006), Unsupervised segmentation of words into morphemes - morpho challenge 2005 application to automatic speech recognition, Interspeech
Ebru Arsoy, Murat Saraclar (2006), Lattice extension and rescoring based approaches for LVCSR of Turkish, Interspeech
Catherine Kobus, Geraldine Damnati, Lionel Delphin-Poulat, Renato De Mori (2006), Exploiting semantic relations for a spoken language understanding application, Interspeech
Yuya Akita, Masahiro Saikou, Hiroaki Nanjo, Tatsuya Kawahara (2006), Sentence boundary detection of spontaneous Japanese using statistical language model and support vector machines, Interspeech
Sami Virpioja, Mikko Kurimo (2006), Compact n-gram models by incremental growing and clustering of histories, Interspeech
Nathalie Camelin, Geraldine Damnati, Frederic Bechet, Renato De Mori (2006), Opinion mining in a telephone survey corpus, Interspeech
Antonio M. Peinado, Angel M. Gómez, Victoria Sánchez, José L. Pérez-Córdoba, Antonio J. Rubio (2006), An integrated solution for error concealment in DSR systems over wireless channels, Interspeech
Angel M. Gómez, Antonio M. Peinado, Victoria Sánchez, José L. Carmona, Antonio J. Rubio (2006), Interleaving and MMSE estimation with VQ replicas for distributed speech recognition over lossy packet networks, Interspeech
Gang Chen, Hesham Tolba, Douglas O’Shaughnessy (2006), Noise-robust speech recognition of conversational telephone speech, Interspeech
Shingo Kuroiwa, Satoru Tsuge, Fuji Ren (2006), Lost speech reconstruction method using speech recognition based on missing feature theory and HMM-based speech synthesis, Interspeech
Sid-Ahmed Selouani, Douglas O’Shaughnessy (2006), Speaker adaptation using evolutionary-based linear transform, Interspeech
Jingying Wang, Zuoying Wang (2006), A speaker adaptation algorithm using principal curves in noisy environments, Interspeech
Constance Clarke, Daniel Jurafsky (2006), Limitations of MLLR adaptation with Spanish-accented English: an error analysis, Interspeech
H. Liao, M. J. F. Gales (2006), Issues with uncertainty decoding for noise robust speech recognition, Interspeech
Haitian Xu, Luca Rigazio, David Kryze (2006), Vector taylor series based joint uncertainty decoding, Interspeech
Qiang Huo, Donglai Zhu (2006), A maximum likelihood training approach to irrelevant variability compensation based on piecewise linear transformations, Interspeech
Arindam Mandal, Mari Ostendorf, Andreas Stolcke (2006), Speaker clustered regression-class trees for MLLR adaptation, Interspeech
Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg (2006), Robust speech recognition over mobile networks using combined weighted viterbi decoding and subvector based error concealment, Interspeech
Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura (2006), Speaker adaptation of trajectory HMMs using feature-space MLLR, Interspeech
Daniel Povey, George Saon (2006), Feature and model space speaker adaptation with full covariance Gaussians, Interspeech
Adrià de Gispert, José B. Mariño (2006), Linguistic tuple segmentation in n-gram-based statistical machine translation, Interspeech
Takanobu Oba, Takaaki Hori, Atsushi Nakamura (2006), Sentence boundary detection using sequential dependency analysis combined with CRF-based chunking, Interspeech
Srinivas Bangalore, Patrick Haffner, Stephan Kanthak (2006), Sequence classification for machine translation, Interspeech
Yoshiaki Itoh, Takayuki Otake, Kohei Iwata, Kazunori Kojima, Masaaki Ishigame, Kazuyo Tanaka, Shi-wook Lee (2006), Two-stage vocabulary-free spoken document retrieval - subword identification and re-recognition of the identified sections, Interspeech
Mihai Surdeanu, David Dominguez-Sal, Pere R. Comas (2006), Design and performance analysis of a factoid question answering system for spontaneous speech transcriptions, Interspeech
Toshiyuki Takezawa, Tohru Shimizu (2006), Performance improvement of dialog speech translation by rejecting unreliable utterances, Interspeech
Emil Ettelaie, Panayiotis G. Georgiou, Shrikanth Narayanan (2006), Cross-lingual dialog model for speech to speech translation, Interspeech
Murat Akbacak, John H. L. Hansen (2006), A robust fusion method for multilingual spoken document retrieval systems employing tiered resources, Interspeech
Weizhong Zhu, Bowen Zhou, Charles Prosser, Pavel Krbec, Yuqing Gao (2006), Recent advances of IBM’s handheld speech translation system, Interspeech
Svetlana Stenchikova, Dilek Hakkani-Tür, Gokhan Tur (2006), QASR: question answering using semantic roles for speech interface, Interspeech
Jan F. Maas, Britta Wrede, Gerhard Sagerer (2006), Towards a multimodal topic tracking system for a mobile robot, Interspeech
Edward C. Kaiser, Paulo Barthelmess (2006), Edge-splitting in a cumulative multimodal system, for a no-wait temporal threshold on information fusion, combined with an under-specified display, Interspeech
Pui-Yu Hui, Helen M. Meng (2006), Joint interpretation of input speech and pen gestures for multimodal human-computer interaction, Interspeech
David Cournapeau, Tatsuya Kawahara, Kenji Mase, Tomoji Toriyama (2006), Voice activity detector based on enhanced cumulant of LPC residual and on-line EM algorithm, Interspeech
David Huggins-Daines, Alexander I. Rudnicky (2006), A constrained baum-welch algorithm for improved phoneme segmentation and efficient training, Interspeech
Fabio Valente (2006), Infinite models for speaker clustering, Interspeech
John Dines, Jithendra Vepa, Thomas Hain (2006), The segmentation of multi-channel meeting recordings for automatic speech recognition, Interspeech
Jen-Wei Kuo, Hsin-Min Wang (2006), Minimum boundary error training for automatic phonetic segmentation, Interspeech
William Schuler, Tim Miller, Stephen Wu, Andrew Exley (2006), Dynamic evidence models in a DBN phone recognizer, Interspeech
B. Ramabhadran, Olivier Siohan, L. Mangu, G. Zweig, M. Westphal, H. Schulz, A. Soneiro (2006), The IBM 2006 speech transcription system for european parliamentary speeches, Interspeech
Christian Fügen, Matthias Wölfel, John W. McDonough, Shajith Ikbal, Florian Kraft, Kornel Laskowski, Mari Ostendorf, Sebastian Stüker, Kenichi Kumatani (2006), Advances in lecture recognition: the ISL RT-06s evaluation system, Interspeech
Mei-Yuh Hwang, Xin Lei, Wen Wang, Takahiro Shinozaki (2006), Investigation on Mandarin broadcast news speech recognition, Interspeech
Xin Lei, Manhung Siu, Mei-Yuh Hwang, Mari Ostendorf, Tan Lee (2006), Improved tone modeling for Mandarin broadcast news speech recognition, Interspeech
Jui-Ting Huang, Lin-shan Lee (2006), Prosodic modeling in large vocabulary Mandarin speech recognition, Interspeech
Ying Sun, Daniel Willett, Raymond Brueckner, Rainer Gruhn, Dirk Bühler (2006), Experiments on Chinese speech recognition with tonal models and pitch estimation using the Mandarin speecon data, Interspeech
Jonas Beskow, Björn Granström, David House (2006), Visual correlates to prominence in several expressive modes, Interspeech
Pashiera Barkhuysen, Emiel Krahmer, Marc Swerts (2006), How auditory and visual prosody is used in end-of-utterance detection, Interspeech
Marc Swerts, Emiel Krahmer (2006), The importance of different facial areas for signalling visual prominence, Interspeech
Josef Chaloupka (2006), Visual speech segmentation and speaker recognition for transcription of TV news, Interspeech
G. Cortés, L. García, Carmen Benítez, José C. Segura (2006), HMM-based continuous sign language recognition using a fast optical flow parameterization of visual information, Interspeech
Xu Shao, Jon Barker (2006), Audio-visual speech recognition in the presence of a competing speaker, Interspeech
Volker Strom, Robert A. J. Clark, Simon King (2006), Expressive prosody for unit-selection speech synthesis, Interspeech
Rolf Carlson, Kjell Gustafson, Eva Strangert (2006), Cues for hesitation in speech synthesis, Interspeech
Francesc Alías, Joan Claudi Socoró, Xavier Sevillano, Ignasi Iriondo, Xavier Gonzalvo (2006), Multi-domain text-to-speech synthesis by automatic text classification, Interspeech
Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao (2006), Phrase break prediction using logistic generalized linear model, Interspeech
Robert A. J. Clark, Simon King (2006), Joint prosodic and segmental unit selection speech synthesis, Interspeech
Yeon-Jun Kim, Ann K. Syrdal, Alistair Conkie, Mark C. Beutnagel (2006), Phonetically enriched labeling in unit selection TTS synthesis, Interspeech
Jerome R. Bellegarda (2006), Further developments in LSM-based boundary training for unit selection TTS, Interspeech
Takashi Nose, Junichi Yamagishi, Takao Kobayashi (2006), A style control technique for speech synthesis using multiple regression HSMM, Interspeech
Katsumi Ogata, Makoto Tachibana, Junichi Yamagishi, Takao Kobayashi (2006), Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis, Interspeech
Ossama Abdel-Hamid, Sherif Mahdy Abdou, Mohsen Rashwan (2006), Improving Arabic HMM based speech synthesis quality, Interspeech
M. Mehdi Homayounpour, Majid Namnabat (2006), Farsbayan: a unit selection based Farsi speech synthesizer, Interspeech
Tadesse Anberbir, Tomio Takara (2006), Amharic speech synthesis using cepstral method with stress generation rule, Interspeech
Ausdang Thangthai, Chatchawarn Hansakunbuntheung, Rungkarn Siricharoenchai, Chai Wutiwiwatchai (2006), Automatic syllable-pattern induction in statistical Thai text-to-phone transcription, Interspeech
H. J. Oosthuizen, S. T. Phihlela, M. J. D. Manamela (2006), Development of prototype text-to-speech systems for northern sotho, Interspeech
Jiali You, Yining Chen, Min Chu, Yong Zhao, Jinlin Wang (2006), Identify language origin of personal names with normalized appearance number of web pages, Interspeech
Christian Weiss, Wolfgang Hess (2006), Conditional random fields for hierarchical segment selection in text-to-speech synthesis, Interspeech
Aleksandra Krul, Géraldine Damnati, François Yvon, Thierry Moudenc (2006), Corpus design based on the kullback-leibler divergence for text-to-speech synthesis application, Interspeech
Zhen-Hua Ling, Ren-Hua Wang (2006), HMM-based unit selection using frame sized speech segments, Interspeech
Paul Taylor (2006), The target cost formulation in unit selection speech synthesis, Interspeech
Daniel Tihelka, Jindrich Matousek (2006), Unit selection and its relation to symbolic prosody: a new approach, Interspeech
Yi-Jian Wu, Wu Guo, Ren-Hua Wang (2006), Minimum generation error criterion for tree-based clustering of context dependent HMMs, Interspeech
Heng Kang, Wenju Liu (2006), Selective-LPC based representation of STRAIGHT spectrum and its applications in spectral smoothing, Interspeech
Matthias Jilka, Bernd Möbius (2006), Towards a comprehensive investigation of factors relevant to peak alignment using a unit selection corpus, Interspeech
Robert J. Utama, Ann K. Syrdal, Alistair Conkie (2006), Six approaches to limited domain concatenative speech synthesis, Interspeech
V. Fischer, S. Kunzmann (2006), From pre-recorded prompts to corporate voices: on the migration of interactive voice response applications, Interspeech
Seung Seop Park, Jong Won Shin, Nam Soo Kim (2006), Automatic speech segmentation with multiple statistical models, Interspeech
Kimmo Pärssinen, Marko Moberg (2006), Evaluation of perceptual quality of control point reduction in rule-based synthesis, Interspeech
Geert Coorman (2006), Segment connection networks for corpus-based speech synthesis, Interspeech
Ryo Tsuji, Tomohiko Kasami, Shogo Ishikawa, Shinya Kiriyama, Yoichi Takebayashi, Shigeyoshi Kitazawa (2006), Observations of the spoken language acquisition process based on a multimodal infant behavior corpus, Interspeech
Ellen Marklund, Francisco Lacerda (2006), Infants² ability to extract verbs from continuous speech, Interspeech
Ricardo A.H. Bion, Paola Escudero, Andréia S. Rauber, Barbara O. Baptista (2006), Category formation and the role of spectral quality in the perception and production of English front vowels, Interspeech
Ranka Bijeljac-Babic, Christelle Dodane, Sabine Metta, Claire Gérard (2006), Productions in bilinguism, early foreign language learning and monolinguism: a prosodic comparison, Interspeech
Yukari Hirata, Elizabeth Whitehurst, Emily Cullings, Jacob Whiton, Carol Glenn (2006), Training native English speakers to identify Japanese vowel length with fast rate sentences, Interspeech
Jiang-Chun Chen, Wei-Tang Hsu, J.-S. Roger Jang, Ren-Yuan Lyu, Yuang-Chin Chiang (2006), Formant-based English vowel assessment for Chinese in Taiwan, Interspeech
Jörg Metzner, Marcel Schmittfull, Karl Schnell (2006), Substitute sounds for ventriloquism and speech disorders, Interspeech
Si Wei, Qing-Sheng Liu, Yu Hu, Ren-Hua Wang (2006), Automatic Mandarin pronunciation scoring for native learners with dialect accent, Interspeech
Kengo Fujita, Tsuneo Kato, Hisashi Kawai (2006), Quick individual fitting methods of simplified hearing compensation for elderly people, Interspeech
Xiao Li, Jonathan Malkin, Susumu Harada, Jeff A. Bilmes, Richard Wright, James Landay (2006), An online adaptive filtering algorithm for the vocal joystick, Interspeech
Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano (2006), Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech, Interspeech
R. San-Segundo, R. Barra, L. F. D’Haro, J. M. Montero, R. Córdoba, J. Ferreiros (2006), A Spanish speech to sign language translation system for assisting deaf-mute people, Interspeech
Eeva Klintfors, Francisco Lacerda (2006), Potential relevance of audio-visual integration in mammals for computational modeling, Interspeech
C. Anton Rytting (2006), Finding the gaps: applying a connectionist model of word segmentation to noisy phone-recognized speech data, Interspeech
Shizhen Wang, Xiaodong Cui, Abeer Alwan (2006), Rapid speaker adaptation using regression-tree based spectral peak alignment, Interspeech
Chanwoo Kim, Yu-Hsiang Chiu, Richard M. Stern (2006), Physiologically-motivated synchrony-based processing for robust automatic speech recognition, Interspeech
Matthias Walliczek, Florian Kraft, Szu-Chen Jou, Tanja Schultz, Alex Waibel (2006), Sub-word unit based non-audible speech recognition using surface electromyography, Interspeech
Jesús Vicente-Peña, Fernando Díaz-de-María, Bastiaan Kleijn (2006), Individual on-line variance adaptation of frequency filtered parameters for robust ASR, Interspeech
Bing Zhang, Spyros Matsoukas, Richard Schwartz (2006), Recent progress on the discriminative region-dependent transform for speech feature extraction, Interspeech
Jan Rademacher, Matthias Wächter, Alfred Mertins (2006), Improved warping-invariant features for automatic speech recognition, Interspeech
Ani Nenkova (2006), Summarization evaluation for text and speech: issues and approaches, Interspeech
Xiaodan Zhu, Gerald Penn (2006), Summarization of spontaneous conversations, Interspeech
Pierre Chatain, Edward Whittaker, Joanna Mrozinski, Sadaoki Furui (2006), Perplexity based linguistic model adaptation for speech summarisation, Interspeech
Lin-shan Lee, Sheng-yi Kong, Yi-cheng Pan, Yi-sheng Fu, Yu-tsun Huang (2006), Multi-layered summarization of spoken document archives by information extraction and semantic structuring, Interspeech
Sameer Maskey, Julia Hirschberg (2006), Soundbite detection in broadcast news domain, Interspeech
Gabriel Murray, Steve Renals (2006), Dialogue act compression via pitch contour preservation, Interspeech
Toshiaki Kubo, Tetsuji Ogawa, Tetsunori Kobayashi (2006), Manifold HLDA and its application to robust speech recognition, Interspeech
Luis Buera, Eduardo Lleida, Juan A. Nolazco-Flores, Antonio Miguel, Alfonso Ortega (2006), Time-dependent cross-probability model for multi-environment model based LInear normalization, Interspeech
Daniel Povey (2006), SPAM and full covariance for speech recognition, Interspeech
Sakriani Sakti, Konstantin Markov, Satoshi Nakamura (2006), The use of Bayesian network for incorporating accent, gender and wide-context dependency information, Interspeech
Yu Wang, Eric Fosler-Lussier (2006), Integrating phonetic boundary discrimination explicitly into HMM systems, Interspeech
Zhimin Xie, Partha Niyogi (2006), Robust acoustic-based syllable detection, Interspeech
Lei He, Jie Hao (2006), A tone recognition framework for continuous Mandarin speech, Interspeech
Annika Hämäläinen, Louis ten Bosch, Lou Boves (2006), Pronunciation variant-based multi-path HMMs for syllables, Interspeech
Junho Park, Hanseok Ko (2006), A new state-dependent phonetic tied-mixture model with head-body-tail structured HMM for real-time continuous phoneme recognition system, Interspeech
Andrej Zgank, Zdravko Kacic (2006), Conversion from phoneme based to grapheme based acoustic models for speech recognition, Interspeech
Bong-Wan Kim, Dae-Lim Choi, Yongnam Um, Yong-Ju Lee (2006), Phone vector DHMM to decode a phone recognizer's output, Interspeech
T. Nagarajan, P. Vijayalakshmi, Douglas O'Shaughnessy (2006), Combining multiple-sized sub-word units in a speech recognition system using baseform selection, Interspeech
Antonio Miguel, Eduardo Lleida, Alfons Juan, Luis Buera, Alfonso Ortega, Oscar Saz (2006), Local transformation models for speech recognition, Interspeech
Toru Imai, Shoei Sato, Akio Kobayashi, Kazuo Onoe, Shinichi Homma (2006), Online speech detection and dual-gender speech recognition for captioning broadcast news, Interspeech
Timothy J. Hazen (2006), Automatic alignment and error correction of human generated transcripts for long speech recordings, Interspeech
Shuangyu Chang (2006), Improving speech recognition accuracy with multi-confidence thresholding, Interspeech
Christophe Servan, Christian Raymond, Frédéric Béchet, Pascal Nocéra (2006), Conceptual decoding from word lattices: application to the spoken dialogue corpus MEDIA, Interspeech
Shilei Huang, Xiang Xie, Jingming Kuang (2006), Improving the performance of out-of-vocabulary word rejection by using support vector machines, Interspeech
Kris Demuynck, Dirk Van Compernolle, Hugo Van hamme (2006), Robust phone lattice decoding, Interspeech
Benjamin Lecouteux, Georges Linarès, Pascal Nocéra, Jean-François Bonastre (2006), Imperfect transcript driven speech recognition, Interspeech
Jian Xue, Rusheng Hu, Yunxin Zhao (2006), New improvements in decoding speed and latency for automatic captioning, Interspeech
Shirin Saleem, Rohit Prasad, Prem Natarajan (2006), Colloquial Iraqi ASR for speech translation, Interspeech
Tomohiro Hakamata, Akinobu Lee, Yoshihiko Nankaku, Keiichi Tokuda (2006), Reducing computation on parallel decoding using frame-wise confidence scores, Interspeech
Hamed Ketabdar, Jithendra Vepa, Samy Bengio, Hervé Bourlard (2006), Posterior based keyword spotting with a priori thresholds, Interspeech
Zhengyu Zhou, Helen M. Meng, Wai Kit Lo (2006), A multi-pass error detection and correction framework for Mandarin LVCSR, Interspeech
Jan Nouza, Jindrich Zdansky, Petr Cerva, Jan Kolorenc (2006), Continual on-line monitoring of Czech spoken broadcast programs, Interspeech
Shilei Zhang, Hongchen Jiang, Shuwu Zhang, Bo Xu (2006), Fast SVM training based on the choice of effective samples for audio classification, Interspeech
Joerg Schmalenstroeer, Reinhold Haeb-Umbach (2006), Online speaker change detection by combining BIC with microphone array beamforming, Interspeech
Javier Ramírez, Pablo Yélamos, J. M. Górriz, José C. Segura, L. García (2006), Speech/non-speech discrimination combining advanced feature extraction and SVM learning, Interspeech
Safaa Jarifi, Dominique Pastor, Olivier Rosec (2006), Cooperation between global and local methods for the automatic segmentation of speech synthesis corpora, Interspeech
Martin Heckmann, Marco Moebus, Frank Joublin, Christian Goerick (2006), Speaker independent voiced-unvoiced detection evaluated in different speaking styles, Interspeech
Xavier Anguera, Chuck Wooters, Jose M. Pardo (2006), Robust speaker diarization for meetings: ICSI RT06s evaluation system, Interspeech
André Coy, Jon Barker (2006), A multipitch tracker for monaural speech segmentation, Interspeech
Rahul Chitturi, Mark Hasegawa-Johnson (2006), Novel entropy based moving average refiners for HMM landmarks, Interspeech
Gibak Kim, Nam Ik Cho (2006), Two-microphone voice activity detection in the presence of coherent interference, Interspeech
Tor André Myrvoll, Tomoko Matsui (2006), On a greedy learning algorithm for dPLRM with applications to phonetic feature detection, Interspeech
Elliot Moore II, Juan Torres (2006), Improving glottal waveform estimation through rank-based glottal quality assessment, Interspeech
Francesc Alías, Carlos Monzo, Joan Claudi Socoró (2006), A pitch marks filtering algorithm based on restricted dynamic programming, Interspeech
Nicolas Malyska, Thomas F. Quatieri (2006), Analysis of nonmodal phonation using minimum entropy deconvolution, Interspeech
Tomoyasu Nakano, Masataka Goto, Yuzuru Hiraga (2006), An automatic singing skill evaluation method for unknown melodies using pitch interval accuracy and vibrato features, Interspeech
Stephen A. Zahorian, Princy Dikshit, Hongbing Hu (2006), A spectral-temporal method for pitch tracking, Interspeech
M. Shahidur Rahman, Hirobumi Tanaka, Tetsuya Shimamura (2006), Pitch determination using aligned AMDF, Interspeech
Yan Han, Lou Boves (2006), Syllable-length path mixture hidden Markov models with trajectory clustering for continuous speech recognition, Interspeech
Tobias Cincarek, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano (2006), Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training, Interspeech
Christophe Lévy, Georges Linarès, Jean-François Bonastre (2006), GMM-based acoustic modeling for embedded speech recognition, Interspeech
Mathias De Wachter, Kris Demuynck, Dirk Van Compernolle (2006), Boosting HMM performance with a memory upgrade, Interspeech
Y. Deng, X. Li, C. Kwan, R. Xu, B. Raj, Richard M. Stern, D. Williamson (2006), An integrated approach to improve speech recognition rate for non-native speakers, Interspeech
Rusheng Hu, Yunxin Zhao (2006), Bayesian decision tree state tying for conversational speech recognition, Interspeech
Barry Kirkpatrick, Darragh O’Brien, Ronán Scaife (2006), Feature extraction for spectral continuity measures in concatenative speech synthesis, Interspeech
Shinsuke Sakai, Tatsuya Kawahara (2006), Decision tree-based training of probabilistic concatenation models for corpus-based speech synthesis, Interspeech
Yong Zhao, Di Peng, Lijuan Wang, Min Chu, Yining Chen, Peng Yu, Jun Guo (2006), Constructing stylistic synthesis databases from audio books, Interspeech
Alistair Conkie, Ann K. Syrdal (2006), Expanding phonetic coverage in unit selection synthesis through unit substitution from a donor voice, Interspeech
Paul Taylor (2006), Unifying unit selection and hidden Markov model speech synthesis, Interspeech
Alan W. Black (2006), CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling, Interspeech
Verena Rieser, Oliver Lemon (2006), Cluster-based user simulations for learning dialogue strategies, Interspeech
Charles Lewis, Giuseppe Di Fabbrizio (2006), Prompt selection with reinforcement learning in an AT&t call routing application, Interspeech
Silke Goronzy, Raquel Mochales, Nicole Beringer (2006), Developing speech dialogs for multimodal HMIs using finite state machines, Interspeech
Norbert Pfleger, Jan Schehl (2006), Development of advanced dialog systems with PATE, Interspeech
Rajah Annamalai Subramanian, Philip Cohen (2006), A joint intention-based dialogue engine, Interspeech
Sebastian Möller, Roman Englert, Klaus Engelbrecht, Verena Hafner, Anthony Jameson, Antti Oulasvirta, Alexander Raake, Norbert Reithinger (2006), Memo: towards automatic usability evaluation of spoken dialogue services by user error simulations, Interspeech
Brett Matthews, Raimo Bakis, Ellen Eide (2006), Synthesizing breathiness in natural speech with sinusoidal modelling, Interspeech
Mauro Nicolao, Carlo Drioli, Piero Cosi (2006), Voice GMM modelling for FESTIVAL/MBROLA emotive TTS synthesis, Interspeech
João P. Cabral, Luís C. Oliveira (2006), Emovoice: a system to generate emotions in speech, Interspeech
Zhiyong Wu, Shen Zhang, Lianhong Cai, Helen M. Meng (2006), Real-time synthesis of Chinese visual speech and facial expressions using MPEG-4 FAP features in a three-dimensional avatar, Interspeech
Hongwu Yang, Helen M. Meng, Lianhong Cai (2006), Modeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis, Interspeech
Sheng Zhang, P. C. Ching, Fanrang Kong (2006), Automatic emotion recognition of speech signal in Mandarin, Interspeech
Yi-hao Kao, Lin-shan Lee (2006), Feature analysis for emotion recognition from Mandarin speech considering the special characteristics of Chinese language, Interspeech
Björn Schuller, Gerhard Rigoll (2006), Timing levels in segment-based speech emotion recognition, Interspeech
Ryuichi Nisimura, Souji Omae, Hideki Kawahara, Toshio Irino (2006), Analyzing dialogue data for real-world emotional speech classification, Interspeech
Cecilia Ovesdotter Alm, Xavier Llorà (2006), Evolving emotional prosody, Interspeech
Xin Luo, Qian-Jie Fu, John J. Galvin III (2006), Vocal emotion recognition with cochlear implants, Interspeech
S. Matsunaga, S. Sakaguchi, M. Yamashita, S. Miyahara, S. Nishitani, K. Shinohara (2006), Emotion detection in infants² cries based on a maximum likelihood approach, Interspeech
Joseph Tepperman, David Traum, Shrikanth Narayanan (2006), yeah right: sarcasm recognition for spoken dialogue systems, Interspeech
Rohit Kumar, Carolyn P. Rosé, Diane J. Litman (2006), Identification of confusion and surprise in spoken dialog using prosodic features, Interspeech
Tin Lay Nwe, Haizhou Li, Minghui Dong (2006), Analysis and detection of speech under sleep deprivation, Interspeech
Ioana Vasilescu, Martine Adda-Decker (2006), Language, gender, speaking style and language proficiency as factors influencing the autonomous vocalic filler production in spontaneous speech, Interspeech
Caroline Lavecchia, Kamel Smaïli, Jean-Paul Haton (2006), How to handle gender and number agreement in statistical language models?, Interspeech
Oscar Chan, Roberto Togneri (2006), Prosodic features for a maximum entropy language model, Interspeech
Shinsuke Mori (2006), Language model adaptation with a word list and a raw corpus, Interspeech
Pascal Wiggers, Léon J.M. Rothkrantz (2006), Topic-based language modeling with dynamic Bayesian networks, Interspeech
Hirofumi Yamamoto, Genichiro Kikui, Satoshi Nakamura, Yoshinori Sagisaka (2006), Speech recognition of foreign out-of-vocabulary words using a hierarchical language model, Interspeech
Xinhui Hu, Hirofumi Yamamoto, Genichiro Kikui, Yoshinori Sagisaka (2006), Language modeling of Chinese personal names based on character units for continuous Chinese speech recognition, Interspeech
A. Lakshmi, Hema A. Murthy (2006), A syllable based continuous speech recognizer for Tamil, Interspeech
Monika Woszczyna, Paisarn Charoenpornsawat, Tanja Schultz (2006), Spontaneous Thai speech recognition, Interspeech
M. Gerosa, D. Giuliani, Shrikanth Narayanan (2006), Acoustic analysis and automatic recognition of spontaneous children's speech, Interspeech
Keith Vertanen (2006), Speech and speech recognition during dictation corrections, Interspeech
Lubos Smídl, Josef V. Psutka (2006), Comparison of keyword spotting methods for searching in speech, Interspeech
Mithun Balakrishna, Cyril Cerovic, Dan Moldovan, Ellis Cave (2006), Automatic generation of statistical language models for interactive voice response applications, Interspeech
Yun-Cheng Ju, Ye-Yi Wang, Alex Acero (2006), Call analysis with classification using speech and non-speech features, Interspeech
Wei-Lin Wu, Ru-Zhan Lu, Hui Liu, Feng Gao (2006), A spoken language understanding approach using successive learners, Interspeech
Osamuyimen Stewart, Juan Huerta, Ea-Ee Jan, Cheng Wu, Xiang Li, David Lubensky (2006), Conversational help desk: vague callers and context switch, Interspeech
Sophie Rosset, Olivier Galibert, Gabriel Illouz, Aurélien Max (2006), Integrating spoken dialog and question answering: the ritel project, Interspeech
Thomas Prommer, Hartwig Holzapfel, Alex Waibel (2006), Rapid simulation-driven reinforcement learning of multimodal dialog strategies in human-robot interaction, Interspeech
Gregory Aist, James Allen, Ellen Campana, Lucian Galescu, Carlos A. Gómez Gallo, Scott C. Stoness, Mary Swift, Michael Tanenhaus (2006), Software architectures for incremental understanding of human speech, Interspeech
Florian Schiel, Christoph Draxler, Marion Libossek (2006), Lingua machinae - an unorthodox proposal, Interspeech
Heather Pon-Barry, Fuliang Weng, Sebastian Varges (2006), Evaluation of content presentation strategies for an in-car spoken dialogue system, Interspeech
Vaibhava Goel, Ramesh Gopinath (2006), On designing context sensitive language models for spoken dialog systems, Interspeech
Yang Liu (2006), Using SVM and error-correcting codes for multiclass dialog act classification in meeting corpus, Interspeech
Hartwig Holzapfel, Alex Waibel (2006), A multilingual expectations model for contextual utterances in mixed-initiative spoken dialogue, Interspeech
Yuichiro Fukubayashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno (2006), Dynamic help generation by estimating user²s mental model in spoken dialogue systems, Interspeech
Dinoj Surendran, Gina-Anne Levow (2006), Dialog act tagging with support vector machines and hidden Markov models, Interspeech
Ángel de la Torre, Javier Ramírez, Carmen Benítez, José C. Segura, L. García, Antonio J. Rubio (2006), Noise robust model-based voice activity detection, Interspeech
Yu Shi, Frank K. Soong, Jian-Lai Zhou (2006), Auto-segmentation based VAD for robust ASR, Interspeech
Kofi Boakye, Andreas Stolcke (2006), Improved speech activity detection using cross-channel features for recognition of multiparty meetings, Interspeech
Yusuke Kida, Tatsuya Kawahara (2006), Evaluation of voice activity detection by combining multiple features with weight adaptation, Interspeech
Keansub Lee, Daniel P. W. Ellis (2006), Voice activity detection in personal audio recordings using autocorrelogram compensation, Interspeech
Ryan Rifkin, Nima Mesgarani (2006), Discriminating speech and non-speech with regularized least squares, Interspeech
John Lee, Stephanie Seneff (2006), Automatic grammar correction for second-language learners, Interspeech
Ambra Neri, Catia Cucchiarini, Helmer Strik (2006), ASR-based corrective feedback on pronunciation: does it really work?, Interspeech
Minghui Dong, Haizhou Li, Tin Lay Nwe (2006), Evaluating prosody of Mandarin speech for language learning, Interspeech
Isabel Trancoso, Carlos Duarte, António Serralheiro, Diamantino Caseiro, Luís Carriço, Céu Viana (2006), Spoken language technologies applied to digital talking books, Interspeech
Akemi Iida, Jun Ito, Shimpei Kajima, Tsutomu Sugawara (2006), Building an English speech synthesis system from a Japanese ALS patient²s voice, Interspeech
Alexey Karpov, Andrey Ronzhin, Alexandre Cadiou (2006), Multi-modal system ICANDO: intellectual computer assistant for disabled operators, Interspeech
Gabriel Skantze, David House, Jens Edlund (2006), User responses to prosodic variation in fragmentary grounding utterances in dialog, Interspeech
Carlos Toshinori Ishi, Hiroshi Ishiguro, Norihiro Hagita (2006), Analysis of prosodic and linguistic cues of phrase finals for turn-taking and dialog acts, Interspeech
David Schlangen (2006), From reaction to prediction: experiments with computational models of turn-taking, Interspeech
Jáchym Kolár, Elizabeth Shriberg, Yang Liu (2006), On speaker-specific prosodic models for automatic dialog act segmentation of multi-party meetings, Interspeech
Nigel G. Ward, Yaffa Al Bayyari (2006), A case study in the identification of prosodic cues to turn-taking: back-channeling in Arabic, Interspeech
Jens Edlund, Mattias Heldner (2006), /nailon/ - software for online analysis of prosody, Interspeech
Junfeng Li, Masato Akagi, Yôiti Suzuki (2006), Improved hybrid microphone array post-filter by integrating a robust speech absence probability estimator for speech enhancement, Interspeech
Timo Gerkmann, Rainer Martin (2006), Soft decision combining for dual channel noise reduction, Interspeech
Guo Chen, Vijay Parsa (2006), An improved affine projection algorithm based crosstalk resistant adaptive noise canceller, Interspeech
Stamatis Leukimmiatis, Dimitrios Dimitriadis, Petros Maragos (2006), An optimum microphone array post-filter for speech applications, Interspeech
Federico Flego, Maurizio Omologo (2006), Multi-microphone periodicity function for robust F0 estimation in real noisy and reverberant environments, Interspeech
H. R. Abutalebi, M. Pourahmadi, M.R. Aghabozorgi (2006), A new dual-microphone speech enhancement method for oriented noises, Interspeech
Andrew Lovitt, Jont B. Allen (2006), 50 years late: repeating miller-nicely 1955, Interspeech
Shuichi Sakamoto, Tadahiro Yoshikawa, Shigeaki Amano, Yôiti Suzuki, Tadahisa Kondo (2006), New 20-word lists for word intelligibility test in Japanese, Interspeech
Guoping Li, Mark E. Lutman (2006), Sparseness and speech perception in noise, Interspeech
Wei M. Liu, John S. D. Mason, Nicholas W. D. Evans, Keith A. Jellyman (2006), An assessment of automatic speech recognition as speech intelligibility estimation in the context of additive noise, Interspeech
Marcel Wältermann, Kirstin Scholz, Alexander Raake, Ulrich Heute, Sebastian Möller (2006), Underlying quality dimensions of modern telephone connections, Interspeech
Guo Chen, Vijay Parsa, Susan Scollie (2006), An ERB loudness pattern based objective speech quality measure, Interspeech
Huazhong Ning, Ming Liu, Hao Tang, Thomas S. Huang (2006), A spectral clustering approach to speaker diarization, Interspeech
Jindrich Zdansky (2006), BINSEG: an efficient speaker-based segmentation technique, Interspeech
Ascensión Gallardo-Antolín, Xavier Anguera, Chuck Wooters (2006), Multi-stream speaker diarization systems for the meetings domain, Interspeech
Carla Lopes, Fernando Perdigão (2006), Improved performance evaluation of speech event detectors, Interspeech
Jose M. Pardo, Xavier Anguera, Chuck Wooters (2006), Speaker diarization for multiple distant microphone meetings: mixing acoustic features and inter-channel time differences, Interspeech
Tuan Van Pham, Gernot Kubin (2006), Low-complexity and efficient classification of voiced/unvoiced/silence for noisy environments, Interspeech
Motoyuki Suzuki, Yasutomo Kajiura, Akinori Ito, Shozo Makino (2006), Unsupervised language model adaptation based on automatic text collection from WWW, Interspeech
Yik-Cheung Tam, Tanja Schultz (2006), Unsupervised language model adaptation using latent semantic marginals, Interspeech
David Mrva, Philip C. Woodland (2006), Unsupervised language model adaptation for Mandarin broadcast conversation transcription, Interspeech
Dietrich Klakow (2006), Language model adaptation for tiny adaptation corpora, Interspeech
Andrej Ljolje (2006), Pronunciation dependent language models, Interspeech
Amit Anil Nanavati, Nitendra Rajput (2006), Improving perplexity measures to incorporate acoustic confusability, Interspeech
Long Qin, Yi-Jian Wu, Zhen-Hua Ling, Ren-Hua Wang (2006), Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format, Interspeech
Chung-Han Lee, Chung-Hsien Wu (2006), Map-based adaptation for speech conversion using adaptation data selection and non-parallel training, Interspeech
Jani Nurminen, Jilei Tian, Victor Popa (2006), Novel method for data clustering and mode selection with application in voice conversion, Interspeech
David Sündermann, Harald Höge, Antonio Bonafonte, Hermann Ney, Julia Hirschberg (2006), Text-independent cross-language voice conversion, Interspeech
Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano (2006), Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation, Interspeech
Mikihiro Nakagiri, Tomoki Toda, Hideki Kashioka, Kiyohiro Shikano (2006), Improving body transmitted unvoiced speech with statistical voice conversion, Interspeech
Keijiro Saino, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda (2006), An HMM-based singing voice synthesis system, Interspeech
Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Akinobu Lee, Keiichi Tokuda (2006), Voice conversion based on mixtures of factor analyzers, Interspeech
Jilei Tian, Jani Nurminen, Victor Popa (2006), Efficient Gaussian mixture model evaluation in voice conversion, Interspeech
Yuji Nakano, Makoto Tachibana, Junichi Yamagishi, Takao Kobayashi (2006), Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis, Interspeech
Zhi-Wei Shuang, Raimo Bakis, Slava Shechtman, Dan Chazan, Yong Qin (2006), Frequency warping based on mapping formant parameters, Interspeech
Cheng-Yuan Lin, J.-S. Roger Jang (2006), Automatic phonetic segmentation by using a SPM-based approach for a Mandarin singing voice corpus, Interspeech
Partha Lal (2006), A comparison of singing evaluation algorithms, Interspeech
Raymond W. M. Ng, Tan Lee, Wentao Gu (2006), Towards automatic parameter extraction of command-response model for Cantonese, Interspeech
Francisco Campillo, Jan P. H. van Santen, Eduardo R. Banga (2006), A model for the f0 reset in corpus-based intonation approaches, Interspeech
Gérard Bailly, Jan Gorisch (2006), Generating German intonation with a trainable prosodic model, Interspeech
Seungwon Kim, Jinsik Lee, Byeongchang Kim, Gary Geunbae Lee (2006), Incorporating second-order information into two-step major phrase break prediction for Korean, Interspeech
Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao (2006), Totally data-driven duration modeling based on generalized linear model for Mandarin TTS, Interspeech
Özlem Özturk, Tolga Ciloglu (2006), Segmental duration modeling in Turkish, Interspeech
Rogier C. van Dalen, Pascal Wiggers, Léon J. M. Rothkrantz (2006), Lexical stress in continuous speech recognition, Interspeech
Siwei Wang, Gina-Anne Levow (2006), Improving tone recognition with combined frequency and amplitude modelling, Interspeech
Che-Kuang Lin, Lin-shan Lee (2006), Latent prosodic modeling (LPM) for speech with applications in recognizing spontaneous Mandarin speech with disfluencies, Interspeech
Keikichi Hirose, Hui Hu, Xiaodong Wang, Nobuaki Minematsu (2006), Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model, Interspeech
Thamar Solorio, Olac Fuentes, Nigel G. Ward, Yaffa Al Bayyari (2006), Prosodic feature generation for back-channel prediction, Interspeech
Wieneke Wesseling, Rob J. J. H. van Son, Louis C. W. Pols (2006), On the sufficiency and redundancy of pitch for TRP projection, Interspeech
Matthew Gibson, Thomas Hain (2006), Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition, Interspeech
Jun Du, Peng Liu, Frank K. Soong, Jian-Lai Zhou, Ren-Hua Wang (2006), Minimum divergence based discriminative training, Interspeech
Xinwei Li, Hui Jiang (2006), Solving large margin estimation of HMMS via semidefinite programming, Interspeech
Dong Yu, Li Deng, Xiaodong He, Alex Acero (2006), Use of incrementally regulated discriminative margins in MCE training for speech recognition, Interspeech
Jinyu Li, Ming Yuan, Chin-Hui Lee (2006), Soft margin estimation of hidden Markov model parameters, Interspeech
Ye-Yi Wang, Alex Acero (2006), Discriminative models for spoken language understanding, Interspeech
G. Gibert, Gérard Bailly, F. Elisei (2006), Evaluating a virtual speech cuer, Interspeech
Laura Mayfield Tomokiyo, Kay Peterson, Alan W. Black, Kevin A. Lenzo (2006), Intelligibility of machine translation output in speech synthesis, Interspeech
Makoto Tachibana, Takashi Nose, Junichi Yamagishi, Takao Kobayashi (2006), A technique for controlling voice quality of synthetic speech using multiple regression HSMM, Interspeech
Tatyana Polyakova, Antonio Bonafonte (2006), Learning from errors in grapheme-to-phoneme conversion, Interspeech
Tomoki Toda, Yamato Ohtani, Kiyohiro Shikano (2006), Eigenvoice conversion based on Gaussian mixture model, Interspeech
Brian Langner, Rohit Kumar, Arthur Chan, Lingyun Gu, Alan W. Black (2006), Generating time-constrained audio presentations of structured information, Interspeech
F. Alsaade, A. Ariyaeeinia, L. Meng, A. Malegaonkar (2006), Multimodal authentication using qualitative support vector machines, Interspeech
Vassilis Pitsikalis, Athanassios Katsamanis, George Papandreou, Petros Maragos (2006), Adaptive multimodal fusion by uncertainty compensation, Interspeech
Debra M. Hardison (2006), Effects of familiarity with faces and voices on second-language speech processing: components of memory traces, Interspeech
Satoshi Tamura, Koji Hashimoto, Jiong Zhu, Satoru Hayamizu, Hirotsugu Asai, Hideki Tanahashi, Makoto Kanagawa (2006), Automatic metadata generation and video editing based on speech and image recognition for medical education contents, Interspeech
Ibrahim Almajai, Ben Milner, Jonathan Darch (2006), Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise, Interspeech
Oxana Govokhina, Gérard Bailly, Gaspard Breton, Paul Bagshaw (2006), TDA: a new trainable trajectory formation system for facial animation, Interspeech
Giorgio Biagetti, Paolo Crippa, Claudio Turchetti (2006), Modeling of speech signals based on Bessel-like orthogonal transform, Interspeech
Pamornpol Jinachitra (2006), Glottal closure and opening detection for flexible parametric voice coding, Interspeech
Jan Trmal, Jan Vanek, Ludek Müller, Jan Zelinka (2006), Independent components for acoustic modeling, Interspeech
Daryush Mehta, Thomas F. Quatieri (2006), Pitch-scale modification using the modulated aspiration noise source, Interspeech
Tony Ezzat, Jake Bouvrie, Tomaso Poggio (2006), Max-Gabor analysis and synthesis of spectrograms, Interspeech
Pedro J. Quintana-Morales, Juan L. Navarro-Mesa, Antonio G. Ravelo-Garcia, Fernando D. Lorenzo-Garcia (2006), Monitoring of the natural voice variations in open and closed phases with frequency warped ARMA modeling, Interspeech
Hirokazu Kameoka, Jonathan Le Roux, Nobutaka Ono, Shigeki Sagayama (2006), Speech analyzer using a joint estimation model of spectral envelope and fine structure, Interspeech
Andrew Errity, John McKenna (2006), An investigation of manifold learning for speech analysis, Interspeech
Jake Bouvrie, Tony Ezzat (2006), An incremental algorithm for signal reconstruction from short-time fourier transform magnitude, Interspeech
Toru Takahashi, Masashi Nishi, Toshio Irino, Hideki Kawahara (2006), Automatic assignment of anchoring points on vowel templates for defining correspondence between time-frequency representations of speech samples, Interspeech
S. Prasad, S. Srinivasan, M. Pannuri, G. Lazarou, Joseph Picone (2006), Nonlinear dynamical invariants for speech recognition, Interspeech
Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen (2006), Exploiting polynomial-fit histogram equalization and temporal average for robust speech recognition, Interspeech
Sébastien Demange, Christophe Cerisara, Jean-Paul Haton (2006), Missing data mask models with global frequency and temporal constraints, Interspeech
Hemant Misra, Jithendra Vepa, Hervé Bourlard (2006), Multi-stream ASR: an oracle perspective, Interspeech
Koji Iwano, Kaname Kojima, Sadaoki Furui (2006), A weight estimation method using LDA for multi-band speech recognition, Interspeech
Chang-wen Hsu, Lin-shan Lee (2006), Powered cepstral normalization (p-CN) for robust features in speech recognition, Interspeech
Pei Ding, Lei He, Xiang Yan, Jie Hao (2006), Robust automatic speech recognition for accented Mandarin in car environments, Interspeech
Xugang Lu, Masashi Unoki, Masato Akagi (2006), A robust feature extraction based on the MTF concept for speech recognition in reverberant environment, Interspeech
Young Joon Kim, Woohyung Lim, Nam Soo Kim (2006), Clean speech feature estimation based on soft spectral masking, Interspeech
Mansoor Vali, Seyyed Ali Seyyed Salehi, Kazem Karimi (2006), Robust speech recognition by modifying clean and telephone feature vectors using bidirectional neural network, Interspeech
Chung-fu Tai, Jeih-weih Hung (2006), Silence energy normalization for robust speech recognition in additive noise environment, Interspeech
Maarten Van Segbroeck, Hugo Van hamme (2006), Handling convolutional noise in missing data automatic speech recognition, Interspeech
Norihide Kitaoka, Souta Hamaguchi, Seiichi Nakagawa (2006), Noisy speech recognition based on selection of multiple noise suppression methods using noise GMMs, Interspeech
Guillermo Aradilla, Jithendra Vepa, Hervé Bourlard (2006), Using posterior-based features in template matching for speech recognition, Interspeech
Yasunari Obuchi, Nobuo Hataoka (2006), Hypothesis-based feature combination of multiple speech inputs for robust speech recognition in automotive environments, Interspeech
Zbynek Koldovsky, Jan Nouza, Jan Kolorenc (2006), Continuous time-frequency masking method for blind speech separation with adaptive choice of threshold parameter using ICA, Interspeech
Yanxue Liang, Ichiro Hagiwara (2006), Multistage convolutive blind source separation for speech mixture, Interspeech
Futoshi Asano, Jun Ogata (2006), Detection and separation of speech events in meeting recordings, Interspeech
Alberto Abad, Carlos Segura, Duàn Macho, Javier Hernando, Climent Nadeu (2006), Audio person tracking in a smart-room environment, Interspeech
Tobias Gehrig, Ulrich Klee, John W. McDonough, Shajith Ikbal, Matthias Wölfel, Christian Fügen (2006), Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters, Interspeech
Martin Heckmann, Tobias Rodemann, Bjorn Scholling, Frank Joublin, Christian Goerick (2006), Modeling the precedence effect for binaural sound source localization in noisy and echoic environments, Interspeech
Fotios Talantzis, Anthony G. Constantinides, Lazaros C. Polymenakos (2006), Using a differential microphone array to estimate the direction of arrival of two acoustic sources, Interspeech
Alessio Brutti, Maurizio Omologo, Piergiorgio Svaizer (2006), Speaker localization based on oriented global coherence field, Interspeech
M. H. Radfar, R. M. Dansereau, A. Sayadiyan (2006), Performance evaluation of three features for model-based single channel speech separation problem, Interspeech
Mikkel N. Schmidt, Rasmus K. Olsson (2006), Single-channel speech separation using sparse non-negative matrix factorization, Interspeech
Rong Hu, Yunxin Zhao (2006), Adaptive speech enhancement for speech separation in diffuse noise, Interspeech
H. T. Attias (2006), A probabilistic graphical model for microphone array source separation using rich pre-trained source models, Interspeech
Erik Visser (2006), Geometrically constrained permutation-free source separation in an undercomplete speech unmixing scenario, Interspeech
Dirk Olszewski, Klaus Linhard (2006), Highly directional multi-beam audio loudspeaker, Interspeech
Reiko Mazuka (2020), Intonational phonology can shed light on the nature of prosody in Japanese children with ASD: Dissociating linguistic and para-linguistic aspects of intonation, SpeechProsody
Scott Reid Moisik (2020), Modeling the influence of voice quality setting on segmental structure, SpeechProsody
Chia-Yuan Lin, Tamara Rathcke (2020), How to hit that beat: Testing acoustic anchors of rhythmic movement with speech, SpeechProsody
Leendert Plug, Robert Lennon, Rachel Smith (2020), Listeners’ sensitivity to syllable complexity in spontaneous speech tempo perception, SpeechProsody
Donna Erickson, Ting Huang, Caroline Menezes (2020), Temporal organization of spoken utterances from an articulatory point of view, SpeechProsody
Benjamin Schmeiser (2020), Prosodic and Segmental Effects on the Durational Variability of Svarabhakti Vowels in Spanish /Cr/ Clusters, SpeechProsody
Kenji Yoshida, Akira Utsugi, Jia Hui Wu, Tetsuo Nitta, Kiyoe Sakamoto, Yoko Ichimura (2020), Articulatory asymmetry in consonantal sequences: A case from English, Fukui Japanese and Chaozhou Chinese., SpeechProsody
Bagus Tris Atmaja, Masato Akagi (2020), The Effect of Silence Feature in Dimensional Speech Emotion Recognition, SpeechProsody
Hansjörg Mixdorff, Debashis Ghosh, Albert Rilliard, Angelika Hönemann (2020), Perception of Audio-visual Expressions in German and Cantonese by Native Speakers of Hindi, SpeechProsody
Motoko Ueyama, Xinyue Li (2020), An Acoustic Study of Emotional Speech Produced by Italian Learners of Japanese, SpeechProsody
Xinyue Li, Carlos Toshinori Ishi, Ryoko Hayashi (2020), Prosodic and Voice Quality Feature of Japanese Speech Conveying Attitudes: Mandarin Chinese Learners and Japanese Native Speakers, SpeechProsody
Catherine Mathon, Carolyn Fontagnol (2020), A cross-linguistics study on how emotion is perceived in sport commentaries: comparing prosodic cues from Japanese and French, SpeechProsody
Lucy Kind, Victoria Murphy (2020), ‘I think’ is what I mean: Prosody as a signifier of speaker attitude across cultures and communication contexts, SpeechProsody
Hironori Katsuda, Jeremy Steffman (2020), Intonational cues to prosodic boundary influence perception of contrastive vowel length in Tokyo Japanese, SpeechProsody
Misaki Kato, Shigeto Kawahara, Kaori Idemaru (2020), Speaking rate normalization across different talkers in the perception of Japanese stop and vowel length contrasts, SpeechProsody
Kimiko Tsukada, John Hajek (2020), Perception of consonant length in familiar and unfamiliar languages by native speakers of Mandarin, Italian and Japanese, SpeechProsody
Katri Hiovain, Atte Asikainen, Juraj Šimko (2020), The role of duration and pitch in signaling quantity in Finnmark North Sámi, SpeechProsody
Shu-Chen Ou, Zhe-Chen Guo (2020), The Opposite Effects of Vowel and Onset Consonant Lengthening on Speech Segmentation, SpeechProsody
Helen Türk, Pärtel Lippus, Karl Pajusalu, Pire Teras (2020), Temporal patterns of geminates in Inari Saami trisyllabic words, SpeechProsody
Davide Garassino, Francesco Cangemi (2020), "No duration without intonation": The interplay of lexical and post-lexical durational differences, SpeechProsody
Sviatlana Karpava (2020), Lexical stress assignment and reading skills of Russian heritage children, SpeechProsody
Yulia Zuban, Tamara Rathcke, Sabine Zerbian (2020), Intonation of yes-no questions by heritage speakers of Russian, SpeechProsody
Rachel Kan (2020), Suprasegmental and prosodic features contributing to perceived accent in heritage Cantonese, SpeechProsody
Chen Lan, Peggy Mok (2020), A preliminary study on Cantonese tone production by young heritage speakers, SpeechProsody
Fabian Schubö, Sabine Zerbian (2020), Phonetic content and phonological structure affect pre-boundary lengthening in German, SpeechProsody
Shinobu Mizuguchi, Koichi Tateishi (2020), Prominence Is Not Cued Only Acoustically, SpeechProsody
Michael Ashby (2020), The first spoken intonation corpus (1909): a re-assessment of Daniel Jones's 'Intonation Curves', SpeechProsody
Elisabeth Delais-Roussarie, Brechtje Post, Hiyon Yoo (2020), Prosodic Units and Intonational Grammar in French : towards a new Approach, SpeechProsody
Marlene Böttcher, Sabine Zerbian (2020), Stressed Pronouns in Spontaneous English, SpeechProsody
Danfeng Wu, Yadav Gowda (2020), Focus and penultimate vowel lengthening in Zulu, SpeechProsody
Heiko Seeliger, Sophie Repp (2020), Competing prominence requirements in verb-first exclamatives with contrastive and given information, SpeechProsody
Bistra Andreeva, Bernd Möbius, James Whang (2020), Effects of surprisal and boundary strength on phrase-final lengthening, SpeechProsody
Leonardo Lancia, Cristel Portes (2020), Perceptual dynamics in the processing of tonal alignment, SpeechProsody
David Le Gac, Sabrina Bendjaballah (2020), Preboundary lengthening in Somali, SpeechProsody
Stefan Baumann, Janina Kalbertodt, Jane Mertens (2020), The appropriateness of prenuclear accent types – Evidence for information structural effects, SpeechProsody
Tatiana Kachkovskaia, Pavel Skrelin (2020), Prosodic phrasing in Russian spontaneous and read speech: evidence from large speech corpora, SpeechProsody
Enkeleida Kapia, Felicitas Kleber, Jonathan Harrington (2020), An Autosegmental-Metrical Analysis of Rising Contours in Standard Albanian, SpeechProsody
Philippe Martin (2020), Dutch Sentence Intonation Revisited, SpeechProsody
Philippe Martin (2020), ToBI Representations in Intonational Phonology: Time for a (melodic) change?, SpeechProsody
Jonathan Barnes, Alejna Brugos, Stefanie Shattuck-Hufnagel, Nanette Veilleux (2020), How prosodic prominence influences fricative spectra in English, SpeechProsody
Jason Bishop, Darlene Intlekofer (2020), Lower Working Memory Capacity is Associated with Shorter Prosodic Phrases: Implications for Speech Production Planning, SpeechProsody
Sophie Herment, Anne Tortel, Laetitia Leonarduzzi (2020), The British English rising contour: an exception in read speech?, SpeechProsody
Jake Aziz (2020), Intonational Phonology of Malagasy: Pitch Accents Demarcate Syntactic Constituents, SpeechProsody
Caroline Crouch, Argyro Katsika, Ioana Chitoran (2020), The role of sonority profile and order of place of articulation on gestural overlap in Georgian, SpeechProsody
James Kirby, Felicitas Kleber, Jessica Siddins, Jonathan Harrington (2020), Effects of prosodic prominence on obstruent-intrinsic F0 and VOT in German, SpeechProsody
Karen Tsai, Argyro Katsika (2020), Pitch accent and phrase boundaries: Kinematic evidence from Japanese, SpeechProsody
Tracy Conner (2020), Questioning Questions: The Illusion of Variation in African American English Polar Question Intonation, SpeechProsody
Antoin Rodgers (2020), K-Max: a tool for estimating, analysing, and evaluating tonal targets, SpeechProsody
Constantijn Kaland, Nikolaus P. Himmelmann (2020), Time-series analysis of F0 in Papuan Malay contrastive focus, SpeechProsody
Pavel Duryagin (2020), On some factors affecting the choice of tune in Russian wh-questions, SpeechProsody
Danfeng Wu (2020), Durational cues to stress and phrasing are preserved post-focally in English, SpeechProsody
Chunyu Ge, Aijun Li (2020), Tonal variations in Honggu Chinese, SpeechProsody
Wenxi Fei, Mingyu Weng, Albert Lee (2020), Phonetic Realisation of Narrow Focus in Wu-Mandarin Bilinguals, SpeechProsody
Ping Cui, Jianjing Kuang, Yunjia Wang (2020), The Effect of Focus and Prosodic Boundary on the Two T3 sandhi in Northeastern Mandarin, SpeechProsody
Yike Yang, Si Chen (2020), Revisiting focus production in Mandarin Chinese: Some preliminary findings, SpeechProsody
Jia Tian, Jianjing Kuang (2020), The phonetic realization of contrastive focus in Shanghainese, SpeechProsody
Jiyoung Jang, Argyro Katsika (2020), The amount and scope of phrase-final lengthening in Korean, SpeechProsody
Argyro Katsika, Jiyoung Jang, Jelena Krivokapić, Louis Goldstein, Elliot Saltzman (2020), The role of focus in accentual lengthening in American English: Kinematic analyses, SpeechProsody
Arndt Riester, Tobias Schröer, Stefan Baumann (2020), On the Prosody of Contrastive Topics in German Interviews, SpeechProsody
Gaëlle Ferre, Amina Mettouchi (2020), A Cross-Linguistic Study of Open-Palm Hand Gestures and their Prosodic Correlates, SpeechProsody
Juliane T. Zimmermann, Simon Wehrle, Francesco Cangemi, Martine Grice, Kai Vogeley (2020), Listeners and Lookers: Using Pitch Height and Gaze Duration for Inferring Mental States, SpeechProsody
Joy Mills (2020), Delexicalised Auditory Priming of Implicit Prosody, SpeechProsody
Xifan Zhang, Ting Wang (2020), An Eye-tracking Study on Mandarin Tone Perception of Children, SpeechProsody
Pärtel Lippus, Kaidi Lõo (2020), Silent and oral sentence reading in Estonian: investigating the effect of phonetic quantity on eye movements, SpeechProsody
Carlos Ishi, Ryusuke Mikata, Hiroshi Ishiguro (2020), Analysis of the factors involved in person-directed pointing gestures in dialogue speech, SpeechProsody
Gilbert Ambrazaitis, Johan Frid, David House (2020), Word prominence ratings in Swedish television news readings – effects of pitch accents and head movements, SpeechProsody
Yshai Kalmanovitch (2020), Interlocutor-dependent intra-speaker speech rate variability in interaction: a pilot study on four conversations in modern Hebrew, SpeechProsody
Yi Liu, Jinghong Ning (2020), Attention Distribution and Integration of Non-native Segments and Tones by Early Multilingual Speakers, SpeechProsody
Liquan Liu, Varghese Peter, Jia Hoong Ong, Paola Escudero (2020), Revisiting infant distributional learning using event-related potentials: Does Unimodal always inhibit and Bimodal always facilitate?, SpeechProsody
Cristina Name, Juan Manuel Sosa (2020), The Prosody of Questions in Brazilian Portuguese Infant Directed Speech, SpeechProsody
Jill Thorson, Stefanie Shattuck-Hufnagel (2020), Phonological and Phonetic Realizations of Downstepping in Child Speech, SpeechProsody
Bettina Braun, Marieke Einfeldt, Gloria Esposito, Nicole Dehé (2020), The prosodic realization of rhetorical and information-questions in German spontaneous speech, SpeechProsody
Mengzhu Yan, Sasha Calhoun, Paul Warren (2020), Prosody or syntax? The perception of focus by Mandarin speakers, SpeechProsody
Jing Ji (2020), Syntax-prosody Interface in Perception of Right Dislocation in Mandarin, SpeechProsody
Amalia Arvaniti, Mary Baltazani, Stella Gryllia (2020), The contribution of pitch accents and boundary tones to intonation meaning, SpeechProsody
Riccardo Orrico, Mariapaola D'Imperio (2020), Tonal specification of speaker commitment in Salerno Italian wh-questions, SpeechProsody
Christine T. Röhr, Stefan Baumann, Petra B. Schumacher, Martine Grice (2020), Perceptual Prominence of Accent Types and the Role of Expectations, SpeechProsody
Hisao Tokizaki (2020), Verb-Second and Initial-Weak Prosody, SpeechProsody
Yu-Yin Hsu, Anqi Xu (2020), Wh-indeterminates and Prosody in Hong Kong Cantonese, SpeechProsody
Eva Liina Asu, Heete Sahkai, Pärtel Lippus (2020), The prosody of rhetorical and information-seeking questions in Estonian: preliminary results, SpeechProsody
Isabelle Franz, Markus Bader, Frank Domahs, Gerrit Kentner (2020), Influences of rhythm on word order in German, SpeechProsody
Katharina Zahner, Manluolan Xu, Yiya Chen, Nicole Dehé, Bettina Braun (2020), The prosodic marking of rhetorical questions in Standard Chinese, SpeechProsody
Margaret Zellers, Saudah Namyalo, Alena Witzlack-Makarevich (2020), Investigating relationships between intonational and syntactic phrasing in Ruruuli/Lunyala, SpeechProsody
Gouming Martens, Michael Wagner, Francisco Torreira (2020), Hat Contour in Dutch: Form and Function, SpeechProsody
Luma Miranda, João Moraes, Albert Rilliard (2020), Statistical modeling of prosodic contours of four speech acts in Brazilian Portuguese, SpeechProsody
Nelleke Jansen, Aoju Chen (2020), Prosodic encoding of sarcasm at the sentence level in Dutch, SpeechProsody
Bethany Sturman (2020), The sound of quotation marks: Prosodic characteristics of subclausal quotation in English, SpeechProsody
Emma Gibson, Francisco Torreira, Michael Wagner (2020), The High-fall Contour in North American English: A Case Study in Imperatives, SpeechProsody
Puyang Geng, Wentao Gu, Keith Johnson, Donna Erickson (2020), Acoustic-Prosodic and Articulatory Characteristics of the Mandarin Speech Conveying Dominance or Submissiveness, SpeechProsody
Roger Yu-Hsiang Lo, Angelika Kiss (2020), Durational and Pitch Marking of Rhetorical Wh-questions in Mandarin, SpeechProsody
Seung-Eun Kim, Sam Tilsen (2020), Speech rate and syntactically conditioned influences on prosodic boundaries, SpeechProsody
Anders Holmberg, Heete Sahkai, Anne Tamm (2020), Prosody distinguishes Estonian V2 from Finnish and Swedish, SpeechProsody
Daiki Hashimoto (2020), Pitch peak and word predictability: Results from CSJ corpus, SpeechProsody
Anders Eriksson, Juraj Šimko, Antti Suni, Martti Vainio, Rosalba Nodari (2020), Lexical stress perception as a function of acoustic properties and the native language of the listener, SpeechProsody
Constantijn Kaland, Vincent J. van Heuven (2020), Papuan Malay word stress reduces lexical alternatives, SpeechProsody
Zhen Qin, Caicai Zhang (2020), How sleep-mediated memory consolidation modulates the generalization across talkers: evidence from tone identification, SpeechProsody
Tatiana Kachkovskaia, Anna Mamushina, Alyona Portnova (2020), Typical and rare post-nuclear melodic movements in Russian, SpeechProsody
Y. Asano, H. Mitterer (2020), Word goodness affects the L1-dependent ability to store pitch contrasts, SpeechProsody
Jenny Yu, Robert Mailhammer, Anne Cutler (2020), Vocabulary structure affects word recognition: Evidence from German listeners, SpeechProsody
Xiaolin Li, Peggy Pik Ki Mok (2020), The acquisition of tone sandhi of the Xiamen dialect, SpeechProsody
Janice Wing Sze Wong, Takayuki Arai (2020), The effects of tonal experience on the categorization of Cantonese lexical tones into Japanese native pitch accent categories, SpeechProsody
Bin Li, Yihan Guan, Si Chen (2020), Carryover Effects on Tones in Hong Kong Cantonese, SpeechProsody
Hongwei Ding, Yiling Li (2020), Tonal Adaptation of Disyllabic Letter-Character Pattern in Mandarin Alphabetical Words, SpeechProsody
Weijun Zhang, Peggy Pik Ki Mok (2020), A Potential New Sound Change after Tonogenesis: A Preliminary Perceptual Study on the Tonal Contrast of Wenzhou Wu Chinese, SpeechProsody
Priti Raychoudhury, Shakuntala Mahanta (2020), The three way tonal system of Sylheti, SpeechProsody
Chenzi Xu (2020), Revisiting Neutral Tone in Mandarin Broadcast News Speech, SpeechProsody
Yue Chen, Yi Xu (2020), Intermediate features are not useful for tone perception, SpeechProsody
Hohsien Pan, Hsiaotung Huang (2020), Lexical Propensity and Taiwanese Min Tone sandhi Rules, SpeechProsody
Seoyoung Kim, Claudia Matachana, Alex Nyman, Kristine Yu (2020), Creak in the phonetic space of low tones in Beijing Mandarin, Cantonese, and White Hmong, SpeechProsody
Alexander Aldrich (2020), Adult Early-Bilingual Speech Rhythm: Evidence from Spanish and English, SpeechProsody
Hyunsong Chung (2020), Rhythm of East-Asian Speakers of English in English Conversation, SpeechProsody
Yiran Ding, Wang Dai, Kaiqi Fu, Yanlu Xie, Jinsong Zhang (2020), A comparative study of rhythmic patterns in non-native Mandarin speech by Russian, Japanese and Vietnamese learners, SpeechProsody
Farhat Jabeen, Elisabeth Delais-Roussarie (2020), The Accentual Phrase in Urdu/Hindi: A prosodic unit at the interplay between rhythm and intonation, SpeechProsody
Wai Ling Law, Olga Dmitrieva, Alexander Francis (2020), Convergence of L1 and L2 speech rhythm in Cantonese-English bilingual speakers, SpeechProsody
Sumio Kobayashi, Amalia Arvaniti (2020), Linguistic experience and rhythm perception, SpeechProsody
Ian Wilson, Donna Erickson, Tim Vance, Jeff Moore (2020), Jaw dancing American style: A way to teach English rhythm, SpeechProsody
Leena Dihingia, Priyankoo Sarmah (2020), Rhythm and Speaking Rate in Assamese Varieties, SpeechProsody
Marie-Anne Morand, Melissa Bruno, Nora Julmi, Sandra Schwab, Stephan Schmid (2020), Speech rhythm in multiethnolectal Zurich German, SpeechProsody
Francesco Burroni, Sam Tilsen (2020), Prominence clash does not induce rhythmic adjustments in Italian, SpeechProsody
Isadora Reynolds, Olga Maxwell, Gillian Wigglesworth (2020), The “other” Spanish: Methodological issues in the study of speech timing in Chilean Spanish, SpeechProsody
Luke Horo, Priyankoo Sarmah (2020), Rhythm in Sora Trilingual Readers, SpeechProsody
Natalie Boll-Avetisyan, Paul Okyere Omane, Frank Kügler (2020), Speech rhythm in Ghanaian languages: The cases of Akan, Ewe and Ghanaian English, SpeechProsody
Noam Amir, Sharon Bolle Fridman, Ortal Shakeman, Nofar Shuli, Avi Karni (2020), Do Musicians Speak Differently? Preliminary Results of a Production Study, SpeechProsody
Cong Zhang, Xinrong Wang (2020), Segment Duration and Proportion in Mandarin Singing, SpeechProsody
Alexsandro Meireles, Hansjörg Mixdorff (2020), Voice Quality in Low and High Registers in Two Different Styles of Singing, SpeechProsody
Eran Raveh, Maya Twig, Bernd Möbius, Oded Zehavi (2020), Prosodic Alignments in Shadowed Singing of Familiar and Novel Music, SpeechProsody
Li-Hsin Ning (2020), Musical Memory and Pitch Discrimination Abilities as Correlates of Vocal Pitch Control for Speakers with Different Tone and Musical Experiences, SpeechProsody
Chawadon Ketkaew (2020), A Comparison Between Speech and Musical Rhythms: A Case Study of Folk Music in Standard and Northern Thai, SpeechProsody
Urban Bruno Zihlmann (2020), Temporal variability in four Alemannic dialects and its influence on the respective varieties of Swiss Standard German, SpeechProsody
Sofoklis Kakouros, Katri Hiovain, Martti Vainio, Juraj Šimko (2020), Dialect Identification of Spoken North Sámi Language Varieties Using Prosodic Features, SpeechProsody
Zixin Qin, Yi Xu (2020), Lack of Prosodic Focus in Chongqing Dialect and Possible Historical Sources, SpeechProsody
Leah Bradshaw, Vincent Hughes, Eleanor Chodroff (2020), Investigating the Forensic Applications of Global and Local Temporal Representations of Speech for Dialect Discrimination, SpeechProsody
Min Liu, Yiya Chen (2020), The Roles of Segment and Tone in Bi-dialectal Auditory Word recognition, SpeechProsody
Aijun Li, Xiaoyan Zhang, Zhiqiang Li (2020), Acoustic and Phonological Analyses of Tones in Taifeng Chinese, SpeechProsody
Jörg Peters, Marina Frank, Marina Rohloff (2020), Pitch Range Variation in High German (L1) and Low German (L2), SpeechProsody
Li-Fang Lai, Janet van Hell (2020), Intonation and Voice Quality of Northern Appalachian English : A First Look, SpeechProsody
Nigel Ward, James Jodoin, Anindita Nath, Olac Fuentes (2020), Using Prosody to Find Mentions of Urgent Problems in Radio Broadcasts, SpeechProsody
Miki Ikoma (2020), Prosodic and Phonetic Aspects of Paralinguistic Utterances with the German Modal Particle schon in L1 and L2, SpeechProsody
Nicole Holliday, Jason Bishop, Grace Kuo (2020), Prosody and Political Style: The Case of Barack Obama and the L+H* Pitch Accent, SpeechProsody
Christine Prechtel (2020), Macro-rhythm in English and Spanish: Evidence from Radio Newscaster Speech, SpeechProsody
Shizuka Nakamura, Carlos Toshinori Ishi, Tatsuya Kawahara (2020), Analysis and modeling of between-sentence pauses in news speech by Japanese newscasters, SpeechProsody
Zixiaofan Yang, Jessica Huynh, Riku Tabata, Nishmar Cestero, Tomer Aharoni, Julia Hirschberg (2020), What Makes a Speaker Charismatic? Producing and Perceiving Charismatic Speech, SpeechProsody
Hussein Hussein, Burkhard Meyer-Sickendiek, Timo Baumann (2020), Free Verse and Beyond: How to Classify Post-modern Spoken Poetry, SpeechProsody
Jan Volín, Radek Skarnitzl (2020), Accent-Groups vs. Stress-Groups in Czech Clear and Conversational Speech, SpeechProsody
Jan Michalsky, Oliver Niebuhr, Lars Penke (2020), Do charismatic people produce charismatic speech? On the relationship between the Big Five personality traits and prosodic features of speaker charisma in female speakers, SpeechProsody
George Christodoulides (2020), Speaking Style Prosodic Variation and the Prosody-Syntax Interface: A Large-Scale Corpus Study, SpeechProsody
Jana Neitsch, Oliver Niebuhr (2020), On the role of prosody in the production and evaluation of German hate speech, SpeechProsody
Ambre Davat, Véronique Aubergé, Gang Feng (2020), Can we hear physical and social space together through prosody?, SpeechProsody
Donna Erickson, Shigeto Kawahara, Albert Rilliard, Ryoko Hayashi, Toshiyuki Sadanobu, Yongwei Li, Hayato Daikuhara, João de Moraes, Kerrie Obert (2020), Cross cultural differences in arousal and valence perceptions of voice quality, SpeechProsody
Céleste Guillemot, Shin-ichiro Sano (2020), Gender- and register-biased patterns in f0 use: How does prosody contribute to (in)formality in Japanese?, SpeechProsody
Mary Baltazani, Joanna Przedlacka, John Coleman (2020), Intonation of Greek–Turkish contact: a real-time diachronic study, SpeechProsody
Y. Asano, C. Yuan, A.-K. Grohe, A. Weber, M. Antoniou, A. Cutler (2020), Uptalk interpretation as a function of listening experience, SpeechProsody
Maia Duguine, Aritz Irurtzun (2020), Prosody and Language Contact: An Experimental Investigation of Interrogative Strategies in Navarro-Labourdin Basque, SpeechProsody
Clément Le Moine, Nicolas Obin (2020), Att-HACK: An Expressive Speech Database with Social Attitudes, SpeechProsody
James S. German, Cristel Portes (2020), Continental French, Corsican French, and the interpretation of intonation: The effect of implicit social cues depends on exposure, SpeechProsody
Vered Silber-Varod, Daphna Amit, Anat Lerner (2020), Tracing changes over the course of the conversation: A case study on filled pauses rates, SpeechProsody
Hong Zhang (2020), The Prosody of Fluent Repetitions in Spontaneous Speech, SpeechProsody
Nigel Ward (2020), Ten Prosodic Patterns of Turn-Taking in Japanese Conversation, SpeechProsody
Heike Lehnert-Lehouillier, Susana Terrazas, Steven Sandoval, Rachel Boren (2020), The Relationship between Prosodic Ability and Conversational Prosodic Entrainment, SpeechProsody
Ralph Rose (2020), Fluidity: Real-time Feedback on Acoustic Measures of Second Language Speech Fluency, SpeechProsody
Kätlin Aare, Emer Gilmartin, Marcin Wlodarczak, Pärtel Lippus, Mattias Heldner (2020), Breath Holds in Chat and Chunk Phases of Multiparty Casual Conversation, SpeechProsody
Mariko Kondo, Lionel Fontan, Maxime Le Coz, Takayuki Konishi, Sylvain Detey (2020), Phonetic fluency of Japanese learners of English: automatic vs native and non-native assessment, SpeechProsody
Juergen Trouvain, Raphael Werner, Bernd Möbius (2020), An Acoustic Analysis of Inbreath Noises in Read and Spontaneous Speech, SpeechProsody
Yiyuan Liao, Jue Yu (2020), Discourse Planning Strategies in Chinese L2 English Speech: Chunking Preference and Disfluency Attributes, SpeechProsody
Emer Gilmartin, Kätlin Aare, Maria O'Reilly, Marcin Wlodarczak (2020), Between and Within Speaker Transitions in Multiparty Conversation, SpeechProsody
Yanan Shen, Zulayati Wufuer, Yujun Ren, Ping Tang, Nan Xu Rattanasone, Ivan Yuen, Katherine Demuth (2020), The production of Mandarin tones by early-implanted children with cochlear implants: effects from the length of implantation, SpeechProsody
Simon Wehrle, Francesco Cangemi, Harriet Hanekamp, Kai Vogeley, Martine Grice (2020), Assessing the Intonation Style of Speakers with Autism Spectrum Disorder, SpeechProsody
Sónia Frota, Jovana Pejovic, Cátia Severino, Marina Vigário (2020), Looking for the edge: emerging segmentation abilities in atypical development, SpeechProsody
Hao Zhang, Jing Zhang, Hongwei Ding, Yongqin Li (2020), Efficacy of Multi-Talker Phonetic Training in Mandarin Tone Perception for Native Pediatric Cochlear Implant Users, SpeechProsody
Yu-xin Lin, Jue Yu, Yiyuan Liao (2020), Acoustic Attributes of Mandarin Retroflex Vowel “ɚ” Produced by Prelingually Deaf Children with Cochlear Implants, SpeechProsody
Barbara Gili Fivela, Sonia Immacolata d'Apolito, Giorgia Di Prizio (2020), Labialization and Prosodic Modulation in Italian Dysarthric Speech by Parkinsonian Speakers: A Preliminary Investigation, SpeechProsody
Hsueh Chu Chen, Jingxuan Tian (2020), The Effects of Explicit Rule and Acoustic-perceptual Instructions on Chinese ESL Learners’ Prosodic Acquisition of English Lexical Stress, SpeechProsody
Tomoko Hori, Michiko Toyama, Mari Akatsuka (2020), Perception of English Intonation by Japanese Learners of English, SpeechProsody
Albert Lee, Yi Xu (2020), Focus prosody in Japanese-English early bilinguals: A pilot study, SpeechProsody
Cristina Herrero, Empar DevÍs (2020), Unintentional impolite intonation in L2 Spanish requests produced by Chinese workers living in Madrid, SpeechProsody
Zenghui Liu, Shufen Liang, Lei Zeng (2020), The Development of Prosodic Focus-marking and Declarative Question Intonation in Thai Learners’ Mandarin, SpeechProsody
Aaron Albin, Ruilai Wang (2020), When does intonational transfer occur? A comparative study of interrogative rises in four groups of L2 Japanese learners, SpeechProsody
Sally Chen, Janice Fon (2020), On Rhythm, Prosodic Grouping, and Declination Pattern of Taiwan Mandarin Learners of English, SpeechProsody
Chie Nakamura, Jesse Harris, Sun-Ah Jun (2020), Learning to Anticipate Contrast with Prosody: A Visual World Study with L2 Learners, SpeechProsody
Michiko Mochizuki Sudo, Takayuki Kagomiya, Tomoko Hori (2020), Present state analysis and measurement of pronunciation training effectiveness in English acquisition: Relationships between production patterns and English proficiencies, SpeechProsody
Aijun Li, Xinyuan Wan, Chenyang Zhao, Lin Zhu (2020), Phonological and Phonetic Realization of Narrow Focus in Declarative Sentences by Jinan L2 English Learners, SpeechProsody
Katrin Leppik, Pärtel Lippus, Eva Liina Asu (2020), The production of Estonian quantity degrees by Spanish L1 speakers, SpeechProsody
Sherry Chien, Janice Fon (2020), On the Learnability of Nuclear and Prenuclear Accents ― Using Taiwan Mandarin Learners of English as an Example, SpeechProsody
Rachel Albar, Hiyon Yoo (2020), The production of French continuation contours at different prosodic boundaries by Japanese learners, SpeechProsody
Lei Xi, Sandrine Wachs, Rachid Ridouane (2020), Production of French final stressed syllables in Accentual Phrase by Chinese learners: A pilot study, SpeechProsody
Kiyoko Yoneyama, Mafuyu Kitahara, Keiichi Tajima (2020), Effects of Japanese Prosody on English Word Production: Interaction between Voicing and Gemination, SpeechProsody
Mariko Sugahara (2020), Lexical Stress Assignment to Base, Inflected and Derived Words in English by Japanese and Seoul Korean Learners of English, SpeechProsody
Juraj Šimko, Martti Vainio, Antti Suni (2020), Analysis of speech prosody using WaveNet embeddings: The Lombard effect, SpeechProsody
Gerardo Cervantes, Nigel Ward (2020), Using Prosody to Spot Location Mentions, SpeechProsody
Takuya Ozuru, Yusuke Ijima, Daisuke Saito, Nobuaki Minematsu (2020), Are you professional?: Analysis of prosodic features between a newscaster and amateur speakers through partial substitution by DNN-TTS, SpeechProsody
Lionel Fontan, Maxime Le Coz, Charlotte Alazard (2020), Using the forward-backward divergence segmentation algorithm and a neural network to predict L2 speech fluency, SpeechProsody
Jian Zhu (2020), Probing the phonetic and phonological knowledge of tones in Mandarin TTS models, SpeechProsody
Aghilas Sini, Sébastien Le Maguer, Damien Lolive, Elisabeth Delais-Roussarie (2020), Introducing Prosodic Speaker Identity for a Better Expressive Speech Synthesis Control, SpeechProsody
Antti Suni, Sofoklis Kakouros, Martti Vainio, Juraj Šimko (2020), Prosodic Prominence and Boundaries in Sequence-to-Sequence Speech Synthesis, SpeechProsody
Vincent Wan, Jonathan Shen, Hanna Siilen, Rob Clark (2020), Modelling Intonation in Spectrograms for Neural Vocoder based Text-to-Speech, SpeechProsody
Andy Murphy, Irena Yanushevskaya, Ailbhe Ní Chasaide, Christer Gobl (2020), Testing the GlórCáil System in a Speaker and Affect Voice Transformation Task, SpeechProsody
Meysam Shamsi, Jonathan Chevelu, Nelly Barbot, Damien Lolive (2020), Corpus design for expressive speech: impact of the utterance length, SpeechProsody
Wei Zhang, Yanlu Xie, Jinsong Zhang (2020), Physiological pitch range estimation from a brief speech input: A study on a bilingual parallel speech corpus, SpeechProsody
Zack Hodari, Catherine Lai, Simon King (2020), Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0, SpeechProsody
Timo Baumann (2020), How a Listener Influences the Speaker, SpeechProsody
Nobukatsu Hojo, Yusuke Ijima, Hiroaki Sugiyama, Noboru Miyazaki, Takahito Kawanishi, Kunio Kashino (2020), DNN-based Speech Synthesis considering Dialogue-Act Information and its Evaluation with Respect to Illocutionary Act Naturalness, SpeechProsody
Kohei Kitamura, Tsuneo Kato, Seiichi Yamamoto (2020), Tree-based Clustering of Vowel Duration Ratio Toward Dictionary-based Automatic Assessment of Prosody in L2 English Word Utterances, SpeechProsody
Vincent Martin, Gabrielle Chapouthier, Mathilde Rieant, Jean-Luc Rouas, Pierre Philip (2020), Using reading mistakes as features for sleepiness detection in speech, SpeechProsody
Barbara Schuppler, Bogdan Ludusan (2020), An analysis of prosodic boundary detection in German and Austrian German read speech, SpeechProsody
Bogdan Ludusan, Petra Wagner (2020), Speech, laughter and everything in between: A modulation spectrum-based analysis, SpeechProsody
Julian Linke, Anneliese Kelterer, Markus A. Dabrowski, Dina El Zarka, Barbara Schuppler (2020), Towards automatic annotation of prosodic prominence levels in Austrian German, SpeechProsody
Parismita Gogoi, Moakala Tzudir, Priyankoo Sarmah, S. R. M. Prasanna (2020), Automatic Tone Recognition of Ao Language, SpeechProsody
Jean-Philippe Goldman, Anne Catherine Simon (2020), ProsoBox, a Praat Plugin for Analysing Prosody, SpeechProsody
Hussein Ghaly, Michael Mandel (2020), Using Prosody to Improve Dependency Parsing, SpeechProsody
Hee Hwang, Kristine Yu (2020), Word-based Neural Prosody Modeling with ToBI, SpeechProsody
Jeremy Peckham (1995), Conversational interaction: breaking the usability barrier, SDS
Sadaoki Furui (1995), Prospects for spoken dialogue systems in a multimedia environment, SDS
Lori F. Lamel, S. K. Bennacef, H. Bonneau-Maynard, S. Rosset, Jean-Luc Gauvain (1995), Recent developments in spoken language sytems for information retrieval, SDS
Tatsuya Kawahara, Masahiro Araki, Shuji Doshita (1995), Comparison of parsing and spotting approaches for spoken dialogue understanding, SDS
Masaru Hidano, Toshihiko Itoh, Mikio Yamamoto, Seiichi Nakagawa (1995), Spontaneous speech understanding for a dialogue system, SDS
Sheryl R. Young, Wayne H. Ward (1995), The role of higher-level semantic, pragmatic and discourse knowledge in recognizing and understanding new spoken words and phrases, SDS
Norbert Reithinger, Elisabeth Maier, Jan Alexandersson (1995), Treatment of incomplete dialogues in a speech-to-speech translation system, SDS
Latifa Taleb, Daniel Luzzati (1995), Finalized spoken dialogue modelling based on communication failure, SDS
Toshiaki Iwadera, Masato Ishizaki, Tsuyoshi Morimoto (1995), Recognizing an interactional structure and topics of task-oriented dialogues, SDS
Stuart Bird, Sue Browning, Roger Moore, Martin Russell (1995), Dialogue move recognition using topic spotting techniques, SDS
Isabel Trancoso, Carlos Ribeiro, Ricardo Rodrigues, Miguel Rosa (1995), Issues in speech recognition applied to directory listing retrieval, SDS
I. Lewin, S. G. Pulman (1995), Inference in the resolution of ellipsis, SDS
Gerhard Hanrieder, Günther Görz (1995), Robust parsing of spoken dialogue using contextual knowledge and recognition probabilities, SDS
Nadia Bellalem, Laurent Romary, Daniel Schang (1995), Which representation for a proper treatment of referring expressions in a man-machine multimodal dialogue, SDS
K. Pepelnjak, Jerneja Gros, F. Mihelic, N. Pavesic (1995), Linguistic analysis in a slovenian information retrieval system for flight services, SDS
Laurel Fais, Kyung-ho Loken-Kim (1995), Lexical accommodation in human-interpreted and machine-interpreted dual language interactions, SDS
Yumi Wakita, Harald Singer, Yoshinori Sagisaka (1995), Phoneme candidate re-entry modeling using recognition error characteristics over multiple HMM states, SDS
Mauro Cettolo, Anna Corazza, Renato De Mori (1995), Automatic learning of sentence dependencies in spoken dialogues, SDS
Johan Bertenstam, Jonas Beskow, Mats Blomberg, Rolf Carlson, Kjell Elenius, Björn Granström, Joakim Gustafson, Sheri Hunnicutt, Jesper Högberg, Roger Lindell, Lennart Neovius, Lennart Nord, Antonio de Serpa-Leitao, Nikko Ström (1995), The waxholm system - a progress report, SDS
Roberto Pieraccini, Esther Levin (1995), A spontaneous-speech understanding system for database query applications, SDS
Anders Baekgaard, Niels Ole Bernsen, T. Brøndsted, Paul Dalsgaard, Hans Dybkjær, Laila Dybkjær, J. Kristiansen, Lars Bo Larsen, Børge Lindberg, B. Maegaard, B. Music, L. Offersgaard, C. Povlsen (1995), The danish spoken language dialogue project - a general overview, SDS
Laila Dybkjær, Niels Ole Bernsen, Hans Dybkjær (1995), Scenario design for spoken language dialogue systems development, SDS
Brian Mellor, Cian O'Connor (1995), User adaptation to voice input interfaces, SDS
Laurent Chapelier, Christine Fay-Varnier, Azim Roussanaly (1995), Modelling an intelligent help system from a wizard of oz experiment, SDS
Anders Baekgaard (1995), A platform for spoken dialogue systems, SDS
J. Eugene Ball, Daniel T. Ling (1995), Spoken language processing in the persona conversational assistant, SDS
Steve Whittaker, David Attwater (1995), Advanced speech applications - the integration of speech technology into complex services, SDS
Brian Mellor, Mike Tomlinson, Nick Coleman (1995), The generic user interface design environment (GUIDE) - overview and features, SDS
Harald Aust, Martin Oerder (1995), Dialogue control in automatic inquiry systems, SDS
Jørn Stern Nielsen (1995), Using automatic speech recognition in supplementary services experiences and results from an extensive test, SDS
Masaki Naito, Shingo Kuroiwa, Kazuya Takeda, Seiichi Yamamoto, Fumihiro Yato (1995), A real-time speech dialogue system for a voice activated telephone extension service, SDS
Gilles Souvay, Jean-Marie Pierrel (1995), DIAPASON - a development environment for the integration of an oral input in machine control applications, SDS
Rolf Carlson, Sheri Hunnicutt, Joakim Gustafsson (1995), Dialog management in the waxholm system, SDS
Mark Fanty, Stephen Sutton, David G. Novick, Ronald Cole (1995), Automated appointment scheduling, SDS
M. D. Sadek, P. Bretier, V. Cadoret, A. Cozannet, P. Dupont, A. Ferrieux, F. Panaget (1995), A cooperative spoken dialogue system based on a rational agent model: a first implementation on the AGS application, SDS
F. R. McInnes, L. S. White, J. C. Foster, Mervyn A. Jack (1995), An automated style checker for human-computer dialogue engineering, SDS
Marc Guyomard, Didier Le Meur, Sébastien Poignonnec, Jacques Siroux (1995), Experimental work for the dual usage of voice and touch screen for a cartographic application, SDS
Norman M. Fraser (1995), Quality standards for spoken language dialogue systems: a report on progress in EAGLES, SDS
Y. Bellik, S. Ferrari, Françoise Néel, D. Teil (1995), Requirements for multimodal dialogue including vocal interaction, SDS
Peter Wyard, Steve Appleby, Ed Kaneen, Sandra Williams, Keith Preston (1995), A combined speech and visual interface to the BT business catalogue, SDS
Katunobu Itou, Osamu Hasegawa, Takio Kurita, Satoru Hayamizu, Kazuyo Tanaka, Kazuhiko Yamamoto, Nobuyuki Otsu (1995), An active multimodal interaction system, SDS
Allen L. Gorin (1995), Spoken dialog as a feedback control system, SDS
Masahiro Araki, Tatsuya Kawahara, Shuji Doshita (1995), Cooperative spoken dialogue model using Bayesian network and event hierarchy, SDS
Susann LuperFoy (1995), Implementing file change semantics for spoken-language dialogue managers, SDS
Elisabetta Gerbino, Paolo Baggia, Egidio Giachin, Claudio Rullent (1995), Analysis and evaluation of spontaneous speech utterances in focused dialogue contexts, SDS
Julia Hirschberg, Christine H. Nakatani, Barbara J. Grosz (1995), Conveying discourse structure through intonation variation, SDS
Wieland Eckert, Elmar Nöth, Heinrich Niemann, Ernst-Günter Schukat-Talamazzini (1995), Real users behave weird - experiences made collecting large human-machine-dialog corpora, SDS
Alan W. Black, Nick Campbell (1995), Predicting the intonation of discourse segments from examples in dialogue speech, SDS
Gösta Bruce, Björn Granström, Kjell Gustafson, Merle Horne, David House, Paul Touati (1995), Towards an enhanced prosodic model adapted to dialogue applications, SDS
Marc Swerts, Mari Ostendorf (1995), Discourse prosody in human-machine interactions, SDS
K. S. Hone, C. Baber (1995), Using a simulation method to predict the transaction time effects of applying alternative levels of constraint to user utterances within speech interactive dialogues, SDS
Kazuhiro Arai, Osamu Yoshioka, Shigeki Sagayama, Noboru Sugamura (1995), A prototype of an address input system with speech recognition, SDS
Hans G. Tillmann, Bernd Tischer (1995), Collection and exploitation of spontaneous speech produced in negotiation dialogues, SDS
Mark Tatham, Katherine Morton (1995), Speech synthesis in dialogue systems, SDS
John Bateman, Eli Hagen, Adelheit Stein (1995), Dialogue modeling for speech generation in multimodal information systems, SDS
Robert I. Damper, M. A. Tranchant, S. D. Wood (1995), Speech versus keying in the human-computer interface, SDS
Keikichi Hirose, Toru Senoo (1995), A method of generating speech reply with elliptical expressions and prosodic emphases, SDS
S. K. Bennacef, Françoise Néel, H. B. Maynard (1995), An oral dialogue model based on speech acts categorization, SDS
P. C. Woodland, D. Povey (2000), Large scale discriminative training for speech recognition, ASR
Xavier L. Aubert (2000), A brief overview of decoding techniques for large vocabulary continuous speech recognition, ASR
Mehryar Mohri, Fernando Pereira, Michael Riley (2000), Weighted finite-state transducers in speech recognition, ASR
Mukund Padmanabhan, Michael Picheny (2000), Towards super-human speech recognition, ASR
Michael H. Cohen (2000), Surfing the voice web: issues in the design of a voice browser, ASR
Katrin Kirchhoff, Jeff A. Bilmes (2000), Combination and joint training of acoustic classifiers for speech recognition, ASR
Jan Stadermann, Jörg Rottland, Gerhard Rigoll (2000), Tied-Posteriors: A new hybrid speech recognition technology with generic capabilities and high portability, ASR
Peter Stubley (2000), The block-synchronous search algorithm, ASR
Diamantino Caseiro, Isabel Trancoso (2000), A decoder for finite-state structured search spaces, ASR
Stephan Kanthak, Achim Sixtus, Sirko Molau, Hermann Ney (2000), Within-word vs. across-word decoding for online speech recognition, ASR
Holger Schwenk, Jean-Luc Gauvain (2000), Improved ROVER using language model information, ASR
I. Potamitis, Nikos Fakotakis, George Kokkinakis (2000), Reliable ASR based on unreliable features, ASR
Filipp Korkmazskiy, Frank K. Soong, Olivier Siohan (2000), Constrained spectrum normalization for robust speech recognition in noise, ASR
Florian Hilger, Hermann Ney (2000), Noise level normalization and reference adaptation for robust speech recognition, ASR
Sadaoki Furui, Daisuke Itoh (2000), Noise adaptation of HMMs using neural networks, ASR
G. F. Meyer, B. A. Edmonds, D. Yang, William A. Ainsworth (2000), Amplitude modulation maps for robust speech recognition, ASR
Astrid Hagen, Andrew Morris, Hervé Bourlard (2000), From multi-band full combination to multi-stream full combination processing in robust ASR, ASR
Hans-Günter Hirsch, David Pearce (2000), The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions, ASR
A. Varona, I. Torres (2000), Delimited smoothing technique over pruned and not pruned syntactic language models: perplexity and WER, ASR
Harry Printz, Peder Olsen (2000), Theory and practice of acoustic confusability, ASR
Ingunn Amdal, Filipp Korkmazskiy, Arun C. Surendran (2000), Data-driven pronunciation modelling for non-native speakers using association strength between phones, ASR
Yuqing Gao, Bhuvana Ramabhadran, Michael Picheny (2000), New adaptation techniques for large vocabulary continuous speech recognition, ASR
Patrick Kenny, Gilles Boulianne, Pierre Dumouchel (2000), Bayesian adaptation revisited, ASR
Olivier Siohan, Tor André Myrvoll, Chin-Hui Lee (2000), Structural maximum a posteriori linear regression for fast HMM adaptation, ASR
Mukund Padmanabhan, George Saon, Geoffrey Zweig (2000), Lattice-based unsupervised MLLR for speaker adaptation, ASR
Matt Richardson, Jeff Bilmes, Chris Diorio (2000), Hidden-articulator Markov models for speech recognition, ASR
Gang Peng, Bo Zhang, William S-Y. Wang (2000), Performance of Mandarin connected digit recognizer with word duration modeling, ASR
Jing Zheng, Horacio Franco, Andreas Stolcke (2000), Rate-of-speech modeling for large vocabulary conversational speech recognition, ASR
Lori Lamel, Jean-Luc Gauvain, Gilles Adda (2000), Lightly supervised acoustic model training, ASR
Anne-Katrin Kienappel, Dieter Geller, Rolf Bippus (2000), Cross-language transfer of multilingual phoneme models, ASR
Steven Greenberg, Shuangyu Chang (2000), Linguistic dissection of switchboard-corpus automatic speech recognition systems, ASR
Delphine Charlet (2000), Optimizing confidence measure based on HMM acoustical rescoring, ASR
Silke Goronzy, Krzysztof Marasek, Andreas Haag, Ralf Kompe (2000), Prosodically motivated features for confidence measures, ASR
Timothy J. Hazen, Theresa Burianek, Joseph Polifroni, Stephanie Seneff (2000), Recognition confidence scoring for use in speech understanding systems, ASR
Mauro Cettolo, Marcello Federico (2000), Model Selection Criteria for Acoustic Segmentation, ASR
Yoshihiko Gotoh, Steve Renals (2000), Sentence boundary detection in broadcast speech transcripts, ASR
Mats Ljungqvist (2000), Human language technologies in European Community Research Programmes, current state and future perspectives, ASR
James D. Bass (2000), Breaking the local optima paradigm: DARPA speech research initiatives in multi-modal and other technologies, ASR
Sadaoki Furui, Kikuo Maekawa, Hitoshi Isahara (2000), A Japanese national project on spontaneous speech corpus and processing technology, ASR
Lou Boves, Denis Jouvet, Juergen Sienel, Renato de Mori, Fréderic Béchet, Luciano Fissore, Pietro Laface (2000), ASR for automatic directory assistance: The SMADA project, ASR
Sharon L. Oviatt (1991), Toward multimodal support of interpreted telephone dialogues, SMMD
Harry Bunt (1991), Dynamic interpretation and dialogue performance, SMMD
D. Luzzati (1991), A dynamic dialog model for human machine communication, SMMD
Pierre Falzon (1991), Multi-modal interactions in MMI2 design dialogues, SMMD
Roger K. Moore, Mike J. Tomlinson (1991), Whither the wizard?, SMMD
M. D. Sadek (1991), Dialogue acts are rational plans, SMMD
Marc Guyomard (1991), Very indirect speech acts or how to keep up appearances, SMMD
Anne Vilnat, Lydia Nicaud (1991), Dialogue handling in a written and/or spoken application: STANDIA, SMMD
Tom Wachtel (1991), Uncertainty, multiple modes and multiple sources, SMMD
Sheryl Young (1991), Using semantics to correct parser output for ATIS utterances, SMMD
Eric Bilange (1991), An approach to oral dialogue modelling, SMMD
K. Matrouf, Francoise Néel (1991), Use of upper level knowledge to improve human-machine interaction, SMMD
Frédéric Gavignet, Marc Guyomard, Jacques Siroux (1991), Implementing an oral and graphic multimodal application: the georal project, SMMD
Marie-Francoise Castaing, Francoise Néel (1991), Human factors in speech processing systems: a laboratory study, SMMD
Jean-Claude Junqua (1991), Robustness and cooperative multimodal man-machine communication applications, SMMD
Katherine Morton (1991), Natural-sounding voice output for dialogue systems, SMMD
M. M. Taylor (1991), Multiplexing, diviplexing, and the control of multimodal dialogue, SMMD
Robbert-Jan Beun (1991), A framework for cooperative dialogues, SMMD
A. Murray (1991), Speech interfaces for form-filling tasks: task structure as a constraint on set-switching procedures and system prompts, SMMD
Sylvia Candelaria de Ram (1991), Why to enter into dialogue is to come out with changed speech: cross-linked modalities, emotion, and language shift, SMMD
Jacques Siroux (1991), Time management in multimodal systems, SMMD
Sylvia Candelaria de Ram (1991), Cognitive integration of multi-modal comprehension and response in discourse: dialogue pragmasemantics proposed, SMMD
D. G. Beroule (1991), The adaptive, dynamic and associative memory model: an actual present tool for vocal human-computer communication, SMMD
William Edmondson (1991), A taxonomy of interaction style: towards a theory of multimodal interaction, SMMD
N. Michael Brooke (1991), Processing facial images to enhance speech communication, SMMD
R. M. Taylor, S. J. Selcon (1991), Multiple information sources; a cognitive integrality model, SMMD
Bertrand Gaiffe, Laurent Romary, Jean-Marie Pierrel (1991), Referring in a multimodal environment: from NL to designation, SMMD
John Lee (1991), Graphics and natural language in multi-modal dialogues, SMMD
Christian Benoît (1991), A promising challenge for bimodal machine-man communication: the SYNTHESIS of TALKING FACES, SMMD
Jean Caelen (1991), Multimodal interaction: event management and experiments with ICPdraw, SMMD
Daniel Teil, Yacine Bellik (1991), Multimodal dialog interface on a PC-like work station, SMMD
Jean-Marie Condom, Andre Lozes, Michel Courdesses (1991), Pragmatic aspects in a multimodal dialogue between an operator and a robot, SMMD
Jean-Noel Perbet, Jean-Jacques Favot, Bruno Barbier (1991), Interactive display concept for the next generation cockpit, SMMD
Daniel Teil, Olivier Da Silva (1991), Gesture recognition using a data glove input device, SMMD
Chai Wutiwiwatchai (2012), Toward universal network-based speech translation, IWSLT
Dong Yu (2012), Who can understand your speech better - deep neural network or Gaussian mixture model?, IWSLT
Hideki Isozaki (2012), Head finalization: translation from SVO to SOV, IWSLT
Marcello Federico, Mauro Cettolo, Luisa Bentivogli, Michael Paul, Sebastian Stüker (2012), Overview of the IWSLT 2012 evaluation campaign, IWSLT
Hitoshi Yamamoto, Youzheng Wu, Chien-Lin Huang, Xugang Lu, Paul R. Dixon, Shigeki Matsuda, Chiori Hori, Hideki Kashioka (2012), The NICT ASR system for IWSLT2012, IWSLT
Mohammed Mediani, Yuqi Zhang, Thanh-Le Ha, Jan Niehues, Eunah Cho, Teresa Herrmann, Rainer Kärgel, Alex Waibel (2012), The KIT translation systems for IWSLT 2012, IWSLT
Eva Hasler, Peter Bell, Arnab Ghoshal, Barry Haddow, Philipp Koehn, Fergus McInnes, Steve Renals, Pawel Swietojanski (2012), The UEDIN systems for the IWSLT 2012 evaluation, IWSLT
Graham Neubig, Kevin Duh, Masaya Ogushi, Takatomo Kano, Tetsuo Kiso, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura (2012), The NAIST machine translation system for IWSLT2012, IWSLT
Nicholas Ruiz, Arianna Bisazza, Roldano Cattoni, Marcello Federico (2012), FBK’s machine translation systems for IWSLT 2012’s TED lectures, IWSLT
Stephan Peitz, Saab Mansour, Markus Freitag, Minwei Feng, Matthias Huck, Joern Wuebker, Malte Nuhn, Markus Nußbaum-Thom, Hermann Ney (2012), The RWTH Aachen speech recognition and machine translation system for IWSLT 2012, IWSLT
Xiaoning Zhu, Yiming Cui, Conghui Zhu, Tiejun Zhao, Hailong Cao (2012), The HIT-LTRC machine translation system for IWSLT 2012, IWSLT
Daniele Falavigna, Roberto Gretter, Fabio Brugnara, Diego Giuliani (2012), FBK@IWSLT 2012 - ASR track, IWSLT
Christian Saam, Christian Mohr, Kevin Kilgour, Michael Heck, Matthias Sperber, Keigo Kubo, Sebastian Stüker, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Alex Waibel (2012), The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation, IWSLT
Michael Heck, Keigo Kubo, Matthias Sperber, Sakriani Sakti, Sebastian Stüker, Christian Saam, Kevin Kilgour, Christian Mohr, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Alex Waibel (2012), The KIT-NAIST (contrastive) English ASR system for IWSLT 2012, IWSLT
Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi (2012), EBMT system of kyoto university in OLYMPICS task at IWSLT 2012, IWSLT
Laurent Besacier, Benjamin Lecouteux, Marwen Azouzi, Luong Ngoc Quang (2012), The LIG English to French machine translation system for IWSLT 2012, IWSLT
Jennifer Drexler, Wade Shen, Terry, Tim Anderson, Raymond Slyh, Brian Ore, Eric Hansen (2012), The MIT-LL/AFRL IWSLT-2012 MT system, IWSLT
Hiroaki Shimizu, Masao Utiyama, Eiichiro Sumita, Satoshi Nakamura (2012), Minimum Bayes-Risk decoding extended with similar examples: NAIST-NICT at IWSLT 2012, IWSLT
Andrew Finch, Ohnmar Htun, Eiichiro Sumita (2012), The NICT translation system for IWSLT 2012, IWSLT
Krzysztof Marasek (2012), TED Polish-to-English translation system for the IWSLT 2012, IWSLT
Hwidong Na, Jong-Hyeok Lee (2012), Forest-to-string translation using binarized dependency forest for IWSLT 2012 OLYMPICS task, IWSLT
Ştefan Daniel Dumitrescu, Radu Ion, Dan Ştefǎnescu, Tiberiu Boroş, Dan Tufiş (2012), Romanian to English automatic MT experiments at IWSLT12 (system description paper), IWSLT
Coşkun Mermer, Hamza Kaya, İlknur Durgar El-Kahlout, Mehmet Uğur Doğan (2012), The TÜBİTAK statistical machine translation system for IWSLT 2012, IWSLT
Rohit Prasad, Rohit Kumar, Sankaranarayanan Ananthakrishnan, Wei Chen, Sanjika Hewavitharana, Matthew Roy, Frederick Choi, Aaron Challenner, Enoch Kan, Arvind Neelakantan, Prem Natarajan (2012), Active error detection and resolution for speech-to-speech translation, IWSLT
Takatomo Kano, Sakriani Sakti, Shinnosuke Takamichi, Graham Neubig, Tomoki Toda, Satoshi Nakamura (2012), A method for translation of paralinguistic information, IWSLT
Jan Niehues, Alex Waibel (2012), Continuous space language models using restricted Boltzmann machines, IWSLT
Daniele Falavigna, Roberto Gretter (2012), Focusing language models for automatic speech recognition, IWSLT
Philipp Koehn (2012), Simulating human judgment in machine translation evaluation campaigns, IWSLT
Walid Aransa, Holger Schwenk, Loic Barrault (2012), Semi-supervised transliteration mining from parallel and comparable corpora, IWSLT
Saab Mansour, Hermann Ney (2012), A simple and effective weighted phrase extraction for machine translation adaptation, IWSLT
Amittai Axelrod, QingJun Li, William D. Lewis (2012), Applications of data selection via cross-entropy difference for real-world statistical machine translation, IWSLT
Mei Tu, Yu Zhou, Chengqing Zong (2012), A universal approach to translating numerical and time expressions, IWSLT
Henrich Kolkhorst, Kevin Kilgour, Sebastian Stüker, Alex Waibel (2012), Evaluation of interactive user corrections for lecture transcription, IWSLT
Youzheng Wu, Hitoshi Yamamoto, Xugang Lu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka (2012), Factored recurrent neural network language model in TED lecture transcription, IWSLT
Frédéric Blain, Holger Schwenk, Jean Senellart (2012), Incremental adaptation using translation information and post-editing analysis, IWSLT
Shahram Khadivi, Zeinab Vakil (2012), Interactive-predictive speech-enabled computer-assisted translation, IWSLT
Nicholas Ruiz, Marcello Federico (2012), MDI adaptation for the lazy: avoiding normalization in LM adaptation for lecture translation, IWSLT
Eunah Cho, Jan Niehues, Alex Waibel (2012), Segmentation and punctuation prediction in speech language translation using a monolingual translation system, IWSLT
Minwei Feng, Jan-Thorsten Peter, Hermann Ney (2012), Sequence labeling-based reordering model for phrase-based SMT, IWSLT
Eva Hasler, Barry Haddow, Philipp Koehn (2012), Sparse lexicalised features and topic adaptation for SMT, IWSLT
Stephan Peitz, Simon Wiesler, Markus Nußbaum-Thom, Hermann Ney (2012), Spoken language translation using automatically transcribed text in training, IWSLT
Marion Potet, Laurent Besacier, Hervé Blanchon, Marwen Azouzi (2012), Towards a better understanding of statistical post-edition usefulness, IWSLT
Li Gong, Aurélien Max, François Yvon (2012), Towards contextual adaptation for any-text translation, IWSLT
Kay Berkling, Uwe Reichel (2016), Progression in Materials for Learning to Read and Write - a Cross-Language and Cross-Century Comparison of Readers, WOCCI
Eva Fringi, Jill Fain Lehman, Martin Russell (2016), The role of phonological processes and acoustic confusability in phone errors in children’s ASR, WOCCI
Behnaz Nojavanasghari, Tadas Baltrusaitis, Charles E. Hughes, Louis Philippe Morency (2016), The Future Belongs to the Curious: Towards Automatic Understanding and Recognition of Curiosity in Children, WOCCI
Jaebok Kim, Khiet P. Truong (2016), Automatic analysis of children’s engagement using interactional network features, WOCCI
Jaebok Kim, Khiet Truong, Vanessa Evers (2016), Automatic detection of children's engagement using non-verbal features and ordinal learning, WOCCI
Abualseoud Hanani, Mays Attari, Atta' Farakhna, Aseel Joma'A, Mohammed Hussein, Stephen Taylor (2016), Automatic Identification of Articulation Disorders for Arabic Children Speakers, WOCCI
Yao Qian, Xinhao Wang, Keelan Evanini, David Suendermann-Oeft (2016), Improving DNN-Based Automatic Recognition of Non-native Children Speech with Adult Speech, WOCCI
Sapna Patel, Darin Hughes, Charles Hughes (2016), MeEmo - Using an Avatar to Improve Social Skills in Children with ASD, WOCCI
Jill Fain Lehman, Nikolas Wolfe, André Pereira (2016), G-g-go! Juuump! Online Performance of a Multi-keyword Spotter in a Real-time Game, WOCCI
Maryam Najafian, Dwight Irvin, Ying Luo, Beth Rous, John Hl Hansen (2016), Automatic measurement and analysis of the child verbal communication using classroom acoustics within a child care center, WOCCI
Sadaoki Furui (2011), Data-intensive approaches for ASR, IWSLT
Daniel Marcu (2011), Meaning-equivalent semantics for understanding, generation, translation, and evaluation, IWSLT
Junichi Tsujii (2011), Resource-rich research on natural language processing and understanding, IWSLT
Marcello Federico, Luisa Bentivogli, Michael Paul, Sebastian Stüker (2011), Overview of the IWSLT 2011 evaluation campaign, IWSLT
Kazuhiko Abe, Youzheng Wu, Chien-lin Huang, Paul R. Dixon, Shigeki Matsuda, Chiori Hori, Hideki Kashioka (2011), The NICT ASR system for IWSLT2011, IWSLT
A. Ryan Aminzadeh, Tim Anderson, Ray Slyh, Brian Ore, Eric Hansen, Wade Shen, Jennifer Drexler, Terry Gleason (2011), The MIT-LL/AFRL IWSLT-2011 MT system, IWSLT
Pratyush Banerjee, Hala Almaghout, Sudip Naskar, Johann Roturier, Jie Jiang, Andy Way, Josef van Genabith (2011), The DCU machine translation systems for IWSLT 2011, IWSLT
Andrew Finch, Chooi-Ling Goh, Graham Neubig, Eiichiro Sumita (2011), The NICT translation system for IWSLT 2011, IWSLT
Xiaodong He, Amittai Axelrod, Li Deng, Alex Acero, Mei-Yuh Hwang, Alisa Nguyen, Andrew Wang, Xiahui Huang (2011), The MSR SYSTEM for IWSLT 2011 evaluation, IWSLT
Thomas Lavergne, Alexandre Allauzen, Hai-Son Le, François Yvon (2011), LIMSI's experiments in domain adaptation for IWSLT11, IWSLT
Benjamin Lecouteux, Laurent Besacier, Hervé Blanchon (2011), LIG English-French spoken language translation system for IWSLT 2011, IWSLT
Mohammed Mediani, Eunah Cho, Jan Niehues, Teresa Herrmann, Alex Waibel (2011), The KIT English-French translation systems for IWSLT 2011, IWSLT
Anthony Rousseau, Fethi Bougares, Paul Deléglise, Holger Schwenk, Yannick Estève (2011), LIUM's systems for the IWSLT 2011 speech translation tasks, IWSLT
Nick Ruiz, Arianna Bisazza, F. Brugnara, D. Falavigna, D. Giuliani, S. Jaber, R. Gretter, Marcello Federico (2011), FBK @ IWSLT 2011, IWSLT
Sebastian Stüker, Kevin Kilgour, Christian Saam, Alex Waibel (2011), The 2011 KIT English ASR system for the IWSLT evaluation, IWSLT
David Vilar, Eleftherios Avramidis, Maja Popović, Sabine Hunsicker (2011), DFKI's SC and MT submissions to IWSLT 2011, IWSLT
Joern Wuebker, Matthias Huck, Saab Mansour, Markus Freitag, Minwei Feng, Stephan Peitz, Christoph Schmidt, Hermann Ney (2011), The RWTH Aachen machine translation system for IWSLT 2011, IWSLT
Karim Boudahmane, Bianka Buschbeck, Eunah Cho, Josep Maria Crego, Markus Freitag, Thomas Lavergne, Hermann Ney, Jan Niehues, Stephan Peitz, Jean Senellart, Artem Sokolov, Alex Waibel, Tonio Wandmacher, Joern Wuebker, François Yvon (2011), Advances on spoken language translation in the Quaero program, IWSLT
Lori Lamel, Sandrine Courcinous, Julien Despres, Jean-Luc Gauvain, Yvan Josse, Kevin Kilgour, Florian Kraft, Viet Bac Le, Hermann Ney, Markus Nußbaum-Thom, Ilya Oparin, Tim Schlippe, Ralf Schlüter, Tanja Schultz, Thiago Fraga da Silva, Sebastian Stüker, Martin Sundermeyer, Bianca Vieru, Ngoc Thang Vu, Alex Waibel, Cècile Woehrling (2011), Speech recognition for machine translation in Quaero, IWSLT
Victoria Arranz, Olivier Hamon, Karim Boudahmane, Martine Garnier-Rizet (2011), Protocol and lessons learnt from the production of parallel corpora for the evaluation of speech translation systems, IWSLT
Arianna Bisazza, Nick Ruiz, Marcello Federico (2011), Fill-up versus interpolation methods for phrase-based SMT adaptation, IWSLT
Boxing Chen, Roland Kuhn, George Foster (2011), Semantic smoothing and fabrication of phrase pairs for SMT, IWSLT
Tagyoung Chung, Licheng Fang, Daniel Gildea (2011), SCFG latent annotation for machine translation, IWSLT
Chenchen Ding, Takashi Inui, Mikio Yamamoto (2011), Long-distance hierarchical structure transformation rules utilizing function words, IWSLT
Paul R. Dixon, Andrew Finch, Chiori Hori, Hideki Kashioka (2011), Investigation on the effects of ASR tuning on speech translation performance, IWSLT
Mridul Gupta, Sanjika Hewavitharana, Stephan Vogel (2011), Extending a probabilistic phrase alignment approach for SMT, IWSLT
Kenneth Heafield, Hieu Hoang, Philipp Koehn, Tetsuo Kiso, Marcello Federico (2011), Left language model state for syntactic machine translation, IWSLT
Matthias Huck, Saab Mansour, Simon Wiesler, Hermann Ney (2011), Lexicon models for hierarchical phrase-based machine translation, IWSLT
Kevin Kilgour, Christian Saam, Christian Mohr, Sebastian Stüker, Alex Waibel (2011), The 2011 KIT QUAERO speech-to-text system for Spanish, IWSLT
Wang Ling, Pável Calado, Bruno Martins, Isabel Trancoso, Alan Black, Luísa Coheur (2011), Named entity translation using anchor texts, IWSLT
Paul Maergner, Kevin Kilgour, Ian Lane, Alex Waibel (2011), Unsupervised vocabulary selection for simultaneous lecture translation, IWSLT
Saab Mansour, Joern Wuebker, Hermann Ney (2011), Combining translation and language model scoring for domain-specific data filtering, IWSLT
Jan Niehues, Alex Waibel (2011), Using Wikipedia to translate domain-specific terms in SMT, IWSLT
Stephan Peitz, Markus Freitag, Arne Mauser, Hermann Ney (2011), Modeling punctuation prediction as machine translation, IWSLT
Jan-Thorsten Peter, Matthias Huck, Hermann Ney, Daniel Stein (2011), Soft string-to-dependency hierarchical machine translation, IWSLT
Anne H. Schneider, Saturnino Luz (2011), Speaker alignment in synthesised, machine translated communication, IWSLT
Nadi Tomeh, Marco Turchi, Guillaume Wisinewski, Alexandre Allauzen, François Yvon (2011), How good are your phrases? assessing phrase quality with single class classification, IWSLT
Keiji Yasuda, Hideo Okuma, Masao Utiyama, Eiichiro Sumita (2011), Annotating data selection for improving machine translation, IWSLT
Tuomas Virtanen (2012), Human sound perception - what can we learn from it when developing audio analysis algorithms?, SAPA
Majid Mirbagheri, Yanbo Xu, Shihab Shamma (2012), Pitch estimation using mutual information, SAPA
Mauro Nicolao, Roger K. Moore (2012), Establishing some principles of human speech production through two-dimensional computational models, SAPA
Tomoyasu Nakano, Masataka Goto (2012), A spectral envelope estimation method based on F0-adaptive multi-frame integration analysis, SAPA
Cong-Thanh Do, Claude Barras (2012), Cochlear implant-like processing of speech signal for speaker verification, SAPA
Cassia Valentini-Botinhao, Junichi Yamagishi, Simon King (2012), Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise, SAPA
Sunder Ram Krishnan, Chandra Sekhar Seelamantula (2012), A generalized Stein’s estimation approach for speech enhancement based on perceptual criteria, SAPA
Zoltán Tüske, Friedhelm R. Drepper, Ralf Schlüter (2012), Non-stationary signal processing and its application in speech recognition, SAPA
Liang Lu, Arnab Ghoshal, Steve Renals (2012), Joint uncertainty decoding with unscented transform for noise robust subspace Gaussian mixture models, SAPA
M. Ali Basha Shaik, David Rybach, Stefan Hahn, Ralf Schlüter, Hermann Ney (2012), Hierarchical hybrid language models for open vocabulary continuous speech recognition using WFST, SAPA
Serena Soldo, Mathew Magimai-Doss, Hervé Bourlard (2012), Template-based ASR using posterior features and synthetic references: comparing different TTS systems, SAPA
Kalu U. Ogbureke, João P. Cabral, Julie Carson-Berndsen (2012), Explicit duration modelling in HMM-based speech synthesis using a hybrid hidden Markov model-multilayer perceptron, SAPA
Deepu Vijayasenan, Fabio Valente (2012), Dimensionality reduction of large TDOA vectors for speaker diarization, SAPA
Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel, Dietrich Klakow (2012), Joint detection and localization of multiple speakers using a probabilistic interpretation of the steered response power, SAPA
Afsaneh Asaei, Bhiksha Raj, Hervé Bourlard, Volkan Cevher (2012), Structured sparse coding for microphone array location calibration, SAPA
Takuya Yoshioka, Daichi Sakaue (2012), Log-normal matrix factorization with application to speech-music separation, SAPA
Rahil Mahdian Toroghi, Friedrich Faubel, Dietrich Klakow (2012), Multi-channel speech separation with soft time-frequency masking, SAPA
Heyun Huang, Louis ten Bosch, Bert Cranen, Lou Boves (2012), Smoothing speech trajectories by regularization, SAPA
Joris Driesen, Jort F. Gemmeke, Hugo Van hamme (2012), Data-driven speech representations for NMF-based word learning, SAPA
Samuel K. Ngouoko M, Martin Heckmann, Britta Wrede (2012), Spectro-temporal features with distribution equalization, SAPA
Kamal Sahni, Pranay Dighe, Rita Singh, Bhiksha Raj (2012), Language identification using spectro-temporal patch features, SAPA
Josh H. McDermott, Daniel P. W. Ellis, Hideki Kawahara (2012), Inharmonic speech: a tool for the study of speech perception and separation, SAPA
L. J. Rodríguez, I. Torres, A. Varona (2001), Annotation and analysis of disfluencies in a spontaneous speech corpus in Spanish, DiSS
Robert Eklund (2001), Prolongations: A dark horse in the disfluency stable, DiSS
Peter Howell, James Au-Yeung (2001), Application of EXPLAN theory to spontaneous speech control, DiSS
Nada Vasic, Frank Wijnen (2001), Stuttering and speech monitoring, DiSS
Michiko Yoshida (2001), Repeated phoneme effect in Japanese speech errors, DiSS
Sieb G. Nooteboom (2001), Different sources of lexical bias and overt self-corrections, DiSS
Yasuharu Den (2001), Are word repetitions really intended by the speaker?, DiSS
Mandana Seyfeddinipur, Sotaro Kita (2001), Gesture as an indicator of early error detection inself-monitoring of speech, DiSS
Laura Abou-Haidar (2001), Pauses in speech by French speakers with Down Syndrome, DiSS
Tapio Hokkanen (2001), Prosodic marking of self-repairs, DiSS
Danielle Duez (2001), Acoustico-phonetic characteristics of filled pauses in spontaneous Frenchspeech: preliminary results, DiSS
Klaus J. Kohler, Benno Peters, Thomas Wesener (2001), Interruption glottalization in German spontaneous speech, DiSS
Nikolinka Nenova, Gina Joue, Ronan Reilly, Julie Carson-Berndsen (2001), Sound and function regularities in interjections, DiSS
Richard Shillcock, Simon Kirby, Scott McDonald, Chris Brew (2001), Filled pauses and their status in the mental lexicon, DiSS
Mária Gósy (2001), The double function of disfluency phenomenain spontaneous speech, DiSS
Karl G. D. Bailey, Fernanda Ferreira (2001), Do non-word disfluencies affect syntactic parsing?, DiSS
Jan McAllister, Susan Cato-Symonds, Blake Johnson (2001), Listeners' ERP responses to false starts and repetitions inspontaneous speech, DiSS
Jeanne-Marie Debaisieux, José Deulofeu (2001), Grammatically unacceptable utterances are communicatively accepted bynative speakers, why are they?, DiSS
Jörg Spilker, Anton Batliner, Elmar Nöth (2001), How to repair speech repairs in an end-to-end system, DiSS
Ben Hutchinson, Cécile Pereira (2001), Um, one large pizza. A preliminary study of disfluency modelling for improving ASR, DiSS
Caroline L. Rieger (2001), Idiosyncratic fillers in the speech of bilinguals, DiSS
Asa Wengelin (2001), Disfluencies in writing - are they like in speaking?, DiSS
Michiko Watanabe (2001), The usage of fillers at discourse segment boundaries injapanese lecture-style monologues, DiSS
Robin J. Lickley (2001), Dialogue moves and disfluency rates, DiSS
Ellen G. Bard, Robin J. Lickley, Matthew P. Aylett (2001), Is disfluency just difficulty?, DiSS
Hideki Kenmochi (2010), VOCALOID and Hatsune Miku phenomenon in Japan, InterSinging
Satoru Fukayama, Kei Nakatsuma, Shinji Sako, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama (2010), Automatic song composition from Japanese lyrics with singing voice synthesizer, InterSinging
Makoto Tachibana, Shin'ichiro Nakaoka, Hideki Kenmochi (2010), A singing robot realized by a collaboration of VOCALOID and cybernetic human HRP-4C, InterSinging
Margarita Mazo, Ken-ichi Sakakibara, Hiroshi Imagawa, Niro Tayama, Donna Erickson (2010), Vocal fold vibration in vocal expression of sadness: lamenting, speaking and singing, InterSinging
Michael I. Proctor, Shrikanth Narayanan, Krishna Nayak (2010), Para-linguistic mechanisms of production in human "beatboxing": a real-time magnetic resonance imaging study, InterSinging
Donna Erickson, Tomoe Suzuki, Kayo Tanosaki, Takeshi Saito, Eri Haneishi, Kuniyo Yahiro, Hiroko Kishimoto (2010), Ah, how sweet the sound: some acoustic characteristics of emotionally sung /ah/, InterSinging
Hideyuki Tachibana, Nobutaka Ono, Shigeki Sagayama (2010), Singing voice enhancement for monaural music signals based on multiple time-frequency analysis, InterSinging
Fernando Villavicencio, Hideki Kenmochi (2010), Resurrecting past singers: non-parallel singing-voice conversion, InterSinging
Graeme M. Clark (1998), Cochlear implants in the second and third millennia, ICSLP
Stephanie Seneff (1998), The use of linguistic hierarchies in speech understanding, ICSLP
Paul C. Bagshaw (1998), Unsupervised training of phone duration and energy models for text-to-speech synthesis, ICSLP
Jerome R. Bellegarda, Kim E. A. Silverman (1998), Improved duration modeling of English phonemes using a root sinusoidal transformation, ICSLP
Chilin Shih, Wentao Gu, Jan P. H. van Santen (1998), Efficient adaptation of TTS duration model to new speakers, ICSLP
Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura (1998), Duration modeling for HMM-based speech synthesis, ICSLP
Cameron S. Fordyce, Mari Ostendorf (1998), Prosody prediction for speech synthesis using transformational rule-based learning, ICSLP
Susan Fitt, Stephen Isard (1998), Representing the environments for phonological processes in an accent-independent lexicon for synthesis of English, ICSLP
Daniel Faulkner, Charles Bryant (1998), Efficient lexical retrieval for English text-to-speech synthesis, ICSLP
Robert E. Donovan, Ellen M. Eide (1998), The IBM trainable speech synthesis system, ICSLP
Sarah Hawkins, Jill House, Mark Huckvale, John Local, Richard Ogden (1998), Prosynth: an integrated prosodic approach to device-independent, natural-sounding speech synthesis, ICSLP
Jialu Zhang, Shiwei Dong, Ge Yu (1998), Total quality evaluation of speech synthesis systems, ICSLP
Gerit P. Sonntag, Thomas Portele (1998), Comparative evaluation of synthetic prosody with the PURR method, ICSLP
Richard Sproat, Andrew Hunt, Mari Ostendorf, Paul Taylor, Alan W. Black, Kevin Lenzo, Mike Eddington (1998), SABLE: a standard for TTS markup, ICSLP
H. Timothy Bunnell, Steve R. Hoskins, Debra Yarrington (1998), Prosodic vs. segmental contributions to naturalness in a diphone synthesizer, ICSLP
Alex Acero (1998), A mixed-excitation frequency domain model for time-scale pitch-scale modification of speech, ICSLP
Masami Akamine, Takehiko Kagoshima (1998), Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS), ICSLP
Martti Vainio, Toomas Altosaar (1998), Modeling the microprosody of pitch and loudness for speech synthesis with neural networks, ICSLP
David T. Chappell, John H. L. Hansen (1998), Spectral smoothing for concatenative speech synthesis, ICSLP
Aimin Chen, Saeed Vaseghi, Charles Ho (1998), MIMIC : a voice-adaptive phonetic-tree speech synthesiser, ICSLP
Jehun Jeon, Sunhwa Cha, Minhwa Chung, Jun Park, Kyuwoong Hwang (1998), Automatic generation of Korean pronunciation variants by multistage applications of phonological rules, ICSLP
Stephen Cox, Richard Brady, Peter Jackson (1998), Techniques for accurate automatic annotation of speech waveforms, ICSLP
Andrew Cronk, Michael W. Macon (1998), Optimized stopping criteria for tree-based unit selection in concatenative synthesis, ICSLP
Stephanie de Tournemire (1998), Automatic transcription of intonation using an identified prosodic alphabet, ICSLP
Ignasi Esquerra, Albert Febrer, Climent Nadeu (1998), Frequency analysis of phonetic units for concatenative synthesis in catalan, ICSLP
Alex Chengyu Fang, Jill House, Mark Huckvale (1998), Investigating the syntactic characteristics of English tone units, ICSLP
Antonio Bonafonte, Ignasi Esquerra, Albert Febrer, José A.R. Fonollosa, Francesc Vallverdu (1998), The UPC text-to-speech system for Spanish and catalan, ICSLP
Attila Ferencz, Istvan Nagy, Tunde-Csilla Kovacs, Maria Ferencz, Teodora Ratiu (1998), The new version of the ROMVOX text-to-speech synthesis system based on a hybrid time domain-LPC synthesis technique, ICSLP
Takehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine (1998), An F0 contour control model for totally speaker driven text to speech system, ICSLP
Keikichi Hirose, Hiromichi Kawanami (1998), On the relationship of speech rates with prosodic units in dialogue speech, ICSLP
Esther Klabbers, Raymond Veldhuis (1998), On the reduction of concatenation artefacts in diphone synthesis, ICSLP
Chih-Chung Kuo, Kun-Yuan Ma (1998), Error analysis and confidence measure of Chinese word segmentation, ICSLP
Jungchul Lee, Donggyu Kang, Sanghoon Kim, Koengmo Sung (1998), Energy contour generation for a sentence using a neural network learning method, ICSLP
Yong-Ju Lee, Sook-hyang Lee, Jong-Jin Kim, Hyun-Ju Ko, Young-Il Kim, Sang-Hun Kim, Jung-Cheol Lee (1998), A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using k-toBI system, ICSLP
Kevin Lenzo, Christopher Hogan, Jeffrey Allen (1998), Rapid-deployment text-to-speech in the DIPLOMAT system, ICSLP
Robert H. Mannell (1998), Formant diphone parameter extraction utilising a labelled single-speaker database, ICSLP
Osamu Mizuno, Shin'ya Nakajima (1998), A new synthetic speech/sound control language, ICSLP
Ryo Mochizuki, Yasuhiko Arai, Takashi Honda (1998), A study on the natural-sounding Japanese phonetic word synthesis by using the VCV-balanced word database that consists of the words uttered forcibly in two types of pitch accent, ICSLP
Vincent Pagel, Kevin Lenzo, Alan W. Black (1998), Letter to sound rules for accented lexicon compression, ICSLP
Ze'ev Roth, Judith Rosenhouse (1998), A name announcement algorithm with memory size and computational power constraints, ICSLP
Frederique Sannier, Rabia Belrhali, Véronique Aubergé (1998), How a French TTS system can describe loanwords, ICSLP
Tomaz Sef, Ales Dobnikar, Matjaz Gams (1998), Improvements in slovene text-to-speech synthesis, ICSLP
Shigenobu Seto, Masahiro Morita, Takehiko Kagoshima, Masami Akamine (1998), Automatic rule generation for linguistic features analysis using inductive learning technique: linguistic features analysis in TOS drive TTS system, ICSLP
Yoshinori Shiga, Hiroshi Matsuura, Tsuneo Nitta (1998), Segmental duration control based on an articulatory model, ICSLP
Evelyne Tzoukermann (1998), Text analysis for the bell labs French text-to-speech system, ICSLP
Jennifer J. Venditti, Jan P. H. van Santen (1998), Modeling vowel duration for Japanese text-to-speech synthesis, ICSLP
Ren-Hua Wang, Qinfeng Liu, Yongsheng Teng, Deyu Xia (1998), Towards a Chinese text-to-speech system with higher naturalness, ICSLP
Andrew P. Breen, Peter Jackson (1998), A phonologically motivated method of selecting non-uniform units, ICSLP
Steve Pearson, Nick Kibre, Nancy Niedzielski (1998), A synthesis method based on concatenation of demisyllables and a residual excited vocal tract model, ICSLP
Ann K. Syrdal, Alistair Conkie, Yannis Stylianou (1998), Exploration of acoustic correlates in speaker selection for concatenative synthesis, ICSLP
Johan Wouters, Michael W. Macon (1998), A perceptual evaluation of distance measures for concatenative speech synthesis, ICSLP
Mike Plumpe, Alex Acero, Hsiao-Wuen Hon, Xuedong Huang (1998), HMM-based smoothing for concatenative speech synthesis, ICSLP
Martin Holzapfel, Nick Campbell (1998), A nonlinear unit selection strategy for concatenative speech synthesis based on syllable level features, ICSLP
Robert Eklund, Anders Lindström (1998), How to handle "foreign" sounds in Swedish text-to-speech conversion: approaching the 'xenophone' problem, ICSLP
Nick Campbell (1998), Multi-lingual concatenative speech synthesis, ICSLP
Takashi Saito (1998), On the use of F0 features in automatic segmentation for speech synthesis, ICSLP
Atsuhiro Sakurai, Takashi Natsume, Keikichi Hirose (1998), A linguistic and prosodic database for data-driven Japanese TTS synthesis, ICSLP
Alexander Kain, Michael W. Macon (1998), Text-to-speech voice adaptation from sparse training data, ICSLP
Gregor Möhler (1998), Describing intonation with a parametric model, ICSLP
Joakim Gustafson, Patrik Elmberg, Rolf Carlson, Arne Jonsson (1998), An educational dialogue system with a user controllable dialogue manager, ICSLP
Klaus Failenschmid, J.H. Simon Thornton (1998), End-user driven dialogue system design: the reward experience, ICSLP
Yi-Chung Lin, Tung-Hui Chiang, Heui-Ming Wang, Chung-Ming Peng, Chao-Huang Chang (1998), The design of a multi-domain Mandarin Chinese spoken dialogue system, ICSLP
Kallirroi Georgila, Anastasios Tsopanoglou, Nikos Fakotakis, George Kokkinakis (1998), An integrated dialogue system for the automation of call centre services, ICSLP
Kuansan Wang (1998), An event driven model for dialogue systems, ICSLP
Cosmin Popovici, Paolo Baggia, Pietro Laface, Loreta Moisa (1998), Automatic classification of dialogue contexts for dialogue predictions, ICSLP
Ganesh N. Ramaswamy, Jan Kleindienst (1998), Automatic identification of command boundaries in a conversational natural language user interface, ICSLP
Massimo Poesio, Andrei Mikheev (1998), The predictive power of game structure in dialogue act recognition: experimental results using maximum entropy estimation, ICSLP
Paul C. Constantinides, Scott Hansma, Chris Tchou, Alexander I. Rudnicky (1998), A schema based approach to dialog control, ICSLP
Gregory Aist (1998), Expanding a time-sensitive conversational architecture for turn-taking to handle content-driven interruption, ICSLP
Marc Swerts, Hanae Koiso, Atsushi Shimojima, Yasuhiro Katagiri (1998), On different functions of repetitive utterances, ICSLP
Hiroaki Noguchi, Yasuharu Den (1998), Prosody-based detection of the context of backchannel responses, ICSLP
Lena Strombäck, Arne Jonsson (1998), Robust interpretation for spoken dialogue systems, ICSLP
Yohei Okato, Keiji Kato, Mikio Yamamoto, Shuichi Itahashi (1998), System-user interaction and response strategy in spoken dialogue system, ICSLP
Noriko Suzuki, Kazuo Ishii, Michio Okada (1998), Organizing self-motivated dialogue with autonomous creatures, ICSLP
Gerhard Hanrieder, Paul Heisterkamp, Thomas Brey (1998), Fly with the EAGLES: evaluation of the "ACCeSS" spoken language dialogue system, ICSLP
Maria Aretoulaki, Stefan Harbeck, Florian Gallwitz, Elmar Nöth, Heinrich Niemann, Jozef Ivanecky, Ivo Ipsic, Nikola Pavesic, Vaclav Matousek (1998), SQEL: a multilingual and multifunctional dialogue system, ICSLP
Stefan Kaspar, Achim Hoffmann (1998), Semi-automated incremental prototyping of spoken dialog systems, ICSLP
Peter A. Heeman, Michael Johnston, Justin Denney, Edward Kaiser (1998), Beyond structured dialogues: factoring out grounding, ICSLP
Masahiro Araki, Shuji Doshita (1998), A robust dialogue model for spoken dialogue processing, ICSLP
Tom Brøndsted, Bo Nygaard Bai, Jesper Østergaard Olsen (1998), The REWARD service creation environment. an overview, ICSLP
Matthew Bull, Matthew Aylett (1998), An analysis of the timing of turn-taking in a corpus of goal-oriented dialogue, ICSLP
Sarah Davies, Massimo Poesio (1998), The provision of corrective feedback in a spoken dialogue CALL system, ICSLP
Laurence Devillers, Helene Bonneau-Maynard (1998), Evaluation of dialog strategies for a tourist information retrieval system, ICSLP
Sadaoki Furui, Koh'ichiro Yamaguchi (1998), Designing a multimodal dialogue system for information retrieval, ICSLP
Dinghua Guan, Min Chu, Quan Zhang, Jian Liu, Xiangdong Zhang (1998), The research project of man-computer dialogue system in Chinese, ICSLP
Kate S. Hone, David Golightly (1998), Interfaces for speech recognition systems: the impact of vocabulary constraints and syntax on performance, ICSLP
Tatsuya Iwase, Nigel Ward (1998), Pacing spoken directions to suit the listener, ICSLP
Annika Flycht-Eriksson, Arne Jonsson (1998), A spoken dialogue system utilizing spatial information, ICSLP
Candace A. Kamm, Diane J. Litman, Marilyn A. Walker (1998), From novice to expert: the effect of tutorials on user expertise with spoken dialogue systems, ICSLP
Takeshi Kawabata (1998), Emergent computational dialogue management architecture for task-oriented spoken dialogue systems, ICSLP
Tadahiko Kumamoto, Akira Ito (1998), An analysis of dialogues with our dialogue system through a WWW page, ICSLP
Michael F. McTear (1998), Modelling spoken dialogues with state transition diagrams: experiences with the CSLU toolkit, ICSLP
Michio Okada, Noriko Suzuki, Jacques Terken (1998), Situated dialogue coordination for spoken dialogue systems, ICSLP
Xavier Pouteau, Luis Arevalo (1998), Robust spoken dialogue systems for consumer products: a concrete application, ICSLP
Daniel Willett, Arno Romer, Jörg Rottland, Gerhard Rigoll (1998), A German dialogue system for scheduling dates and meetings by naturally spoken continuous speech, ICSLP
Chung-Hsien Wu, Gwo-Lang Yan, Chien-Liang Lin (1998), Spoken dialogue system using corpus-based hidden Markov model, ICSLP
Peter Wyard, Gavin Churcher (1998), A realistic wizard of oz simulation of a multimodal spoken language system, ICSLP
Yen-Ju Yang, Lin-Shan Lee (1998), A syllable-based Chinese spoken dialogue system for telephone directory services primarily trained with a corpus, ICSLP
Hiroyuki Yano, Akira Ito (1998), How disagreement expressions are used in cooperative tasks, ICSLP
Phil Rose (1998), Tones of a tridialectal: acoustic and perceptual data on ten linguistic tonetic contrasts between lao, nyo and standard Thai, ICSLP
Napier Guy Ian Thompson (1998), Tone sandhi between complex tones in a seven-tone southern Thai dialect, ICSLP
Alexander Robertson Coupe (1998), The acoustic and perceptual features of tone in the tibeto-burman language ao naga, ICSLP
Phil Rose (1998), The differential status of semivowels in the acoustic phonetic realisation of tone, ICSLP
Kai Alter, Karsten Steinhauer, Angela D. Friederici (1998), De-accentuation: linguistic environments and prosodic realizations, ICSLP
N. Amir, S. Ron (1998), Towards an automatic classification of emotions in speech, ICSLP
Marc Schröder, Véronique Aubergé, Marie-Agnes Cathiard (1998), Can we hear smile?, ICSLP
Matthew Aylett, Matthew Bull (1998), The automatic marking of prominence in spontaneous speech using duration and part of speech information, ICSLP
JongDeuk Kim, SeongJoon Baek, MyungJin Bae (1998), On a pitch alteration technique in excited cepstral spectrum for high quality TTS, ICSLP
Jan Buckow, Anton Batliner, Richard Huber, Elmar Nöth, Volker Warnke, Heinrich Niemann (1998), Dovetailing of acoustics and prosody in spontaneous speech recognition, ICSLP
Janet E. Cahn (1998), A computational memory and processing model for prosody, ICSLP
Belinda Collins (1998), Convergence of fundamental frequencies in conversation: if it happens, does it matter?, ICSLP
Hiroya Fujisaki, Sumio Ohno, Takashi Yagi, Takeshi Ono (1998), Analysis and interpretation of fundamental frequency contours of british English in terms of a command-response model, ICSLP
Frode Holm, Kazue Hata (1998), Common patterns in word level prosody, ICSLP
Yasuo Horiuchi, Akira Ichikawa (1998), Prosodic structure in Japanese spontaneous speech, ICSLP
Shunichi Ishihara (1998), An acoustic-phonetic description of word tone in kagoshima Japanese, ICSLP
Koji Iwano, Keikichi Hirose (1998), Representing prosodic words using statistical models of moraic transition of fundamental frequency contours of Japanese, ICSLP
Tae-Yeoub Jang, Minsuck Song, Kiyeong Lee (1998), Disambiguation of Korean utterances using automatic intonation recognition, ICSLP
Oliver Jokisch, Diane Hirschfeld, Matthias Eichner, Rüdiger Hoffmann (1998), Multi-level rhythm control for speech synthesis using hybrid data driven and rule-based approaches, ICSLP
Jiangping Kong (1998), EGG model of ditoneme in Mandarin, ICSLP
Geetha Krishnan, Wayne Ward (1998), Temporal organization of speech for normal and fast rates, ICSLP
Haruo Kubozono (1998), A syllable-based generalization of Japanese accentuation, ICSLP
Hyuck-Joon Lee (1998), Non-adjacent segmental effects in tonal realization of accentual phrase in seoul Korean, ICSLP
Eduardo López, Javier Caminero, Ismael Cortazar, Luis A. Hernández (1998), Improvement on connected numbers recognition using prosodic information, ICSLP
Kazuaki Maeda, Jennifer J. Venditti (1998), Phonetic investigation of boundary pitch movements in Japanese, ICSLP
Kikuo Maekawa (1998), Phonetic and phonological characteristics of paralinguistic information in spoken Japanese, ICSLP
Arman Maghbouleh (1998), ToBI accent type recognition, ICSLP
Hansjorg Mixdorff, Hiroya Fujisaki (1998), The influence of syllable structure on the timing of intonational events in German, ICSLP
Osamu Mizuno, Shin'ya Nakajima (1998), New prosodic control rules for expressive synthetic speech, ICSLP
Mitsuru Nakai, Hiroshi Shimodaira (1998), The use of F0 reliability function for prosodic command analysis on F0 contour generation model, ICSLP
Sumio Ohno, Hiroya Fujisaki, Hideyuki Taguchi (1998), Analysis of effects of lexical accent, syntax, and global speech rate upon the local speech rate, ICSLP
Sumio Ohno, Hiroya Fujisaki, Yoshikazu Hara (1998), On the effects of speech rate upon parameters of the command-response model for the fundamental frequency contours of speech, ICSLP
Thomas Portele, Barbara Heuft (1998), The maximum-based description of F0 contours and its application to English, ICSLP
Thomas Portele (1998), Perceived prominence and acoustic parameters in american English, ICSLP
Erhard Rank, Hannes Pirker (1998), Generating emotional speech with a concatenative synthesizer, ICSLP
Albert Rilliard, Véronique Aubergé (1998), A perceptive measure of pure prosody linguistic functions with reiterant sentences, ICSLP
Kazuhito Koike, Hirotaka Suzuki, Hiroaki Saito (1998), Prosodic parameters in emotional speech, ICSLP
Barbertje M. Streefkerk, Louis C. W. Pols, Louis F.M. ten Bosch (1998), Automatic detection of prominence (as defined by listeners' judgements) in read aloud dutch sentences, ICSLP
Masafumi Tamoto, Takeshi Kawabata (1998), A schema for illocutionary act identification with prosodic feature, ICSLP
Wataru Tsukahara (1998), An algorithm for choosing Japanese acknowledgments using prosodic cues and context, ICSLP
Chao Wang, Stephanie Seneff (1998), A study of tones and tempo in continuous Mandarin digit strings and their application in telephone quality speech recognition, ICSLP
Sandra P. Whiteside (1998), Simulated emotions: an acoustic study of voice and perturbation measures, ICSLP
Jin-song Zhang, Keikichi Hirose (1998), A robust tone recognition method of Chinese based on sub-syllabic F0 contours, ICSLP
Xiaonong Sean Zhu (1998), The microprosodics of tone sandhi in shanghai disyllabic compounds, ICSLP
Natalija Bolfan-Stosic, Tatjana Prizl (1998), Jitter and shimmer differences between pathological voices of school children, ICSLP
Xiaonong Sean Zhu (1998), What spreads, and how? tonal rightward spreading on shanghai disyllabic compounds, ICSLP
Sean Zhu, Phil Rose (1998), Tonal complexity as a dialectal feature: 25 different citation tones from four zhejiang wu dialects, ICSLP
Juan Manuel Montero, Juana M. Gutierrez-Arriola, Sira Palazuelos, Emilia Enriquez, Santiago Aguilera, José Manuel Pardo (1998), Emotional speech synthesis: from speech database to TTS, ICSLP
Cecile Pereira, Catherine Watson (1998), Some acoustic characteristics of emotion, ICSLP
Marc Swerts (1998), Intonative structure as a determinant of word order variation in dutch verbal endgroups, ICSLP
Johanneke Caspers (1998), Experiments on the meaning of two pitch accent types: the 'pointed hat' versus the accent-lending fall in dutch, ICSLP
Sun-Ah Jun, Hyuck-Joon Lee (1998), Phonetic and phonological markers of contrastive focus in Korean, ICSLP
Emiel Krahmer, Marc Swerts (1998), Reconciling two competing views on contrastiveness, ICSLP
Paul Taylor (1998), The tilt intonation model, ICSLP
Hiroya Fujisaki, Sumio Ohno, Seiji Yamada (1998), Analysis of occurrence of pauses and their durations in Japanese text reading, ICSLP
Estelle Campione, Jean Véronis (1998), A statistical study of pitch target points in five languages, ICSLP
Fabrice Malfrère, Thierry Dutoit, Piet Mertens (1998), Fully automatic prosody generator for text-to-speech, ICSLP
Halewijn Vereecken, Jean-Pierre Martens, Cynthia Grover, Justin Fackrell, Bert Van Coile (1998), Automatic prosodic labeling of 6 languages, ICSLP
Helen Wright (1998), Automatic utterance type detection using suprasegmental features, ICSLP
Ee Ling Low, Esther Grabe (1998), A contrastive study of lexical stress placement in singapore English and british English, ICSLP
Florian Gallwitz, Anton Batliner, Jan Buckow, Richard Huber, Heinrich Niemann, Elmar Nöth (1998), Integrated recognition of words and phrase boundaries, ICSLP
Amalia Arvaniti (1998), Phrase accents revisited: comparative evidence from standard and cypriot greek, ICSLP
Grzegorz Dogil, Gregor Möhler (1998), Phonetic invariance and phonological stability: lithuanian pitch accents, ICSLP
Christel Brindöpke, Gernot A. Fink, Franz Kummert, Gerhard Sagerer (1998), A HMM-based recognition system for perceptive relevant pitch movements of spontaneous German speech, ICSLP
Jean Véronis, Estelle Campione (1998), Towards a reversible symbolic coding of intonation, ICSLP
Xiaoqiang Luo, Frederick Jelinek (1998), Nonreciprocal data sharing in estimating HMM parameters, ICSLP
Jeff A. Bilmes (1998), Data-driven extensions to HMM statistical dependencies, ICSLP
Jiping Sun, Li Deng (1998), Use of high-level linguistic constraints for constructing feature-based phonological model in speech recognition, ICSLP
Steven C. Lee, James R. Glass (1998), Real-time probabilistic segmentation for segment-based speech recognition, ICSLP
Guillaume Gravier, Marc Sigelle, Gérard Chollet (1998), Toward Markov random field modeling of speech, ICSLP
Rukmini Iyer, Herbert Gish, Man-Hung Siu, George Zavaliagkos, Spyros Matsoukas (1998), Hidden Markov models for trajectory modeling, ICSLP
Katsura Aizawa, Chieko Furuichi (1998), A statistical phonemic segment model for speech recognition based on automatic phonemic segmentation, ICSLP
Kris Demuynck, Jacques Duchateau, Dirk Van Compernolle, Patrick Wambacq (1998), Improved feature decorrelation for HMM-based speech recognition, ICSLP
J. A. du Preez, D. M. Weber (1998), Efficient high-order hidden Markov modelling, ICSLP
Ellen M. Eide, Lalit R. Bahl (1998), A time-synchronous, tree-based search strategy in the acoustic fast match of an asynchronous speech recognition system, ICSLP
Jürgen Fritsch, Michael Finke, Alex Waibel (1998), Effective structural adaptation of LVCSR systems to unseen domains using hierarchical connectionist acoustic models, ICSLP
Aravind Ganapathiraju, Jonathan Hamaker, Joseph Picone (1998), Support vector machines for speech recognition, ICSLP
Malan B. Gandhi (1998), Natural number recognition using discriminatively trained inter-word context dependent hidden Markov models, ICSLP
Jonathan Hamaker, Aravind Ganapathiraju, Joseph Picone (1998), Information theoretic approaches to model selection, ICSLP
Kengo Hanai, Kazumasa Yamamoto, Nobuaki Minematsu, Seiichi Nakagawa (1998), Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context dependency, ICSLP
Jacques Simonin, Lionel Delphin-Poulat, Geraldine Damnati (1998), Gaussian density tree structure in a multi-Gaussian HMM-based speech recognition system, ICSLP
Hiroaki Kojima, Kazuyo Tanaka (1998), Generalized phone modeling based on piecewise linear segment lattice, ICSLP
Ryosuke Koshiba, Mitsuyoshi Tachimori, Hiroshi Kanazawa (1998), A flexible method of creating HMM using block-diagonalization of covariance matrices, ICSLP
C. Chesta, Pietro Laface, F. Ravera (1998), HMM topology selection for accurate acoustic and duration modeling, ICSLP
Tan Lee, Rolf Carlson, Björn Granström (1998), Context-dependent duration modelling for continuous speech recognition, ICSLP
Brian Mak, Enrico Bocchieri (1998), Training of context-dependent subspace distribution clustering hidden Markov model, ICSLP
Cesar Martin del Alamo, Luis Villarrubia, Francisco Javier Gonzalez, Luis A. Hernández (1998), Unsupervised training of HMMs with variable number of mixture components per state, ICSLP
Mate Szarvas, Shoichi Matsunaga (1998), Acoustic observation context modeling in segment based speech recognition, ICSLP
Ji Ming, Philip Hanna, Darryl Stewart, Saeed Vaseghi, F. Jack Smith (1998), Capturing discriminative information using multiple modeling techniques, ICSLP
Laurence Molloy, Stephen Isard (1998), Suprasegmental duration modelling with elastic constraints in automatic speech recognition, ICSLP
Albino Nogueiras-Rodriguez, José B. Mariño, Enric Monte (1998), An adaptive gradient-search based algorithm for discriminative training of HMM's, ICSLP
Albino Nogueiras-Rodriguez, José B. Mariño (1998), Task adaptation of sub-lexical unit models using the minimum confusibility criterion on task independent databases, ICSLP
Gordon Ramsay (1998), Stochastic calculus, non-linear filtering, and the internal model principle: implications for articulatory speech recognition, ICSLP
Christian J. Wellekens, Jussi Kangasharju, Cedric Milesi (1998), The use of meta-HMM in multistream HMM training for automatic speech recognition, ICSLP
Christian J. Wellekens (1998), Enhanced ASR by acoustic feature filtering, ICSLP
Christoph Neukirchen, Daniel Willett, Gerhard Rigoll (1998), Soft state-tying for HMM-based speech recognition, ICSLP
Silke Witt, Steve Young (1998), Estimation of models for non-native speech in computer-assisted language learning based on linear model combination, ICSLP
Tae-Young Yang, Ji-Sung Kim, Chungyong Lee, Dae Hee Youn, Il-Whan Cha (1998), Duration modeling using cumulative duration probability and speaking rate compensation, ICSLP
Geoffrey Zweig, Stuart Russell (1998), Probabilistic modeling with Bayesian networks for automatic speech recognition, ICSLP
Perasiriyan Sivakumaran, Aladdin M. Ariyaeeinia, Jill A. Hewitt (1998), Sub-band based speaker verification using dynamic recombination weights, ICSLP
Michael Barlow, Michael Wagner (1998), Measuring the dynamic encoding of speaker identity and dialect in prosodic parameters, ICSLP
Nicole Beringer, Florian Schiel, Peter Regel-Brietzmann (1998), German regional variants - a problem for automatic speech recognition?, ICSLP
Kay Berkling, Marc A. Zissman, Julie Vonwiller, Christopher Cleirigh (1998), Improving accent identification through knowledge of English syllable structure, ICSLP
Z. S. Bond, Donald Fucci, Verna Stockmal, Douglas McColl (1998), Multi-dimensional scaling of listener responses to complex auditory stimuli, ICSLP
Verna Stockmal, Danny R. Moates, Z. S. Bond (1998), Same talker, different language, ICSLP
Susanne Burger, Daniela Oppermann (1998), The impact of regional variety upon specific word categories in spontaneous German, ICSLP
Dominique Genoud, Gérard Chollet (1998), Speech pre-processing against intentional imposture in speaker recognition, ICSLP
Mike Lincoln, Stephen Cox, Simon Ringland (1998), A comparison of two unsupervised approaches to accent identification, ICSLP
Dominik R. Dersch, Christopher Cleirigh, Julie Vonwiller (1998), The influence of accents in australian English vowels and their relation to articulatory tract parameters, ICSLP
J. A. du Preez, D. M. Weber (1998), Automatic language recognition using high-order HMMs, ICSLP
Marcos Faundez-Zanuy, Daniel Rodriguez-Porcheron (1998), Speaker recognition using residual signal of linear and nonlinear prediction models, ICSLP
Yong Gu, Trevor Thomas (1998), An implementation and evaluation of an on-line speaker verification system for field trials, ICSLP
Javier Hernando, Climent Nadeu (1998), Speaker verification on the polycost database using frequency filtered spectral energies, ICSLP
Qin Jin, Luo Si, Qixiu Hu (1998), A high-performance text-independent speaker identification system based on BCDM, ICSLP
Hiroshi Kido, Hideki Kasuya (1998), Representation of voice quality features associated with talker individuality, ICSLP
Ji-Hwan Kim, Gil-Jin Jang, Seong-Jin Yun, Yung Hwan Oh (1998), Candidate selection based on significance testing and its use in normalisation and scoring, ICSLP
Yuko Kinoshita (1998), Japanese forensic phonetics: non-contemporaneous within-speaker variation in natural and read-out speech, ICSLP
Filipp Korkmazskiy, Biing-Hwang Juang (1998), Statistical modeling of pronunciation and production variations for speech recognition, ICSLP
Arne Kjell Foldvik, Knut Kvale (1998), Dialect maps and dialect research; useful tools for automatic speech recognition?, ICSLP
Youn-Jeong Kyung, Hwang-Soo Lee (1998), Text independent speaker recognition using micro-prosody, ICSLP
Yoik Cheng, Hong C. Leung (1998), Speaker verification using fundamental frequency, ICSLP
Weijie Liu, Toshihiro Isobe, Naoki Mukawa (1998), On optimum normalization method used for speaker verification, ICSLP
Harvey Lloyd-Thomas, Eluned S. Parris, Jeremy H. Wright (1998), Recurrent substrings and data fusion for language recognition, ICSLP
Konstantin P. Markov, Seiichi Nakagawa (1998), Text-independent speaker recognition using multiple information sources, ICSLP
Konstantin P. Markov, Seiichi Nakagawa (1998), Discriminative training of GMM using a modified EM algorithm for speaker recognition, ICSLP
Driss Matrouf, Martine Adda-Decker, Lori F. Lamel, Jean-Luc Gauvain (1998), Language identification incorporating lexical information, ICSLP
Enric Monte, Ramon Arqué, Xavier Miró (1998), A VQ based speaker recognition system based in histogram distances. text independent and for noisy environments, ICSLP
Asunción Moreno, José B. Mariño (1998), Spanish dialects: phonetic transcription, ICSLP
Mieko Muramatsu (1998), Acoustic analysis of Japanese English prosody: comparison between fukushima dialect speakers and tokyo dialect speakers in declarative sentences and yes-no questions, ICSLP
Hideki Noda, Katsuya Harada, Eiji Kawaguchi, Hidefumi Sawai (1998), A context-dependent approach for speaker verification using sequential decision, ICSLP
Javier Ortega-García, Santiago Cruz-Llanas, Joaquin Gonzalez-Rodriguez (1998), Quantitative influence of speech variability factors for automatic speaker verification in forensic tasks, ICSLP
Thilo Pfau, Guenther Ruske (1998), Creating hidden Markov models for fast speech, ICSLP
Tuan Pham, Michael Wagner (1998), Speaker identification using relaxation labeling, ICSLP
Leandro Rodriguez-Linares, Carmen García-Mateo (1998), A novel technique for the combination of utterance and speaker verification systems in a text-dependent speaker verification task, ICSLP
Phil Rose (1998), A forensic phonetic investigation into non-contemporaneous variation in the f-pattern of similar-sounding speakers., ICSLP
Astrid Schmidt-Nielsen, Thomas H. Crystal (1998), Human vs. machine speaker identification with telephone speech, ICSLP
Stefan Slomka, Sridha Sridharan, Vinod Chandran (1998), A comparison of fusion techniques in mel-cepstral based speaker identification, ICSLP
Hagen Soltau, Alex Waibel (1998), On the influence of hyperarticulated speech on recognition performance, ICSLP
Nuala C. Ward, Dominik R. Dersch (1998), Text-independent speaker identification and verification using the TIMIT database, ICSLP
Lisa R. Yanguas, Gerald C. O'Leary, Marc A. Zissman (1998), Incorporating linguistic knowledge into automatic dialect identification of Spanish, ICSLP
Yiying Zhang, Xiaoyan Zhu (1998), A novel text-independent speaker verification method using the global speaker model, ICSLP
Aaron E. Rosenberg, Ivan Magrin-Chagnolleau, S. Parthasarathy, Qian Huang (1998), Speaker detection in broadcast speech databases, ICSLP
Eluned S. Parris, Michael J. Carey (1998), Multilateral techniques for speaker recognition, ICSLP
Masafumi Nishida, Yasuo Ariki (1998), Real time speaker indexing based on subspace method - application to TV news articles and debate, ICSLP
George Doddington, Walter Liggett, Alvin Martin, Mark Przybocki, Douglas A. Reynolds (1998), SHEEP, GOATS, LAMBS and WOLVES: a statistical analysis of speaker performance in the NIST 1998 speaker recognition evaluation, ICSLP
Andres Corrada-Emmanuel, Michael Newman, Barbara Peskin, Lawrence Gillick, Robert Roth (1998), Progress in speaker recognition at dragon systems, ICSLP
Tomas Nordström, Haakan Melin, Johan Lindberg (1998), A comparative study of speaker verification systems using the polycost database, ICSLP
Tomoko Matsui, Kiyoaki Aikawa (1998), Robust speaker verification insensitive to session-dependent utterance variation and handset-dependent distortion, ICSLP
Haakan Melin, Johan W. Koolwaaij, Johan Lindberg, Frédéric Bimbot (1998), A comparative evaluation of variance flooring techniques in HMM-based speaker verification, ICSLP
Dijana Petrovska-Delacretaz, Jan Cernocky, Jean Hennebert, Gérard Chollet (1998), Text-independent speaker verification using automatically labelled acoustic segments, ICSLP
Qi Li (1998), A fast decoding algorithm based on sequential detection of the changes in distribution, ICSLP
Jesper Østergaard Olsen (1998), Speaker verification with ensemble classifiers based on linear speech transforms, ICSLP
Jesper Østergaard Olsen (1998), Speaker recognition based on discriminative projection models, ICSLP
James Moody, Stefan Slomka, Jason Pelecanos, Sridha Sridharan (1998), On the convergence of Gaussian mixture models: improvements through vector quantization, ICSLP
Kemal Sönmez, Elizabeth Shriberg, Larry Heck, Mitchel Weintraub (1998), Modeling dynamic prosodic variation for speaker verification, ICSLP
Douglas A. Reynolds, Elliot Singer, Beth A. Carlson, Gerald C. O'Leary, Jack J. McLaughlin, Marc A. Zissman (1998), Blind clustering of speech utterances based on speaker and language characteristics, ICSLP
Diamantino Caseiro, Isabel M. Trancoso (1998), Spoken language identification using the speechdat corpus, ICSLP
Jerome Braun, Haim Levkowitz (1998), Automatic language identification with perceptually guided training and recurrent neural networks, ICSLP
Sarel van Vuuren, Hynek Hermansky (1998), On the importance of components of the modulation spectrum for speaker verification, ICSLP
Andrew P. Breen, O. Gloaguen, P. Stern (1998), A fast method of producing talking head mouth shapes from real speech, ICSLP
Phil R. Cohen, Michael Johnston, David McGee, Sharon L. Oviatt, Joshua Clow, Ira Smith (1998), The efficiency of multimodal interaction: a case study, ICSLP
Laszlo Czap (1998), Audio and audio-visual perception of consonants disturbed by white noise and 'cocktail party', ICSLP
Simon Downey, Andrew P. Breen, Maria Fernández, Edward Kaneen (1998), Overview of the maya spoken language system, ICSLP
Mauro Cettolo, Daniele Falavigna (1998), Automatic recognition of spontaneous speech dialogues, ICSLP
Georg Fries, Stefan Feldes, Alfred Corbet (1998), Using an animated talking character in a web-based city guide demonstrator, ICSLP
Rika Kanzaki, Takashi Kato (1998), Influence of facial views on the mcgurk effect in auditory noise, ICSLP
Tom Brøndsted, Lars Bo Larsen, Michael Manthey, Paul McKevitt, Thomas B. Moeslund, Kristian G. Olesen (1998), The intellimedia workbench - a generic environment for multimodal systems, ICSLP
Joshua Clow, Sharon L. Oviatt (1998), STAMP: a suite of tools for analyzing multimodal system processing, ICSLP
Sumi Shigeno (1998), Cultural similarities and differences in the recognition of audio-visual speech stimuli, ICSLP
Toshiyuki Takezawa, Tsuyoshi Morimoto (1998), A multimodal-input multimedia-output guidance system: MMGS, ICSLP
Oscar Vanegas, Akiji Tanaka, Keiichi Tokuda, Tadashi Kitamura (1998), HMM-based visual speech recognition using intensity and location normalization, ICSLP
Yanjun Xu, Limin Du, Guoqiang Li, Ziqiang Hou (1998), A hierarchy probability-based visual features extraction method for speechreading, ICSLP
Jörn Ostermann, Mark C. Beutnagel, Ariel Fischer, Yao Wang (1998), Integration of talking heads and text-to-speech synthesizers for visual TTS, ICSLP
Levent M. Arslan, David Talkin (1998), Speech driven 3-d face point trajectory synthesis algorithm, ICSLP
Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano (1998), Speech-to-lip movement synthesis based on the EM algorithm using audio-visual HMMs, ICSLP
Deb Roy, Alex Pentland (1998), Learning words from natural audio-visual input, ICSLP
Stéphane Dupont, Juergen Luettin (1998), Using the multi-stream approach for continuous audio-visual speech recognition: experiments on the M2VTS database, ICSLP
Sharon L. Oviatt, Karen Kuhn (1998), Referential features and linguistic indirection in multimodal language, ICSLP
Michael Johnston (1998), Multimodal language processing, ICSLP
Jun-ichi Hirasawa, Noboru Miyazaki, Mikio Nakano, Takeshi Kawabata (1998), Implementation of coordinative nodding behavior on spoken dialogue systems, ICSLP
Masao Yokoyama, Kazumi Aoyama, Hideaki Kikuchi, Katsuhiko Shirai (1998), Use of non-verbal information in communication between human and robot, ICSLP
Steve Whittaker, John Choi, Julia Hirschberg, Christine H. Nakatani (1998), What you see is (almost) what you hear: design principles for user interfaces for accessing speech archives, ICSLP
Daniel Azzopardi, Shahram Semnani, Ben Milner, Richard Wiseman (1998), Improving accuracy of telephony-based, speaker-independent speech recognition, ICSLP
Aruna Bayya (1998), Rejection in speech recognition systems with limited training, ICSLP
Ruxin Chen, Miyuki Tanaka, Duanpei Wu, Lex Olorenshaw, Mariscela Amador (1998), A four layer sharing HMM system for very large vocabulary isolated word recognition, ICSLP
Rathinavelu Chengalvarayan (1998), A comparative study of hybrid modelling techniques for improved telephone speech recognition, ICSLP
Jae-Seung Choi, Jong-Seok Lee, Hee-Youn Lee (1998), Smoothing and tying for Korean flexible vocabulary isolated word recognition, ICSLP
Javier Ferreiros, Javier Macias-Guarasa, Ascensión Gallardo, José Colás, Ricardo Córdoba, José Manuel Pardo, Luis Villarrubia (1998), Recent work on a preselection module for a flexible large vocabulary speech recognition system in telephone environment, ICSLP
Masakatsu Hoshimi, Maki Yamada, Katsuyuki Niyada, Shozo Makino (1998), A study of noise robustness for speaker independent speech recognition method using phoneme similarity vector, ICSLP
Fran H. L. Jian (1998), Classification of taiwanese tones based on pitch and energy movements, ICSLP
Finn Tore Johansen (1998), Phoneme-based recognition for the norwegian speechdat(II) database, ICSLP
Montri Karnjanadecha, Stephen A. Zahorian (1998), Robust feature extraction for alphabet recognition, ICSLP
Hisashi Kawai, Norio Higuchi (1998), Recognition of connected digit speech in Japanese collected over the telephone network, ICSLP
Takuya Koizumi, Shuji Taniguchi, Kazuhiro Kohtoh (1998), Improving the speaker-dependency of subword-unit-based isolated word recognition, ICSLP
Tomohiro Konuma, Tetsu Suzuki, Maki Yamada, Yoshio Ohno, Masakatsu Hoshimi, Katsuyuki Niyada (1998), Speaker independent speech recognition method using constrained time alignment near phoneme discriminative frame, ICSLP
Ki Yong Lee, Joohun Lee (1998), A nonstationary autoregressive HMM with gain adaptation for speech recognition, ICSLP
Ren-yuan Lyu, Yuang-jin Chiang, Wen-ping Hsieh (1998), A large-vocabulary taiwanese (MIN-NAN) multi-syllabic word recognition system based upon right-context-dependent phones with state clustering by acoustic decision tree, ICSLP
Kazuyo Tanaka, Hiroaki Kojima (1998), Speech recognition based on the distance calculation between intermediate phonetic code sequences in symbolic domain, ICSLP
York Chung-Ho Yang, June-Jei Kuo (1998), High accuracy Chinese speech recognition approach with Chinese input technology for telecommunication use, ICSLP
William J.J. Roberts, Yariv Ephraim (1998), Robust speech recognition using HMM's with toeplitz state covariance matrices, ICSLP
David Thambiratnam, Sridha Sridharan (1998), Modeling of output probability distribution to improve small vocabulary speech recognition in adverse environments, ICSLP
Philippe Morin, Ted H. Applebaum, Robert Boman, Yi Zhao, Jean-Claude Junqua (1998), Robust and compact multilingual word recognizers using features extracted from a phoneme similarity front-end, ICSLP
Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano (1998), An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi search, ICSLP
Joaquin Gonzalez-Rodriguez, Santiago Cruz-Llanas, Javier Ortega-García (1998), Coherence-based subband decomposition for robust speech and speaker recognition in noisy and reverberant rooms, ICSLP
Hui Jiang, Keikichi Hirose, Qiang Huo (1998), A minimax search algorithm for CDHMM based robust continuous speech recognition, ICSLP
Su-Lin Wu, Brian E. D. Kingsbury, Nelson Morgan, Steven Greenberg (1998), Performance improvements through combining phone- and syllable-scale information in automatic speech recognition, ICSLP
Arun C. Surendran, Chin-Hui Lee (1998), Predictive adaptation and compensation for robust speech recognition, ICSLP
Jean-Claude Junqua, Steven Fincke, Ken Field (1998), Influence of the speaking style and the noise spectral tilt on the lombard reflex and automatic speech recognition, ICSLP
Stefano Crafa, Luciano Fissore, Claudio Vair (1998), Data-driven PMC and Bayesian learning integration for fast model adaptation in noisy conditions, ICSLP
Martin Hunke, Meeran Hyun, Steve Love, Thomas Holton (1998), Improving the noise and spectral robustness of an isolated-word recognizer using an auditory-model front end, ICSLP
Owen P. Kenny, Douglas J. Nelson (1998), A model for speech reverberation and intelligibility restoring filters, ICSLP
Guojun Zhou, John H. L. Hansen, James F. Kaiser (1998), Linear and nonlinear speech feature analysis for stress classification, ICSLP
Sahar E. Bou-Ghazale, John H. L. Hansen (1998), Speech feature modeling for robust stressed speech recognition, ICSLP
Katrin Kirchhoff (1998), Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments, ICSLP
Timothy Wark, Sridha Sridharan (1998), Improving speaker identification performance in reverberant conditions using lip information, ICSLP
Masato Akagi, Mamoru Iwaki, Noriyoshi Sakaguchi (1998), Spectral sequence compensation based on continuity of spectral sequence, ICSLP
Aruna Bayya, B. Yegnanarayana (1998), Robust features for speech recognition systems, ICSLP
Frédéric Berthommier, Hervé Glotin, Emmanuel Tessier, Hervé Bourlard (1998), Interfacing of CASA and partial recognition based on a multistream technique, ICSLP
Sen-Chia Chang, Shih-Chieh Chien, Chih-Chung Kuo (1998), AN RNN-based compensation method for Mandarin telephone speech recognition, ICSLP
Stephen M. Chu, Yunxin Zhao (1998), Robust speech recognition using discriminative stream weighting and parameter interpolation, ICSLP
Johan de Veth, Bert Cranen, Louis Boves (1998), Acoustic backing-off in the local distance computation for robust automatic speech recognition, ICSLP
Laura Docio-Fernández, Carmen García-Mateo (1998), Noise model selection for robust speech recognition, ICSLP
Simon Doclo, Ioannis Dologlou, Marc Moonen (1998), A novel iterative signal enhancement algorithm for noise reduction in speech, ICSLP
Stéphane Dupont (1998), Missing data reconstruction for robust automatic speech recognition in the framework of hybrid HMM/ANN systems, ICSLP
Ascensión Gallardo-Antolin, Fernando Diaz-de-Maria, Francisco J. Valverde-Albacete (1998), Recognition from GSM digital speech, ICSLP
Petra Geutner, Matthias Denecke, Uwe Meier, Martin Westphal, Alex Waibel (1998), Conversational speech systems for on-board car navigation and assistance, ICSLP
Laurent Girin, Laurent Varin, Gang Feng, Jean-Luc Schwartz (1998), A signal processing system for having the sound "pop-out" in noise thanks to the image of the speaker's lips: new advances using multi-layer perceptrons, ICSLP
Ruhi Sarikaya, John H. L. Hansen (1998), Robust speech activity detection in the presence of noise, ICSLP
Michel Héon, Hesham Tolba, Douglas O'Shaughnessy (1998), Robust automatic speech recognition by the application of a temporal-correlation-based recurrent multilayer neural network to the mel-based cepstral coefficients, ICSLP
Juan M. Huerta, Richard M. Stern (1998), Speech recognition from GSM codec parameters, ICSLP
Jeih-Weih Hung, Jia-Lin Shen, Lin-Shan Lee (1998), Improved parallel model combination based on better domain transformation for speech recognition under noisy environments, ICSLP
Lamia Karray, Jean Monne (1998), Robust speech/non-speech detection in adverse conditions based on noise and speech statistics, ICSLP
Myung Gyu Song, Hoi In Jung, Kab-Jong Shim, Hyung Soon Kim (1998), Speech recognition in car noise environments using multiple models according to noise masking levels, ICSLP
Klaus Linhard, Tim Haulick (1998), Spectral noise subtraction with recursive gain curves, ICSLP
Shengxi Pan, Jia Liu, Jintao Jiang, Zuoying Wang, Dajin Lu (1998), A novel robust speech recognition algorithm based on multi-models and integrated decision method, ICSLP
Dusan Macho, Climent Nadeu (1998), On the interaction between time and frequency filtering of speech parameters for robust speech recognition, ICSLP
Bhiksha Raj, Rita Singh, Richard M. Stern (1998), Inference of missing spectrographic features for robust speech recognition, ICSLP
Volker Schless, Fritz Class (1998), SNR-dependent flooring and noise overestimation for joint application of spectral subtraction and model combination, ICSLP
Jia-Lin Shen, Jeih-Weih Hung, Lin-Shan Lee (1998), Improved robust speech recognition considering signal correlation approximated by taylor series, ICSLP
Won-Ho Shin, Weon-Goo Kim, Chungyong Lee, Il-Whan Cha (1998), Speech recognition in noisy environment using weighted projection-based likelihood measure, ICSLP
Tetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano, Masatoshi Morishima, Toshihiro Isobe (1998), Evaluation of model adaptation by HMM decomposition on telephone speech recognition, ICSLP
Hesham Tolba, Douglas O'Shaughnessy (1998), Comparative experiments to evaluate a voiced-unvoiced-based pre-processing approach to robust automatic speech recognition in low-SNR environments, ICSLP
Masashi Unoki, Masato Akagi (1998), Signal extraction from noisy signal based on auditory scene analysis, ICSLP
Tsuyoshi Usagawa, Kenji Sakai, Masanao Ebata (1998), Frequency domain binaural model as the front end of speech recognition system, ICSLP
An-Tzyh Yu, Hsiao-Chuan Wang (1998), A study on the recognition of low bit-rate encoded speech, ICSLP
Tai-Hwei Hwang, Hsiao-Chuan Wang (1998), Weighted parallel model combination for noisy speech recognition, ICSLP
Daniel Woo (1998), Favourable and unfavourable short duration segments of speech in noise, ICSLP
Piero Cosi, Stefano Pasquin, Enrico Zovato (1998), Auditory modeling techniques for robust pitch extraction and noise reduction, ICSLP
Eliathamby Ambikairajah, Graham Tattersall, Andrew Davis (1998), Wavelet transform-based speech enhancement, ICSLP
Beth Logan, Tony Robinson (1998), A practical perceptual frequency autoregressive HMM enhancement system, ICSLP
John H. L. Hansen, Bryan L. Pellom (1998), An effective quality evaluation protocol for speech enhancement algorithms, ICSLP
Jin-Nam Park, Tsuyoshi Usagawa, Masanao Ebata (1998), An adaptive beamforming microphone array system using a blind deconvolution, ICSLP
Latchman Singh, Sridha Sridharan (1998), Speech enhancement using critical band spectral subtraction, ICSLP
Pierre Badin, Gérard Bailly, Monica Raybaudi, Christoph Segebarth (1998), A three-dimensional linear articulatory model based on MRI data, ICSLP
Pascal Perrier, Yohan Payan, Joseph Perkell, Frédéric Jolly, Majid Zandipour, Melanie Matthies (1998), On loops and articulatory biomechanics, ICSLP
Didier Demolin, Véronique Lecuit, Thierry Metens, Bruno Nazarian, Alain Soquet (1998), Magnetic resonance measurements of the velum port opening, ICSLP
Masafumi Matsumura, Takuya Niikawa, Takao Tanabe, Takashi Tachimura, Takeshi Wada (1998), Cantilever-type force-sensor-mounted palatal plate for measuring palatolingual contact stress and pattern during speech phonation, ICSLP
Tokihiko Kaburagi, Masaaki Honda (1998), Determination of the vocal tract spectrum from the articulatory movements based on the search of an articulatory-acoustic database, ICSLP
Kiyoshi Honda, Mark K. Tiede (1998), An MRI study on the relationship between oral cavity shape and larynx position, ICSLP
Frantz Clermont, Parham Mokhtari (1998), Acoustic-articulatory evaluation of the upper vowel-formant region and its presumed speaker-specific potency, ICSLP
Philip Hoole, Christian Kroos (1998), Control of larynx height in vowel production, ICSLP
Paavo Alku, Juha Vintturi, Erkki Vilkman (1998), Analyzing the effect of secondary excitations of the vocal tract on vocal intensity in different loudness conditions, ICSLP
Gordon Ramsay (1998), An analysis of modal coupling effects during the glottal cycle: formant synthesizers from time-domain finite-difference simulations, ICSLP
John H. Esling (1998), Laryngoscopic analysis of pharyngeal articulations and larynx-height voice quality settings, ICSLP
Hiroki Matsuzaki, Kunitoshi Motoki, Nobuhiro Miki (1998), Effects of shapes of radiational aperture on radiation characteristics, ICSLP
Jonathan Harrington, Mary E. Beckman, Janet Fletcher, Sallyanne Palethorpe (1998), An electropalatographic, kinematic, and acoustic analysis of supralaryngeal correlates of word-level prominence contrasts in English, ICSLP
Marija Tabain (1998), Consistencies and inconsistencies between EPG and locus equation data on coarticulation, ICSLP
Gérard Bailly, Pierre Badin, Anne Vilain (1998), Synergy between jaw and lips/tongue movements : consequences in articulatory modelling, ICSLP
Philip Hoole (1998), Modelling tongue configuration in German vowel production, ICSLP
Alan A. Wrench, Alan D. McIntosh, Colin Watson, William J. Hardcastle (1998), Optopalatograph: real-time feedback of tongue movement in 3D, ICSLP
Yohann Meynadier, Michel Pitermann, Alain Marchal (1998), Effects of contrastive focal accent on linguopalatal articulation and coarticulation in the French [kskl] cluster, ICSLP
Christine Kitamura, Denis Burnham (1998), Acoustic and affective qualities of IDS in English, ICSLP
Chayada Thanavisuth, Sudaporn Luksaneeyanawin (1998), Acoustic qualities of IDS and ADS in Thai, ICSLP
Sudaporn Luksaneeyanawin, Chayada Thanavisuth, Suthasinee Sittigasorn, Onwadee Rukkarangsarit (1998), Pragmatic characteristics of infant directed speech, ICSLP
Denis Burnham, Elizabeth Francis, Ute Vollmer-Conna, Christine Kitamura, Vicky Averkiou, Amanda Olley, Mary Nguyen, Cal Paterson (1998), Are you my little pussy-cat? acoustic, phonetic and affective qualities of infant- and pet-directed speech, ICSLP
Denis Burnham (1998), Special speech registers: talking to australian and Thai infants, and to pets, ICSLP
Takashi Masuko, Keiichi Tokuda, Takao Kobayashi (1998), A very low bit rate speech coder using HMM with speaker adaptation, ICSLP
E. Ekudden, R. Hagen, B. Johansson, S. Hayashi, A. Kataoka, S. Kurihara (1998), ITU-t g.729 extension at 6.4 kbps, ICSLP
Damith J. Mudugamuwa, Alan B. Bradley (1998), Adaptive transformation for segmented parametric speech coding, ICSLP
Julien Epps, W. Harvey Holmes (1998), Speech enhancement using STC-based bandwidth extension, ICSLP
Weihua Zhang, W. Harvey Holmes (1998), Performance and optimization of the SEEVOC algorithm, ICSLP
Wendy J. Holmes (1998), Towards a unified model for low bit-rate speech coding using a recognition-synthesis approach, ICSLP
Jan Skoglund, W. Bastiaan Kleijn (1998), On the significance of temporal masking in speech coding, ICSLP
W. Bastiaan Kleijn, Huimin Yang, Ed F. Deprettere (1998), Waveform interpolation coding with pitch-spaced subbands, ICSLP
Nicola R. Chong, Ian S. Burnett, Joe F. Chicharo (1998), An improved decomposition method for WI using IIR wavelet filter banks, ICSLP
Paavo Alku, Susanna Varho (1998), A new linear predictive method for compression of speech signals, ICSLP
Shahrokh Ghaemmaghami, Mohamed Deriche, Sridha Sridharan (1998), Hierarchical temporal decomposition: a novel approach to efficient compression of spectral characteristics of speech, ICSLP
Susan L. Hura (1998), Speech intelligibility testing for new technologies, ICSLP
Sung Joo Kim, Sangho Lee, Woo Jin Han, Yung Hwan Oh (1998), Efficient quantization of LSF parameters based on temporal decomposition, ICSLP
Minoru Kohata (1998), A sinusoidal harmonic vocoder at 1.2 kbps using auditory perceptual characteristics, ICSLP
Kazuhito Koishida, Gou Hirabayashi, Keiichi Tokuda, Takao Kobayashi (1998), A 16 kbit/s wideband CELP coder using MEL-generalized cepstral analysis and its subjective evaluation, ICSLP
D. J. Molyneux, C. I. Parris, X. Q. Sun, B. M. G. Cheetham (1998), Comparison of spectral estimation techniques for low bit-rate speech coding, ICSLP
Yoshihisa Nakatoh, Takeshi Norimatsu, Ah Heng Low, Hiroshi Matsumoto (1998), Low bit rate coding for speech and audio using mel linear predictive coding (MLPC) analysis, ICSLP
Jeng-Shyang Pan, Chin-Shiuh Shieh, Shu-Chuan Chu (1998), Comparison study on VQ codevector index assignment, ICSLP
John J. Parry, Ian S. Burnett, Joe F. Chicharo (1998), Using linguistic knowledge to improve the design of low-bit rate LSF quantisation, ICSLP
Davor Petrinovic (1998), Transform coding of LSF parameters using wavelets, ICSLP
F. Plante, B. M. G. Cheetham, D. Marston, P. A. Barrett (1998), Source controlled variable bit-rate speech coder based on waveform interpolation, ICSLP
Carlos M. Ribeiro, Isabel M. Trancoso (1998), Improving speaker recognisability in phonetic vocoders, ICSLP
Visarut Ahkuputra, Somchai Jitapunkul, Nutthacha Jittiwarangkul, Ekkarit Maneenoi, Sawit Kasuriya (1998), A comparison of Thai speech recognition systems using hidden Markov model, neural network, and fuzzy-neural network, ICSLP
Felix Freitag, Enric Monte (1998), Phoneme recognition with statistical modeling of the prediction error of neural networks, ICSLP
Toshiaki Fukada, Takayoshi Yoshimura, Yoshinori Sagisaka (1998), Neural network based pronunciation modeling with applications to speech recognition, ICSLP
Stephen J. Haskey, Sekharajit Datta (1998), A comparative study of OCON and MLP architectures for phoneme recognition, ICSLP
John-Paul Hosom, Ronald A. Cole, Piero Cosi (1998), Evaluation and integration of neural-network training techniques for continuous digit recognition, ICSLP
Ying Jia, Limin Du, Ziqiang Hou (1998), Hierarchical neural networks (HNN) for Chinese continuous speech recognition, ICSLP
Eric Keller (1998), Neural network motivation for segmental distribution, ICSLP
Nikki Mirghafori, Nelson Morgan (1998), Combining connectionist multi-band and full-band probability streams for speech recognition of natural numbers, ICSLP
Ednaldo B. Pizzolato, T. Jeff Reynolds (1998), Initial speech recognition results using the multinet architecture, ICSLP
Tomio Takara, Yasushi Iha, Itaru Nagayama (1998), Selection of the optimal structure of the continuous HMM using the genetic algorithm, ICSLP
Dat Tran, Michael Wagner, Tu Van Le (1998), A proposed decision rule for speaker recognition based on fuzzy c-means clustering, ICSLP
Dat Tran, Tu Van Le, Michael Wagner (1998), Fuzzy Gaussian mixture models for speaker recognition, ICSLP
Chai Wutiwiwatchai, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Sudaporn Luksaneeyanawin (1998), A new strategy of fuzzy-neural network for Thai numeral speech recognition, ICSLP
Chai Wutiwiwatchai, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Sudaporn Luksaneeyanawin (1998), Thai polysyllabic word recognition using fuzzy-neural network, ICSLP
Axel Glaeser (1998), Modular neural networks for low-complex phoneme recognition, ICSLP
Joao F. G. de Freitas, Sue E. Johnson, Mahesan Niranjan, Andrew H. Gee (1998), Global optimisation of neural network models via sequential sampling-importance resampling, ICSLP
Jörg Rottland, Andre Ludecke, Gerhard Rigoll (1998), Efficient computation of MMI neural networks for large vocabulary speech recognition systems, ICSLP
Sid-Ahmed Selouani, Jean Caelen (1998), Modular connectionist systems for identifying complex arabic phonetic features, ICSLP
Tuan Pham, Michael Wagner (1998), Fuzzy-integration based normalization for speaker verification, ICSLP
Hiroshi Shimodaira, Jun Rokui, Mitsuru Nakai (1998), Improving the generalization performance of the MCE/GPD learning, ICSLP
Tetsuro Kitazoe, Tomoyuki Ichiki, Sung-Ill Kim (1998), Acoustic speech recognition model by neural net equation with competition and cooperation, ICSLP
Julie Ngan, Aravind Ganapathiraju, Joseph Picone (1998), Improved surname pronunciations using decision trees, ICSLP
M. Carmen Benitez, Antonio Rubio, Pedro García, Jesus Diaz-Verdejo (1998), Word verification using confidence measures in speech recognition, ICSLP
Giulia Bernardis, Hervé Bourlard (1998), Improving posterior based confidence measures in hybrid HMM/ANN speech recognition systems, ICSLP
Javier Caminero, Eduardo López, Luis A. Hernández (1998), Two-pass utterance verification algorithm for long natural numbers recognition, ICSLP
Berlin Chen, Hsin-Min Wang, Lee-Feng Chien, Lin-Shan Lee (1998), A*-admissible key-phrase spotting with sub-syllable level utterance verification, ICSLP
Volker Fischer, Yuqing Gao, Eric Janke (1998), Speaker-independent upfront dialect adaptation in a large vocabulary continuous speech recognizer, ICSLP
Asela Gunawardana, Hsiao-Wuen Hon, Li Jiang (1998), Word-based acoustic confidence measures for large-vocabulary speech recognition, ICSLP
Sunil K. Gupta, Frank K. Soong (1998), Improved utterance rejection using length dependent thresholds, ICSLP
Ching Hsiang Ho, Saeed Vaseghi, Aimin Chen (1998), Bayesian constrained frequency warping HMMS for speaker normalisation, ICSLP
Masaki Ida, Ryuji Yamasaki (1998), An evaluation of keyword spotting performance utilizing false alarm rejection based on prosodic information, ICSLP
Dieu Tran, Ken-ichi Iso (1998), Predictive speaker adaptation and its prior training, ICSLP
Rachida El Méliani, Douglas O'Shaughnessy (1998), Powerful syllabic fillers for general-task keyword-spotting and unlimited-vocabulary continuous-speech recognition, ICSLP
Christine Pao, Philipp Schmid, James R. Glass (1998), Confidence scoring for speech understanding systems, ICSLP
Bhuvana Ramabhadran, Abraham Ittycheriah (1998), Phonological rules for enhancing acoustic enrollment of unknown words, ICSLP
Anand R. Setlur, Rafid A. Sukkar (1998), Recognition-based word counting for reliable barge-in and early endpoint detection in continuous speech recognition, ICSLP
Martin Westphal, Tanja Schultz, Alex Waibel (1998), Linear discriminant - a new criterion for speaker normalization, ICSLP
Gethin Williams, Steve Renals (1998), Confidence measures derived from an acceptor HMM, ICSLP
Chung-Hsien Wu, Yeou-Jiunn Chen, Yu-Chun Hung (1998), Telephone speech multi-keyword spotting using fuzzy search algorithm and prosodic verification, ICSLP
Yoichi Yamashita, Toshikatsu Tsunekawa, Riichiro Mizoguchi (1998), Topic recognition for news speech based on keyword spotting, ICSLP
Sieb G. Nooteboom, Meinou van Dijk (1998), Heads and tails in word perception: evidence for `early-to-late' processing in listening and reading, ICSLP
Saskia te Riele, Hugo Quené (1998), Evidence for early effects of sentence context on word segmentation, ICSLP
Hugo Quené, Maya van Rossum, Mieke van Wijck (1998), Assimilation and anticipation in word perception, ICSLP
M. Louise Kelly, Ellen Gurman Bard, Catherine Sotillo (1998), Lexical activation by assimilated and reduced tokens, ICSLP
Masato Akagi, Mamoru Iwaki, Tomoya Minakawa (1998), Fundamental frequency fluctuation in continuous vowel utterance and its perception, ICSLP
Shigeaki Amano, Tadahisa Kondo (1998), Estimation of mental lexicon size with word familiarity database, ICSLP
Matthew Aylett, Alice Turk (1998), Vowel quality in spontaneous speech: what makes a good vowel?, ICSLP
Adrian Neagu, Gérard Bailly (1998), Cooperation and competition of burst and formant transitions for the perception and identification of French stops, ICSLP
Anne Bonneau, Yves Laprie (1998), The effect of modifying formant amplitudes on the perception of French vowels generated by copy synthesis, ICSLP
Hsuan-Chih Chen, Michael C. W. Yip, Sum-Yin Wong (1998), Segmental and tonal processing in Cantonese, ICSLP
Michael C. W. Yip, Po-Yee Leung, Hsuan-Chih Chen (1998), Phonological similarity effects in Cantonese spoken-word processing, ICSLP
Bob I. Damper, Steve R. Gunn (1998), On the learnability of the voicing contrast for initial stops, ICSLP
Loredana Cerrato, Mauro Falcone (1998), Acoustic and perceptual characteristic of Italian stop consonants, ICSLP
Santiago Fernández, Sergio Feijóo, Ramon Balsa, Nieves Barros (1998), Acoustic cues for the auditory identification of the Spanish fricative /f/, ICSLP
Santiago Fernández, Sergio Feijóo, Ramon Balsa, Nieves Barros (1998), Recognition of vowels in fricative context., ICSLP
Santiago Fernández, Sergio Feijóo, Plinio Almeida (1998), Voicing affects perceived manner of articulation., ICSLP
Valerie Hazan, Andrew Simpson, Mark Huckvale (1998), Enhancement techniques to improve the intelligibility of consonants in noise : speaker and listener effects, ICSLP
Fran H. L. Jian (1998), Boundaries of perception of long tones in taiwanese speech, ICSLP
Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka (1998), Effects of phonetic quality and duration on perceptual acceptability of temporal changes in speech, ICSLP
Michael Kiefte, Terrance M. Nearey (1998), Dynamic vs. static spectral detail in the perception of gated stops, ICSLP
Takashi Otake, Kiyoko Yoneyama (1998), Phonological units in speech segmentation and phonological awareness, ICSLP
Elizabeth Shriberg, Andreas Stolcke (1998), How far do speakers back up in repairs? a quantitatve model, ICSLP
Karsten Steinhauer, Kai Alter, Angela D. Friederici (1998), Don't blame it (all) on the pause: further ERP evidence for a prosody-induced garden-path in running speech, ICSLP
Jean Vroomen, Beatrice de Gelder (1998), The role of stress for lexical selection in dutch, ICSLP
Jyrki Tuomainen, Jean Vroomen, Beatrice de Gelder (1998), The perception of stressed syllables in finnish, ICSLP
Kimiko Yamakawa, Ryoji Baba (1998), The perception of the morae with devocalized vowels in Japanese language., ICSLP
Dominic W. Massaro (1998), Categorical perception: important phenomenon or lasting myth?, ICSLP
Ellen Gerrits, Bert Schouten (1998), Categorical perception of vowels, ICSLP
Kazuhiko Kakehi, Yuki Hirose (1998), Suprasegmental cues for the segmentation of identical vowel sequences in Japanese, ICSLP
William A. Ainsworth (1998), Perception of concurrent approximant-vowel syllables, ICSLP
Dawn M. Behne, Peter E. Czigler, Kirk P. H. Sullivan (1998), Perceived Swedish vowel quantity: effects of postvocalic consonant duration, ICSLP
Anne Cutler, Rebecca Treiman, Brit van Ooijen (1998), Orthografik inkoncistensy ephekts in foneme detektion?, ICSLP
Bruce L. Derwing, Terrance M. Nearey, Yeo Bom Yoon (1998), The effect of orthographic knowledge on the segmentation of speech, ICSLP
James M. McQueen, Anne Cutler (1998), Spotting (different types of) words in (different types of) context, ICSLP
Manjari Ohala, John J. Ohala (1998), Correlation between consonantal VC transitions and degree of perceptual confusion of place contrast in hindi, ICSLP
David House, Dik Hermes, Frédéric Beaugendre (1998), Perception of tonal rises and falls for accentuation and phrasing in Swedish, ICSLP
Steven Greenberg, Takayuki Arai, Rosaria Silipo (1998), Speech intelligibility derived from exceedingly sparse spectral information, ICSLP
Mark C. Flynn, Richard C. Dowell, Graeme M. Clark (1998), Adults with a severe-to-profound hearing impairment. investigating the effects of linguistic context on speech perception, ICSLP
Florien J. Koopmans-van Beinum, Caroline E. Schwippert, Cecile T. L. Kuijpers (1998), Speech perception in dyslexia: measurements from birth onwards, ICSLP
Karen Croot (1998), An acoustic analysis of vowel production across tasks in a case of non-fluent progressive aphasia, ICSLP
Jan van Doorn, Sharynne McLeod, Elise Baker, Alison Purcell, William Thorpe (1998), Speech technology in clinical environments, ICSLP
Stephanie Seneff, Ed Hurley, Raymond Lau, Christine Pao, Philipp Schmid, Victor Zue (1998), GALAXY-II: a reference architecture for conversational system development, ICSLP
Grace Chung, Stephanie Seneff (1998), Improvements in speech understanding accuracy through the integration of hierarchical linguistic, prosodic, and phonological constraints in the jupiter domain, ICSLP
Kenney Ng (1998), Towards robust methods for spoken document retrieval, ICSLP
Richard Sproat, Jan P. H. van Santen (1998), Automatic ambiguity detection, ICSLP
Julia Fischer, Juergen Haas, Elmar Nöth, Heinrich Niemann, Frank Deinzer (1998), Empowering knowledge based speech understanding through statistics, ICSLP
Akito Nagai, Yasushi Ishikawa (1998), Concept-driven speech understanding incorporated with a statistic language model, ICSLP
José Colás, Javier Ferreiros, Juan Manuel Montero, Julio Pastor, Ascensión Gallardo, José Manuel Pardo (1998), On the limitations of stochastic conceptual finite-state language models for speech understanding, ICSLP
Todd Ward, Salim Roukos, Chalapathy Neti, Jerome Gros, Mark Epstein, Satya Dharanipragada (1998), Towards speech understanding across multiple languages, ICSLP
Andreas Stolcke, Elizabeth Shriberg, Rebecca Bates, Mari Ostendorf, Dilek Hakkani, Madelaine Plauche, Gokhan Tur, Yu Lu (1998), Automatic detection of sentence boundaries and disfluencies based on recognized words, ICSLP
Wolfgang Reichl, Bob Carpenter, Jennifer Chu-Carroll, Wu Chou (1998), Language modeling for content extraction in human-computer dialogues, ICSLP
John Gillett, Wayne Ward (1998), A language model combining trigrams and stochastic context-free grammars, ICSLP
Bernd Souvignier, Andreas Kellner (1998), Online adaptation of language models in spoken dialogue systems, ICSLP
Giuseppe Riccardi, Alexandros Potamianos, Shrikanth Narayanan (1998), Language model adaptation for spoken language systems, ICSLP
Brigitte Bigi, Renato De Mori, Marc El-Beze, Thierry Spriet (1998), Detecting topic shifts using a cache memory, ICSLP
Lori Levin, Ann Thyme-Gobbel, Alon Lavie, Klaus Ries, Klaus Zechner (1998), A discourse coding scheme for conversational Spanish, ICSLP
Kazuhiro Arai, Jeremy H. Wright, Giuseppe Riccardi, Allen L. Gorin (1998), Grammar fragment acquisition using syntactic and semantic clustering, ICSLP
Tom Brøndsted (1998), Non-expert access to unification based speech understanding, ICSLP
Bob Carpenter, Jennifer Chu-Carroll (1998), Natural language call routing: a robust, self-organizing approach, ICSLP
Debajit Ghosh, David Goddeau (1998), Automatic grammar induction from semantic parsing, ICSLP
Yasuyuki Kono, Takehide Yano, Munehiko Sasajima (1998), BTH: an efficient parsing algorithm for word-spotting, ICSLP
Susanne Kronenberg, Franz Kummert (1998), Syntax coordination: interaction of discourse and extrapositions, ICSLP
Bor-Shen Lin, Berlin Chen, Hsin-Min Wang, Lin-Shan Lee (1998), Hierarchical tag-graph search for spontaneous speech understanding in spoken dialog systems, ICSLP
Yasuhisa Niimi, Noboru Takinaga, Takuya Nishimoto (1998), Extraction of the dialog act and the topic from utterances in a spoken dialog system, ICSLP
Harry Printz (1998), Fast computation of maximum entropy / minimum divergence feature gain, ICSLP
Giuseppe Riccardi, Allen L. Gorin (1998), Stochastic language models for speech recognition and understanding, ICSLP
Carol Van Ess-Dykema, Klaus Ries (1998), Linguistically engineered tools for speech recognition error analysis, ICSLP
Kazuya Takeda, Atsunori Ogawa, Fumitada Itakura (1998), Estimating entropy of a language from optimal word insertion penalty, ICSLP
Shu-Chuan Tseng (1998), A linguistic analysis of repair signals in co-operative spoken dialogues, ICSLP
Francisco J. Valverde-Albacete, José Manuel Pardo (1998), A hierarchical language model for CSR, ICSLP
Jeremy H. Wright, Allen L. Gorin, Alicia Abella (1998), Spoken language understanding within dialogs using a graphical model of task structure, ICSLP
Yoshimi Suzuki, Fumiyo Fukumoto, Yoshihiro Sekiguchi (1998), Keyword extraction of radio news using domain identification based on categories of an encyclopedia, ICSLP
James Droppo, Alex Acero (1998), Maximum a posteriori pitch tracking, ICSLP
Dekun Yang, Georg F. Meyer, William A. Ainsworth (1998), Vowel separation using the reassigned amplitude-modulation spectrum, ICSLP
Eloi Batlle, Climent Nadeu, José A.R. Fonollosa (1998), Feature decorrelation methods in speech recognition. a comparative study, ICSLP
Marie-José Caraty, Claude Montacié (1998), Multi-resolution for speech analysis, ICSLP
Steve Cassidy, Catherine Watson (1998), Dynamic features in children's vowels, ICSLP
Johan de Veth, Louis Boves (1998), Effectiveness of phase-corrected rasta for continuous speech recognition, ICSLP
Satya Dharanipragada, Ramesh A. Gopinath, Bhaskar D. Rao (1998), Techniques for capturing temporal variations in speech signals with fixed-rate processing, ICSLP
Limin Du, Kenneth N. Stevens (1998), Automatic detection of landmark for nasal consonants from speech waveform, ICSLP
Thierry Dutoit, Juergen Schroeter (1998), Plug and play software for designing high-level speech processing systems, ICSLP
Alexandre Girardi, Kiyohiro Shikano, Satoshi Nakamura (1998), Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphing, ICSLP
Laure Charonnat, Michel Guitton, Joel Crestel, Gerome Allée (1998), Restoration of hyperbaric speech by correction of the formants and the pitch, ICSLP
Juana M. Gutierrez-Arriola, Yung-Sheng Hsiao, Juan Manuel Montero, José Manuel Pardo, Donald G. Childers (1998), Voice conversion based on parameter transformation, ICSLP
Jilei Tian, Ramalingam Hariharan, Kari Laurila (1998), Noise robust two-stream auditory feature extraction method for speech recognition, ICSLP
Andrew K. Halberstadt, James R. Glass (1998), Heterogeneous measurements and multiple classifiers for speech recognition, ICSLP
Naomi Harte, Saeed Vaseghi, Ben Milner (1998), Joint recognition and segmentation using phonetically derived features and a hybrid phoneme model, ICSLP
Hynek Hermansky, Sangita Sharma (1998), TRAPS - classifiers of temporal patterns, ICSLP
John N. Holmes (1998), Robust measurement of fundamental frequency and degree of voicing, ICSLP
John F. Holzrichter, Gregory C. Burnett, Todd J. Gable, Lawrence C. Ng (1998), Micropower electro-magnetic sensors for speech characterization, recognition, verification, and other applications, ICSLP
Jia-Lin Shen, Jeih-Weih Hung, Lin-Shan Lee (1998), Robust entropy-based endpoint detection for speech recognition in noisy environments, ICSLP
Jia-Lin Shen, Wen-Liang Hwang (1998), Statistical integration of temporal filter banks for robust speech recognition using linear discriminant analysis (LDA), ICSLP
Dorota J. Iskra, William H. Edmondson (1998), Feature-based approach to speech recognition, ICSLP
Hiroyuki Kamata, Akira Kaneko, Yoshihisa Ishida (1998), Periodicity emphasis of voice wave using nonlinear IIR digital filters and its applications, ICSLP
Simon King, Todd Stephenson, Stephen Isard, Paul Taylor, Alex Strachan (1998), Speech recognition via phonetically featured syllables, ICSLP
Jacques Koreman, Bistra Andreeva, William J. Barry (1998), Do phonetic features help to improve consonant identification in ASR?, ICSLP
Hisao Kuwabara (1998), Perceptual and acoustic properties of phonemes in continuous speech for different speaking rate, ICSLP
Joohun Lee, Ki Yong Lee (1998), On robust sequential estimator based on t-distribution with forgetting factor for speech analysis, ICSLP
Christopher John Long, Sekharajit Datta (1998), Discriminant wavelet basis construction for speech recognition, ICSLP
Hiroshi Matsumoto, Yoshihisa Nakatoh, Yoshinori Furuhata (1998), An efficient mel-LPC analysis method for speech recognition, ICSLP
Philip McMahon, Paul McCourt, Saeed Vaseghi (1998), Discriminative weighting of multi-resolution sub-band cepstral features for speech recognition, ICSLP
Yoram Meron, Keikichi Hirose (1998), Separation of singing and piano sounds, ICSLP
Nobuaki Minematsu, Seiichi Nakagawa (1998), Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing, ICSLP
Partha Niyogi, Partha Mitra, Man Mohan Sondhi (1998), A detection framework for locating phonetic events, ICSLP
Climent Nadeu, Felix Galindo, Jaume Padrell (1998), On frequency averaging for spectral analysis in speech recognition, ICSLP
Munehiro Namba, Yoshihisa Ishida (1998), Wavelet transform domain blind equalization and its application to speech analysis, ICSLP
Steve Pearson (1998), A novel method of formant analysis and glottal inverse filtering, ICSLP
Antonio J. Araujo, Vitor C. Pera, Marcio N. Souza (1998), Vector quantizer acceleration for an automatic speech recognition application, ICSLP
Hartmut R. Pfitzinger (1998), Local speech rate as a combination of syllable and phone rate, ICSLP
Solange Rossato, Gang Feng, Rafael Laboissiere (1998), Recovering gestures from speech signals: a preliminary study for nasal vowels, ICSLP
Guenther Ruske, Robert Faltlhauser, Thilo Pfau (1998), Extended linear discriminant analysis (ELDA) for speech recognition, ICSLP
Ara Samouelian, Jordi Robert-Ribes, Mike Plumpe (1998), Speech, silence, music and noise classification of TV broadcast material, ICSLP
Jean Schoentgen, Alain Soquet, Véronique Lecuit, Sorin Ciocea (1998), The relation between vocal tract shape and formant frequencies can be described by means of a system of coupled differential equations, ICSLP
Youngjoo Suh, Kyuwoong Hwang, Oh-Wook Kwon, Jun Park (1998), Improving speech recognizer by broader acoustic-phonetic group classification, ICSLP
C. William Thorpe (1998), Separation of speech source and filter by time-domain deconvolution, ICSLP
Hesham Tolba, Douglas O'Shaughnessy (1998), On the application of the AM-FM model for the recovery of missing frequency bands of telephone speech, ICSLP
Chang-Sheng Yang, Hideki Kasuya (1998), Estimation of voice source and vocal tract parameters using combined subspace-based and amplitude spectrum-based algorithm, ICSLP
Fang Zheng, Zhanjiang Song, Ling Li, Wenjian Yu, Fengzhou Zheng, Wenhu Wu (1998), The distance measure for line spectrum pairs applied to speech recognition, ICSLP
William A. Ainsworth, Charles R. Day, Georg F. Meyer (1998), Improving pitch estimation with short duration speech samples, ICSLP
Hideki Kawahara, Alain de Cheveigne, Roy D. Patterson (1998), An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suite, ICSLP
Kiyoaki Aikawa (1998), Speaker-independent speech recognition using micro segment spectrum integration, ICSLP
Keiichi Funaki, Yoshikazu Miyanaga, Koji Tochinai (1998), On robust speech analysis based on time-varying complex AR model, ICSLP
Hynek Hermansky, Narendranath Malayath (1998), Spectral basis functions from discriminant analysis, ICSLP
Shin Suzuki, Takesi Okadome, Masaaki Honda (1998), Determination of articulatory positions from speech acoustics by applying dynamic articulatory constraints, ICSLP
Yang Li, Yunxin Zhao (1998), Recognizing emotions in speech using short-term and long-term features, ICSLP
Arnaud Robert, Jan Eriksson (1998), Periphear : a nonlinear active model of the auditory periphery, ICSLP
Padma Ramesh, Partha Niyogi (1998), The voicing feature for stop consonants: acoustic phonetic analyses and automatic speech recognition experiments, ICSLP
Sankar Basu, Stéphane Maes (1998), Wavelet-based energy binning cepstral features for automatic speech recognition, ICSLP
Carlos Silva, Samir Chennoukh (1998), Articulatory analysis using a codebook for articulatory based low bit-rate speech coding, ICSLP
Chen Fang, Yuan Baozong (1998), The modeling and realization of natural speech generation system, ICSLP
Robert Eklund (1998), ko tok ples ensin bilong tok pisin or the TP-CLE: a first report from a pilot speech-to-speech translation project from Swedish to tok pisin, ICSLP
Ismael García-Varea, Francisco Casacuberta, Hermann Ney (1998), An iterative, DP-based search algorithm for statistical machine translation, ICSLP
Barbara Gawronska, David House (1998), Information extraction and text generation of news reports for a Swedish-English bilingual spoken dialogue system, ICSLP
Joris Hulstijn, Arjan van Hessen (1998), Utterance generation for transaction dialogues, ICSLP
Kai Ishikawa, Eiichiro Sumita, Hitoshi Iida (1998), Example-based error recovery method for speech translation: repairing sub-trees according to the semantic distance, ICSLP
Emiel Krahmer, Mariet Theune (1998), Context sensitive generation of descriptions, ICSLP
Lori Levin, Donna Gates, Alon Lavie, Alex Waibel (1998), An interlingua based on domain actions for machine translation of task-oriented dialogues, ICSLP
Sandra Williams (1998), Generating pitch accents in a concept-to-speech system using a knowledge base, ICSLP
Tobias Ruland, C. J. Rupp, Jörg Spilker, Hans Weber, Karsten L. Worm (1998), Making the most of multiplicity: a multi-parser multi-strategy architecture for the robust processing of spoken language, ICSLP
Jon R. W. Yi, James R. Glass (1998), Natural-sounding speech synthesis using variable-length units, ICSLP
Esther Klabbers, Emiel Krahmer, Mariet Theune (1998), A generic algorithm for generating spoken monologues, ICSLP
Janet Hitzeman, Alan W. Black, Paul Taylor, Chris Mellish, Jon Oberlander (1998), On the use of automatically generated discourse-level information in a concept-to-speech synthesis system, ICSLP
Hiyan Alshawi, Srinivas Bangalore, Shona Douglas (1998), Learning phrase-based head transduction models for translation of spoken utterances, ICSLP
Toshiaki Fukada, Detlef Koll, Alex Waibel, Kouichi Tanigaki (1998), Probabilistic dialogue act extraction for concept based multilingual translation systems, ICSLP
Ye-Yi Wang, Alex Waibel (1998), Fast decoding for statistical machine translation, ICSLP
Toshiyuki Takezawa, Tsuyoshi Morimoto, Yoshinori Sagisaka, Nick Campbell, Hitoshi Iida, Fumiaki Sugaya, Akio Yokoo, Seiichi Yamamoto (1998), A Japanese-to-English speech translation system: ATR-MATRIX, ICSLP
Julia Hirschberg, Christine H. Nakatani (1998), Acoustic indicators of topic segmentation, ICSLP
Esther Grabe, Francis Nolan, Kimberley J. Farrar (1998), IVie - a comparative transcription system for intonational variation in English, ICSLP
Fu-Chiang Chou, Chiu-Yu Tseng, Lin-Shan Lee (1998), Automatic segmental and prosodic labeling of Mandarin speech database, ICSLP
Stefan Rapp (1998), Automatic labelling of German prosody, ICSLP
Matti Karjalainen, Toomas Altosaar, Miikka Huttunen (1998), An efficient labeling tool for the Quicksig speech database, ICSLP
Harry Bratt, Leonardo Neumeyer, Elizabeth Shriberg, Horacio Franco (1998), Collection and detailed transcription of a speech database for development of language learning technologies, ICSLP
Neeraj Deshmukh, Aravind Ganapathiraju, Andi Gleeson, Jonathan Hamaker, Joseph Picone (1998), Resegmentation of SWITCHBOARD, ICSLP
Demetrio Aiello, Cristina Delogu, Renato De Mori, Andrea Di Carlo, Marina Nisi, Silvia Tummeacciu (1998), Automatic generation of visual scenarios for spoken corpora acquisition, ICSLP
Mauro Cettolo, Daniele Falavigna (1998), Automatic detection of semantic boundaries based on acoustic and lexical knowledge, ICSLP
Iman Gholampour, Kambiz Nayebi (1998), A new fast algorithm for automatic segmentation of continuous speech, ICSLP
Akemi Iida, Nick Campbell, Soichiro Iga, Fumito Higuchi, Michiaki Yasumura (1998), Acoustic nature and perceptual testing of corpora of emotional speech, ICSLP
Pyungsu Kang, Jiyoung Kang, Jinyoung Kim (1998), Korean prosodic break index labelling by a new mixed method of LDA and VQ, ICSLP
Mark Laws, Richard Kilgour (1998), MOOSE: management of otago speech environment, ICSLP
Fabrice Malfrère, Olivier Deroo, Thierry Dutoit (1998), Phonetic alignment: speech synthesis based vs. hybrid HMM/ANN, ICSLP
J. Bruce Millar (1998), Customisation and quality assessment of spoken language description, ICSLP
Claude Montacié, Marie-José Caraty (1998), A silence/noise/music/speech splitting algorithm, ICSLP
David Pye, Nicholas J. Hollinghurst, Timothy J. Mills, Kenneth R. Wood (1998), Audio-visual segmentation for content-based retrieval, ICSLP
Stefan Rapp, Grzegorz Dogil (1998), Same news is good news: automatically collecting reoccurring radio news stories, ICSLP
Christel Brindöpke, Brigitte Schaffranietz (1998), An annotation system for melodic aspects of German spontaneous speech, ICSLP
Karlheinz Stöber, Wolfgang Hess (1998), Additional use of phoneme duration hypotheses in automatic speech segmentation, ICSLP
Amy Isard, David McKelvie, Henry S. Thompson (1998), Towards a minimal standard for dialogue transcripts: a new SGML architecture for the HCRC map task corpus, ICSLP
Pedro J. Moreno, Chris Joerg, Jean-Manuel Van Thong, Oren Glickman (1998), A recursive algorithm for the forced alignment of very long audio segments, ICSLP
Judith M. Kessens, Mirjam Wester, Catia Cucchiarini, Helmer Strik (1998), The selection of pronunciation variants: comparing the performance of man and machine, ICSLP
Jon Barker, Gethin Williams, Steve Renals (1998), Acoustic confidence measures for segmenting broadcast news, ICSLP
Bryan L. Pellom, John H. L. Hansen (1998), A duration-based confidence measure for automatic segmentation of noise corrupted speech, ICSLP
Thomas Hain, Philip C. Woodland (1998), Segmentation and classification of broadcast news audio, ICSLP
Børge Lindberg, Robrecht Comeyne, Christoph Draxler, Francesco Senia (1998), Speaker recruitment methods and speaker coverage - experiences from a large multilingual speech database collection, ICSLP
Estelle Campione, Jean Véronis (1998), A multilingual prosodic database, ICSLP
Ronald A. Cole, Mike Noel, Victoria Noel (1998), The CSLU speaker recognition corpus, ICSLP
Gregory Aist, Peggy Chan, Xuedong Huang, Li Jiang, Rebecca Kennedy, DeWitt Latimer, Jack Mostow, Calvin Yeung (1998), How effective is unsupervised data collection for children's speech recognition?, ICSLP
Jyh-Shing Shyuu, Wang Jhing-Fa (1998), An algorithm for automatic generation of Mandarin phonetic balanced corpus, ICSLP
Steven Bird, Mark Liberman (1998), Towards a formal framework for linguistic annotations, ICSLP
Toomas Altosaar, Martti Vainio (1998), Forming generic models of speech for uniform database access, ICSLP
Gary Cook, Tony Robinson, James Christie (1998), Real-time recognition of broadcast news, ICSLP
Ha-Jin Yu, Hoon Kim, Jae-Seung Choi, Joon-Mo Hong, Kew-Suh Park, Jong-Seok Lee, Hee-Youn Lee (1998), Automatic recognition of Korean broadcast news speech, ICSLP
James R. Glass, Timothy J. Hazen (1998), Telephone-based conversational speech recognition in the JUPITER domain, ICSLP
Hsiao-Wuen Hon, Yun-Cheng Ju, Keiko Otani (1998), Japanese large-vocabulary continuous speech recognition system based on microsoft whisper, ICSLP
Jean-Luc Gauvain, Lori F. Lamel, Gilles Adda (1998), Partitioning and transcription of broadcast news data, ICSLP
Hajime Tsukada, Hirofumi Yamamoto, Toshiyuki Takezawa, Yoshinori Sagisaka (1998), Grammatical word graph re-generation for spontaneous speech recognition, ICSLP
Norimichi Yodo, Kiyohiro Shikano, Satoshi Nakamura (1998), Compression algorithm of trigram language models based on maximum likelihood estimation, ICSLP
Ulla Uebler, Heinrich Niemann (1998), Morphological modeling of word classes for language models, ICSLP
Imed Zitouni, Kamel Smaili, Jean-Paul Haton, Sabine Deligne, Frédéric Bimbot (1998), A comparative study between polyclass and multiclass language models, ICSLP
Dietrich Klakow (1998), Log-linear interpolation of language models, ICSLP
Philip Clarkson, Tony Robinson (1998), The applicability of adaptive language modelling for the broadcast news task, ICSLP
Long Nguyen, Richard Schwartz (1998), The BBN single-phonetic-tree fast-match algorithm, ICSLP
Akinobu Lee, Tatsuya Kawahara, Shuji Doshita (1998), An efficient two-pass search algorithm using word trellis index, ICSLP
Mike Schuster (1998), Nozomi -- a fast, memory-efficient stack decoder for LVCSR, ICSLP
Thomas Kemp, Alex Waibel (1998), Reducing the OOV rate in broadcast news speech recognition, ICSLP
Michiel Bacchiani, Mari Ostendorf (1998), Using automatically-derived acoustic sub-word units in large vocabulary speech recognition, ICSLP
Don McAllaster, Lawrence Gillick, Francesco Scattone, Michael Newman (1998), Fabricating conversational speech data with acoustic models: a program to examine model-data mismatch, ICSLP
Wu Chou, Wolfgang Reichl (1998), High resolution decision tree based acoustic modeling beyond CART, ICSLP
Thomas Kemp, Alex Waibel (1998), Unsupervised training of a speech recognizer using TV broadcasts, ICSLP
Clark Z. Lee, Douglas O'Shaughnessy (1998), A new method to achieve fast acoustic matching for speech recognition, ICSLP
Jacques Duchateau, Kris Demuynck, Dirk Van Compernolle, Patrick Wambacq (1998), Improved parameter tying for efficient acoustic model evaluation in large vocabulary continuous speech recognition, ICSLP
Ananth Sankar (1998), A new look at HMM parameter tying for large vocabulary speech recognition, ICSLP
Ramesh A. Gopinath, Bhuvana Ramabhadran, Satya Dharanipragada (1998), Factor analysis invariant to linear transformations of data, ICSLP
Akio Ando, Akio Kobayashi, Toru Imai (1998), A thesaurus-based statistical language model for broadcast news transcription, ICSLP
Sreeram V. Balakrishnan (1998), Effect of task complexity on search strategies for the motorola lexicus continuous speech recognition system, ICSLP
Dhananjay Bansal, Mosur K. Ravishankar (1998), New features for confidence annotation, ICSLP
Jerome R. Bellegarda (1998), Multi-Span statistical language modeling for large vocabulary speech recognition, ICSLP
Rathinavelu Chengalvarayan (1998), Maximum-likelihood updates of HMM duration parameters for discriminative continuous speech recognition, ICSLP
Noah Coccaro, Daniel Jurafsky (1998), Towards better integration of semantic predictors in statistical language modeling, ICSLP
Julio Pastor, José Colás, Ruben San-Segundo, José Manuel Pardo (1998), An asymmetric stochastic language model based on multi-tagged words, ICSLP
Vassilis Digalakis, Leonardo Neumeyer, Manolis Perakakis (1998), Product-code vector quantization of cepstral parameters for speech recognition over the WWW, ICSLP
Bernard Doherty, Saeed Vaseghi, Paul McCourt (1998), Context dependent tree based transforms for phonetic speech recognition, ICSLP
Michael T. Johnson, Mary P. Harper, Leah H. Jamieson (1998), Interfacing acoustic models with natural language processing systems, ICSLP
Photina Jaeyoun Jang, Alexander G. Hauptmann (1998), Hierarchical cluster language modeling with statistical rule extraction for rescoring n-best hypotheses during speech decoding, ICSLP
Atsuhiko Kai, Yoshifumi Hirose, Seiichi Nakagawa (1998), Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding system, ICSLP
Tetsunori Kobayashi, Yosuke Wada, Norihiko Kobayashi (1998), Source-extended language model for large vocabulary continuous speech recognition, ICSLP
Akio Kobayashi, Kazuo Onoe, Toru Imai, Akio Ando (1998), Time dependent language model for broadcast news transcription and its post-correction, ICSLP
Jacques Koreman, William J. Barry, Bistra Andreeva (1998), Exploiting transitions and focussing on linguistic properties for ASR, ICSLP
Raymond Lau, Stephanie Seneff (1998), A unified framework for sublexical and linguistic modelling supporting flexible vocabulary speech understanding, ICSLP
Lalit R. Bahl, S. De Gennaro, P. De Souza, E. Epstein, J.M. Le Roux, B. Lewis, C. Waast (1998), A method for modeling liaison in a speech recognition system for French, ICSLP
Fu-Hua Liu, Michael Picheny (1998), On variable sampling frequencies in speech recognition, ICSLP
Kristine Ma, George Zavaliagkos, Rukmini Iyer (1998), Pronunciation modeling for large vocabulary conversational speech recognition, ICSLP
Sankar Basu, Abraham Ittycheriah, Stéphane Maes (1998), Time shift invariant speech recognition, ICSLP
José B. Mariño, Pau Paches-Leal, Albino Nogueiras (1998), The demiphone versus the triphone in a decision-tree state-tying framework, ICSLP
Shinsuke Mori, Masafumi Nishimura, Nobuyasu Itoh (1998), Word clustering for a word bi-gram model, ICSLP
Joao P. Neto, Ciro Martins, Luis B. Almeida (1998), A large vocabulary continuous speech recognition hybrid system for the portuguese language, ICSLP
Mukund Padmanabhan, Bhuvana Ramabhadran, Sankar Basu (1998), Speech recognition performance on a new voicemail transcription task, ICSLP
Sira Palazuelos, Santiago Aguilera, José Rodrigo, Juan Godino (1998), Grammatical and statistical word prediction system for Spanish integrated in an aid for people with disabilities, ICSLP
Kishore Papineni, Satya Dharanipragada (1998), Segmentation using a maximum entropy approach, ICSLP
Adam Berger, Harry Printz (1998), Recognition performance of a large-scale dependency grammar language model, ICSLP
Ganesh N. Ramaswamy, Harry Printz, Ponani S. Gopalakrishnan (1998), A bootstrap technique for building domain-dependent language models, ICSLP
Joan-Andreu Sanchez, José-Miguel Benedi (1998), Estimation of the probability distributions of stochastic context-free grammars from the k-best derivations, ICSLP
Ananth Sankar (1998), Robust HMM estimation with Gaussian merging-splitting and tied-transform HMMs, ICSLP
Kristie Seymore, Stanley Chen, Ronald Rosenfeld (1998), Nonlinear interpolation of topic models for language model adaptation, ICSLP
Kazuyuki Takagi, Rei Oguro, Kenji Hashimoto, Kazuhiko Ozeki (1998), Performance evaluation of word phrase and noun category language models for broadcast news speech recognition, ICSLP
Hesham Tolba, Douglas O'Shaughnessy (1998), Robust automatic continuous-speech recognition based on a voiced-unvoiced decision, ICSLP
Juan Carlos Torrecilla, Ismael Cortazar, Luis A. Hernández (1998), Double tree beam search using hierarchical subword units, ICSLP
Paul van Mulbregt, Ira Carp, Lawrence Gillick, Steve Lowe, Jon Yamron (1998), Text segmentation and topic tracking on broadcast news via a hidden Markov model approach, ICSLP
Philip O'Neill, Saeed Vaseghi, Bernard Doherty, Wooi Haw Tan, Paul McCourt (1998), Multi-phone strings as subword units for speech recognition, ICSLP
Nanette M. Veilleux, Stefanie Shattuck-Hufnagel (1998), Phonetic modification of the syllable /tu/ in two spontaneous american English dialogues, ICSLP
Fuliang Weng, Andreas Stolcke, Ananth Sankar (1998), Efficient lattice representation and generation, ICSLP
Mirjam Wester, Judith M. Kessens, Helmer Strik (1998), Modeling pronunciation variation for a dutch CSR: testing three methods, ICSLP
Edward W. D. Whittaker, Philip C. Woodland (1998), Comparison of language modelling techniques for Russian and English, ICSLP
Petra Witschel (1998), Optimized POS-based language models for large vocabulary speech recognition, ICSLP
Mark Wright, Simon Hovell, Simon Ringland (1998), Reducing peak search effort using two-tier pruning, ICSLP
George Zavaliagkos, Man-Hung Siu, Thomas Colthurst, Jayadev Billa (1998), Using untranscribed training data to improve performance, ICSLP
Ea-Ee Jan, Raimo Bakis, Fu-Hua Liu, Michael Picheny (1998), Telephone band LVCSR for hearing-impaired users, ICSLP
Antonio Bonafonte, José B. Mariño (1998), Using x-gram for efficient speech recognition, ICSLP
Tatsuya Kawahara, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano (1998), Sharable software repository for Japanese large vocabulary continuous speech recognition, ICSLP
Katunobu Itou, Mikio Yamamoto, Kazuya Takeda, Toshiyuki Takezawa, Tatsuo Matsuoka, Tetsunori Kobayashi, Kiyohiro Shikano, Shuichi Itahashi (1998), The design of the newspaper-based Japanese large vocabulary continuous speech recognition corpus, ICSLP
Jun Ogata, Yasuo Ariki (1998), Indexing and classification of TV news articles based on speech dictation using word bigram, ICSLP
Man-Hung Siu, Rukmini Iyer, Herbert Gish, Carl Quillen (1998), Parametric trajectory mixtures for LVCSR, ICSLP
Axel Glaeser, Frédéric Bimbot (1998), Steps toward the integration of speaker recognition in real-world telecom applications, ICSLP
Hyun-Yeol Chung, Cheol-Jun Hwang, Shi-Wook Lee (1998), A bimodal Korean address entry/retrieval system, ICSLP
Cristina Delogu, Andrea Di Carlo, Paolo Rotundi, Danilo Sartori (1998), Usability evaluation of IVR systems with DTMF and ASR, ICSLP
Pascale Fung, Chi Shun Cheung, Kwok Leung Lam, Wai Kat Liu, Yuen Yee Lo (1998), SALSA version 1.0: a speech-based web browser for hong kong English, ICSLP
Andrew Pargellis, Qiru Zhou, Antoine Saad, Chin-Hui Lee (1998), A language for creating speech applications, ICSLP
Robert Graham, Chris Carter, Brian Mellor (1998), The use of automatic speech recognition to reduce the interference between concurrent tasks of driving and phoning, ICSLP
Makoto J. Hirayama, Taro Sugahara, Zhiyong Peng, Junichi Yamazaki (1998), Interactive listening to structured speech content on the internet, ICSLP
Cheol-Woo Jo (1998), MSF format for the representation of speech synchronized moving image, ICSLP
Pernilla Qvarfordt, Arne Jonsson (1998), Effects of using speech in timetable information systems for WWW, ICSLP
Thomas Kemp, Petra Geutner, Michael Schmidt, Borislav Tomaz, Manfred Weber, Martin Westphal, Alex Waibel (1998), The interactive systems labs view4you video indexing system, ICSLP
Hyung-Jin Kim, Lee Hetherington (1998), SEMOLE: a robust framework for gathering information from the world wide web, ICSLP
Lau Bakman, Mads Blidegn, Martin Wittrup, Lars Bo Larsen, Thomas B. Moeslund (1998), Enhancing a WIMP based interface with speech, gaze tracking and agents, ICSLP
Christine H. Nakatani, Steve Whittaker, Julia Hirschberg (1998), Now you hear it, now you don't: empirical studies of audio browsing behavior behavior, ICSLP
Rongyu Qiao, Youngkyu Choi, Johnson I. Agbinya (1998), A voice verifier for face/voice based person verification system, ICSLP
Jordi Robert-Ribes (1998), On the use of automatic speech recognition for TV captioning, ICSLP
Ben Serridge (1998), An undergraduate course on speech recognition based on the CSLU toolkit, ICSLP
Ping-Fai Yang, Yannis Stylianou (1998), Real time voice alteration based on linear prediction, ICSLP
Beng Tiong Tan, Yong Gu, Trevor Thomas (1998), Evaluation and implementation of a voice-activated dialing system with utterance verification, ICSLP
Hsin-Min Wang, Bor-Shen Lin, Berlin Chen, Bo-Ren Bai (1998), Towards a Mandarin voice memo system, ICSLP
Tsubasa Shinozaki, Masanobu Abe (1998), Development of CAI system employing synthesized speech responses, ICSLP
Andreas Kellner, Bernhard Rueber, Hauke Schramm (1998), Using combined decisions and confidence measures for name recognition in automatic directory assistance systems, ICSLP
Bruce Buntschuh, Candace A. Kamm, Giuseppe Di Fabbrizio, Alicia Abella, Mehryar Mohri, Shrikanth Narayanan, I. Zeljkovic, R.D. Sharp, Jeremy H. Wright, S. Marcus, J. Shaffer, R. Duncan, J.G. Wilpon (1998), VPQ: a spoken language interface to large scale directory information, ICSLP
John Choi, Don Hindle, Julia Hirschberg, Ivan Magrin-Chagnolleau, Christine H. Nakatani, Fernando Pereira, Amit Singhal, Steve Whittaker (1998), SCAN - speech content based audio navigator: a system overview, ICSLP
Javier Ferreiros, José Colás, Javier Macias-Guarasa, Alejandro Ruiz, José Manuel Pardo (1998), Controlling a HIFI with a continuous speech understanding system, ICSLP
Lori F. Lamel, Samir Bennacef, Jean-Luc Gauvain, Hervé Dartigues, Jean-Noel Temem (1998), User evaluation of the mask kiosk, ICSLP
Niels Ole Bernsen, Laila Dybkjaer (1998), Is speech the right thing for your application?, ICSLP
Juan Ignacio Godino Llorente, Santiago Aguilera Navarro, Sira Palazuelos Cagigas, Alberto Nieto Altuzarra, Pedro Gomez Vilda (1998), A PC-based tool for helping in diagnosis of pathologic voice, ICSLP
Kaare Sjölander, Jonas Beskow, Joakim Gustafson, Erland Lewin, Rolf Carlson, Björn Granström (1998), Web-based educational tools for speech technology, ICSLP
Stephen Sutton, Ronald A. Cole, Jacques de Villiers, Johan Schalkwyk, Pieter Vermeulen, Michael W. Macon, Yonghong Yan, Edward Kaiser, Brian Rundle, Khaldoun Shobaki, John-Paul Hosom, Alex Kain, Johan Wouters, Dominic W. Massaro, Michael Cohen (1998), Universal speech tools: the CSLU toolkit, ICSLP
Ben Serridge, Alejandro Barbosa, Ronald A. Cole, Nora Munive, Alcira Vargas (1998), Creating a mexican Spanish version of the CSLU toolkit, ICSLP
Carmen García-Mateo, Qiru Zhou, Chin-Hui Lee, Andrew Pargellis (1998), A voice user interface demonstration system for mexican Spanish, ICSLP
Yasuyo Minagawa-Kawai, Shigeru Kiritani (1998), Non-native productions of Japanese single stops that are too long for one mora unit, ICSLP
Nobuko Yamada (1998), The process of generation and development of second language Japanese accentuation, ICSLP
Seiya Funatsu, Shigeru Kiritani (1998), Perceptual properties of Russians with Japanese fricatives, ICSLP
Catia Cucchiarini, Febe De Wet, Helmer Strik, Louis Boves (1998), Assessment of dutch pronunciation by means of automatic speech recognition technology, ICSLP
Philippe Langlais, Anne-Marie Öster, Björn Granström (1998), Phonetic-level mispronunciation detection in non-native Swedish speech, ICSLP
Reiko Akahane-Yamada, Erik McDermott, Takahiro Adachi, Hideki Kawahara, John S. Pruitt (1998), Computer-based second language production training by using spectrographic representation and HMM-based speech recognition scores, ICSLP
Debra M. Hardison (1998), Spoken word identification by native and nonnative speakers of English: effects of training, modality, context and phonetic environment, ICSLP
Michael D. Tyler (1998), The effect of background knowledge on first and second language comprehension difficulty, ICSLP
Kimiko Tsukada (1998), Comparison of cross-language coarticulation: English, Japanese and Japanese-accented English, ICSLP
Satoshi Imaizumi, Hidemi Itoh, Yuji Tamekawa, Toshisada Deguchi, Koichi Mori (1998), Plasticity of non-native phonetic perception and production: a training study, ICSLP
Ian Watson (1998), The relation between perceptual and production categories in acquisition, ICSLP
Valerie Hazan, Sarah Barrett (1998), The development of perceptual cue-weighting in children aged 6 to 12, ICSLP
Anne Cutler, Takashi Otake (1998), Assimilation of place in Japanese and dutch, ICSLP
Yuko Kondo, Yumiko Arai (1998), Prosodic constraint on v-to-v coarticulation in Japanese, ICSLP
Catia Cucchiarini, Henk van den Heuvel (1998), Postvocalic /r/-deletion in standard dutch: how experimental phonology can profit from ASR technology, ICSLP
John Hajek, Ian Watson (1998), More evidence for the perceptual basis of sound change? suprasegmental effects in the development of distinctive nasalization, ICSLP
Jianwu Dang, Kiyoshi Honda (1998), Speech production of vowel sequences using a physiological articulatory model, ICSLP
Felicity Cox, Sallyanne Palethorpe (1998), Regional variation in the vowels of female adolescents from sydney, ICSLP
Catherine Watson, Jonathan Harrington, Sallyanne Palethorpe (1998), A kinematic analysis of new zealand and australian English vowel spaces, ICSLP
Noel Nguyen, Sarah Hawkins (1998), Syllable-onset acoustic properties associated with syllable-coda voicing, ICSLP
Noel Nguyen, Alan A. Wrench, Fiona Gibbon, William J. Hardcastle (1998), Articulatory, acoustic and perceptual aspects of fricative-stop coarticulation, ICSLP
Rob J. J. H. van Son, Florien J. Koopmans-van Beinum, Louis C. W. Pols (1998), Efficiency as an organizing principle of natural speech, ICSLP
Inger Karlsson, Tanja Bänziger, Jana Dankovicova, Tom Johnstone, Johan Lindberg, Haakan Melin, Francis Nolan, Klaus R. Scherer (1998), Within-speaker variability due to speaking manners, ICSLP
Roland Kuhn, Patrick Nguyen, Jean-Claude Junqua, Lloyd Goldwasser, Nancy Niedzielski, Steven Fincke, Ken Field, Matteo Contolini (1998), Eigenvoices for speaker adaptation, ICSLP
Sue E. Johnson, Philip C. Woodland (1998), Speaker clustering using direct maximisation of the MLLR-adapted likelihood, ICSLP
Olli Viikki, Kari Laurila (1998), Incremental on-line speaker adaptation in adverse conditions, ICSLP
Mark J. F. Gales (1998), Cluster adaptive training for speech recognition, ICSLP
Jen-Tzung Chien (1998), On-line hierarchical transformation of hidden Markov models for speaker adaptation, ICSLP
Motoyuki Suzuki, Toshiaki Abe, Hiroki Mori, Shozo Makino, Hirotomo Aso (1998), High-speed speaker adaptation using phoneme dependent tree-structured speaker clustering, ICSLP
Tasos Anastasakos, Sreeram V. Balakrishnan (1998), The use of confidence measures in unsupervised adaptation of speech recognizers, ICSLP
John McDonough, William Byrne, Xiaoqiang Luo (1998), Speaker normalization with all-pass transforms, ICSLP
Rong Zheng, Zuoying Wang (1998), Toward on-line learning of Chinese continuous speech recognition system, ICSLP
Sharon L. Oviatt (1998), The CHAM model of hyperarticulate adaptation during human-computer error resolution, ICSLP
Ulla Uebler, Michael Schüssler, Heinrich Niemann (1998), Bilingual and dialectal adaptation and retraining, ICSLP
Tanja Schultz, Alex Waibel (1998), Language independent and language adaptive large vocabulary speech recognition, ICSLP
Goh Kawai, Keikichi Hirose (1998), A method for measuring the intelligibility and nonnativeness of phone quality in foreign language pronunciation training, ICSLP
Peter Blamey, Julia Sarant, Tanya Serry, Roger Wales, Christopher James, Johanna Barry, Graeme M. Clark, M. Wright, R. Tooher, C. Psarros, G. Godwin, M. Rennie, T. Meskin (1998), Speech perception and spoken language in children with impaired hearing, ICSLP
Catia Cucchiarini, Helmer Strik, Louis Boves (1998), Quantitative assessment of second language learners' fluency: an automatic approach, ICSLP
Paul Dalsgaard, Ove Andersen, William J. Barry (1998), Cross-language merged speech units and their descriptive phonetic correlates, ICSLP
Robert Eklund, Elizabeth Shriberg (1998), Crosslinguistic disfluency modelling: a comparative analysis of Swedish and american English human--human and human--machine dialogues, ICSLP
Horacio Franco, Leonardo Neumeyer (1998), Calibration of machine scores for pronunciation grading, ICSLP
Petra Geutner, Michael Finke, Alex Waibel (1998), Phonetic-distance-based hypothesis driven lexical adaptation for transcribing multlingual broadcast news, ICSLP
Chul-Ho Jo, Tatsuya Kawahara, Shuji Doshita, Masatake Dantsuji (1998), Automatic pronunciation error detection and guidance for foreign language learning, ICSLP
Roger Ho-Yin Leung, Hong C. Leung (1998), Lexical access for large-vocabulary speech recognition, ICSLP
Sharlene Liu, Sean Doyle, Allen Morris, Farzad Ehsani (1998), The effect of fundamental frequency on Mandarin speech recognition, ICSLP
Duncan Markham (1998), The perception of nativeness: variable speakers and flexible listeners, ICSLP
Michael F. McTear, Eamonn A. O'Hare (1998), Voice dictation in the secondary school classroom, ICSLP
Kazuo Nakayama, Kaoru Tomita-Nakayama (1998), The importance of the first syllable in English spoken word recognition by adult Japanese speakers, ICSLP
Anne-Marie Öster (1998), Spoken L2 teaching with contrastive visual and auditory feedback, ICSLP
Dominiek Sandra, Steven Gillis (1998), The role of phonological, morphological, and orthographic knowledge in the intuitive syllabification of dutch words: a longitudinal approach, ICSLP
Ayako Shirose, Haruo Kubozono, Shigeru Kiritani (1998), The acquisition of Japanese compound accent rule, ICSLP
Lydia K. H. So, Zhou Jing (1998), The acquisition of putonghua phonology, ICSLP
Kaoru Tomita-Nakayama, Kazuo Nakayama, Masayuki Misaki (1998), Enhancing speech processing of Japanese learners of English utilizing time-scale expansion with constant pitch, ICSLP
Volker Warnke, Elmar Nöth, Jan Buckow, Stefan Harbeck, Heinrich Niemann (1998), A bootstrap training approach for language model classifiers, ICSLP
Sandra P. Whiteside, Jeni Marshall (1998), Voice onset time patterns in 7-, 9- and 11-year old children, ICSLP
Sandra P. Whiteside, Carolyn Hodgson (1998), Some developmental patterns in the speech of 6-, 8- and 10-year old children: an acoustic phonetic study, ICSLP
Lisa-Jane Brown, John Locke, Peter Jones, Sandra P. Whiteside (1998), Language development after extreme childhood deprivation: a case study, ICSLP
Geoff Williams, Mark Terry, Jonathan Kaye (1998), Phonological elements as a basis for language-independent ASR, ICSLP
Claudio Zmarich, Roberta Lanni (1998), A phonetic and acoustic study of babbling in an Italian child, ICSLP
Roland Kuhn, Jean-Claude Junqua, Philip D. Martzen (1998), Rescoring multiple pronunciations generated from spelled words, ICSLP
Yolanda Blanco, Maria Cuellar, Arantxa Villanueva, Fernando Lacunza, Rafael Cabeza, Beatriz Marcotegui (1998), SIVHA, visual speech synthesis system, ICSLP
C. G. de Bruijn, Sandra P. Whiteside, P. A. Cudd, D. Syder, K. M. Rosen, L. Nord (1998), Using automatic speech recognition and its possible effects on the voice, ICSLP
Robert Alexander Fearn (1998), The importance of F0 or voice pitch for perception of tonal language: simulations with cochlear implant speech processing strategies, ICSLP
Karin Brunnegaard, Katja Laakso, Lena Hartelius, Elisabeth Ahlsen (1998), Assessing high-level language in individuals with multiple sclerosis: a pilot study, ICSLP
Shizuo Hiki, Kazuya Imaizumi, Yumiko Fukuda (1998), Design of cochlear implant device for transmitting voice pitch information in speech sound of asian languages, ICSLP
Aileen K. Ho, John L. Bradshaw, Robert Iansek, Robin J. Alfredson (1998), Abnormal volume-duration relationship in parkinsonian speech, ICSLP
Cheol-Woo Jo, Dae-Hyun Kim (1998), Analysis of disordered speech signal using wavelet transform, ICSLP
Shigeyoshi Kitazawa, Hiroyuki Kirihata, Tatsuya Kitamura (1998), Multi-channel pulsation strategy for electric stimulation of cochlea, ICSLP
Eva Agelfors, Jonas Beskow, Martin Dahlquist, Björn Granström, Magnus Lundeberg, Karl-Erik Spens, Tobias Öhman (1998), Synthetic faces as a lipreading support, ICSLP
Lois Martin, John Bench (1998), Predicting language scores from the speech perception scores of hearing-impaired children, ICSLP
Oleg P. Skljarov (1998), Content-independent duration model on categories of voice and unvoice segments, ICSLP
Ali-Asghar Soltani-Farani, Edward H.S. Chilton, Robin Shirley (1998), Dynamical spectrogram, an aid for the deaf, ICSLP
Rosemary A. Varley, Sandra P. Whiteside (1998), Evidence of dual-route phonetic encoding from apraxia of speech: implications for phonetic encoding models, ICSLP
M. F. Cheesman, K. L. Smilsky, T. M. Major, F. Lewis, L. M. Boorman (1998), Speech communication profiles across the adult lifespan: persons without self-identified hearing impairment, ICSLP
William J. Barry (1998), Time as a factor in the acoustic variation of schwa, ICSLP
Hendrik F. V. Boshoff, Elizabeth C. Botha (1998), On the structure of vowel space: a genealogy of general phonetic concepts, ICSLP
Véronique Lecuit, Didier Demolin (1998), The relationship between intensity and subglottal pressure with controlled pitch, ICSLP
Alain Soquet, Véronique Lecuit, Thierry Metens, Bruno Nazarian, Didier Demolin (1998), Segmentation of the airway from the surrounding tissues on magnetic resonance images: a comparative study, ICSLP
Sorin Dusan, Li Deng (1998), Recovering vocal tract shapes from MFCC parameters, ICSLP
John H. Esling, Jocelyn Clayards, Jerold A. Edmondson, Qiu Fuyuan, Jimmy G. Harris (1998), Quantification of pharyngeal articulations using measurements from laryngoscopic images, ICSLP
Janice Fon (1998), Variance and invariance in speech rate as a reflection of conceptual planning, ICSLP
Masako Fujimoto, Emi Murano, Seiji Niimi, Shigeru Kiritani (1998), Correspondence between the glottal gesture overlap pattern and vowel devoicing in Japanese, ICSLP
Yukiko Fujisawa, Nobuaki Minematsu, Seiichi Nakagawa (1998), Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique, ICSLP
Shunichi Ishihara (1998), Independence of consonantal voicing and vocoid F0 perturbation in English and Japanese, ICSLP
Daniel Jurafsky, Alan Bell, Eric Fosler-Lussier, Cynthia Girand, William Raymond (1998), Reduction of English function words in switchboard, ICSLP
Hee-Sun Kim (1998), Duration compensation in non-adjacent consonant and temporal regularity, ICSLP
Keisuke Mori, Yorinobu Sonoda (1998), Relationship between lip shapes and acoustical characteristics during speech, ICSLP
Kunitoshi Motoki, Hiroki Matsuzaki (1998), A model to represent propagation and radiation of higher-order modes for 3-d vocal-tract configuration, ICSLP
Takuya Niikawa, Masafumi Matsumura, Takashi Tachimura, Takeshi Wada (1998), FEM analysis of aspirated air flow in three-dimensional vocal tract during fricative consonant phonation, ICSLP
Takesi Okadome, Tokihiko Kaburagi, Masaaki Honda (1998), Trajectory formation of articulatory movements for a given sequence of phonemes, ICSLP
Chilin Shih, Bernd Möbius (1998), Contextual effects on voicing profiles of German and Mandarin consonants, ICSLP
Andrew J. Lundberg, Maureen Stone (1998), Reconstructing the tongue surface from six cross-sectional contours: ultrasound data, ICSLP
Yasushi Terao, Tadao Murata (1998), Articulability of two consecutive morae in Japanese speech production: evidence from sound exchange errors in spontaneous speech, ICSLP
Anne Vilain, Christian Abry, Pierre Badin (1998), Coarticulation and degrees of freedom in the elaboration of a new articulatory plant: GENTIANE, ICSLP
Masahiko Wakumoto, Shinobu Masaki, Kiyoshi Honda, Toshikazu Ohue (1998), A pressure sensitive palatography: application of new pressure sensitive sheet for measuring tongue-palatal contact pressure, ICSLP
Sandra P. Whiteside, Rosemary A. Varley (1998), Dual-route phonetic encoding: some acoustic evidence, ICSLP
Brigitte Zellner (1998), Fast and slow speech rate: a characterisation for French, ICSLP
Padma Ramesh, Chin-Hui Lee, Biing-Hwang Juang (1998), Context dependent anti subword modeling for utterance verification, ICSLP
J. G. A. Dolfing, Andreas Wendemuth (1998), Combination of confidence measures in isolated word recognition, ICSLP
Daniel Willett, Andreas Worm, Christoph Neukirchen, Gerhard Rigoll (1998), Confidence measures for HMM-based speech recognition, ICSLP
Li Jiang, Xuedong Huang (1998), Vocabulary-independent word confidence measure using subword features, ICSLP
Qiguang Lin, Subrata Das, David Lubensky, Michael Picheny (1998), A new confidence measure based on rank-ordering subphone scores, ICSLP
Tatsuya Kawahara, Kentaro Ishizuka, Shuji Doshita, Chin-Hui Lee (1998), Speaking-style dependent lexicalized filler model for key-phrase detection and verification, ICSLP
Paul Duchnowski, Louis Braida, Maroula Bratakos, David Lum, Matthew Sexton, Jean Krause (1998), A speechreading aid based on phonetic ASR, ICSLP
Jan Nouza (1998), Training speech through visual feedback patterns, ICSLP
Ichiro Maruyama, Yoshiharu Abe, Takahiro Wakao, Eiji Sawamura, Terumasa Ehara, Katsuhiko Shirai (1998), Word sequence pair spotting for synchronization of speech and text in production of closed-caption TV programs for the hearing impaired, ICSLP
Aileen K. Ho, John L. Bradshaw, Robert Iansek, Robin J. Alfredson (1998), Volume regulation in parkinsonian speech, ICSLP
Eva Strangert, Mattias Heldner (1998), On the amount and domain of focal lengthening in Swedish, ICSLP
Daniel Hirst, Corine Astesano, Albert Di Cristo (1998), Differential lengthening of syllabic constituents in French: the effect of accent type and speaking style, ICSLP
Felix C. M. Quimbo, Tatsuya Kawahara, Shuji Doshita (1998), Prosodic analysis of fillers and self-repair in Japanese speech, ICSLP
Jinfu Ni, Goh Kawai, Keikichi Hirose (1998), A synthesis-oriented model of phrasal pitch movements in standard Chinese, ICSLP
Qing Guo, Fang Zheng, Jian Wu, Wenhu Wu (1998), Non-linear probability estimation method used in HMM for modeling frame correlation, ICSLP
Shuri Kumagai (1998), Patterns of linguopalatal contact during Japanese vowel devoicing, ICSLP
Xiao Yu, Guangrui Hu (1998), Speech separation based on the GMM PDF estimation, ICSLP
Xiaoqiang Luo (1998), Growth transform of a sum of rational functions and its application in estimating HMM parameters, ICSLP
Mirjam Wester, Judith M. Kessens, Helmer Strik (1998), Two automatic approaches for analyzing connected speech processes in dutch, ICSLP
Johan W. Koolwaaij, Johan de Veth (1998), The use of broad phonetic class models in speaker recognition, ICSLP
Jorge Miquélez, Rocio Sesma, Yolanda Blanco (1998), Analysis and treatment of esophageal speech for the enhancement of its comprehension, ICSLP
Fernando Lacunza, Yolanda Blanco (1998), High quality text-to-speech system in Spanish for handicapped people, ICSLP
Corinna Ng, Ross Wilkinson, Justin Zobel (1998), Factors affecting speech retrieval, ICSLP
Johan Frid (1998), Perception of words with vowel reduction, ICSLP
Ingrid Ahmer, Robin W. King (1998), Automated captioning of television programs: development and analysis of a soundtrack corpus, ICSLP
Fabrice Lefèvre, Claude Montacié, Marie-José Caraty (1998), On the influence of the delta coefficients in a HMM-based speech recognition system, ICSLP
Raymond Low, Roberto Togneri (1998), Speech recognition using the probabilistic neural network, ICSLP
Imed Zitouni (1998), A language modeling based on a hierarchical approach: m_n^v, ICSLP
Michiko Watanabe (1998), Temporal variables in lectures in the Japanese language, ICSLP
Matthew Aylett (1998), Building a statistical model of the vowel space for phoneticians, ICSLP
Michelle Minnick Fox (1998), Computer-mediated input and the acquisition of L2 vowels, ICSLP
Najam Malik, W. Harvey Holmes (1998), Speech analysis by subspace methods of spectral line estimation, ICSLP
Petra Hansson (1998), Pausing in Swedish spontaneous speech, ICSLP
Elisabeth Zetterholm (1998), Prosody and voice quality in the expression of emotions, ICSLP
Julie Lunn, Alan A. Wrench, Janet Mackenzie Beck (1998), Acoustic analysis of /l/ in glossectomees, ICSLP
Arne Risberg (1993), The development of speech-processing aids for the DEAF - past, present and future, SLTDP
Wolfgang Helmut Döring, Hans-Günter Hirsch (1993), Speech intelligibility improvement for people with cochlear implants, SLTDP
S. Peeters, F. E. Offeciers, L. Moeneclaey (1993), The laura cochlear implant programmed with the continuous interleaved strategy and phase-locked continuous interleaved, SLTDP
Barry Nevison (1993), New coding strategy for the nucleus speech processor, SLTDP
Ian R. Summers, Ruth Gray (1993), Comparison of speech features for presentation to the profoundly DEAF, SLTDP
Donald G. Jamieson, Todd Schneider (1993), Consumer-based electroacoustic hearing aid measures, SLTDP
John Walliker, Julian Daley, Kerensa Smith, Andrew Faulkner, Adrian Fourcin (1993), Speech analytic hearing aids for the profoundly DEAF: technical design aspects and user field trial results, SLTDP
Paul Dalsgaard, Ove Andersen, Viggo Moss Hansen (1993), Tactile representation of selected acoustic-phonetic features for use in lipreading, SLTDP
Maxine Eskenazi, E. Vormes, G. Monguillot, B. Frachet (1993), A new training and assessment technique for cochlear implants, SLTDP
A. Maynard Engebretson (1993), Issues of reverberation regarding acoustic prosthetic devices, SLTDP
D. Bauer, J. C. Geiger, R. Beerwerth (1993), Speech signal conditioning communication aids for the impaired with severe auditory sensory damages, SLTDP
Hans Georg Piroth, Thomas Arnhold, Hans Lindow (1993), The execution of tracking experiments with a system for the synthesis of tactile speech equivalents, SLTDP
E. M. Ellis, A. J. Robinson (1993), A tactile system for speech listening based on phonetic representation, SLTDP
J. Crestel, M. Guitton (1993), Alaryngeal female speech: an approach for the restoration of gender, SLTDP
K. Hermansen, F. K. Fink, U. Hartmann (1993), Parametric transformation of speech signals, SLTDP
Claude Hustinx (1993), Computer aided learning in the new educational approach to surdity, SLTDP
Georges Rensonnet (1993), Symbol: multilingual and multicode lexical learning systemon CD-i environment, SLTDP
Birgit Cook (1993), A multi-media program exercising the basics in lip-reading, cued-speech and sign-language vocabulary, SLTDP
Yumiko Fukuda, Shizuo Hiki (1993), Design of a system for electronic dictionary of Japanese sign language, SLTDP
Cilia M. Beijk, Ben A. G. Elsendoorn (1993), A comparison of fundamental frequency development of DEAF and hearing children aged 4 to 20 years, SLTDP
Magnus Magnusson (1993), Still-picture telephony for an aphasic user, SLTDP
Judy Lariviere, Elizabeth MacKinnon, Nancy Risebrough (1993), Is speech recognition worth it?, SLTDP
M. Zajicek, C. Rose (1993), Evaluation of strategies for replacing mouse action with speech, SLTDP
Corine Bickley, Sheri Hunnicutt, Lori Lamel (1993), Alternative strategies for creating autoCAD drawings, SLTDP
Georgios Kouroupetroglou, Antonis Anagnostopoulos, Georgios Papakostas, Aris Charoupias (1993), The BLISPHON alternative communication system for the speechless individual, SLTDP
Iain R. Murray, John L. Arnott (1993), A tool for the rapid development of new synthetic voice personalities, SLTDP
Matthew E. J. Wood, Eric Lewis (1993), Grammatical recognition in computer aided conversation, SLTDP
Jane Brodin (1993), Telefax communication for people with mental, SLTDP
Torben Moller-Sorensen (1993), HSP-form training helps children to begin with reading, SLTDP
Harry Levitt (1993), The impact of technology on speech rehabilitation, SLTDP
S. Aguilera, M. A. Berrojo, F. M. Gimenez de los Galanes, J. Colas, J. Macias, J. M. Montero (1993), Impaired persons facilities based on a multi-modality speech processing system, SLTDP
Alan A. Wrench, M. S. Jackson, Mervyn A. Jack, D. S. Soutar, A. G. Robertson, Janet MacKenzie Beck, John Laver (1993), A speech therapy workstation providing visual feedback of segmental quality, SLTDP
Hector Javkin, Norma Antonanzas-Barroso, Amitav Das, Nancy Niedzielski, Yoshinori Yamada, Norio Murata, Harry Levitt, Karen Youdelman (1993), A multi-parameter speech training system, SLTDP
James A. Till (1993), CASPER: computer assisted speech evaluation expert system, SLTDP
Onsy A. Alim, T. Anber, R. Ahmed (1993), Speech rehabilitation programme for mentally disabled children, SLTDP
Brian M. Weiss (1993), Facilitated voice output communication for persons with autism using a ZYGO MACAW, SLTDP
Fred Runge, Ulrich Schultheiß (1993), A voice-controlled telecommunication terminal, SLTDP
Parimala Raghavendra, Elisabet Rosengren (1993), Evaluation of multi-talk II: feedback from users and their partners, SLTDP
Katherine T. Samworth (1993), A method for obtaining control parameters for a parallel formant synthesizer, SLTDP
John L. Arnott, Norman Alm, Iain R. Murray (1993), Enhancing a communication prosthesis with vocal emotion effects, SLTDP
Debra Yarrington, Richard Foulds (1993), Personalizing synthesized voices, SLTDP
R. W. Series (1993), A speech training aid, SLTDP
Klara Vicsi (1993), A product oriented teaching and training system for speech handicapped children, SLTDP
Stefan Grocholewski (1993), PC based speech training environment for deaf children, SLTDP
J. M. Gill (1993), Speech technology for visually disabled persons: research and practical use, SLTDP
Wolf-Joachim Fischer, Wolf Owe, Ulrich Kordon, Diane Hirschfeld (1993), A pocket reading device for the blind, SLTDP
Renee van Bezooijen, Willy Jongenburger (1993), Evaluation of an electronic newspaper for the blind in the netherlands, SLTDP
G. Epitropakis, N. Yiourgalis, D. Valkaniotis, P. Pierros, C. Kesendes, Nikos Fakotakis, George Kokkinakis (1993), An improved greek reading machine for visually impaired persons, SLTDP
Joaquim Llisterri, Natividad Fernandez, Francesc Gudayol, Juan Jose Poyatos, Josep Marti (1993), Testing users acceptance of ciber232, a text-to-speech system used by blind persons, SLTDP
Xuedong Huang (2004), Enabling Natural Computing, ISCSLP
Biing-Hwang (Fred) Juang (2004), Speech Research in Telecommunications: A Bell-Centric View, ISCSLP
William Shi-Yuan Wang (2004), Spoken Language Processing: People versus Machines, ISCSLP
Wu Chou (2004), Minimum Classification Error Rate Pattern Recognition Approach for Speech and Language Processing, ISCSLP
Hong-Kwang Jeff Kuo (2004), Maximum Entropy Modeling for Speech Recognition, ISCSLP
MeiYuh Hwang, Xin Lei, Tim Ng, Ivan Bulyko, Mari Ostendorf, Andreas Stolcke, Wen Wang, Jing Zheng, Venkata Ramana Rao Gadde, Martin Graciarena, Yan Huang, Manhung Siu (2004), Progress on Mandarin Conversational Telephone Speech Recognition, ISCSLP
YiCheng Pan, ChiaHsing Yu, LinShan Lee (2004), Large Vocabulary Continuous Mandarin Speech Recognition using Finite State Machine, ISCSLP
Gang Guo, Chao Huang, Hui Jiang, RenHua Wang (2004), A Comparative Study on Various Confidence Measures in Large Vocabulary Speech Recognition, ISCSLP
Wai Kit Lo, Frank K. Soong, Satoshi Nakamura (2004), Generalized Posterior Probability for Minimizing Verification Errors at Subword, Word and Sentence Levels, ISCSLP
Nick JuiChang Wang, ChingHo Tsai, Patrick Huang, JiaLin Shen (2004), Chinese Large-Vocabulary Name Recognition System using Character Description and Syllable Spelling Recognition, ISCSLP
Zhengyu Zhou, Helen Meng (2004), Error Identification for Large Vocabulary Speech Recognition, ISCSLP
Jianwu Dang, Jianguo Wei, Takeharu Suzuki, Kiyoshi Honda, Pascal Perrier, Masaaki Honda (2004), Investigation and Modeling of Coarticulation in Speech Production, ISCSLP
ShuChuan Tseng (2004), Spontaneous Mandarin Production: Results of a Corpus-Based Study, ISCSLP
Pik Ki Peggy Mok, Sarah Hawkins (2004), Effects of Phonemic Vs Allophonic Density and Stress on Vowel-To-Vowel Coarticulation in Cantonese and Beijing Mandarin, ISCSLP
Hongwei Ding, Oliver Jokisch, Ruediger Hoffmann (2004), Glottalization in Inventory Construction: A Cross-Language Study, ISCSLP
Yiya Chen (2004), Focus and Intonational Phrase Boundary in Standard Chinese, ISCSLP
Jiahong Yuan (2004), Perception of Mandarin Intonation, ISCSLP
Boon Pang Lim, Haizhou Li, Yu Chen (2004), Language Identification Through Large Vocabulary Continuous Speech Recognition, ISCSLP
Shizhen Wang, Jia Liu, Runsheng Liu (2004), Language Identification using Discriminative Weighted Language Models, ISCSLP
Guiwen Ou, Dengfeng Ke (2004), Text-Independent Speaker Verification based on Relation of MFCC Components, ISCSLP
KaYee Leung, ManWai Mak, Manhung Siu, SunYuan Kung (2004), Adaptive Conditional Pronunciation Modeling using Articulatory Features for Speaker Verification, ISCSLP
JyhHer Yang, YuanFu Liao (2004), Unseen Handset Mismatch Compensation based on Feature/Model-Speech A Priori Knowledge Interpolation for Robust Speaker Recognition, ISCSLP
Junmei Bai, Rong Zheng, Bo Xu, Shuwu Zhang (2004), Robust Speaker Recognition Integrating Pitch and Wiener Filter, ISCSLP
Zhenhua Ling, Yuping Wang, Yu Hu, RenHua Wang (2004), Modeling Glottal Effect on the Spectral Envelop of STRAIGHT using Mixture of Gaussians, ISCSLP
Fengyan Qi, Changchun Bao, Yan Liu (2004), A Novel Two-Step SVM Classifier for Voiced/Unvoiced/Silence Classification of Speech, ISCSLP
Wentao Gu, Keikichi Hirose, Hiroya Fujisaki (2004), Analysis of Shanghainese F0 Contours based on the Command-Response Model, ISCSLP
Weibin Zhu, Wei Zhang, Qin Shi, Xijun Ma, Liqin Shen (2004), Automatic Detection of Chinese Accent-Index based on Approximation-Ratio, ISCSLP
Jilei Tian, Jani Nurminen (2004), On Analysis of Eigenpitch in Mandarin Chinese, ISCSLP
Jiangping Kong (2004), Acoustical Study on Sub-Harmonic of Glottal Source in Mandarin Tones, ISCSLP
Donglai Zhu, Qiang Huo, Jian Wu (2004), A Study of Switching State Segmentation in Segmental Switching Linear Gaussian Hidden Markov Models for Robust Speech Recognition, ISCSLP
Yi Chen, LinShan Lee (2004), Robust Features for Speech Recognition using Minimum Variance Distortionless Response (MVDR) Spectrum Estimation and Feature Normalization Techniques, ISCSLP
YungSheng Huang, Jeihweih Hung (2004), Data-Driven Temporal Filters based on Maximum Mutual Information for Robust Features in Speech Recognition, ISCSLP
ChihHsien Huang, JenTzung Chien, Hsinmin Wang (2004), A New Eigenvoice Approach to Speaker Adaptation, ISCSLP
XiaoBing Li, LiRong Dai, RenHua Wang (2004), Mce-Based Training of Subspace Distribution Clustering HMM, ISCSLP
Yun Tang, Wenju Liu, Yiyan Zhang, Bo Xu (2004), A Framework for Fast Segment Model by Avoidance of Redundant Computation on Segment, ISCSLP
Akemi Hoshino (2004), Dependence of Correct Pronunciation of Chinese Aspirated Sounds on Power During Voice Onset Time, ISCSLP
Akemi Hoshino, Akio Yassuda (2004), Effect of Japanese Articulation of Stops on Pronunciation of Chinese Aspirated Sounds by Japanese Students, ISCSLP
Huiju Hsu (2004), Taiwan Mandarin -- Does It Remain Homogeneous?, ISCSLP
Xin Luo, QianJie Fu (2004), Contributions of Periodicity Fluctuation Cues in Individual Frequency Channels to Chinese Speech Recognition, ISCSLP
Bin Dong, Qingwei Zhao, Jianping Zhang, Yonghong Yan (2004), Automatic Assessment of Pronunciation Quality, ISCSLP
Jing Li, Changchun Bao (2004), Quantization of Sew and Rew Magnitude for 2 Kb/S Waveform Interpolation Speech Coding, ISCSLP
Guiping Wang, Changchun Bao (2004), Low Complexity Decomposition for the Characteristic Waveform of Speech Signal, ISCSLP
Changchun Bao, Jason Lukasiak, Christian Ritz (2004), High Quality Harmonic Excitation Linear Predictive Speech Coding at 2 Kb/S, ISCSLP
Yanning Bai, Changchun Bao (2004), An Improved 4 Kbit/S Celp Speech Coding Algorithm, ISCSLP
GuiLin (2004), An Embedded English Synthesis Approach based on Speech Concatenation and Smoothing, ISCSLP
GuoPing Hu, QingFeng Liu, Yu Hu, RenHua Wang (2004), Hearer Model based Stress Prediction for Chinese TTS System, ISCSLP
Honghui Dong, Jianhua Tao, Bo Xu (2004), Grapheme-To-Phoneme Conversion in Chinese TTS System, ISCSLP
ShaoHuang Pin, Yongcheng Chen, Hsinmin Wang, Chiuyu Tseng (2004), A Mandarin TTS System with an Integrated Prosodic Model, ISCSLP
HuaJui Peng, Chiching Chen, Chiuyu Tseng, Kehjiann Chen (2004), Predicting Prosodic Words from Lexical Words--A First Step Towards Predicting Prosody from Text, ISCSLP
GaoPeng Chen, Gerard Bailly, QingFeng Liu, RenHua Wang (2004), A Superposed Prosodic Model for Chinese Text-To-Speech Synthesis, ISCSLP
Guoyu Zuo, Wenju Liu, Xiaogang Ruan (2004), Improving the Performance of MGM-Based Voice Conversion by Preparing Training Data Method, ISCSLP
Wentao Gu, Keikichi Hirose, Hiroya Fujisaki (2004), Analysis and Synthesis of Cantonese F0 Contours based on the Command-Response Model, ISCSLP
Bei Liu, Limin Du (2004), The Disambiguation Strategies of Semantic Analysis in Chinese Spoken Dialogue System, ISCSLP
Wing Lin Yip, Helen Meng (2004), Bilingual Response Generation using Semi-Automatically-Induced Templates for a Mixed-Initiative Dialog System, ISCSLP
Yi Liu, Pascale Fung, Shudong Huang, Chris Cieri, Lufeng Zhai, Benfeng Chen (2004), Development of A Chinese Telephony Conversational Corpus for Speech Processing, ISCSLP
ChungHsien Wu, ChiChun Hsia, JiunFu Chen, TeHsien Liu (2004), Variable-Length Unit Selection using Lsa-Based Syntactic Structure Cost, ISCSLP
HungYan Gu, KuoHsian Wang (2004), An Acoustic and Articulatory Knowledge Integrated Method for Improving Synthetic Mandarin Speech's Fluency, ISCSLP
Tien Ying Fung, Yuk Chi Li, Helen Meng, P.C. Ching (2004), Prosody and Style Controls in CU VOCAL using SSML and SAPI XML Tags, ISCSLP
JianFeng Li, GuoPing Hu, Ming Fan, LiRong Dai (2004), Apply Length Distribution Model to Intonational Phrase Prediction, ISCSLP
Chiuyu Tseng, Yehlin Lee (2004), Intensity in Relation to Prosody Organization, ISCSLP
Jianhua Tao (2004), Rhythm Correlation of Speech Synthesis System, ISCSLP
Yun Tang, Wenju Liu, Bo Xu (2004), Trigram Duration Modeling in Speech Recognition, ISCSLP
Chao Xu, Yi Liu, Yongsheng Yang, Pascale Fung, Zhigang Cao (2004), A System for Mandarin Short Phrase Recognition on Portable Devices, ISCSLP
Gang Peng, Hongying Zheng, William S.Y. Wang (2004), Tone Recognition for Chinese Speech: A Comparative Study of Mandarin and Cantonese, ISCSLP
ShanRuei You, ShihChieh Chien, ChihHsing Hsu, KeShiu Chen, JiaJang Tu, Jeng Shien Lin, SenChia Chang (2004), Chinese-English Mixed-Lingual Keyword Spotting, ISCSLP
Jian Yang, Yuanyuan Pu, Hong Wei (2004), An Acoustic-Phonetic Analysis of Large Vocabulary Continuous Mandarin Speech Recognition for Non-Native Speakers, ISCSLP
Yuezhong Tang, Xia Wang, Yang Cao, Feng Ding (2004), Feature Masking in an Embedded Mandarin Speech Recognition System, ISCSLP
TaiHwei Hwang, SenChia Chang (2004), Energy Contour Enhancement for Noisy Speech Recognition, ISCSLP
Bo Liu, LiRong Dai, JinYu Li, RenHua Wang (2004), Double Gaussian based Feature Normalization for Robust Speech Recognition, ISCSLP
C.L. Chen, Y.R. Wang, S.H. Chen (2004), A Study on Mandarin Broadcast News Speech Recognition, ISCSLP
GuoHong Ding, Bo Xu, Xia Wang, Yang Cao, Feng Ding, Yuezhong Tang (2004), Task-Specific Adaptation in Chinese Name Recognition, ISCSLP
Dongsheng Luo, Xiang Xie, Jingming Kuang (2004), Integrating Tonal Information into Mandarin Name Recognition with Different Strategies, ISCSLP
Gang Guo, RenHua Wang (2004), Discriminative Transform for Confidence Estimation in Mandarin Speech Recognition, ISCSLP
Michael Zhang, Jun Xu (2004), An Investigation into Subspace Rapid Speaker Adaptaion, ISCSLP
Chen Yang, Frank K. Soong, Tan Lee (2004), On Noise Robustness of Dynamic and Static Features for Continuous Cantonese Digit Recognition, ISCSLP
Thomas Fang Zheng, Jing Li, Zhanjiang Song, Mingxing Xu (2004), A Two-Step Keyword Spotting Method based on Context-Dependent A Posteriori Probability, ISCSLP
JyhMin Cheng, HsiaoChuan Wang (2004), A Method of Estimating the Equal Error Rate for Automatic Speaker Verification, ISCSLP
Rong Zheng, Shuwu Zhang, Bo Xu (2004), Text-Independent Speaker Identification using GMM-UBM and Frame Level Likelihood Normalization, ISCSLP
Joyce Y.C. Chan, P.C. Ching, Tan Lee, Helen Meng (2004), Detection of Language Boundary in Code-Switching Utterances by Bi-Phone Probabilities, ISCSLP
Chao Qin, Tan Lee (2004), Cantonese Verbal Information Verification System using GMM-Based Anti-Model, ISCSLP
TsangLong Pao, YuTe Chen, JunHeng Yeh (2004), Emotion Recognition from Mandarin Speech Signals, ISCSLP
Shaojun Wang, Shaomin Wang, Russell Greiner, Dale Schuurmans, Li Cheng (2004), Exploiting Syntactic, Semantic and Lexical Regularities in Language Modeling Via Directed Markov Random Fields, ISCSLP
ChuangHua Chueh, JenTzung Chien, Hsinmin Wang (2004), A Maximum Entropy Approach for Integrating Semantic Information in Statistical Language Models, ISCSLP
Berlin Chen, WenHung Tsai, JenWei Kuo (2004), Statistical Language Model Adaptation for Mandarin Broadcast News Transcription, ISCSLP
FuHua Liu, Yuqing Gao (2004), Use of Direct Modeling in Natural Language Generation for Chinese and English Translation, ISCSLP
JhingFa Wang, ShunChieh Lin, HsuehWei Yang (2004), A New Two-Layer Approach for Spoken Language Translation, ISCSLP
Yan Zhang, Hideki Kashioka (2004), Analysis of Paraphrased Corpus and Lexical-Based Approach to Chinese Paraphrasing, ISCSLP
LinShan Lee, ShunChuan Chen, Yuan Ho, JiaFu Chen, MingHan Li, Tehsuan Li (2004), An Initial Prototype System for Chinese Spoken Document Understanding and Organization for Indexing/Browsing and Retrieval Applications, ISCSLP
ChiaHsin Hsieh, ChienLin Huang, ChungHsien Wu (2004), Spoken Document Summarization using Topic-Related Corpus and Semantic Dependency Grammar, ISCSLP
JiangChun Chen, JuiLin Lo, JyhShing Roger Jang (2004), Computer Assisted Spoken English Learning for Chinese in Taiwan, ISCSLP
Stephanie Seneff, Chao Wang, Mitchell Peabody, Victor Zue (2004), Second Language Acquisition Through Human Computer Dialogue, ISCSLP
Haiping Li, Haixin Chai (2004), An Information Gain and Grammar Complexity based Approach to Attribute Selection in Speech Enabled Information Retrieval Dialogs, ISCSLP
David B. Pisoni, Beth G. Greene (1989), Ten years of research on the perceptual evaluation of synthetic speech: a summary and critical interpretation, SIOA
Louis C.W. Pols (1989), Improving synthetic speech quality by systematic evaluation, SIOA
Erland Hjelmquist (1989), Spoken newspaper for the blind, SIOA
Hans W. Zelle (1989), Application and comparative assessment of a formant synthesis chip, SIOA
Roger K. Moore (1989), Assessment of speech input systems, SIOA
David S. Pallett (1989), Speech input assessment using benchmark tests: procedures, advantages, and limitations, SIOA
Helmut Mangold (1989), Assessment of speech recognizers in public information and ordering systems, SIOA
David B. Pisoni, Beth G. Greene, John S. Logan (1989), An overview of ten years of research on the perception of synthetic speech, SIOA
Murray Spiegel, Mary Jo Altom, Marian Macchi, Karen Wallace (1989), A monosyllabic test corpus to evaluate the intelligibility of synthesized and natural speech, SIOA
Rolf Carlson, Björn Granström (1989), Evaluation and development of the KTH text-to-speech system on the segmental level, SIOA
Ute Jekosch (1989), The cluster-based rhyme test: a segmental synthesis test for open vocabulary, SIOA
Martine Grice (1989), Syntactic structures and lexicon requirements for semantically unpredictable sentences in a number of languages, SIOA
Valerie Hazan, Martine Grice (1989), The assessment of synthetic speech intelligibility using semantically unpredictable sentences, SIOA
Christian Benoît (1989), Intelligibility test for the assessment of French synthesisers using semantically unpredictable sentences, SIOA
John E. Clark, Robert H. Mannell (1989), Frequency resolution effects effects on phonetic level perception of synthesized speech, SIOA
Victor Zue, Stephanie Seneff, James Glass (1989), Speech database development: TIMIT and beyond, SIOA
Akira Kurematsu, Kazuya Takeda, Hisao Kuwabara, Kiyohiro Shikano (1989), ATR Japanese speech database as a tool of speech recognition and synthesis, SIOA
Katsuhiko Shirai, Hiroya Fujisaki, S. Itahashi (1989), Speech database projects in Japan: present and future, SIOA
J. Bruce Millar (1989), Design and use of a national speech database, SIOA
Shyam S. Agrawal (1989), Acoustic phonetic data base for hindi speech, SIOA
Volker Steinbiss, Hans-Hermann Hamer, Dieter Mergel, Hermann Ney, Andreas Noll, Annedore Paeseler, Herbert Piotrowski, Horst Tomaschewski (1989), The speech database used in SPICOS, SIOA
Bert van Heugten (1989), The speech processing expertise centre SPEX, SIOA
Per Hedelin, Dieter Huber (1989), The CTH - speech database: an integrated multilevel approach, SIOA
G. P. Walker, W. Millar (1989), Database collection: experience at british telecom research laboratories, SIOA
Rolf Carlson, Björn Granström, Lennart Nord (1989), The KTH speech database, SIOA
Peter Howell, Michael Johnson, Karima Kadi-Hanifi, Pippa Bark, Patricia Hanke, Celia Bonnett, Trudie Wingfield (1989), Databases incorporating spontaneous speech from fluent and disfluent speakers, SIOA
N. De Sario, Andrea Di Carlo, A. Paoloni, B. Saverione (1989), An acoustical database design for speaker recognition, SIOA
Tjeerd de Graaf (1989), Reconstruction, signal enhancement and storage of old sound material, SIOA
Chaslav V. Pavlovic, Christel Sorin, Jean Pierre Roumiquiere, Jean Pierre Lucas (1989), A comparative analysis of the magnitude estimation and the pair comparison techniques for use in assessing quality of text-to-speech synthesis, SIOA
Chaslav V. Pavlovic, Mario Rossi, Robert Espesser (1989), Subjective assessment of acceptability, intelligibility and naturalness of text-to-speech synthesis, SIOA
Michel Cartier, Christer Karlsson, Giulio Modena (1989), Standardization of synthetic speech quality for telecommunication purposes, SIOA
Renée van Bezooijen, Louis C. W. Pols (1989), Evaluation of text-to-speech conversion for Dutch: from segment to text, SIOA
R. P. M. W. van Gerwen, Wilhelm H. Vieregge, M. P. A. M. Kerkhof (1989), Evaluation of an automatic text-to-speech conversion system for Spanish, SIOA
Alex I. C. Monaghan, D. Robert Ladd (1989), Evaluating intonation in the CSTR text-to-speech system, SIOA
David S. Pallett (1989), Benchmark tests for DARPA resource management database performance evaluations, SIOA
Jeremy Peckham, Trevor Thomas, E. Frangoulis (1989), Recogniser sensitivity analysis: trial results and future directions, SIOA
Melvyn J. Hunt (1989), Figures of merit for assessing connected-word recognisers, SIOA
Henry S. Thompson (1989), Evaluation of phoneme lattices: four methods compared, SIOA
J. M. E. van de Vegte, M. M. Taylor (1989), Testing the effective vocabulary capacity method of evaluating speech recognizers, SIOA
Jeroen G. van Velden, Herman J. M. Steeneken (1989), RAMOS i: recognizer assessment by manipulation of speech, SIOA
Sabine Crosnier, Mats Blomberg, Kjell Elenius (1989), Speech recognizer sensitivity to the variation of different control parameters in synthetic speech, SIOA
Maxine Eskénazi (1989), On coordinated assessment efforts in france, SIOA
Seiichi Nakagawa (1989), An evaluation method for continuous speech recognition systems, SIOA
Phillip Dermody, Kerrie Mackie (1989), Development of analytical speech input/ouput assessment procedures, SIOA
C. Bourjot, A. Boyer, Dominique Fohr, Jean-Paul Haton (1989), Tools for phonetic labeling and phonetic assessment, SIOA
Lori F. Lamel (1989), Some perspectives on speech database development, SIOA
Lori F. Lamel, Robert H. Kassel, Stephanie Seneff (1989), Speech database development: design and analysis of the acoustic-phonetic corpus, SIOA
Lou Boves (1989), Linguistic data bases and tests on language models, SIOA
Guy Pérennou, M. de Calmès, J. M. Pécatte, Nadine Vigouroux (1989), Phonetic-string alignment for an automatic labelling of speech corpora, SIOA
C. J. M. van Hoeckel (1989), The reliability of manual labelling of continuous speech, SIOA
A. K. Datta, R. Sridhar (1989), Organisation and access procedure for a large lexicon, SIOA
Jan P. M. Hendriks (1989), An acoustic-phonetic formalism for database access, SIOA
Jean Claude Caerou, Jean Marc Dolmazon, Jean Michel Lunati (1989), SESAM: a low cost workstation for speech assessment, SIOA
Giuseppe Castagneri, Lucia Vacchetta, Andrea Di Carlo (1989), An application of relational database to recognizer testing workstation, SIOA
Alessandro Falaschi (1989), An automated procedure for minimum size phonetically balanced phrases selection, SIOA
Ronald Vendelmans (1989), A structured knowledge bank for syntactic and semantic speech analysis, SIOA
Henry S. Thompson (1989), Linguistic corpora for the language industry: a european community public utility, SIOA
Christian Benoît (1989), Towards the perceptual quantification of context redundancy in sentences, SIOA
Janet M. Baker (1989), Speech recognition: interactive performance assessment for realistic environments, SIOA
Sheryl R. Young (1989), Evaluation techniques for spoken language systems, SIOA
Renée van Bezooijen (1989), Evaluation of the suitability of Dutch text-to-speech conversion for application in a digital daily newspaper, SIOA
Shizuo Hiki (1989), Test items for evaluating quality of synthetic speech, SIOA
Mats Blomberg, Kjell Elenius (1989), Testing some essential parameters of a word recognizer used in car noise, SIOA
Harry Chang, Alan Smith, George Vysotsky (1989), An automated system for ASR performance evaluation, SIOA
Herman J. M. Steeneken, M. Tomlinson, Jean-Luc Gauvain (1989), Assessment of two commercial recognizers with the SAM workstation and EUROM 0, SIOA
Raymond Descout, Pierre Dumouchel, Pierre Hamel, Louis Vrooment (1989), Design and recording of a large speech database over the local telephone network in English and in French, SIOA
Marek Strelec, Jonas Rohnke, Antonio Bonafonte, Mateusz Lajszczak, Trevor Wood (2022), Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech , IberSPEECH
Marc Arnela, Leonardo Pereira-Vivas, Jorge Egea (2022), An animated realistic head with vocal tract for the finite element simulation of vowel /a/ , IberSPEECH
Ander González-Docasal, Aitor Álvarez, Haritz Arzelus (2022), Exploring the limits of neural voice cloning: A case study on two well-known personalities , IberSPEECH
Marc Freixes, Joan Claudi Socoró, Francesc Alías (2022), Analysis of iterative adaptive and quasi closed phase inverse filtering techniques on OPENGLOT synthetic vowels , IberSPEECH
Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen (2022), An Experimental Study on Light Speech Features for Small-Footprint Keyword Spotting , IberSPEECH
Dayana Ribas, Miguel Angel Pastor Yoldi, Antonio Miguel, David Martínez, Alfonso Ortega, Eduardo Lleida (2022), S3prl-Disorder: Open-Source Voice Disorder Detection System based in the Framework of S3PRL-toolkit , IberSPEECH
Filipe Reynaud, Eugénio Ribeiro, David Martins de Matos (2022), Active Learning Improves the Teacher’s Experience: A Case Study in a Language Grounding Scenario , IberSPEECH
Celia García-Ruiz, Angel M. Gomez, Juan M. Martín-Doñas (2022), The role of window length and shift in complex-domain DNN-based speech enhancement , IberSPEECH
Yongjian Chen, Mireia Farrús (2022), Neural Detection of Cross-lingual Syntactic Knowledge , IberSPEECH
Sergio Izquierdo del Alamo, Beltrán Labrador, Alicia Lozano-Diez, Doroteo T. Toledano (2022), Efficient Transformers for End-to-End Neural Speaker Diarization , IberSPEECH
Vinícius G. Santos, Caroline Adriane Alves, Bruno Baldissera Carlotto, Bruno Angelo Papa Dias, Lucas Rafael Stefanel Gris, Renan de Lima Izaias, Maria Luiza Azevedo de Morais, Paula Marin de Oliveira, Rafael Sicoli, Flaviane Romani Fernandes Svartman, Marli Quadros Leite, Sandra Maria Aluísio (2022), CORAA NURC-SP Minimal Corpus: a manually annotated corpus of Brazilian Portuguese spontaneous speech , IberSPEECH
Federico Costa, Miquel India, Javier Hernando (2022), Speaker Characterization by means of Attention Pooling , IberSPEECH
Marina Escobar Planas, Emilia Gómez, Carlos-D Martínez-Hinarejos (2022), Enhancing the Design of a Conversational Agent for an Ethical Interaction with Children , IberSPEECH
Isabel Carvalho, Hugo Gonçalo Oliveira, Catarina Silva (2022), Sentiment Analysis in Portuguese Dialogues , IberSPEECH
Eros Rosello, Alejandro Gomez-Alanis, Manuel Chica, Angel M. Gomez, Jose A. Gonzalez, Antonio M. Peinado (2022), On the application of conformers to logical access voice spoofing attack detection , IberSPEECH
Irune Zubiaga, Raquel Justo, M. Inés Torres, Mikel De Velasco (2022), Speech emotion recognition in Spanish TV Debates , IberSPEECH
Emanuel Matos, Mário Rodrigues, António Teixeira (2022), Assessing Transfer Learning and automatically annotated data in the development of Named Entity Recognizers for new domains , IberSPEECH
Anna Pompili, Tiago Luís, Nuno Monteiro, João Miranda, Carlo Mendes, Sérgio Paulo (2022), On the detection of acoustic events for public security: the challenges of the counter-terrorism domain , IberSPEECH
Manuel Chica, Alejandro Gomez-Alanis, Eros Rosello, Angel M. Gomez, Jose A. Gonzalez, Antonio M. Peinado (2022), Database dependence comparison in detection of physical access voice spoofing attacks , IberSPEECH
Cristina Luna Jiménez, Syaheerah Lebai Lutfi, Manuel Gil-Martín, Ricardo Kleinlein, Juan M. Montero, Fernando Fernández-Martínez (2022), Measuring trust at zero-acquaintance using acted-emotional videos , IberSPEECH
José Manuel Ramírez Sánchez, Laura Docio-Fernandez, Carmen Garcia Mateo (2022), Galician’s Language Technologies in the Digital Age  , IberSPEECH
Alejandro Gomez-Alanis, Lukas Drude, Andreas Schwarz, Rupak Vignesh Swaminathan, Simon Wiesler (2022), Contextual-Utterance Training for Automatic Speech Recognition , IberSPEECH
Eder Del Blanco, Inge Salomons, Eva Navas, Inma Hernáez (2022), Phone classification using electromyographic signals , IberSPEECH
Mikel Penagarikano, Amparo Varona, German Bordel, Luis J. Rodriguez-Fuentes (2022), Semisupervised training of a fully bilingual ASR system for Basque and Spanish , IberSPEECH
David Gimeno-Gomez, Carlos David Martinez Hinarejos (2022), Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish , IberSPEECH
Fernando López, Jordi Luque (2022), Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation , IberSPEECH
Wanying Ge, Hemlata Tak, Massimiliano Todisco, Nicholas Evans (2022), On the potential of jointly-optimised solutions to spoofing attack detection and automatic speaker verification , IberSPEECH
Pablo Gimeno, Alfonso Ortega, Antonio Miguel, Eduardo Lleida (2022), A Study on the Use of wav2vec Representations for Multiclass Audio Segmentation , IberSPEECH
Noelia Salor-Burdalo, Ascension Gallardo-Antolin (2022), Respiratory Sound Classification Using an Attention LSTM Model with Mixup Data Augmentation , IberSPEECH
Juan Manuel Martín-Doñas, Iván González Torre, Aitor Álvarez, Joaquin Arellano (2022), The Vicomtech Spoofing-Aware Biometric System for the SASV Challenge , IberSPEECH
John Mendonca, Isabel Trancoso (2022), VoxCeleb-PT – a dataset for a speech processing course , IberSPEECH
Miguel Pastor, Dayana Ribas, Alfonso Ortega, Antonio Miguel, Eduardo Lleida (2022), Cross-Corpus Speech Emotion Recognition with HuBERT Self-Supervised Representation , IberSPEECH
Cristina Luna Jiménez, Ricardo Kleinlein, Syaheerah Lebai Lutfi, Juan M. Montero, Fernando Fernández-Martínez (2022), Analysis of Trustworthiness Recognition models from an aural and emotional perspective , IberSPEECH
Edward L. Campbell, Laura Docío Fernández, Nicholas Cummins, Carmen García Mateo (2022), Speech and Text Processing for Major Depressive Disorder Detection , IberSPEECH
Clara Luis-Mingueza, Esther Rituerto-González, Carmen Peláez-Moreno (2022), Bridging the Semantic Gap with Affective Acoustic Scene Analysis: an Information Retrieval-based Approach , IberSPEECH
Emma Reyner Fuentes, Esther Rituerto González, Clara Luis Mingueza, Carmen Peláez Moreno, Celia López Ongil (2022), Detecting Gender-based Violence aftereffects from Emotional Speech Paralinguistic Features , IberSPEECH
Rodrigo Sousa, Helena Sofia Pinto, Alberto Abad, Daniel Neto, Joaquim Gago (2022), Extraction of structural and semantic features for the identification of Psychosis in European Portuguese , IberSPEECH
José Ángel González, Encarna Segarra, Fernando García-Granada, Emilio Sanchis, Lluis-F Hurtado (2022), An Attentional Extractive Summarization Framework , IberSPEECH
Rui Ribeiro, Luísa Coheur (2022), SUMBot: Summarizing Context in Open-Domain Dialogue Systems , IberSPEECH
Jorge Mira Prats, Marcos Estecha-Garitagoitia, Mario Rodríguez-Cantelar, Luis Fernando D’Haro (2022), Automatic Detection of Inconsistencies in Open-Domain Chatbots , IberSPEECH
Andrés Piñeiro Martín, Carmen García Mateo, Laura Docío Fernández, María del Carmen López Pérez (2022), Ethics Guidelines for the Development of Virtual Assistants for e-Health , IberSPEECH
Asier Gutiérrez-Fandiño, David Pérez-Fernández, Jordi Armengol-Estapé, David Griol, Zoraida Callejas (2022), esCorpius: A Massive Spanish Crawling Corpus, IberSPEECH
Victoria Mingote, Antonio Miguel (2022), Representation and Metric Learning Advances for Deep Neural Network Face and Speaker Biometric Systems, IberSPEECH
Alejandro Gomez-Alanis, Jose Andres Gonzalez-Lopez, Antonio Miguel Peinado Herreros (2022), Voice Biometric Systems based on Deep Neural Networks: A Ph.D. Thesis Overview, IberSPEECH
Juan Manuel Martín-Doñas, Antonio M. Peinado, Angel M. Gomez (2022), Online Multichannel Speech Enhancement combining Statistical Signal Processing and Deep Neural Networks: A Ph.D. Thesis Overview, IberSPEECH
Inma Hernaez, Jose Andres Gonzalez Lopez, Eva Navas, Jose Luis Pérez Córdoba, Ibon Saratxaga, Gonzalo Olivares, Jon Sanchez de la Fuente, Alberto Galdón, Victor Garcia, Jesús del Castillo, Inge Salomons, Eder del Blanco Sierra (2022), ReSSInt project: voice restoration using Silent Speech Interfaces, IberSPEECH
Itziar Aldabe, Aritz Farwell, Eva Navas, Inma Hernaez, German Rigau (2022), ELE Project: an overview of the desk research, IberSPEECH
Mike Rizkalla, Thomas Chan, Emilio Granell, Chara Tsoukala, Aitor Carricondo, Carlos Bailon, María Teresa González, Vicent Alabau (2022), Snorble: An Interactive Children Companion, IberSPEECH
Angel M. Gómez, Victoria E. Sanchez, Antonio M. Peinado, Juan M. Martín-Doñas, Alejandro Gómez-Alanis, Amelia Villegas-Morcillo, Eros Rosello, Manuel Chica, Celia García, Ivan López-Espejo (2022), Fusion of Classical Digital Signal Processing and Deep Learning methods (FTCAPPS), IberSPEECH
Carlos David Martinez Hinarejos, David Gimeno-Gomez, Francisco Casacuberta, Emilio Granell, Roberto Paredes, Moisés Pastor, Enrique Vidal (2022), Spanish Lipreading in Realistic Scenarios: the LLEER project, IberSPEECH
Jose Andres Gonzalez Lopez, Alberto Galdón, Gonzalo Olivares, Sneha Raman, David Murcia, Daniela Paolieri, Pedro Macizo, José L. Pérez-Córdoba, Antonio M. Peinado, Angel Gomez, Victoria E. Sanchez, Ana B. Chica (2022), Clinical Applications of Neuroscience: Locating Language Areas in Epileptic Patients and Restoring Speech in Paralyzed People, IberSPEECH
Juan Alos, Julien Boullié, M. Inés Torres, Eneko Ruiz, Andoni Beristain, Jacobo López Fernández, Iñaki Tellería, Janeth Carolina Carreño, Iker Garay, Arkaitz Carbajo, Amaia Santamaría, Urtzi Zubiate, Jon Ander Arzallus, Francisco Martínez, Adriana Martínez (2022), ORKESTA Comprehensive Solution for the Orchestration of Services and Soci-Sanitary Care at Home, IberSPEECH
Mikel Tainta, Javier Mikel Olaso, M. Inés Torres, Mirian Ecay-Torres, Nekane Balluerka, Naia Ros, Mikel Izquierdo, Mikel Saéz de Asteasu, Usune Etxebarria, Lucía Gayoso, Maider Mateo, Oliver Ibarrondo, Elena Alberdi, Estíbaliz Capetillo-Zárate, Jesus Angel Bravo, Pablo Martínez-Lage (2022), The CITA GO-ON trial: A person-centered, digital, intergenerational, and cost-effective dementia prevention multi-modal intervention model to guide strategic policies facing the demographic challenges of progressive aging, IberSPEECH
Antonio M. Peinado, Alejandro Gomez-Alanis, Jose Andres Gonzalez-Lopez, Angel M. Gomez, Eros Rosello, Manuel Chica-Villar, Jose C. Sanchez-Valera, Jose L. Perez-Cordoba, Victoria Sanchez (2022), The BioVoz Project: Secure Speech Biometrics by Deep Processing Techniques, IberSPEECH
César González-Ferreras, Valentín Cardeñoso-Payo, David Escudero-Mancebo, Carlos Enrique Vivaracho-Pascual, Lourdes Aguilar, Valle Flores-Lucas, Mario Corrales-Astorgano (2022), Automatic evaluation of the pronunciation of people with Down syndrome in an educational video game (EvaProDown), IberSPEECH
Dayana Ribas, Antonio Miguel, Luis Guillen, Jose Javier Castejon, Juan Antonio Navarro, Alfonso Ortega, Luis Benavente (2022), SONOC Platform for Audio and Speech Analytics in Call Centers, IberSPEECH
Haritz Arzelus, Iván G. Torres, Juan Manuel Martín-Doñas, Ander González-Docasal, Aitor Alvarez (2022), The Vicomtech-UPM Speech Transcription Systems for the Albayzín-RTVE 2022 Speech to Text Transcription Challenge, IberSPEECH
Fernando López, Jordi Luque (2022), TID Spanish ASR system for the Albayzin 2022 Speech-to-Text Transcription Challenge, IberSPEECH
Martin Kocour, Jahnavi Umesh, Martin Karafiat, Ján Švec, Fernando López, Jordi Luque, Karel Beneš, Mireia Diez, Igor Szoke, Karel Veselý, Lukáš Burget, Jan Černocký (2022), BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge, IberSPEECH
Roman Shrestha, Cornelius Glackin, Julie Wall, Nigel Cannings (2022), Intelligent Voice Speaker Recognition and Diarization System for IberSpeech 2022 Albayzin Evaluations Speaker Diarization and Identity Assignment Challenge, IberSPEECH
Antonio Miguel, Alfonso Ortega, Eduardo Lleida (2022), ViVoLAB System Description for the S2TC IberSPEECH-RTVE 2022 challenge, IberSPEECH
Germán Bordel, Luis Javier Rodriguez-Fuentes, Mikel Peñagarikano, Amparo Varona (2022), GTTS Systems for the Albayzin 2022 Speech and Text Alignment Challenge, IberSPEECH
Xin Wang, Junichi Yamagishi (2022), Investigating Self-Supervised Front Ends for Speech Spoofing Countermeasures, Odyssey
Junyi Peng, Chunlei Zhang, Jan "Honza" Černocký, Dong Yu (2022), Progressive Contrastive Learning for Self-Supervised Text-Independent Speaker Verification, Odyssey
Natsuo Yamashita, Shota Horiguchi, Takeshi Homma (2022), Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization, Odyssey
Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan (2022), Investigation on Mixup Strategies for End-to-End Voice Spoof Detection System, Odyssey
Yijun Gong, Xiao-Lei Zhang (2022), DP-Means: An Efficient Bayesian Nonparametric Model for Speaker Diarization, Odyssey
Diego Castan, Md Hafizur Rahman, Sarah Bakst, Chris Cobo-Kroenke, Mitchell McLaren, Martin Graciarena, Aaron Lawson (2022), Speaker-Targeted Synthetic Speech Detection, Odyssey
Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko (2022), Language-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models, Odyssey
Yucong Zhang, Qinjian Lin, Weiqing Wang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li (2022), Low-Latency Online Speaker Diarization with Graph-Based Label Generation, Odyssey
Sandro Cumani, Salvatore Sarni (2022), Impostor Score Statistics as Quality Measures for the Calibration of Speaker Verification Systems, Odyssey
Quan Wang, Yang Yu, Jason Pelecanos, Yiling Huang, Ignacio Lopez Moreno (2022), Attentive Temporal Pooling for Conformer-Based Streaming Language Identification in Long-Form Speech, Odyssey
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw (2022), Closing the Gap Between Single-User and Multi-User VoiceFilter-Lite, Odyssey
Sarala Padi, Seyed Omid Sadjadi, Dinesh Manocha, Ram D. Sriram (2022), Multimodal Emotion Recognition Using Transfer Learning from Speaker Recognition and BERT-Based Models, Odyssey
Jason Pelecanos, Quan Wang, Yiling Huang, Ignacio Lopez Moreno (2022), Parameter-Free Attentive Scoring for Speaker Verification, Odyssey
Wei Liu, Meng Sun, Xiongwei Zhang, Hugo Van hamme, Thomas Fang Zheng (2022), A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing, Odyssey
Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan (2022), Domain Generalized Speaker Embedding Learning via Mutual Information Minimization, Odyssey
Tanel Alumäe, Kunnar Kukk (2022), Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge, Odyssey
Yanxiong Li, Wucheng Wang, Hao Chen, Wenchang Cao, Wei Li, Qianhua He (2022), Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention, Odyssey
Zuoer Chen, Liang He (2022), A Quick and Effective Speaker Diarization System, Odyssey
Hiroto Kai, Shinnosuke Takamichi, Sayaka Shiota, Hitoshi Kiya (2022), Robustness of Signal Processing-Based Pseudonymization Method Against Decryption Attack, Odyssey
Fuchuan Tong, Siqi Zheng, Haodong Zhou, Xingjia Xie, Qingyang Hong, Lin Li (2022), Deep Representation Decomposition for Rate-Invariant Speaker Verification, Odyssey
Sarah Bakst, Chris Cobo-Kroenke, Aaron Lawson, Mitchell McLaren, Allen Stauffer (2022), Time-Varying Score Reliability Prediction in Speaker Identification, Odyssey
Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang (2022), An Empirical Study of Weakly Supervised Audio Tagging Embeddings for General Audio Representations, Odyssey
Seyed Omid Sadjadi, Craig Greenberg, Elliot Singer, Lisa Mason, Douglas Reynolds (2022), The NIST CTS Speaker Recognition Challenge, Odyssey
Seyed Omid Sadjadi, Craig Greenberg, Elliot Singer, Lisa Mason, Douglas Reynolds (2022), The 2021 NIST Speaker Recognition Evaluation, Odyssey
David Guennec, Hassan Hajipoor, Gwénolé Lecorvé, Pascal Lintanf, Damien Lolive, Antoine Perquin, Gaëlle Vidal (2022), BreizhCorpus: A Large Breton Language Speech Corpus and Its Use for Text-to-Speech Synthesis, Odyssey
Lantian Li, Di Wang, Wenqiang Du, Dong Wang (2022), C-P Map: A Novel Evaluation Toolkit for Speaker Verification, Odyssey
Hemlata Tak, Massimiliano Todisco, Xin Wang, Jee-weon Jung, Junichi Yamagishi, Nicholas Evans (2022), Automatic Speaker Verification Spoofing and Deepfake Detection Using Wav2vec 2.0 and Data Augmentation, Odyssey
Alexey Sholokhov, Xuechen Liu, Md Sahidullah, Tomi Kinnunen (2022), Baselines and Protocols for Household Speaker Recognition, Odyssey
Wanying Ge, Massimiliano Todisco, Nicholas Evans (2022), Explainable Deepfake and Spoofing Detection: An Attack Analysis Using SHapley Additive exPlanations, Odyssey
Xuechen Liu, Md Sahidullah, Tomi Kinnunen (2022), Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation, Odyssey
Jesús Villalba, Bengt J. Borgstrom, Saurabh Kataria, Jaejin Cho, Pedro A. Torres-Carrasquillo, Najim Dehak (2022), Advances in Speaker Recognition for Multilingual Conversational Telephone Speech: The JHU-MIT System for NIST SRE20 CTS Challenge, Odyssey
Jesús Villalba, Bengt J. Borgstrom, Saurabh Kataria, Magdalena Rybicka, Carlos D. Castillo, Jaejin Cho, L. Paola García-Perera, Pedro A. Torres-Carrasquillo, Najim Dehak (2022), Advances in Cross-Lingual and Cross-Source Audio-Visual Speaker Recognition: The JHU-MIT System for NIST SRE21, Odyssey
Hexin Liu, Leibny Paola Garcia Perera, Andy W. H. Khong, Justin Dauwels, Suzy J. Styles, Sanjeev Khudanpur (2022), Enhancing Language Identification Using Dual-Mode Model with Knowledge Distillation, Odyssey
Madina Abdrakhmanova, Saniya Abushakimova, Yerbolat Khassanov, Huseyin Atakan Varol (2022), A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data, Odyssey
Jingze Lu, Yuxiang Zhang, Wenchao Wang, Pengyuan Zhang (2022), Robust Cross-SubBand Countermeasure Against Replay Attacks, Odyssey
Joonas Kalda, Tanel Alumäe (2022), Collar-Aware Training for Streaming Speaker Change Detection in Broadcast Speech, Odyssey
Galina Lavrentyeva, Sergey Novoselov, Vladimir Volokhov, Anastasia Avdeeva, Aleksei Gusev, Alisa Vinogradova, Igor Korsunov, Alexander Kozlov, Timur Pekhovsky, Andrey Shulipa, Evgeny Smirnov, Vasily Galyuk (2022), STC Speaker Recognition System for the NIST SRE 2021, Odyssey
Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md Sahidullah, Tomi Kinnunen, Nicholas Evans (2022), Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion, Odyssey
Nikita Kuzmin, Igor Fedorov, Alexey Sholokhov (2022), Magnitude-Aware Probabilistic Speaker Embeddings, Odyssey
Jincheng He, Yuanyuan Bao, Na Xu, Hongfeng Li, Shicong Li, Linzhang Wang, Fei Xiang, Ming Li (2022), Single-Channel Target Speaker Separation Using Joint Training with Target Speaker's Pitch Information, Odyssey
Haibin Wu, Jiawen Kang, Lingwei Meng, Yang Zhang, Xixin Wu, Zhiyong Wu, Hung-yi Lee, Helen Meng (2022), Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion, Odyssey
Mohammad MohammadAmini, Driss Matrouf, Jean-François Bonastre, Sandipana Dowerah, Romain Serizel, Denis Jouvet (2022), Learning Noise Robust ResNet-Based Speaker Embedding for Speaker Recognition, Odyssey
Zhuo Gong, Daisuke Saito, Longfei Yang, Takahiro Shinozaki, Sheng Li, Hisashi Kawai, Nobuaki Minematsu (2022), Self-Adaptive Multilingual ASR Rescoring with Language Identification and Unified Language Model, Odyssey
Longting Xu, Mianxin Tian, Xing Guo, Zhiyong Shan, Jie Jia, Yiyuan Peng, Jichen Yang, Rohan Kumar Das (2022), A Novel Feature Based on Graph Signal Processing for Detection of Physical Access Attacks, Odyssey
Chenguang Hu, Qingran Zhan, Miao Liu, Xiang Xie (2022), BIT Submission for the Conversational Speaker Diarization Challenge, Odyssey
Anna Silnova, Themos Stafylakis, Ladislav Mošner, Oldřich Plchot, Johan Rohdin, Pavel Matĕjka, Lukáš Burget, Ondřej Glembek, Niko Brummer (2022), Analyzing Speaker Verification Embedding Extractors and Back-Ends Under Language and Channel Mismatch, Odyssey
Anand Therattil, Priyanka Gupta, Piyushkumar K. Chodingala, Hemant A. Patil (2022), Teager Energy Based-Detection of One-point and Two-point Replay Attacks: Towards Cross-Database Generalization, Odyssey
Sandip Ghimire, Tomi Kinnunen, Rosa González Hautamäki (2022), Gamified Speaker Comparison by Listening, Odyssey
Haoran Sun, Chen Chen, Lantian Li, Dong Wang (2022), Cycleflow: Purify Information Factors by Cycle Loss, Odyssey
You Zhang, Ge Zhu, Zhiyao Duan (2022), A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification, Odyssey
Jahangir Alam, Radek Beneš, Marián Beszédeš, Lukáš Burget, Mohamed Dahmane, Abderrahim Fathan, Hamed Ghodrati, Ondřej Glembek, Woo Hyun Kang, Pavel Matĕjka, Ladislav Mošner, Oldřich Plchot, Johan Rohdin, Anna Silnova, Themos Stafylakis (2022), Development of ABC Systems for the 2021 Edition of NIST Speaker Recognition Evaluation, Odyssey
Yosef Solewicz, Noa Cohen, Johan Rohdin, Srikanth Madikeri, Jan ”Honza” Čercnocký (2022), Speaker Recognition on Mono-Channel Telephony Recordings, Odyssey
Haoxu Wang, Yan Jia, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li (2022), Generating TTS Based Adversarial Samples for Training Wake-Up Word Detection Systems Against Confusing Words, Odyssey
Jahangir Alam, Woo Hyun Kang, Abderrahim Fathan (2022), Hybrid Neural Network-Based Deep Embedding Extractors for Text-Independent Speaker Verification, Odyssey
Jintao Kang, Aijun Li, Jingyang Li (2022), Formant Dynamics of Chinese Compound Vowels with Implications for Forensic Speaker Identification, Odyssey
YingWei Tan, XueFeng Ding (2022), The Volkswagen-Mobvoi System for CN-Celeb Speaker Recognition Challenge 2022, Odyssey
Woo Hyun Kang, Jahangir Alam (2022), Investigation on Deep Speaker Embedding Extraction Methods for Multi-Genre Speaker Verification, Odyssey
Jialin Zhang, Qinghua Ren, Youcai Qin, Zikai Wan, Qirong Mao (2022), Cross-Scene Speaker Verification Based on Dynamic Convolution for the CNSRC 2022 Challenge, Odyssey
Xinmei Su, Qingran Zhan, Chenguang Hu, Xiang Xie (2022), Combination of Multiple Embeddings for Speaker Retrieval, Odyssey
Mary E. Beckman (2015), The emergence of compositional structure in language evolution and development, Interspeech
Ruhi Sarikaya (2015), The technology powering personal digital assistants, Interspeech
Katrin Amunts (2015), The HBP-atlas — concept, perspectives, and application for language and speech research, Interspeech
Klaus Scherer (2015), Voices of power, passion, and personality, Interspeech
Tara N. Sainath, Ron J. Weiss, Andrew Senior, Kevin W. Wilson, Oriol Vinyals (2015), Learning the speech front-end with raw waveform CLDNNs, Interspeech
Mayank Bhargava, Richard Rose (2015), Architectures for deep neural network based acoustic models defined over windowed speech waveforms, Interspeech
Dimitri Palaz, Mathew Magimai-Doss, Ronan Collobert (2015), Analysis of CNN-based speech recognition system using raw speech as input, Interspeech
Tetsuji Ogawa, Kenshiro Ueda, Kouichi Katsurada, Tetsunori Kobayashi, Tsuneo Nitta (2015), Bilinear map of filter-bank outputs for DNN-based speech recognition, Interspeech
Payton Lin, Dau-Cheng Lyu, Yun-Fan Chang, Yu Tsao (2015), Speech recognition with temporal neural networks, Interspeech
Pavel Golik, Zoltán Tüske, Ralf Schlüter, Hermann Ney (2015), Convolutional neural networks for acoustic modeling of raw time signal in LVCSR, Interspeech
Ulrike Glavitsch, Lei He, Volker Dellwo (2015), Stable and unstable intervals as a basic segmentation procedure of the speech signal, Interspeech
Andreas Windmann, Juraj Šimko, Petra Wagner (2015), Polysyllabic shortening and word-final lengthening in English, Interspeech
Anders Eriksson, Mattias Heldner (2015), The acoustics of word stress in English as a function of stress level and speaking style, Interspeech
Katharina Zahner, Muna Pohl, Bettina Braun (2015), Pitch accent distribution in German infant-directed speech, Interspeech
Hansjörg Mixdorff, Christian Cossio-Mercado, Angelika Hönemann, Jorge Gurlekian, Diego Evin, Humberto Torres (2015), Acoustic correlates of perceived syllable prominence in German, Interspeech
Simone Simonetti, Jeesun Kim, Chris Davis (2015), Cross-modality matching of linguistic and emotional prosody, Interspeech
Jan Michalsky (2015), Pitch scaling as a perceptual cue for questions in German, Interspeech
Uwe D. Reichel, Katalin Mády, Štefan Beňuš (2015), Parameterization of prosodic headedness, Interspeech
Biswajit Dev Sarma, Priyankoo Sarmah, Wendy Lalhminghlui, S. R. Mahadeva Prasanna (2015), Detection of mizo tones, Interspeech
Sophie Repp, Lena Rosin (2015), The intonation of echo wh-questions, Interspeech
Farhat Jabeen, Tina Bögel, Miriam Butt (2015), Immediately postverbal questions in urdu, Interspeech
Katalin Mády (2015), Prosodic (non-)realisation of broad, narrow and contrastive focus in Hungarian: a production and a perception study, Interspeech
Štefan Beňuš, Uwe D. Reichel, Juraj Šimko (2015), F0 discontinuity as a marker of prosodic boundary strength in lombard speech, Interspeech
Cédric Gendrot, Martine Adda-Decker, Yaru Wu (2015), Comparing journalistic and spontaneous speech: prosodic and spectral analysis, Interspeech
Nadja Schauffler, Katrin Schweitzer (2015), Rhythm influences the tonal realisation of focus, Interspeech
Bistra Andreeva, Bernd Möbius, Grazyna Demenko, Frank Zimmerer, Jeanin Jügler (2015), Linguistic measures of pitch range in slavic and Germanic languages, Interspeech
Chunan Qiu, Jie Liang (2015), The effect of stress on vowel space in daxi hakka Chinese, Interspeech
Maria O'Reilly, Ailbhe Ní Chasaide (2015), Declination, peak height and pitch level in declaratives and questions of south connaught irish, Interspeech
Priyankoo Sarmah, Leena Dihingia, Wendy Lalhminghlui (2015), Contextual variation of tones in mizo, Interspeech
Daniela Wochner, Jana Schlegel, Nicole Dehé, Bettina Braun (2015), The prosodic marking of rhetorical questions in German, Interspeech
Tudor-Cătălin Zorilă, Yannis Stylianou (2015), A fast algorithm for improved intelligibility of speech-in-noise based on frequency and time domain energy reallocation, Interspeech
Maria Koutsogiannaki, Petko N. Petkov, Yannis Stylianou (2015), Intelligibility enhancement of casual speech for reverberant environments inspired by clear speech properties, Interspeech
A. Ben Jemaa, N. Mechergui, G. Courtois, A. Mudry, S. Djaziri-Larbi, M. Turki, H. Lissek, M. Jaidane (2015), Intelligibility enhancement of vocal announcements for public address systems: a design for all through a presbycusis pre-compensation filter, Interspeech
Henning Schepker, David Hülsmeier, Jan Rennies, Simon Doclo (2015), Model-based integration of reverberation for noise-adaptive near-end listening enhancement, Interspeech
Sebastian Rottschäfer, Hendrik Buschmeier, Herwin van Welbergen, Stefan Kopp (2015), Online Lombard adaptation in incremental speech synthesis, Interspeech
Emma Jokinen, Ulpu Remes, Paavo Alku (2015), Comparison of Gaussian process regression and Gaussian mixture models in spectral tilt modelling for intelligibility enhancement of telephone speech, Interspeech
Naveen Kumar, Shrikanth S. Narayanan (2015), A discriminative reliability-aware classification model with applications to intelligibility classification in pathological speech, Interspeech
J. R. Orozco-Arroyave, Florian Hönig, J. D. Arias-Londoño, J. F. Vargas-Bonilla, Sabine Skodda, J. Rusz, Elmar Nöth (2015), Voiced/unvoiced transitions in speech as a potential bio-marker to detect Parkinson's disease, Interspeech
T. Villa-Cañas, J. D. Arias-Londoño, J. R. Orozco-Arroyave, J. F. Vargas-Bonilla, Elmar Nöth (2015), Low-frequency components analysis in running speech for the automatic detection of Parkinson's disease, Interspeech
J. C. Vásquez-Correa, T. Arias-Vergara, J. R. Orozco-Arroyave, J. F. Vargas-Bonilla, J. D. Arias-Londoño, Elmar Nöth (2015), Automatic detection of Parkinson's disease from continuous speech recorded in non-controlled noise conditions, Interspeech
Nicholas Cummins, Vidhyasaharan Sethu, Julien Epps, Jarek Krajewski (2015), Relevance vector machine for depression prediction, Interspeech
Erik Marchi, Björn Schuller, Simon Baron-Cohen, Ofer Golan, Sven Bölte, Prerna Arora, Reinhold Häb-Umbach (2015), Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages, Interspeech
Chunxi Liu, Puyang Xu, Ruhi Sarikaya (2015), Deep contextual language understanding in spoken dialogue systems, Interspeech
Yik-Cheung Tam, Yangyang Shi, Hunk Chen, Mei-Yuh Hwang (2015), RNN-based labeled data generation for spoken language understanding, Interspeech
Vedran Vukotic, Christian Raymond, Guillaume Gravier (2015), Is it time to switch to word embedding and recurrent neural networks for spoken language understanding?, Interspeech
Suman Ravuri, Andreas Stolcke (2015), Recurrent neural network and LSTM models for lexical utterance classification, Interspeech
Hung-tsung Lu, Yuan-ming Liou, Hung-yi Lee, Lin-shan Lee (2015), Semantic retrieval of personal photos using a deep autoencoder fusing visual features with speech annotations represented as word/paragraph vectors, Interspeech
Mohamed Morchid, Richard Dufour, Driss Matrouf (2015), A comparison of normalization techniques applied to latent space representations for speech analytics, Interspeech
Imran Sheikh, Irina Illina, Dominique Fohr (2015), Study of entity-topic models for OOV proper name retrieval, Interspeech
Simon Boutin, Réal Tremblay, Patrick Cardinal, Doug Peters, Pierre Dumouchel (2015), Audio quotation marks for natural language understanding, Interspeech
Xiaohao Yang, Jia Liu (2015), Using word confusion networks for slot filling in spoken language understanding, Interspeech
Justin Chiu, Yajie Miao, Alan W. Black, Alexander I. Rudnicky (2015), Distributed representation-based spoken word sense induction, Interspeech
Sheng-syun Shen, Hung-yi Lee, Shang-wen Li, Victor Zue, Lin-shan Lee (2015), Structuring lectures in massive open online courses (MOOCs) for efficient learning by linking similar sections and predicting prerequisites, Interspeech
Delphine Charlet, Géraldine Damnati, Jérémy Trione (2015), News talk-show chaptering with journalistic genres, Interspeech
Vikram Ramanarayanan, Lei Chen, Chee Wee Leong, Gary Feng, David Suendermann-Oeft (2015), An analysis of time-aggregated and time-series features for scoring different aspects of multimodal presentation data, Interspeech
David N. Racca, Gareth J. F. Jones (2015), Incorporating prosodic prominence evidence into term weights for spoken content retrieval, Interspeech
Kuan-Yu Chen, Shih-Hung Liu, Hsin-Min Wang, Berlin Chen, Hsin-Hsi Chen (2015), Leveraging word embeddings for spoken document summarization, Interspeech
Vincent Renkens, Hugo Van hamme (2015), Mutually exclusive grounding for weakly supervised non-negative matrix factorisation, Interspeech
Emanuele Bastianelli, Danilo Croce, Roberto Basili, Daniele Nardi (2015), Using semantic maps for robust natural language interaction with robots, Interspeech
Yi Luan, Shinji Watanabe, Bret Harsham (2015), Efficient learning for spoken language understanding tasks with word embedding based pre-training, Interspeech
Emmanuel Ferreira, Bassam Jabaian, Fabrice Lefèvre (2015), Zero-shot semantic parser for spoken language understanding, Interspeech
Jeremie Tafforeau, Thierry Artieres, Benoit Favre, Frederic Bechet (2015), Adapting lexical representation and OOV handling from written to spoken language with word embedding, Interspeech
Xiaohao Yang, Jia Liu (2015), Dialog state tracking using long short-term memory neural networks, Interspeech
José Lopes, Giampiero Salvi, Gabriel Skantze, Alberto Abad, Joakim Gustafson, Fernando Batista, Raveesh Meena, Isabel Trancoso (2015), Detecting repetitions in spoken dialogue systems using phonetic distances, Interspeech
Paul A. Crook, Jean-Philippe Robichaud, Ruhi Sarikaya (2015), Multi-language hypotheses ranking and domain tracking for open domain dialogue systems, Interspeech
Vijay Solanki, Alessandro Vinciarelli, Jane Stuart-Smith, Rachel Smith (2015), Measuring mimicry in task-oriented conversations: degree of mimicry is related to task difficulty, Interspeech
Kornel Laskowski (2015), Auto-imputing radial basis functions for neural-network turn-taking models, Interspeech
Quim Llimona, Jordi Luque, Xavier Anguera, Zoraida Hidalgo, Souneil Park, Nuria Oliver (2015), Effect of gender and call duration on customer satisfaction in call center big data, Interspeech
Zoraida Callejas, David Griol (2015), Using profile similarity to measure agreement in personality perception, Interspeech
Shizuka Nakamura, Miki Watanabe, Yuichiro Yoshikawa, Kohei Ogawa, Hiroshi Ishiguro (2015), Relieving mental stress of speakers using a tele-operated robot in foreign language speech education, Interspeech
Agustín Gravano, Štefan Beňuš, Rivka Levitan, Julia Hirschberg (2015), Backward mimicry and forward influence in prosodic contour choice in standard American English, Interspeech
Shammur Absar Chowdhury, Morena Danieli, Giuseppe Riccardi (2015), The role of speakers and context in classifying competition in overlapping speech, Interspeech
George Christodoulides, Mathieu Avanzi (2015), Automatic detection and annotation of disfluencies in spoken French corpora, Interspeech
Dilek Hakkani-Tür, Yun-Cheng Ju, Geoffrey Zweig, Gokhan Tur (2015), Clustering novel intents in a conversational interaction system with semantic parsing, Interspeech
Vladimir Despotovic, Oliver Walter, Reinhold Haeb-Umbach (2015), Semantic analysis of spoken input using Markov logic networks, Interspeech
Jan Švec, Adam Chýlek, Luboš Šmídl (2015), Hierarchical discriminative model for spoken language understanding based on convolutional neural network, Interspeech
Yun-Nung Chen, William Yang Wang, Alexander I. Rudnicky (2015), Learning semantic hierarchy with distributed representations for unsupervised spoken language understanding, Interspeech
Éva Székely, Mark T. Keane, Julie Carson-Berndsen (2015), The effect of soft, modal and loud voice levels on entrainment in noisy conditions, Interspeech
Benjamin R. Cowan, Holly P. Branigan (2015), Does voice anthropomorphism affect lexical alignment in speech-based human-computer dialogue?, Interspeech
Ning Ma, Guy J. Brown, Jose A. Gonzalez (2015), Exploiting top-down source models to improve binaural localisation of multiple sources in reverberant environments, Interspeech
Christopher Schymura, Fiete Winter, Dorothea Kolossa, Sascha Spors (2015), Binaural sound source localisation and tracking using a dynamic spherical head model, Interspeech
Tobias May, Thomas Bentsen, Torsten Dau (2015), The role of temporal resolution in modulation-based speech segregation, Interspeech
Hendrik Kayser, Constantin Spille, Daniel Marquardt, Bernd T. Meyer (2015), Improving automatic speech recognition in spatially-aware hearing aids, Interspeech
Randy Gomez, Levko Ivanchuk, Keisuke Nakamura, Takeshi Mizumoto, Kazuhiro Nakadai (2015), Dereverberation for active human-robot communication robust to speaker's face orientation, Interspeech
Nanxin Chen, Yanmin Qian, Kai Yu (2015), Multi-task learning for text-dependent speaker verification, Interspeech
Themos Stafylakis, Patrick Kenny, Md. Jahangir Alam, Marcel Kockmann (2015), JFA for speaker recognition with random digit strings, Interspeech
Elena Knyazeva, Guillaume Wisniewski, Hervé Bredin, François Yvon (2015), Structured prediction for speaker identification in TV series, Interspeech
Sandro Cumani, Pietro Laface, Farzana Kulsoom (2015), Speaker recognition by means of acoustic and phonetically informed GMMs, Interspeech
Ashish Panda (2015), A fast approach to psychoacoustic model compensation for robust speaker recognition in additive noise, Interspeech
Danila Doroshin, Nikolay Lubimov, Marina Nastasenko, Mikhail Kotov (2015), Blind score normalization method for PLDA based speaker recognition, Interspeech
Sergey Novoselov, Timur Pekhovsky, Oleg Kudashev, Valentin S. Mendelev, Alexey Prudnikov (2015), Non-linear PLDA for i-vector speaker verification, Interspeech
Carlos Vaquero, Patricia Rodríguez (2015), On the need of template protection for voice authentication, Interspeech
Finnian Kelly, John H. L. Hansen (2015), Evaluation and calibration of short-term aging effects in speaker verification, Interspeech
Liping Chen, Kong Aik Lee, Bin Ma, Wu Guo, Haizhou Li, Li-Rong Dai (2015), Phone-centric local variability vector for text-constrained speaker verification, Interspeech
Kuruvachan K. George, C. Santhosh Kumar, K I Ramachandran, Ashish Panda (2015), Cosine distance features for robust speaker verification, Interspeech
Sayaka Shiota, Fernando Villavicencio, Junichi Yamagishi, Nobutaka Ono, Isao Echizen, Tomoko Matsui (2015), Voice liveness detection algorithms based on pop noise caused by human breath for automatic speaker verification, Interspeech
Antti Hurmalainen, Rahim Saeidi, Tuomas Virtanen (2015), Noise robust speaker recognition with convolutive sparse coding, Interspeech
Md. Jahangir Alam, Patrick Kenny, Themos Stafylakis (2015), Combining amplitude and phase-based features for speaker verification with short duration utterances, Interspeech
Kong Aik Lee, Anthony Larcher, Guangsen Wang, Patrick Kenny, Niko Brümmer, David van Leeuwen, Hagai Aronowitz, Marcel Kockmann, Carlos Vaquero, Bin Ma, Haizhou Li, Themos Stafylakis, Md. Jahangir Alam, Albert Swart, Javier Perez (2015), The reddots data collection for speaker recognition, Interspeech
Yongjun He, Chen Chen, Jiqing Han (2015), Noise-robust speaker recognition based on morphological component analysis, Interspeech
Andreas Nautsch, Rahim Saeidi, Christian Rathgeb, Christoph Busch (2015), Analysis of mutual duration and noise effects in speaker recognition: benefits of condition-matched cohort selection in score normalization, Interspeech
Josué Fredes, José Novoa, Victor Poblete, Simon King, Richard M. Stern, Néstor Becerra Yoma (2015), Robustness to additive noise of locally-normalized cepstral coefficients in speaker verification, Interspeech
Navid Shokouhi, John H. L. Hansen (2015), Probabilistic linear discriminant analysis for robust speaker identification in co-channel speech, Interspeech
Hongcui Wang, Di Jin, Lantian Li, Jianwu Dang (2015), Community detection with manifold learning on speaker i-vector space for Chinese, Interspeech
Sree Harsha Yella, Andreas Stolcke (2015), A comparison of neural network feature transforms for speaker diarization, Interspeech
Ilya Shapiro, Neta Rabin, Irit Opher, Itshak Lapidot (2015), Clustering short push-to-talk segments, Interspeech
Anna Fedorova, Ondřej Glembek, Tomi Kinnunen, Pavel Matějka (2015), Exploring ANN back-ends for i-vector based speaker age estimation, Interspeech
Désiré Bansé, George R. Doddington, Daniel Garcia-Romero, John J. Godfrey, Craig S. Greenberg, Jaime Hernández-Cordero, John M. Howard, Alvin F. Martin, Lisa P. Mason, Alan McCree, Douglas A. Reynolds (2015), Analysis of the second phase of the 2013-2014 i-vector machine learning challenge, Interspeech
Alvin F. Martin, Craig S. Greenberg, John M. Howard, Désiré Bansé, George R. Doddington, Jaime Hernández-Cordero, Lisa P. Mason (2015), NIST language recognition evaluation — plans for 2015, Interspeech
Brecht Desplanques, Kris Demuynck, Jean-Pierre Martens (2015), Factor analysis for speaker segmentation and improved speaker diarization, Interspeech
Koji Inoue, Yukoh Wakabayashi, Hiromasa Yoshimoto, Katsuya Takanashi, Tatsuya Kawahara (2015), Enhanced speaker diarization with detection of backchannels using eye-gaze information in poster conversations, Interspeech
Héctor Delgado, Xavier Anguera, Corinne Fredouille, Javier Serrano (2015), Novel clustering selection criterion for fast binary key speaker diarization, Interspeech
Gregory Sell, Daniel Garcia-Romero, Alan McCree (2015), Speaker diarization with i-vectors from DNN senone posteriors, Interspeech
Abraham Woubie, Jordi Luque, Javier Hernando (2015), Using voice-quality measurements with prosodic and spectral features for speaker diarization, Interspeech
Srikanth Madikeri, Ivan Himawan, Petr Motlicek, Marc Ferras (2015), Integrating online i-vector extractor with information bottleneck based speaker diarization system, Interspeech
Tuomo Raitio, Lauri Juvela, Antti Suni, Martti Vainio, Paavo Alku (2015), Phase perception of the glottal excitation of vocoded speech, Interspeech
Sunayana Sitaram, Serena Jeblee, Alan W. Black (2015), Using acoustics to improve pronunciation for synthesis of low resource languages, Interspeech
Tadashi Inai, Sunao Hara, Masanobu Abe, Yusuke Ijima, Noboru Miyazaki, Hideyuki Mizuno (2015), Sub-band text-to-speech combining sample-based spectrum with statistically generated spectrum, Interspeech
Heng Lu, Wei Zhang, Xu Shao, Quan Zhou, Wenhui Lei, Hongbin Zhou, Andrew Breen (2015), Pruning redundant synthesis units based on static and delta unit appearance frequency, Interspeech
Yamato Ohtani, Yu Nasu, Masahiro Morita, Masami Akamine (2015), Emotional transplant in statistical speech synthesis based on emotion additive model, Interspeech
Xurong Xie, Xunying Liu, Lan Wang, Rongfeng Su (2015), Generalized variable parameter HMMs based acoustic-to-articulatory inversion, Interspeech
Seyed Hamidreza Mohammadi, Alexander Kain (2015), Semi-supervised training of a voice conversion mapping function using a joint-autoencoder, Interspeech
Stefan Huber, Axel Roebel (2015), On glottal source shape parameter transformation using a novel deterministic and stochastic speech analysis and synthesis system, Interspeech
Yi-Chin Huang, Chung-Hsien Wu, Ming-Ge Shie (2015), Fluent personalized speech synthesis with prosodic word-level spontaneous speech generation, Interspeech
Yuji Oshima, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (2015), Non-native speech synthesis preserving speaker individuality based on partial correction of prosodic and phonetic characteristics, Interspeech
Markus Toman, Michael Pucher (2015), Evaluation of state mapping based foreign accent conversion, Interspeech
Zhizheng Wu, Simon King (2015), Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features, Interspeech
Yang Wang, Minghao Yang, Zhengqi Wen, Jianhua Tao (2015), Combining extreme learning machine and decision tree for duration prediction in HMM based speech synthesis, Interspeech
Duy Khanh Ninh, Yoichi Yamashita (2015), F0 parameterization of glottalized tones for HMM-based vietnamese TTS, Interspeech
Thomas Merritt, Junichi Yamagishi, Zhizheng Wu, Oliver Watts, Simon King (2015), Deep neural network context embeddings for model selection in rich-context HMM synthesis, Interspeech
Bo Chen, Zhehuai Chen, Jiachen Xu, Kai Yu (2015), An investigation of context clustering for statistical speech synthesis with deep neural network, Interspeech
Oliver Watts, Zhizheng Wu, Simon King (2015), Sentence-level control vectors for deep neural network speech synthesis, Interspeech
Simon Betz, Petra Wagner, David Schlangen (2015), Micro-structure of disfluencies: basics for conversational speech synthesis, Interspeech
György Szaszák, András Beke, Gábor Olaszy, Bálint Pál Tóth (2015), Using automatic stress extraction from audio for improved prosody modelling in speech synthesis, Interspeech
Pierre Lanchantin, Christophe Veaux, Mark J. F. Gales, Simon King, Junichi Yamagishi (2015), Reconstructing voices within the multiple-average-voice-model framework, Interspeech
Ye Kyaw Thu, Win Pa Pa, Jinfu Ni, Yoshinori Shiga, Andrew Finch, Chiori Hori, Hisashi Kawai, Eiichiro Sumita (2015), HMM based myanmar text to speech system, Interspeech
Shinji Takaki, SangJin Kim, Junichi Yamagishi, JongJin Kim (2015), Multiple feed-forward deep neural networks for statistical parametric speech synthesis, Interspeech
Kaisheng Yao, Geoffrey Zweig (2015), Sequence-to-sequence neural net models for grapheme-to-phoneme conversion, Interspeech
Rosie Kay, Oliver Watts, Roberto Barra Chicote, Cassie Mayo (2015), Knowledge versus data in TTS: evaluation of a continuum of synthesis systems, Interspeech
Steffen Eger (2015), Improving G2p from wiktionary and other (web) resources, Interspeech
Chuang Ding, Pengcheng Zhu, Lei Xie (2015), BLSTM neural networks for speech driven head motion synthesis, Interspeech
Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (2015), Articulatory controllable speech modification based on Gaussian mixture models with direct waveform modification using spectrum differential, Interspeech
Thomas Le Cornu, Ben Milner (2015), Reconstructing intelligible audio speech from visual speech features, Interspeech
Sunayana Sitaram, Alok Parlikar, Gopala Krishna Anumanchipalli, Alan W. Black (2015), Universal grapheme-based speech synthesis, Interspeech
Mirjam Wester, Matthew Aylett, Marcus Tomalin, Rasmus Dall (2015), Artificial personality and disfluency, Interspeech
Marc Evrard, Samuel Delalez, Christophe d'Alessandro, Albert Rilliard (2015), Comparison of chironomic stylization versus statistical modeling of prosody for expressive speech synthesis, Interspeech
Luc Ardaillon, Gilles Degottex, Axel Roebel (2015), A multi-layer F0 model for singing voice synthesis using a b-spline representation with intuitive controls, Interspeech
Igor Jauk, Antonio Bonafonte, Paula Lopez-Otero, Laura Docio-Fernandez (2015), Creating expressive synthetic voices by unsupervised clustering of audiobooks, Interspeech
Sandesh Aryal, Ricardo Gutierrez-Osuna (2015), Articulatory-based conversion of foreign accents with deep neural networks, Interspeech
Jindřich Matoušek, Daniel Tihelka (2015), Anomaly-based annotation errors detection in TTS corpora, Interspeech
Katrin Schweitzer, Markus Gärtner, Arndt Riester, Ina Rösiger, Kerstin Eckart, Jonas Kuhn, Grzegorz Dogil (2015), Analysing automatic descriptions of intonation with ICARUS, Interspeech
Nancy F. Chen, Rong Tong, Darren Wee, Peixuan Lee, Bin Ma, Haizhou Li (2015), iCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent, Interspeech
Ka Ho Wong, Yu Ting Yeung, Edwin H. Y. Chan, Patrick C. M. Wong, Gina-Anne Levow, Helen Meng (2015), Development of a Cantonese dysarthric speech corpus, Interspeech
Harish Arsikere, Sonal Patil, Ranjeet Kumar, Kundan Shrivastava, Om Deshmukh (2015), Stylex: a corpus of educational videos for research on speaking styles and their impact on engagement and learning, Interspeech
Doğan Can, David C. Atkins, Shrikanth S. Narayanan (2015), A dialog act tagging approach to behavioral coding: a case study of addiction counseling conversations, Interspeech
Valentina Vapnarsky, Claude Barras, Cédric Becquey, David Doukhan, Martine Adda-Decker, Lori Lamel (2015), Analysing rhythm in ritual discourse in yucatec maya using automatic speech alignment, Interspeech
Madina Hasan, Rama Doddipatla, Thomas Hain (2015), Noise-matched training of CRF based sentence end detection models, Interspeech
Jianjing Kuang, Mark Liberman (2015), The effect of spectral slope on pitch perception, Interspeech
Honghao Bao, Wenhuan Lu, Kiyoshi Honda, Jianguo Wei, Qiang Fang, Jianwu Dang (2015), Combined cine- and tagged-MRI for tracking landmarks on the tongue surface, Interspeech
Guillaume Barbier, Louis-Jean Boë, Guillaume Captier, Rafael Laboissière (2015), Human vocal tract growth: a longitudinal study of the development of various anatomical structures, Interspeech
Ganesh Sivaraman, Vikramjit Mitra, Mark K. Tiede, Elliot Saltzman, Louis Goldstein, Carol Espy-Wilson (2015), Analysis of coarticulated speech using estimated articulatory trajectories, Interspeech
Guillaume Barbier, Pascal Perrier, Lucie Ménard, Yohan Payan, Mark K. Tiede, Joseph S. Perkell (2015), Speech planning in 4-year-old children versus adults: acoustic and articulatory analyses, Interspeech
Tokihiko Kaburagi (2015), Morphological and acoustic analysis of the vocal tract using a multi-speaker volumetric MRI dataset, Interspeech
Zisis Iason Skordilis, Vikram Ramanarayanan, Louis Goldstein, Shrikanth S. Narayanan (2015), Experimental assessment of the tongue incompressibility hypothesis during speech production, Interspeech
Radek Fér, Pavel Matějka, František Grézl, Oldřich Plchot, Jan Černocký (2015), Multilingual bottleneck features for language recognition, Interspeech
Alan McCree, Daniel Garcia-Romero (2015), DNN senone MAP multinomial i-vectors for phonotactic language recognition, Interspeech
Yan Song, Xinhai Hong, Bing Jiang, Ruilian Cui, Ian McLoughlin, Li-Rong Dai (2015), Deep bottleneck network based i-vector representation for language identification, Interspeech
Alicia Lozano-Diez, Ruben Zazo-Candil, Javier Gonzalez-Dominguez, Doroteo T. Toledano, Joaquin Gonzalez-Rodriguez (2015), An end-to-end approach to language identification in short utterances using convolutional neural networks, Interspeech
Ville Hautamäki, Sabato Marco Siniscalchi, Hamid Behravan, Valerio Mario Salerno, Ivan Kukanov (2015), Boosting universal speech attributes classification with deep neural network for foreign accent characterization, Interspeech
Wang Geng, Jie Li, Shanshan Zhang, Xinyuan Cai, Bo Xu (2015), Multilingual tandem bottleneck feature for language identification, Interspeech
Afsaneh Asaei, Milos Cernak, Hervé Bourlard (2015), On compressibility of neural network phonological features for low bit rate speech coding, Interspeech
Michał Lenarczyk (2015), Robust and accurate LSF location with laguerre method, Interspeech
Jochen Issing, Nikolaus Färber, Reinhard German (2015), Interactivity-aware playout adaptation, Interspeech
Jochen Issing, Nikolaus Färber, Reinhard German (2015), Advanced time shrinking using a drop classifier based on codec features, Interspeech
Andrew Hines, Eoin Gillen, Naomi Harte (2015), Measuring and monitoring speech quality for voice over IP with POLQA, viSQOL and p.563, Interspeech
Laura Fernández Gallardo, Sebastian Möller (2015), Towards the prediction of human speaker identification performance from measured speech quality, Interspeech
M. Levit, Andreas Stolcke, R. Subba, S. Parthasarathy, S. Chang, S. Xie, T. Anastasakos, Benoit Dumoulin (2015), Personalization of word-phrase-entity language models, Interspeech
Akio Kobayashi, Manon Ichiki, Takahiro Oku, Kazuo Onoe, Shoei Sato (2015), Discriminative bilinear language modeling for broadcast transcriptions, Interspeech
Xi Ma, Xiaoxi Wang, Dong Wang, Zhiyong Zhang (2015), Recognize foreign low-frequency words with similar pairs, Interspeech
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito (2015), Combinations of various language model technologies including data expansion and adaptation in spontaneous speech recognition, Interspeech
Petar Aleksic, Mohammadreza Ghodsi, Assaf Michaely, Cyril Allauzen, Keith Hall, Brian Roark, David Rybach, Pedro Moreno (2015), Bringing contextual information to google speech recognition, Interspeech
Lucy Vasserman, Vlad Schogol, Keith Hall (2015), Sequence-based class tagging for robust transcription in ASR, Interspeech
Björn Schuller, Stefan Steidl, Anton Batliner, Simone Hantke, Florian Hönig, J. R. Orozco-Arroyave, Elmar Nöth, Yue Zhang, Felix Weninger (2015), The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, Parkinson's & eating condition, Interspeech
Florian Hönig (2015), The degree of nativeness sub-challenge: the data, Interspeech
Claude Montacié, Marie-José Caraty (2015), Phrase accentuation verification and phonetic variation measurement for the degree of nativeness sub-challenge, Interspeech
Eugénio Ribeiro, Jaime Ferreira, Julia Olcoz, Alberto Abad, Helena Moniz, Fernando Batista, Isabel Trancoso (2015), Combining multiple approaches to predict the degree of nativeness, Interspeech
Matthew P. Black, Daniel Bone, Zisis Iason Skordilis, Rahul Gupta, Wei Xia, Pavlos Papadopoulos, Sandeep Nallan Chakravarthula, Bo Xiao, Maarten Van Segbroeck, Jangwon Kim, Panayiotis G. Georgiou, Shrikanth S. Narayanan (2015), Automated evaluation of non-native English pronunciation quality: combining knowledge- and data-driven features at multiple time scales, Interspeech
J. R. Orozco-Arroyave (2015), The Parkinson's condition sub-challenge: the data, Interspeech
Dávid Sztahó, Gábor Kiss, Klára Vicsi (2015), Estimating the severity of Parkinson's disease from speech using linear regression and database partitioning, Interspeech
Alexander Zlotnik, Juan M. Montero, Rubén San-Segundo, Ascensión Gallardo-Antolín (2015), Random forest-based prediction of Parkinson's disease progression using acoustic, ASR and intelligibility features, Interspeech
Guozhen An, David Guy Brizan, Min Ma, Michelle Morales, Ali Raza Syed, Andrew Rosenberg (2015), Automatic recognition of unified Parkinson's disease rating from speech with acoustic, i-vector and phonotactic features, Interspeech
Seongjun Hahm, Jun Wang (2015), Parkinson's condition estimation using speech acoustic and inversely mapped articulatory data, Interspeech
James R. Williamson, Thomas F. Quatieri, Brian S. Helfer, Joseph Perricone, Satrajit S. Ghosh, Gregory Ciccarelli, Daryush D. Mehta (2015), Segment-dependent dynamics in predicting Parkinson's disease, Interspeech
Anton Batliner (2015), The eating condition sub-challenge: the data, Interspeech
Abhay Prasad, Prasanta Kumar Ghosh (2015), Automatic classification of eating conditions from speech using acoustic feature selection and a set of hierarchical support vector machine classifiers, Interspeech
Johannes Wagner, Andreas Seiderer, Florian Lingenfelser, Elisabeth André (2015), Combining hierarchical classification with frequency weighting for the recognition of eating conditions, Interspeech
Dara Pir, Theodore Brown (2015), Acoustic group feature selection using wrapper method for automatic eating condition recognition, Interspeech
Thomas Pellegrini (2015), Comparing SVM, softmax, and shallow neural networks for eating condition classification, Interspeech
Benjamin Milde, Chris Biemann (2015), Using representation learning and out-of-domain data for a paralinguistic speech task, Interspeech
Heysem Kaya, Alexey A. Karpov, Albert Ali Salah (2015), Fisher vectors with cascaded normalization for paralinguistic analysis, Interspeech
Jangwon Kim, Md. Nasir, Rahul Gupta, Maarten Van Segbroeck, Daniel Bone, Matthew P. Black, Zisis Iason Skordilis, Zhaojun Yang, Panayiotis G. Georgiou, Shrikanth S. Narayanan (2015), Automatic estimation of Parkinson's disease severity from diverse speech tasks, Interspeech
Tamás Grósz, Róbert Busa-Fekete, Gábor Gosztolya, László Tóth (2015), Assessing the degree of nativeness and Parkinson's condition using Gaussian processes and deep rectifier neural networks, Interspeech
Stefan Steidl (2015), The INTERSPEECH 2015 computational paralinguistics challenge: a summary of results, Interspeech
Anton Batliner (2015), Wrapping up: the story of the compare challenges, what we learned and where to go, Interspeech
S. M. Houghton, Colin J. Champion, Philip Weber (2015), Recognition of voiced sounds with a continuous state HMM, Interspeech
Xiangyu Zeng, Shi Yin, Dong Wang (2015), Learning speech rate in speech recognition, Interspeech
Guoguo Chen, Hainan Xu, Minhua Wu, Daniel Povey, Sanjeev Khudanpur (2015), Pronunciation and silence probability modeling for ASR, Interspeech
Marelie Davel, Etienne Barnard, Charl van Heerden, William Hartmann, Damianos Karakos, Richard Schwartz, Stavros Tsakalidis (2015), Exploring minimal pronunciation modeling for low resource languages, Interspeech
Hao Zheng, Zhanlei Yang, Liwei Qiao, Jianping Li, Wenju Liu (2015), Attribute knowledge integration for speech recognition based on multi-task learning neural networks, Interspeech
Etienne Marcheret, Gerasimos Potamianos, Josef Vopicka, Vaibhava Goel (2015), Detecting audio-visual synchrony using deep neural networks, Interspeech
Shahram Kalantari, David Dean, Houman Ghaemmaghami, Sridha Sridharan, Clinton Fookes (2015), Cross database training of audio-visual hidden Markov models for phone recognition, Interspeech
Shahram Kalantari, David Dean, Sridha Sridharan (2015), Incorporating visual information for spoken term detection, Interspeech
Hiroshi Ninomiya, Norihide Kitaoka, Satoshi Tamura, Yurie Iribe, Kazuya Takeda (2015), Integration of deep bottleneck features for audio-visual speech recognition, Interspeech
Sofoklis Kakouros, Okko Räsänen (2015), Automatic detection of sentence prominence in speech using predictability of word-level acoustic features, Interspeech
Milos Cernak, Pierre-Edouard Honnet (2015), An empirical model of emphatic word detection, Interspeech
Yishuang Ning, Zhiyong Wu, Xiaoyan Lou, Helen Meng, Jia Jia, Lianhong Cai (2015), Using tilt for automatic emphasis detection with Bayesian networks, Interspeech
Linxue Bai, Peter Jančovič, Martin Russell, Philip Weber (2015), Analysis of a low-dimensional bottleneck neural network representation of speech for modelling speech dynamics, Interspeech
Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose (2015), Statistical acoustic-to-articulatory mapping unified with speaker normalization based on voice conversion, Interspeech
Raghavendra Reddy Pappagari, Karthika Vijayan, K. Sri Rama Murty (2015), Analysis of features from analytic representation of speech using MP-ABX measures, Interspeech
Erfan Loweimi, Jon Barker, Thomas Hain (2015), Source-filter separation of speech signal in the phase domain, Interspeech
Ranniery Maia, Yannis Stylianou, Masami Akamine (2015), A maximum likelihood approach to the detection of moments of maximum excitation and its application to high-quality speech parameterization, Interspeech
Christopher Liberatore, Sandesh Aryal, Zelun Wang, Seth Polsley, Ricardo Gutierrez-Osuna (2015), SABR: sparse, anchor-based representation of the speech signal, Interspeech
Tamás Gábor Csapó, Géza Németh (2015), Automatic transformation of irregular to regular voice by residual analysis and synthesis, Interspeech
Simon Preuß, Peter Birkholz (2015), Optical sensor calibration for electro-optical stomatography, Interspeech
Kálmán Abari, Tamás Gábor Csapó, Bálint Pál Tóth, Gábor Olaszy (2015), From text to formants — indirect model for trajectory prediction based on a multi-speaker parallel speech database, Interspeech
Chung-Chien Hsu, Jen-Tzung Chien, Tai-Shih Chi (2015), Layered nonnegative matrix factorization for speech separation, Interspeech
Catherine Laporte, Lucie Ménard (2015), Robust tongue tracking in ultrasound images: a multi-hypothesis approach, Interspeech
Danny Websdale, Thomas Le Cornu, Ben Milner (2015), Objective measures for predicting the intelligibility of spectrally smoothed speech with artificial excitation, Interspeech
Christophe Mertens, Francis Grenez, François Viallet, Alain Ghio, Sabine Skodda, Jean Schoentgen (2015), Vocal tremor analysis via AM-FM decomposition of empirical modes of the glottal cycle length time series, Interspeech
Elizabeth Godoy, Nicolas Malyska, Thomas F. Quatieri (2015), Estimating lower vocal tract features with closed-open phase spectral analyses, Interspeech
S. M. Houghton, Colin J. Champion (2015), Inductive implementation of segmental HMMs as CS-HMMs, Interspeech
G. Nisha Meenakshi, Prasanta Kumar Ghosh (2015), A discriminative analysis within and across voiced and unvoiced consonants in neutral and whispered speech in multiple indian languages, Interspeech
T. J. Tsai, Andreas Stolcke (2015), Aligning meeting recordings via adaptive fingerprinting, Interspeech
Matthias Zöhrer, Robert Peharz, Franz Pernkopf (2015), On representation learning for artificial bandwidth extension, Interspeech
Dhananjaya Gowda, Rahim Saeidi, Paavo Alku (2015), AM-FM based filter bank analysis for estimation of spectro-temporal envelopes and its application for speaker recognition in noisy reverberant environments, Interspeech
Thomas Drugman, Yannis Stylianou (2015), Fast and accurate phase unwrapping, Interspeech
Xugang Lu, Peng Shen, Yu Tsao, Chiori Hori, Hisashi Kawai (2015), Sparse representation with temporal max-smoothing for acoustic event detection, Interspeech
G Anushiya Rachel, P Vijayalakshmi, T Nagarajan (2015), Estimation of glottal closure instants from telephone speech using a group delay-based approach that considers speech signal as a spectrum, Interspeech
Raúl Montaño, Francesc Alías (2015), The role of prosody and voice quality in text-dependent categories of storytelling across languages, Interspeech
Alexandre Hyafil, Milos Cernak (2015), Neuromorphic based oscillatory device for incremental syllable boundary detection, Interspeech
Ann Lee, James Glass (2015), Mispronunciation detection without nonnative training data, Interspeech
Ramya Rasipuram, Milos Cernak, Alexandre Nachen, Mathew Magimai-Doss (2015), Automatic accentedness evaluation of non-native speech using phonetic and sub-phonetic posterior probabilities, Interspeech
Min Ma, Keelan Evanini, Anastassia Loukina, Xinhao Wang, Klaus Zechner (2015), Using F0 contours to assess nativeness in a sentence repeat task, Interspeech
Rebecca Lunsford, Peter A. Heeman (2015), Using linguistic indicators of difficulty to identify mild cognitive impairment, Interspeech
Lionel Fontan, Jérôme Farinas, Isabelle Ferrané, Julien Pinquier, Xavier Aumont (2015), Automatic intelligibility measures applied to speech signals simulating age-related hearing loss, Interspeech
Sandeep Nallan Chakravarthula, Bo Xiao, Zac E. Imel, David C. Atkins, Panayiotis G. Georgiou (2015), Assessing empathy using static and dynamic behavior models based on therapist's language in addiction counseling, Interspeech
Yuzong Liu, Rishabh Iyer, Katrin Kirchhoff, Jeff Bilmes (2015), SVitchboard II and fiSVer i: high-quality limited-complexity corpora of conversational English speech, Interspeech
Herman Kamper, Aren Jansen, Sharon Goldwater (2015), Fully unsupervised small-vocabulary speech recognition using a segmental Bayesian model, Interspeech
Ottokar Tilk, Tanel Alumäe (2015), LSTM for punctuation restoration in speech transcripts, Interspeech
Emre Yılmaz, Deepak Baby, Hugo Van hamme (2015), Noise robust exemplar matching for speech enhancement: applications to automatic speech recognition, Interspeech
Yingming Gao, Yanlu Xie, Wen Cao, Jinsong Zhang (2015), A study on robust detection of pronunciation erroneous tendency based on deep neural network, Interspeech
Shrikant Joshi, Nachiket Deo, Preeti Rao (2015), Vowel mispronunciation detection using DNN acoustic models with cross-lingual training, Interspeech
Kshitiz Kumar, Ziad Al Bawab, Yong Zhao, Chaojun Liu, Benoit Dumoulin, Yifan Gong (2015), Confidence-features and confidence-scores for ASR applications in arbitration and DNN speaker adaptation, Interspeech
Pengfei Liu, Shoaib Jameel, Wai Lam, Bin Ma, Helen Meng (2015), Topic modeling for conference analytics, Interspeech
Pulkit Sharma, Vinayak Abrol, A. D. Dileep, Anil Kumar Sao (2015), Sparse coding based features for speech units classification, Interspeech
Andreea I. Niculescu, Ngoc Thuy Huong Thai, Chongjia Ni, Boon Pang Lim, Kheng Hui Yeo, Rafael E. Banchs (2015), Smarter driving with IDA, the intelligent driving assistant for singapore, Interspeech
Kheng Hui Yeo, Rafael E. Banchs (2015), Talk it out: adding speech interaction to support informational and transactional applications on public touch-screen kiosks, Interspeech
Luis Fernando D'Haro, Seokhwan Kim, Rafael E. Banchs (2015), Conversational agent and management tools for conference and tourism domain, Interspeech
Askars Salimbajevs, Jevgenijs Strigins (2015), Latvian speech-to-text transcription service, Interspeech
Jakub Gałka, Joanna Grzybowska, Magdalena Igras, Paweł Jaciów, Kamil Wajda, Marcin Witkowski, Mariusz Ziółko (2015), System supporting speaker identification in emergency call center, Interspeech
Ahmed Abdelali, Ahmed Ali, Francisco Guzmán, Felix Stahlberg, Stephan Vogel, Yifan Zhang (2015), QAT2 — the QCRI advanced transcription and translation system, Interspeech
Michael Stadtschnitzer, Christoph Schmidt (2015), Implementation of a live dialectal media subtitling system, Interspeech
Peter Bell, Catherine Lai, Clare Llewellyn, Alexandra Birch, Mark Sinclair (2015), A system for automatic broadcast news summarisation, geolocation and translation, Interspeech
Artūrs Znotiņš, Kaspars Polis, Roberts Darģis (2015), Media monitoring system for latvian radio and TV broadcasts, Interspeech
Michel Assayag, Jonathan Huang, Jonathan Mamou, Oren Pereg, Saurav Sahay, Oren Shamir, Georg Stemmer, Moshe Wasserblat (2015), Meeting assistant application, Interspeech
Bartosz Ziółko, Tomasz Jadczyk, Dawid Skurzok, Piotr Żelasko, Jakub Gałka, Tomasz Pȩdzimąż, Ireneusz Gawlik, Szymon Pałka (2015), SARMATA 2.0 automatic Polish language speech recognition system, Interspeech
Arlo Faria, Korbinian Riedhammer (2015), Remeeting — get more out of meetings, Interspeech
Ikuyo Masuda-Katsuse (2015), Web application system for pronunciation practice by children with disabilities and to support cooperation of teachers and medical workers, Interspeech
Caroline Kaufhold, Vadim Gamidov, Andreas Kiessling, Klaus Reinhard, Elmar Nöth (2015), PATSY — it's all about pronunciation!, Interspeech
Elias Azarov, Maxim Vashkevich, Denis Likhachov, Alexander Petrovsky (2015), Real-time pitch modification system for speech and singing voice, Interspeech
Guillaume Dubuisson Duplessis, Lucile Béchade, Mohamed A. Sehili, Agnès Delaborde, Vincent Letard, Anne-Laure Ligozat, Paul Deléglise, Yannick Estève, Sophie Rosset, Laurence Devillers (2015), Nao is doing humour in the CHIST-ERA joker project, Interspeech
Lisa Lange, Bartholomäus Pfeiffer, Daniel Duran (2015), ABIMS — auditory bewildered interaction measurement system, Interspeech
Kay Berkling, Nadine Pflaumer, Alexei Coyplove (2015), Phontasia — a game for training German orthography, Interspeech
Ka Ho Wong, Wai Kim Leung, Helen Meng (2015), E-commu-book: an assistive technology for users with speech impairments, Interspeech
Martina Röthlisberger, Iliana I. Karipidis, Georgette Pleisch, Volker Dellwo, Ulla Richardson, Silvia Brem (2015), Swiss graphogame: concept and design presentation of a computerised reading intervention for children with high risk for poor reading outcomes, Interspeech
Jakob Pfab, Hanna Jakob, Mona Späth, Christoph Draxler (2015), Neolexon — a therapy app for patients with aphasia, Interspeech
Sonal Patil, Harish Arsikere, Om Deshmukh (2015), Acoustic stress detection for improved navigation of educational videos, Interspeech
Xavier Anguera (2015), Multimodal read-aloud ebooks for language learning, Interspeech
Laurent Besacier, Elodie Gauthier, Mathieu Mangeot, Philippe Bretier, Paul Bagshaw, Olivier Rosec, Thierry Moudenc, François Pellegrino, Sylvie Voisin, Egidio Marsico, Pascal Nocera (2015), Speech technologies for african languages: example of a multilingual calculator for education, Interspeech
Kong Aik Lee, Guangsen Wang, Kam Pheng Ng, Hanwu Sun, Trung Hieu Nguyen, Ngoc Thuy Huong Thai, Bin Ma, Haizhou Li (2015), The reddots platform for mobile crowd-sourcing of speech data, Interspeech
Takayuki Arai (2015), Two extensions of umeda and teranishi's physical models of the human vocal tract, Interspeech
Matheuz Budnik, Laurent Besacier, Johann Poignant, Hervé Bredin, Claude Barras, Mickael Stefas, Pierrick Bruneau, Thomas Tamisier (2015), Collaborative annotation for person identification in TV shows, Interspeech
Thomas Kisler, Florian Schiel, Uwe D. Reichel, Christoph Draxler (2015), Phonetic/linguistic web services at BAS, Interspeech
Raphael Winkelmann (2015), Managing speech databases with emur and the EMU-webapp, Interspeech
Sebastian Wankerl, Florian Hönig, Anton Batliner, J. R. Orozco-Arroyave, Elmar Nöth (2015), Visual comparison of speaker groups, Interspeech
Rohit Kumar, Matthew E. Roy, Sanjika Hewavitharana, Dennis N. Mehay, Nina Zinovieva (2015), Tools for rapid customization of S2s systems for emergent domains, Interspeech
Florian Metze, Eric Riebling, Eric Fosler-Lussier, Andrew Plummer, Rebecca Bates (2015), The speech recognition virtual kitchen turns one, Interspeech
Jan Rennies, Andreas Volgenandt, Henning Schepker, Simon Doclo (2015), Model-based adaptive pre-processing of speech for enhanced intelligibility in noise and reverberation, Interspeech
Sebastian Möller, Tilo Westermann (2015), Experiences with and new application ideas for the interspeech app, Interspeech
Dmitry Sityaev, Praphul Kumar, Rajesh Ramchander (2015), Traditional IVR and visual IVR — killing two birds with one stone, Interspeech
Kousuke Itakura, Izaya Nishimuta, Yoshiaki Bando, Katsutoshi Itoyama, Kazuyoshi Yoshii (2015), Bayesian integration of sound source separation and speech recognition: a new approach to simultaneous speech recognition, Interspeech
Ivan Himawan, Petr Motlicek, Sridha Sridharan, David Dean, Dian Tjondronegoro (2015), Channel selection in the short-time modulation domain for distant speech recognition, Interspeech
Gert Dekkers, Toon van Waterschoot, Bart Vanrumste, Bert Van Den Broeck, Jort F. Gemmeke, Hugo Van hamme, Peter Karsmakers (2015), A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users, Interspeech
Chanwoo Kim, Kean K. Chin (2015), Sound source separation algorithm using phase difference and angle distribution modeling near the target, Interspeech
Mirco Ravanelli, Maurizio Omologo (2015), Contaminated speech training methods for robust DNN-HMM distant speech recognition, Interspeech
Yajie Miao, Florian Metze (2015), Distance-aware DNNs for robust speech recognition, Interspeech
Helena Levy (2015), Perception and production of vowel contrasts in German learners of English, Interspeech
Rong Tong, Nancy F. Chen, Bin Ma, Haizhou Li (2015), Goodness of tone (GOT) for non-native Mandarin tone recognition, Interspeech
Jeanin Jügler, Frank Zimmerer, Bernd Möbius, Christoph Draxler (2015), The effect of high-variability training on the perception and production of French stops by German native speakers, Interspeech
Wenfu Bao, Hui Feng, Jianwu Dang, Zhilei Liu, Yang Yu, Siyu Wang (2015), Perception of Mandarin tones by native tibetan speakers, Interspeech
Shambhu Nath Saha, Shyamal Kr. Das Mandal (2015), Study of acoustic correlates of English lexical stress produced by native (L1) bengali speakers compared to native (L1) English speakers, Interspeech
Yasuko Nagano-Madsen (2015), Prosodic phrasing unique to the acquisition of L2 intonation — an analysis of L2 Japanese intonation by L1 Swedish learners, Interspeech
Leda Sarı, Batuhan Gündoğdu, Murat Saraçlar (2015), Fusion of LVCSR and posteriorgram based keyword search, Interspeech
Gideon Mendels, Erica Cooper, Victor Soto, Julia Hirschberg, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang (2015), Improving speech recognition and keyword search for low resource languages using web data, Interspeech
Kentaro Domoto, Takehito Utsuro, Naoki Sawada, Hiromitsu Nishizaki (2015), Two-step spoken term detection using SVM classifier trained with pre-indexed keywords based on ASR result, Interspeech
Le Zhang, Damianos Karakos, William Hartmann, Roger Hsiao, Richard Schwartz, Stavros Tsakalidis (2015), Enhancing low resource keyword spotting with automatically retrieved web documents, Interspeech
Dario Bertero, Linlin Wang, Ho Yin Chan, Pascale Fung (2015), A comparison between a DNN and a CRF disfluency detection and reconstruction system, Interspeech
Julian Hough, David Schlangen (2015), Recurrent neural networks for incremental disfluency detection, Interspeech
Qiong Hu, Zhizheng Wu, Korin Richmond, Junichi Yamagishi, Yannis Stylianou, Ranniery Maia (2015), Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning, Interspeech
Sivanand Achanta, Tejas Godambe, Suryakanth V. Gangashetty (2015), An investigation of recurrent neural network architectures for statistical parametric speech synthesis, Interspeech
Yuchen Fan, Yao Qian, Frank K. Soong, Lei He (2015), Sequence generation error (SGE) minimization based deep neural networks training for text-to-speech synthesis, Interspeech
Cassia Valentini-Botinhao, Zhizheng Wu, Simon King (2015), Towards minimum perceptual error training for DNN-based speech synthesis, Interspeech
Eunwoo Song, Hong-Goo Kang (2015), Deep neural network-based statistical parametric speech synthesis system using improved time-frequency trajectory excitation model, Interspeech
Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, Simon King (2015), A study of speaker adaptation for DNN-based speech synthesis, Interspeech
Yannan Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee (2015), High-resolution acoustic modeling and compact language modeling of language-universal speech attributes for spoken language identification, Interspeech
Saad Irtza, Vidhyasaharan Sethu, Phu Ngoc Le, Eliathamby Ambikairajah, Haizhou Li (2015), Phonemes frequency based PLLR dimensionality reduction for language recognition, Interspeech
Sandro Cumani, Oldřich Plchot, Radek Fér (2015), Exploiting i-vector posterior covariances for short-duration language recognition, Interspeech
Athanasios Lykartsis, Stefan Weinzierl (2015), Using the beat histogram for speech rhythm description and language identification, Interspeech
Rahim Saeidi, Tuija Niemi, Hanna Karppelin, Jouni Pohjalainen, Tomi Kinnunen, Paavo Alku (2015), Speaker recognition for speech under face cover, Interspeech
Md. Hafizur Rahman, Ahilan Kanagasundaram, David Dean, Sridha Sridharan (2015), Dataset-invariant covariance normalization for out-domain PLDA speaker verification, Interspeech
Longting Xu, Kong Aik Lee, Haizhou Li, Zhen Yang (2015), Sparse coding of total variability matrix, Interspeech
Weicheng Cai, Ming Li, Lin Li, QingYang Hong (2015), Duration dependent covariance regularization in PLDA modeling for speaker verification, Interspeech
Hagai Aronowitz (2015), Exploiting supervector structure for speaker recognition trained on a small development set, Interspeech
QingYang Hong, Lin Li, Ming Li, Ling Huang, Lihong Wan, Jun Zhang (2015), Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system, Interspeech
Sarfaraz Jelil, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna (2015), Speaker verification using Gaussian posteriorgrams on fixed phrase short utterances, Interspeech
Laura Fernández Gallardo, Sebastian Möller, Michael Wagner (2015), Importance of intelligible phonemes for human speaker recognition in different channel bandwidths, Interspeech
Hitoshi Yamamoto, Takafumi Koshinaka (2015), Denoising autoencoder-based speaker feature restoration for utterances of short duration, Interspeech
Dayana Ribas, Emmanuel Vincent, José Ramón Calvo (2015), Full multicondition training for robust i-vector based speaker recognition, Interspeech
Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Jinyu Li, Jiadong Wu, Chin-Hui Lee (2015), Maximum a posteriori adaptation of network parameters in deep models, Interspeech
Yan Huang, Yifan Gong (2015), Regularized sequence-level deep neural network model adaptation, Interspeech
Xiangang Li, Xihong Wu (2015), Modeling speaker variability using long short-term memory networks for speech recognition, Interspeech
Kshitiz Kumar, Chaojun Liu, Kaisheng Yao, Yifan Gong (2015), Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation, Interspeech
Murali Karthick B., Prateek Kolhar, S. Umesh (2015), Speaker adaptation of convolutional neural network using speaker specific subspace vectors of SGMM, Interspeech
Yajie Miao, Florian Metze (2015), On speaker adaptation of long short-term memory recurrent neural networks, Interspeech
Emilio Parisotto, Youness A. Ghassabeh, Matt J. MacDonald, Adelina Cozma, Elizabeth W. Pang, Frank Rudzicz (2015), Automatic identification of received language in MEG, Interspeech
Laurens van der Werff, Jón Guðnason, Kamilla Rún Jóhannsdóttir (2015), Detection of cardiovascular reactivity in speech, Interspeech
Alex Francois-Nienaber, Jed A. Meltzer, Frank Rudzicz (2015), Lateralization in emotional speech perception following transcranial direct current stimulation, Interspeech
Minda Yang, Sameer A. Sheth, Catherine A. Schevon, Guy M. McKhann, Nima Mesgarani (2015), Speech reconstruction from human auditory cortex with deep neural networks, Interspeech
Jonathan S. Brumberg, Nichol Castro, Akshatha Rao (2015), Temporal dynamics of the speech readiness potential, and its use in a neural decoder of speech-motor intention, Interspeech
Dominic Heger, Christian Herff, Adriana de Pesters, Dominic Telaar, Peter Brunner, Gerwin Schalk, Tanja Schultz (2015), Continuous speech recognition from ECoG, Interspeech
Yu-hsin Chen, Ignacio Lopez-Moreno, Tara N. Sainath, Mirkó Visontai, Raziel Alvarez, Carolina Parada (2015), Locally-connected and convolutional neural networks for small footprint speaker recognition, Interspeech
Daniel Garcia-Romero, Alan McCree (2015), Insights into deep neural networks for speaker recognition, Interspeech
Fred Richardson, Douglas A. Reynolds, Najim Dehak (2015), A unified deep neural network for speaker and language recognition, Interspeech
Yao Tian, Meng Cai, Liang He, Jia Liu (2015), Investigation of bottleneck features and multilingual deep neural networks for speaker verification, Interspeech
Hua Xing, Gang Liu, John H. L. Hansen (2015), Frequency offset correction in single sideband (SSB) speech by deep neural network for speaker verification, Interspeech
Hao Zheng, Shanshan Zhang, Wenju Liu (2015), Exploring robustness of DNN/RNN for extracting speaker baum-welch statistics in mismatched conditions, Interspeech
Takenori Yoshimura, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda (2015), Simultaneous optimization of multiple tree structures for factor analyzed HMM-based speech synthesis, Interspeech
Maël Pouget, Thomas Hueber, Gérard Bailly, Timo Baumann (2015), HMM training strategy for incremental speech synthesis, Interspeech
Shinnosuke Takamichi, Tomoki Toda, Alan W. Black, Satoshi Nakamura (2015), Modulation spectrum-constrained trajectory training algorithm for HMM-based speech synthesis, Interspeech
Alan W. Black, Prasanna Kumar Muthukumar (2015), Random forests for statistical speech synthesis, Interspeech
Doo Hwa Hong, Joun Yeop Lee, Se Young Jang, Nam Soo Kim (2015), Speaker adaptation using relevance vector regression for HMM-based expressive TTS, Interspeech
Vassilis Tsiaras, Ranniery Maia, Vassilis Diakoloukas, Yannis Stylianou, Vassilis Digalakis (2015), Towards a linear dynamical model based speech synthesizer, Interspeech
Céline De Looze, Brian Vaughan, Finnian Kelly, Alison Kay (2015), Providing objective metrics of team communication skills via interpersonal coordination mechanisms, Interspeech
Donghyeon Lee, Jinsik Lee, Eun-Kyoung Kim, Jaewon Lee (2015), Dialog act modeling for virtual personal assistant applications using a small volume of labeled data and domain knowledge, Interspeech
Csaba Zainkó, Mátyás Bartalis, Géza Németh, Gábor Olaszy (2015), A polyglot domain optimised text-to-speech system for railway station announcements, Interspeech
Partho Mandal, Shalini Jain, Gaurav Ojha, Anupam Shukla (2015), Development of hindi speech recognition system of agricultural commodities using deep neural network, Interspeech
Thomas Fehér, Michael Freitag, Christian Gruber (2015), Real-time audio signal enhancement for hands-free speech applications, Interspeech
D. Erro, Inma Hernaez, Agustin Alonso, D. García-Lorenzo, Eva Navas, J. Ye, H. Arzelus, Igor Jauk, N. Q. Hy, C. Magariños, R. Pérez-Ramón, M. Sulír, Xiaohai Tian, X. Wang (2015), Personalized synthetic voices for speaking impaired: website and app, Interspeech
Reza Sahraeian, Dirk Van Compernolle, Febe de Wet (2015), Under-resourced speech recognition based on the speech manifold, Interspeech
Pavel Golik, Zoltán Tüske, Ralf Schlüter, Hermann Ney (2015), Multilingual features based keyword search for very low-resource languages, Interspeech
Xiaoyun Wang, Seiichi Yamamoto (2015), Second language speech recognition using multiple-pass decoding with lexicon represented by multiple reduced phoneme sets, Interspeech
Sarah Samson Juan, Laurent Besacier, Benjamin Lecouteux, Mohamed Dyab (2015), Using resources from a closely-related language to develop ASR for a very under-resourced language: a case study for iban, Interspeech
Maxim L. Korenevsky, Andrey B. Smirnov, Valentin S. Mendelev (2015), Prediction of speech recognition accuracy for utterance classification, Interspeech
Eugen Beck, Ralf Schlüter, Hermann Ney (2015), Error bounds for context reduction and feature omission, Interspeech
Nobuyasu Itoh, Gakuto Kurata, Ryuki Tachibana, Masafumi Nishimura (2015), A metric for evaluating speech recognizer output based on human-perception model, Interspeech
Mohamed Ameur Ben Jannet, Olivier Galibert, Martine Adda-Decker, Sophie Rosset (2015), How to evaluate ASR output for named entity recognition?, Interspeech
Hansjörg Mixdorff, Angelika Hönemann, Albert Rilliard (2015), Acoustic-prosodic analysis of attitudinal expressions in German, Interspeech
Hossein Khaki, Engin Erzin (2015), Continuous emotion tracking using total variability space, Interspeech
Chi-Chun Lee, Daniel Bone, Shrikanth S. Narayanan (2015), An analysis of the relationship between signal-derived vocal arousal score and human emotion production and perception, Interspeech
Hiroki Mori (2015), Morphology of vocal affect bursts: exploring expressive interjections in Japanese conversation, Interspeech
Mahnoosh Mehrabani, Ozlem Kalinli, Ruxin Chen (2015), Emotion clustering based on probabilistic linear discriminant analysis, Interspeech
Aaron Albin, Elliot Moore (2015), Objective study of the performance degradation in emotion recognition through the AMR-WB+ codec, Interspeech
Sudarsana Reddy Kadiri, P. Gangamohan, Suryakanth V. Gangashetty, B. Yegnanarayana (2015), Analysis of excitation source features of speech for emotion recognition, Interspeech
Zhaocheng Huang, Julien Epps, Eliathamby Ambikairajah (2015), An investigation of emotion change detection from speech, Interspeech
Wentao Gu, Ping Tang, Keikichi Hirose, Véronique Aubergé (2015), Crosslinguistic comparison on the perception of Mandarin attitudinal speech, Interspeech
Gábor Gosztolya (2015), Conflict intensity estimation from speech using Greedy forward-backward feature selection, Interspeech
Chee Seng Chong, Jeesun Kim, Chris Davis (2015), Exploring acoustic differences between Cantonese (tonal) and English (non-tonal) spoken expressions of emotions, Interspeech
Elisavet Palogiannidi, Elias Iosif, Polychronis Koutsakis, Alexandros Potamianos (2015), Valence, arousal and dominance estimation for English, German, Greek, Portuguese and Spanish lexica using semantic models, Interspeech
Xinzhou Xu, Jun Deng, Wenming Zheng, Li Zhao, Björn Schuller (2015), Dimensionality reduction for speech emotion features by multiscale kernels, Interspeech
Jinkyu Lee, Ivan Tashev (2015), High-level feature representation using recurrent neural network for speech emotion recognition, Interspeech
Myung Jong Kim, Joohong Yoo, Younggwan Kim, Hoirin Kim (2015), Speech emotion classification using tree-structured sparse logistic regression, Interspeech
Bogdan Vlasenko, Andreas Wendemuth (2015), Annotators' agreement and spontaneous emotion classification performance, Interspeech
Ebru Arisoy, Murat Saraçlar (2015), Multi-stream long short-term memory neural network language model, Interspeech
Keith Hall, Eunjoon Cho, Cyril Allauzen, Françoise Beaufays, Noah Coccaro, Kaisuke Nakajima, Michael Riley, Brian Roark, David Rybach, Linda Zhang (2015), Composition-based on-the-fly rescoring for salient n-gram biasing, Interspeech
Alex Marin, Mari Ostendorf, Ji He (2015), Learning phrase patterns for ASR name error detection using semantic similarity, Interspeech
Noam Shazeer, Joris Pelemans, Ciprian Chelba (2015), Sparse non-negative matrix language modeling for skip-grams, Interspeech
Joris Pelemans, Noam Shazeer, Ciprian Chelba (2015), Pruning sparse non-negative matrix n-gram language models, Interspeech
Ciprian Chelba, Xuedong Zhang, Keith Hall (2015), Geo-location for voice search language modeling, Interspeech
Rami Botros, Kazuki Irie, Martin Sundermeyer, Hermann Ney (2015), On efficient training of word classes and their application to recurrent neural network language models, Interspeech
Ali Orkan Bayer, Giuseppe Riccardi (2015), Deep semantic encodings for language modeling, Interspeech
Ming Sun, Yun-Nung Chen, Alexander I. Rudnicky (2015), Learning OOV through semantic relatedness in spoken dialog systems, Interspeech
Tze Yuang Chong, Rafael E. Banchs, Eng Siong Chng, Haizhou Li (2015), TDTO language modeling with feedforward neural networks, Interspeech
Matthias Paulik (2015), Improvements to the pruning behavior of DNN acoustic models, Interspeech
Haşim Sak, Andrew Senior, Kanishka Rao, Françoise Beaufays (2015), Fast and accurate recurrent neural network acoustic models for speech recognition, Interspeech
Preetum Nakkiran, Raziel Alvarez, Rohit Prabhavalkar, Carolina Parada (2015), Compressing deep neural networks using a rank-constrained topology, Interspeech
Tara N. Sainath, Carolina Parada (2015), Convolutional neural networks for small-footprint keyword spotting, Interspeech
Ewout van den Berg, Daniel Brand, Rajesh Bordawekar, Leonid Rachevsky, Bhuvana Ramabhadran (2015), Efficient GPU implementation of convolutional neural networks for speech recognition, Interspeech
Nikko Strom (2015), Scalable distributed DNN training using commodity GPU cloud computing, Interspeech
Sachin N. Kalkur, Sandeep Reddy C., Rajesh M. Hegde (2015), Joint source localization and separation in spherical harmonic domain using a sparsity based method, Interspeech
Shaofei Zhang, Dong-Yan Huang, Lei Xie, Eng Siong Chng, Haizhou Li, Minghui Dong (2015), Regularized non-negative matrix factorization using alternating direction method of multipliers and its application to source separation, Interspeech
Shuai Nie, Shan Liang, Wei Xue, Xueliang Zhang, Wenju Liu, Like Dong, Hong Yang (2015), Two-stage multi-target joint learning for monaural speech separation, Interspeech
Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee (2015), Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement, Interspeech
Kisoo Kwon, Jong Won Shin, Hyung Yong Kim, Nam Soo Kim (2015), Discriminative nonnegative matrix factorization using cross-reconstruction error for source separation, Interspeech
Faheem Khan, Ben Milner (2015), Using audio and visual information for single channel speaker separation, Interspeech
Harald Höge (2015), On the nature of the features generated in the human auditory pathway for phone recognition, Interspeech
Kodai Yamamoto, Toshio Irino, Ryuichi Nisimura, Hideki Kawahara, Roy D. Patterson (2015), How the slope of the speech spectrum affects the perception of speaker size, Interspeech
Heikki Rasilo, Okko Räsänen (2015), Weakly-supervised word learning is improved by an active online algorithm, Interspeech
Lin Lin, Jon Barker, Guy J. Brown (2015), The effect of cochlear implant processing on speaker intelligibility: a perceptual study and computer model, Interspeech
Mengxue Cao, Aijun Li, Qiang Fang, Bernd J. Kröger (2015), Phonetic-phonological feature emerges by associating phonetic with semantic information — a GSOM-based modeling study, Interspeech
L. ten Bosch, L. Boves, B. Tucker, M. Ernestus (2015), DIANA: towards computational modeling reaction times in lexical decision in north American English, Interspeech
Qian Chen, Zhen-Hua Ling, Chen-Yu Yang, Li-Rong Dai (2015), Automatic phrase boundary labeling of speech synthesis database using context-dependent HMMs and n-gram prior distributions, Interspeech
Manuel Sam Ribeiro, Junichi Yamagishi, Robert A. J. Clark (2015), A perceptual investigation of wavelet-based decomposition of f0 for text-to-speech synthesis, Interspeech
Decha Moungsri, Tomoki Koriyama, Takao Kobayashi (2015), Duration prediction using multi-level model for GPR-based speech synthesis, Interspeech
Mahsa Sadat Elyasi Langarani, Jan van Santen, Seyed Hamidreza Mohammadi, Alexander Kain (2015), Data-driven foot-based intonation generator for text-to-speech synthesis, Interspeech
Branislav Gerazov, Pierre-Edouard Honnet, Aleksandar Gjoreski, Philip N. Garner (2015), Weighted correlation based atom decomposition intonation modelling, Interspeech
Raul Fernandez, Asaf Rendel, Bhuvana Ramabhadran, Ron Hoory (2015), Using deep bidirectional recurrent neural networks for prosodic-target prediction in a unit-selection text-to-speech system, Interspeech
Hank Liao, Golan Pundak, Olivier Siohan, Melissa K. Carroll, Noah Coccaro, Qi-Ming Jiang, Tara N. Sainath, Andrew Senior, Françoise Beaufays, Michiel Bacchiani (2015), Large vocabulary automatic speech recognition for children, Interspeech
Daniel Bone, Matthew P. Black, Anil Ramakrishna, Ruth Grossman, Shrikanth S. Narayanan (2015), Acoustic-prosodic correlates of `awkward' prosody in story retellings from adolescents with autism, Interspeech
Eva Fringi, Jill Fain Lehman, Martin Russell (2015), Evidence of phonological processes in automatic recognition of children's speech, Interspeech
Michael Pucher, Markus Toman, Dietmar Schabus, Cassia Valentini-Botinhao, Junichi Yamagishi, Bettina Zillinger, Erich Schmid (2015), Influence of speaker familiarity on blind and visually impaired children's perception of synthetic voices in audio games, Interspeech
S. Shahnawazuddin, Rohit Sinha (2015), Low-memory fast on-line adaptation for acoustically mismatched children's speech recognition, Interspeech
Diego Giuliani, Bagher BabaAli (2015), Large vocabulary children's speech recognition with DNN-HMM and SGMM acoustic modeling, Interspeech
Avashna Govender, Febe de Wet, Jules-Raymond Tapamo (2015), HMM adaptation for child speech synthesis, Interspeech
Jaebok Kim, Khiet P. Truong, Vicky Charisi, Cristina Zaga, Manja Lohse, Dirk Heylen, Vanessa Evers (2015), Vocal turn-taking patterns in groups of children performing collaborative tasks: an exploratory study, Interspeech
Roozbeh Sadeghian, Stephen A. Zahorian (2015), Towards an automated screening tool for pediatric speech delay, Interspeech
Jorge Proença, Dirce Celorico, Sara Candeias, Carla Lopes, Fernando Perdigão (2015), Children's reading aloud performance: a database and automatic detection of disfluencies, Interspeech
Harshavardhan Sundar, Jill Fain Lehman, Rita Singh (2015), Keyword spotting in multi-player voice driven games for children, Interspeech
Jinxi Guo, Rohit Paturi, Gary Yeung, Steven M. Lulich, Harish Arsikere, Abeer Alwan (2015), Age-dependent height estimation and speaker normalization for children's speech using the first three subglottal resonances, Interspeech
Adrian Leemann, Camilla Bernardasci, Francis Nolan (2015), The effect of speakers' regional varieties on listeners' decision-making, Interspeech
Robert Fuchs (2015), Word-initial glottal stop insertion, hiatus resolution and linking in British English, Interspeech
Shanpeng Li, Wentao Gu (2015), Acoustic analysis of Mandarin affricates, Interspeech
Hannah Leykum, Sylvia Moosmüller, Wolfgang U. Dressler (2015), Homophonous phonotactic and morphonotactic consonant clusters in word-final position, Interspeech
Mark Gibson, Ana María Fernández Planas, Adamantios Gafos, Emily Remirez (2015), Consonant duration and VOT as a function of syllable complexity and voicing in a sub-set of Spanish clusters, Interspeech
Takayuki Arai (2015), Hands-on tool producing front vowels for phonetic education: aiming for pronunciation training with tactile sensation, Interspeech
Indranil Dutta, Ayushi Pandey (2015), Acoustics of articulatory constraints: vowel classification and nasalization, Interspeech
Janina Kraus (2015), Voice-conditioned allophones of MOUTH and PRICE in bahamian creole, Interspeech
Marie-José Kolly, Adrian Leemann, Florian Matter (2015), Analysis of spatial variation with app-based crowdsourced audio data, Interspeech
Mátyás Jani, Catia Cucchiarini, Roeland van Hout, Helmer Strik (2015), Confusability in L2 vowels: analyzing the role of different features, Interspeech
Frank Zimmerer, Jürgen Trouvain (2015), Perception of French speakers' German vowels, Interspeech
Jagoda Bruni, Daniel Duran, Grzegorz Dogil (2015), Unintuitive phonetic behavior in tswana post-nasal stops, Interspeech
A. P. Prathosh, A. G. Ramakrishnan, T. V. Ananthapadmanabha (2015), Classification of place-of-articulation of stop consonants using temporal analysis, Interspeech
Marissa Barlaz, Maojing Fu, Zhi-Pei Liang, Ryan Shosted, Brad Sutton (2015), The emergence of nasal velar codas in Brazilian Portuguese: an rt-MRI study, Interspeech
Elise Michon, Emmanuel Dupoux, Alejandrina Cristia (2015), Salient dimensions in implicit phonotactic learning, Interspeech
Phil Howson (2015), An acoustic examination of the three-way sibilant contrast in lower sorbian, Interspeech
Jiahong Yuan, Mark Liberman (2015), Investigating consonant reduction in Mandarin Chinese with improved forced alignment, Interspeech
Marianne Pouplier, Stefania Marin, Alexei Kochetov (2015), Durational characteristics and timing patterns of Russian onset clusters at two speaking rates, Interspeech
Chun Hoy Wong, Tan Lee, Yu Ting Yeung, P. C. Ching (2015), Modeling temporal dependency for robust estimation of LP model parameters in speech enhancement, Interspeech
Colin Vaz, Shrikanth S. Narayanan (2015), Learning a speech manifold for signal subspace speech denoising, Interspeech
Samy Elshamy, Nilesh Madhu, Wouter Tirry, Tim Fingscheidt (2015), An iterative speech model-based a priori SNR estimator, Interspeech
Xiao-Lei Zhang, DeLiang Wang (2015), Multi-resolution stacking for speech separation based on boosted DNN, Interspeech
Sidsel Marie Nørholm, Martin Krawczyk-Becker, Timo Gerkmann, Steven van de Par, Jesper Rindom Jensen, Mads Græsbøll Christensen (2015), Least squares estimate of the initial phases in STFT based speech enhancement, Interspeech
Sidsel Marie Nørholm, Jesper Rindom Jensen, Mads Græsbøll Christensen (2015), Enhancement of non-stationary speech using harmonic chirp filters, Interspeech
Keisuke Kinoshita, Marc Delcroix, Atsunori Ogawa, Tomohiro Nakatani (2015), Text-informed speech enhancement with deep neural networks, Interspeech
Shogo Masaya, Masashi Unoki (2015), Complex tensor factorization in modulation frequency domain for single-channel speech enhancement, Interspeech
Hyeonjoo Kang, JeeSok Lee, Soonho Baek, Hong-Goo Kang (2015), Systematic integration of acoustic echo canceller and noise reduction modules for voice communication systems, Interspeech
Chul Min Lee, Jong Won Shin, Nam Soo Kim (2015), DNN-based residual echo suppression, Interspeech
Qi He, Changchun Bao, Feng Bao (2015), Codebook-based speech enhancement using Markov process and speech-presence probability, Interspeech
Aleksej Chinaev, Reinhold Haeb-Umbach (2015), On optimal smoothing in minimum statistics based noise tracking, Interspeech
Yue Hao, Changchun Bao, Feng Bao, Feng Deng (2015), A data-driven speech enhancement method based on modeled long-range temporal dynamics, Interspeech
Florian Mayer, Pejman Mowlaee (2015), Improved phase reconstruction in single-channel speech separation, Interspeech
Tuo Zhao, Yunxin Zhao, Xin Chen (2015), Time-frequency kernel-based CNN for speech recognition, Interspeech
Philip Weber, Colin J. Champion, S. M. Houghton, Peter Jančovič, Martin Russell (2015), Consonant recognition with continuous-state hidden Markov models and perceptually-motivated features, Interspeech
Sriram Ganapathy, Samuel Thomas, Dimitrios Dimitriadis, Steven Rennie (2015), Investigating factor analysis features for deep neural networks in noisy speech recognition, Interspeech
Ruchir Travadi, Shrikanth S. Narayanan (2015), Ensemble of Gaussian mixture localized neural networks with application to phone recognition, Interspeech
Jan Pešán, Lukáš Burget, Hynek Hermansky, Karel Veselý (2015), DNN derived filters for processing of modulation spectrum of speech, Interspeech
Tasha Nagamine, Michael L. Seltzer, Nima Mesgarani (2015), Exploring how deep neural networks form phonemic categories, Interspeech
Anastassia Loukina, Melissa Lopez, Keelan Evanini, David Suendermann-Oeft, Alexei V. Ivanov, Klaus Zechner (2015), Pronunciation accuracy and intelligibility of non-native speech, Interspeech
Frank Zimmerer, Jürgen Trouvain (2015), Productions of /h/ in German: French vs. German speakers, Interspeech
Anne Bonneau, Martine Cadot (2015), German non-native realizations of French voiced fricatives in final position of a group of words, Interspeech
Catherine T. Best, Jason A. Shaw, Gerard Docherty, Bronwen G. Evans, Paul Foulkes, Jennifer Hay, Jalal Al-Tamimi, Katharine Mair, Karen E. Mulak, Sophie Wood (2015), From newcastle MOUTH to aussie ears: australians' perceptual assimilation and adaptation for newcastle UK vowels, Interspeech
Rikke Louise Bundgaard-Nielsen, Brett Baker, Olga Maxwell, Janet Fletcher (2015), Wubuy coronal stop perception by speakers of three dialects of bangla, Interspeech
Daniel Hirst, Hongwei Ding (2015), Using melody metrics to compare English speech read by native speakers and by L2 Chinese speakers from shanghai, Interspeech
James Gibson, Nikolaos Malandrakis, Francisco Romero, David C. Atkins, Shrikanth S. Narayanan (2015), Predicting therapist empathy in motivational interviews using language features inspired by psycholinguistic norms, Interspeech
Nikolaos Malandrakis, Shrikanth S. Narayanan (2015), Therapy language analysis using automatically generated psycholinguistic norms, Interspeech
Wei Xia, James Gibson, Bo Xiao, Brian Baucom, Panayiotis G. Georgiou (2015), A dynamic model for behavioral analysis of couple interactions using acoustic features, Interspeech
Rahul Gupta, Theodora Chaspari, Panayiotis G. Georgiou, David C. Atkins, Shrikanth S. Narayanan (2015), Analysis and modeling of the role of laughter in motivational interviewing based psychotherapy conversations, Interspeech
Francesca Bonin, Nick Campbell, Carl Vogel (2015), The discourse value of social signals at topic change moments, Interspeech
Tobias Schrank, Barbara Schuppler (2015), Automatic detection of uncertainty in spontaneous German dialogue, Interspeech
Fabien Ringeval, Erik Marchi, Marc Mehu, Klaus Scherer, Björn Schuller (2015), Face reading from speech — predicting facial action units from audio cues, Interspeech
Mahesh Kumar Nandwana, Hynek Bořil, John H. L. Hansen (2015), A new front-end for classification of non-speech sounds: a study on human whistle, Interspeech
Sri Harsha Dumpala, Bhanu Teja Nellore, Raghu Ram Nevali, Suryakanth V. Gangashetty, B. Yegnanarayana (2015), Robust features for sonorant segmentation in continuous speech, Interspeech
Sebastian Gergen, Anil Nagathil, Rainer Martin (2015), Reduction of reverberation effects in the MFCC modulation spectrum for improved classification of acoustic signals, Interspeech
Jonathan Dennis, Huy Dat Tran, Haizhou Li (2015), Spiking neural networks and the generalised hough transform for speech pattern detection, Interspeech
Woohyun Choi, Sangwook Park, David K. Han, Hanseok Ko (2015), Acoustic event recognition using dominant spectral basis vectors, Interspeech
Inyoung Hwang, Jaeseong Sim, Sang-Hyeon Kim, Kwang-Sub Song, Joon-Hyuk Chang (2015), A statistical model-based voice activity detection using multiple DNNs and noise awareness, Interspeech
Qing Wang, Jun Du, Xiao Bao, Zi-Rui Wang, Li-Rong Dai, Chin-Hui Lee (2015), A universal VAD based on jointly trained deep neural networks, Interspeech
Ge Zhan, Zhaoqiong Huang, Dongwen Ying, Jielin Pan, Yonghong Yan (2015), Spectrographic speech mask estimation using the time-frequency correlation of speech presence, Interspeech
Houman Ghaemmaghami, David Dean, Shahram Kalantari, Sridha Sridharan, Clinton Fookes (2015), Complete-linkage clustering for voice activity detection in audio and visual speech, Interspeech
Kaavya Sriskandaraja, Vidhyasaharan Sethu, Phu Ngoc Le, Eliathamby Ambikairajah (2015), A model based voice activity detector for noisy environments, Interspeech
Fei Tao, John H. L. Hansen, Carlos Busso (2015), An unsupervised visual-only voice activity detection approach using temporal orofacial features, Interspeech
Ganna Raboshchuk, Peter Jančovič, Climent Nadeu, Alex Peiró Lilja, Münevver Köküer, Blanca Muñoz Mahamud, Ana Riverola de Veciana (2015), Automatic detection of equipment alarms in a neonatal intensive care unit environment: a knowledge-based approach, Interspeech
Jia Dai, Wenju Liu, Chongjia Ni, Like Dong, Hong Yang (2015), “multilingual” deep neural network for music genre classification, Interspeech
Baiyang Liu, Bjorn Hoffmeister, Ariya Rastrow (2015), Accurate endpointing with expected pause duration, Interspeech
Wenbo Liu, Zhiding Yu, Bhiksha Raj, Ming Li (2015), Locality constrained transitive distance clustering on speech data, Interspeech
Miquel Espi, Masakiyo Fujimoto, Keisuke Kinoshita, Tomohiro Nakatani (2015), Feature extraction strategies in deep learning based acoustic event detection, Interspeech
Peter Transfeld, Simon Receveur, Tim Fingscheidt (2015), An acoustic event detection framework and evaluation metric for surveillance in cars, Interspeech
Abdessalam Bouchekif, Géraldine Damnati, Yannick Estève, Delphine Charlet, Nathalie Camelin (2015), Diachronic semantic cohesion for topic segmentation of TV broadcast news, Interspeech
Ivan Kraljevski, Zheng-Hua Tan, Maria Paola Bissiri (2015), Comparison of forced-alignment speech recognition and humans for generating reference VAD, Interspeech
Bernhard Lehner, Gerhard Widmer, Reinhard Sonnleitner (2015), Improving voice activity detection in movies, Interspeech
Pei-Hao Su, David Vandyke, Milica Gašić, Dongho Kim, Nikola Mrkšić, Tsung-Hsien Wen, Steve Young (2015), Learning from real users: rating dialogue success with neural networks for reinforcement learning in spoken dialogue systems, Interspeech
David Griol, Zoraida Callejas, Ramón López-Cózar (2015), A framework to develop context-aware adaptive dialogue system, Interspeech
David Griol, Zoraida Callejas (2015), A proposal to develop domain and subtask-adaptive dialog management models, Interspeech
Omar Zia Khan, Jean-Philippe Robichaud, Paul A. Crook, Ruhi Sarikaya (2015), Hypotheses ranking and state tracking for a multi-domain dialog system using multiple ASR alternates, Interspeech
Ji Wu, Miao Li, Chin-Hui Lee (2015), An entropy minimization framework for goal-driven dialogue management, Interspeech
Ingrid Zukerman, Andisheh Partovi, Su Nam Kim (2015), Context-dependent error correction of spoken referring expressions, Interspeech
Zhizheng Wu, Tomi Kinnunen (2015), Automatic speaker verification spoofing and countermeasures (ASVspoof 2015): introductory talk by the organizers, Interspeech
Zhizheng Wu, Tomi Kinnunen, Nicholas Evans, Junichi Yamagishi, Cemal Hanilçi, Md. Sahidullah, Aleksandr Sizov (2015), ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge, Interspeech
Jon Sanchez, Ibon Saratxaga, Inma Hernaez, Eva Navas, D. Erro (2015), The AHOLAB RPS SSD spoofing challenge 2015 submission, Interspeech
Mirjam Wester, Zhizheng Wu, Junichi Yamagishi (2015), Human vs machine spoofing detection on wideband and narrowband data, Interspeech
Xiong Xiao, Xiaohai Tian, Steven Du, Haihua Xu, Eng Siong Chng, Haizhou Li (2015), Spoofing speech detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015 challenge, Interspeech
Cemal Hanilçi, Tomi Kinnunen, Md. Sahidullah, Aleksandr Sizov (2015), Classifiers for synthetic speech detection: a comparison, Interspeech
Tanvina B. Patel, Hemant A. Patil (2015), Combining evidences from mel cepstral, cochlear filter cepstral and instantaneous frequency features for detection of natural vs. spoofed speech, Interspeech
Jesús Villalba, Antonio Miguel, Alfonso Ortega, Eduardo Lleida (2015), Spoofing detection with DNN and one-class SVM for the ASVspoof 2015 challenge, Interspeech
Md. Jahangir Alam, Patrick Kenny, Gautam Bhattacharya, Themos Stafylakis (2015), Development of CRIM system for the automatic speaker verification spoofing and countermeasures challenge 2015, Interspeech
Artur Janicki (2015), Spoofing countermeasure based on analysis of linear prediction error, Interspeech
Yi Liu, Yao Tian, Liang He, Jia Liu, Michael T. Johnson (2015), Simultaneous utilization of spectral magnitude and phase information to extract supervectors for speaker verification anti-spoofing, Interspeech
Md. Sahidullah, Tomi Kinnunen, Cemal Hanilçi (2015), A comparison of features for synthetic speech detection, Interspeech
Longbiao Wang, Yohei Yoshida, Yuta Kawakami, Seiichi Nakagawa (2015), Relative phase information for detecting human speech and spoofed speech, Interspeech
Nanxin Chen, Yanmin Qian, Heinrich Dinkel, Bo Chen, Kai Yu (2015), Robust deep feature for spoofing detection — the SJTU system for ASVspoof 2015 challenge, Interspeech
Junichi Yamagishi, Nicholas Evans (2015), Automatic speaker verification spoofing and countermeasures (ASVspoof 2015): open discussion and future plans, Interspeech
Kyungmin Lee, Chiyoun Park, Ilhwan Kim, Namhoon Kim, Jaewon Lee (2015), Applying GPGPU to recurrent neural network language model based fast network search in the real-time LVCSR, Interspeech
Youssef Oualil, Marc Schulder, Hartmut Helmke, Anna Schmidt, Dietrich Klakow (2015), Real-time integration of dynamic context information for improving automatic speech recognition, Interspeech
Cyril Allauzen, Michael Riley (2015), Rapid vocabulary addition to context-dependent decoder graphs, Interspeech
Hainan Xu, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur (2015), Modeling phonetic context with non-random forests for speech recognition, Interspeech
Benjamin Lecouteux, Didier Schwab (2015), Ant colony algorithm applied to automatic speech recognition graph decoding, Interspeech
Christophe Van Gysel, Leonid Velikovich, Ian McGraw, Françoise Beaufays (2015), Garbage modeling for on-device speech recognition, Interspeech
Haihua Xu, Van Hai Do, Xiong Xiao, Eng Siong Chng (2015), A comparative study of BNF and DNN multilingual training on cross-lingual low-resource speech recognition, Interspeech
Martin Ratajczak, Sebastian Tschiatschek, Franz Pernkopf (2015), Neural higher-order factors in conditional random fields for phoneme classification, Interspeech
Shahab Jalalvand, Daniele Falavigna (2015), Stacked auto-encoder for ASR error detection and word error rate prediction, Interspeech
Satyabrata Parida, Ashok Kumar Pattem, Prasanta Kumar Ghosh (2015), Estimation of the air-tissue boundaries of the vocal tract in the mid-sagittal plane from electromagnetic articulograph data, Interspeech
Claudia Canevari, Leonardo Badino, Luciano Fadiga (2015), A new Italian dataset of parallel acoustic and articulatory data, Interspeech
Tamás Gábor Csapó, Steven M. Lulich (2015), Error analysis of extracted tongue contours from 2d ultrasound images, Interspeech
Andrea Bandini, Slim Ouni, Piero Cosi, Silvia Orlandi, Claudia Manfredi (2015), Accuracy of a markerless acquisition technique for studying speech articulators, Interspeech
Yujie Chi, Kiyoshi Honda, Jianguo Wei, Hui Feng, Jianwu Dang (2015), Measuring oral and nasal airflow in production of Chinese plosive, Interspeech
Carlo Drioli, Gian Luca Foresti (2015), Enhanced videokymographic data analysis based on vocal folds dynamics modeling, Interspeech
Andrew J. Kolb, Michael T. Johnson, Jeffrey Berry (2015), Interpolation of tongue fleshpoint kinematics from combined EMA position and orientation data, Interspeech
Gustavo Andrade-Miranda, Nathalie Henrich Bernardoni, Juan Ignacio Godino-Llorente (2015), A new technique for assessing glottal dynamics in speech and singing by means of optical-flow computation, Interspeech
Alexei Kochetov, Phil Howson (2015), On the incompatibility of trilling and palatalization: a single-subject study of sustained apical and uvular trills, Interspeech
Pengcheng Zhu, Lei Xie, Yunlin Chen (2015), Articulatory movement prediction using deep bidirectional long short-term memory based recurrent neural networks and word/phone embeddings, Interspeech
Nicholas Ruiz, Qin Gao, William Lewis, Marcello Federico (2015), Adapting machine translation models toward misrecognized speech with text-to-speech pronunciation rules and acoustic confusability, Interspeech
Frederic Bechet, Benoit Favre, Mickael Rouvier (2015), “speech is silver, but silence is golden”: improving speech-to-speech translation performance by slashing users input, Interspeech
Raymond W. M. Ng, Kashif Shah, Lucia Specia, Thomas Hain (2015), A study on the stability and effectiveness of features in quality estimation for spoken language translation, Interspeech
Joris Pelemans, Tom Vanallemeersch, Kris Demuynck, Hugo Van hamme, Patrick Wambacq (2015), Efficient language model adaptation for automatic speech recognition of spoken translations, Interspeech
Takashi Mieno, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura (2015), Speed or accuracy? a study in evaluation of simultaneous speech translation, Interspeech
Marcin Junczys-Dowmunt, Paweł Przybysz, Arleta Staszuk, Eun-Kyoung Kim, Jaewon Lee (2015), Large scale speech-to-text translation with out-of-domain corpora using better context-based models and domain adaptation, Interspeech
Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam, Marcel Kockmann (2015), An i-vector backend for speaker verification, Interspeech
Joana Correia, Alessio Brutti, Alberto Abad (2015), Multi-channel speaker verification based on total variability modelling, Interspeech
Na Li, Man-Wai Mak (2015), SNR-invariant PLDA modeling for robust speaker verification, Interspeech
Md. Hafizur Rahman, David Dean, Ahilan Kanagasundaram, Sridha Sridharan (2015), Investigating in-domain data requirements for PLDA training, Interspeech
Ondřej Glembek, Pavel Matějka, Oldřich Plchot, Jan Pešán, Lukáš Burget, Petr Schwarz (2015), Migrating i-vectors between speaker recognition systems using regression neural networks, Interspeech
Ahilan Kanagasundaram, David Dean, Sridha Sridharan (2015), Improving PLDA speaker verification using WMFD and linear-weighted approaches in limited microphone data conditions, Interspeech
Christer Gobl, Irena Yanushevskaya, Ailbhe Ní Chasaide (2015), The relationship between voice source parameters and the maxima dispersion quotient (MDQ), Interspeech
Manu Airaksinen, Tom Bäckström, Paavo Alku (2015), Glottal inverse filtering based on quadratic programming, Interspeech
N. P. Narendra, K. Sreenivasa Rao (2015), Automatic detection of creaky voice using epoch parameters, Interspeech
Rikke Louise Bundgaard-Nielsen, Brett Baker (2015), Perception of voicing in the absence of native voicing experience, Interspeech
Jody Kreiman, Soo Jin Park, Patricia A. Keating, Abeer Alwan (2015), The relationship between acoustic and perceived intraspeaker variability in voice quality, Interspeech
Li Jiao, Qiuwu Ma, Ting Wang, Yi Xu (2015), Perceptual cues of whispered tones: are they really special?, Interspeech
Tsuyoshi Morioka, Tomoharu Iwata, Takaaki Hori, Tetsunori Kobayashi (2015), Multiscale recurrent neural network based language model, Interspeech
Kazuki Irie, Ralf Schlüter, Hermann Ney (2015), Bag-of-words input for long history representation in neural network-based language models for speech recognition, Interspeech
Ahmad Emami (2015), Efficient machine translation decoding with slow language models, Interspeech
Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito (2015), Latent words recurrent neural network language models, Interspeech
Vataya Chunwijitra, Ananlada Chotimongkol, Chai Wutiwiwatchai (2015), Combining multiple-type input units using recurrent neural network for LVCSR language modeling, Interspeech
Siva Reddy Gangireddy, Steve Renals, Yoshihiko Nankaku, Akinobu Lee (2015), Prosodically-enhanced recurrent neural network language models, Interspeech
Matthias Janke, Michael Wand (2015), Biosignal-based spoken communication: welcome and introduction, Interspeech
Peter Anderson, Negar M. Harandi, Scott Moisik, Ian Stavness, Sidney Fels (2015), A comprehensive 3d biomechanically-driven vocal tract model including inverse dynamics for speech research, Interspeech
Ian McLoughlin, Yan Song (2015), Low frequency ultrasonic voice activity detection using convolutional neural networks, Interspeech
Florent Bocquelet, Thomas Hueber, Laurent Girin, Christophe Savariaux, Blaise Yvert (2015), Real-time control of a DNN-based articulatory synthesizer for silent speech conversion: a pilot study, Interspeech
Diandra Fabre, Thomas Hueber, Florent Bocquelet, Pierre Badin (2015), Tongue tracking in ultrasound images using eigentongue decomposition and artificial neural networks, Interspeech
Jun Wang, Seongjun Hahm (2015), Speaker-independent silent speech recognition with across-speaker articulatory normalization and speaker adaptive training, Interspeech
Lorenz Diener, Matthias Janke, Tanja Schultz (2015), Codebook clustering for unit selection based EMG-to-speech conversion, Interspeech
Majid Mirbagheri, Bradley Ekin, Les Atlas, Adrian K. C. Lee (2015), Flexible tracking of auditory attention, Interspeech
Matthias Janke, Michael Wand (2015), Biosignal-based spoken communication: panel and discussion, Interspeech
Seyedmahdad Mirsamadi, John H. L. Hansen (2015), A study on deep neural network acoustic model adaptation for robust far-field speech recognition, Interspeech
Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara (2015), Speech dereverberation using long short-term memory, Interspeech
Vijayaditya Peddinti, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur (2015), Reverberation robust acoustic modeling using i-vectors with time delay neural networks, Interspeech
Kshitiz Kumar, Chaojun Liu, Yifan Gong (2015), Delta-melspectra features for noise robustness to DNN-based ASR systems, Interspeech
Vikramjit Mitra, Julien Van Hout, Mitchell McLaren, Wen Wang, Martin Graciarena, Dimitra Vergyri, Horacio Franco (2015), Combating reverberation in large vocabulary continuous speech recognition, Interspeech
Martin Karafiát, František Grézl, Lukáš Burget, Igor Szöke, Jan Černocký (2015), Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge, Interspeech
Mark J. Harvilla, Richard M. Stern (2015), Robust parameter estimation for audio declipping in noise, Interspeech
Bin Huang, Dengfeng Ke, Hao Zheng, Bo Xu, Yanyan Xu, Kaile Su (2015), Multi-task learning deep neural networks for speech feature denoising, Interspeech
Yuxuan Wang, Ananya Misra, Kean K. Chin (2015), Time-frequency masking for large scale robust speech recognition, Interspeech
Rongfeng Su, Xurong Xie, Xunying Liu, Lan Wang (2015), Efficient use of DNN bottleneck features in generalized variable parameter HMMs for noise robust speech recognition, Interspeech
Deepak Baby, Hugo Van hamme (2015), Investigating modulation spectrogram features for deep neural network-based automatic speech recognition, Interspeech
Kun Han, Yanzhang He, Deblin Bagchi, Eric Fosler-Lussier, DeLiang Wang (2015), Deep neural network based spectral feature mapping for robust speech recognition, Interspeech
Bo Xiao, Zac E. Imel, David C. Atkins, Panayiotis G. Georgiou, Shrikanth S. Narayanan (2015), Analyzing speech rate entrainment and its relation to therapist empathy in drug addiction counseling, Interspeech
Atsushi Ando, Taichi Asami, Manabu Okamoto, Hirokazu Masataki, Sumitaka Sakauchi (2015), Agreement and disagreement utterance detection in conversational speech by extracting and integrating local features, Interspeech
Md. Nasir, Wei Xia, Bo Xiao, Brian Baucom, Shrikanth S. Narayanan, Panayiotis G. Georgiou (2015), Still together?: the role of acoustic features in predicting marital outcome, Interspeech
Gábor Gosztolya (2015), On evaluation metrics for social signal detection, Interspeech
Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen (2015), Laughter and filler detection in naturalistic audio, Interspeech
Aasish Pappu, Amanda Stent (2015), Automatic formatted transcripts for videos, Interspeech
Lucas Azaïs, Adrien Payan, Tianjiao Sun, Guillaume Vidal, Tina Zhang, Eduardo Coutinho, Florian Eyben, Björn Schuller (2015), Does my speech rock? automatic assessment of public speaking skills, Interspeech
Roman Sergienko, Alexander Schmitt (2015), Verbal intelligence identification based on text classification, Interspeech
Shan-Wen Hsiao, Hung-Ching Sun, Ming-Chuan Hsieh, Ming-Hsueh Tsai, Hsin-Chih Lin, Chi-Chun Lee (2015), A multimodal approach for automatic assessment of school principals' oral presentation during pre-service training program, Interspeech
T. J. Tsai (2015), Are you TED talk material? comparing prosody in professors and TED speakers, Interspeech
Hayakawa Akira, Fasih Haider, Loredana Cerrato, Nick Campbell, Saturnino Luz (2015), Detection of cognitive states and their correlation to speech recognition performance in speech-to-speech machine translation systems, Interspeech
Friedemann Köster, Sebastian Möller (2015), Perceptual speech quality dimensions in a conversational situation, Interspeech
Jens Berger, Anna Llagostera (2015), Multidimensional evaluation and predicting overall speech quality, Interspeech
Andreas Gaich, Pejman Mowlaee (2015), On speech intelligibility estimation of phase-aware single-channel speech enhancement, Interspeech
Ricard Marxer, Martin Cooke, Jon Barker (2015), A framework for the evaluation of microscopic intelligibility models, Interspeech
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen (2015), A binaural short time objective intelligibility measure for noisy and enhanced speech, Interspeech
Yan Tang, Martin Cooke, Bruno M. Fazenda, Trevor J. Cox (2015), A glimpse-based approach for predicting binaural intelligibility with single and multiple maskers in anechoic conditions, Interspeech
Fei Chen (2015), Improving the prediction power of the speech transmission index to account for non-linear distortions introduced by noise-reduction algorithms, Interspeech
Kehuang Li, Zhen Huang, Yong Xu, Chin-Hui Lee (2015), DNN-based speech bandwidth expansion and its application to adding high-frequency missing features for automatic speech recognition of narrowband speech, Interspeech
Hannu Pulakka, Ville Myllylä, Anssi Rämö, Paavo Alku (2015), Speech quality evaluation of artificial bandwidth extension: comparing subjective judgments and instrumental predictions, Interspeech
M. A. Tuğtekin Turan, Engin Erzin (2015), Synchronous overlap and add of spectra for enhancement of excitation in artificial bandwidth extension of speech, Interspeech
Yingxue Wang, Shenghui Zhao, Wenbo Liu, Ming Li, Jingming Kuang (2015), Speech bandwidth expansion based on deep neural networks, Interspeech
Bin Liu, Jianhua Tao, Zhengqi Wen, Ya Li, Danish Bukhari (2015), A novel method of artificial bandwidth extension using deep architecture, Interspeech
Rogier C. van Dalen, Mark J. F. Gales (2015), Annotating large lattices with the exact word error, Interspeech
Vimal Manohar, Daniel Povey, Sanjeev Khudanpur (2015), Semi-supervised maximum mutual information training of deep neural network acoustic models, Interspeech
Shiliang Zhang, Hui Jiang, Si Wei, Li-Rong Dai (2015), Rectified linear neural networks with tied-scalar regularization for LVCSR, Interspeech
Yanzhang He, Eric Fosler-Lussier (2015), Segmental conditional random fields with deep neural networks as acoustic models for first-pass word recognition, Interspeech
Dongpeng Chen, Brian Mak (2015), Distinct triphone acoustic modeling using deep neural networks, Interspeech
Gregory Gelly, Jean-Luc Gauvain (2015), Minimum word error training of RNN-based voice activity detection, Interspeech
Thomas F. Quatieri, James R. Williamson, Christopher J. Smalt, Tejash Patel, Joseph Perricone, Daryush D. Mehta, Brian S. Helfer, Gregory Ciccarelli, Darrell Ricke, Nicolas Malyska, Jeff Palmer, Kristin Heaton, Marianna Eddy, Joseph Moran (2015), Vocal biomarkers to discriminate cognitive load in a working memory task, Interspeech
Chunlei Zhang, Gang Liu, Chengzhu Yu, John H. L. Hansen (2015), I-vector based physical task stress detection with different fusion strategies, Interspeech
László Tóth, Gábor Gosztolya, Veronika Vincze, Ildikó Hoffmann, Gréta Szatlóczki, Edit Biró, Fruzsina Zsura, Magdolna Pákáski, János Kálmán (2015), Automatic detection of mild cognitive impairment from spontaneous speech using ASR, Interspeech
Maxim Sidorov, Christina Brester, Alexander Schmitt (2015), Contemporary stochastic feature selection algorithms for speech-based emotion recognition, Interspeech
Carlos A. Ferrer, Diana Torres, Eduardo González, José Ramón Calvo, Eduardo Castillo (2015), Effect of different jitter-induced glottal pulse shape changes in periodicity perturbation measures, Interspeech
Lakshmish Kaushik, Abhijeet Sangwan, John H. L. Hansen (2015), Automatic audio sentiment extraction using keyword spotting, Interspeech
Panupong Pasupat, Dilek Hakkani-Tür (2015), Unsupervised relation detection using automatic alignment of query patterns extracted from knowledge graphs and query click logs, Interspeech
The Tung Nguyen, Graham Neubig, Hiroyuki Shindo, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura (2015), A latent variable model for joint pause prediction and dependency parsing, Interspeech
Mohammad Hadi Bokaei, Hossein Sameti, Yang Liu (2015), Extractive meeting summarization through speaker zone detection, Interspeech
Shih-Hung Liu, Kuan-Yu Chen, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu (2015), Positional language modeling for extractive broadcast news speech summarization, Interspeech
Saeid Mokaram, Roger K. Moore (2015), Speech-based location estimation of first responders in a simulated search and rescue scenario, Interspeech
Tahir Sousa, Lucie Flekova, Margot Mieskes, Iryna Gurevych (2015), Constructive feedback, thinking process and cooperation: assessing the quality of classroom interaction, Interspeech
Dong-Yan Huang, Minghui Dong, Haizhou Li (2015), A real-time variable-q non-stationary Gabor transform for pitch shifting, Interspeech
Ryo Aihara, Testuya Takiguchi, Yasuo Ariki (2015), Many-to-many voice conversion based on multiple non-negative matrix factorization, Interspeech
Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (2015), Statistical singing voice conversion based on direct waveform modification with global variance, Interspeech
Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Quy Hy Nguyen, Minghui Dong, Eng Siong Chng (2015), System fusion for high-performance voice conversion, Interspeech
Agustin Alonso, D. Erro, Eva Navas, Inma Hernaez (2015), Speaker adaptation using only vocalic segments via frequency warping, Interspeech
Yusuke Tajiri, Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (2015), Non-audible murmur enhancement based on statistical conversion using air- and body-conductive microphones in noisy environments, Interspeech
Tim Polzehl, Gina-Anne Levow (2015), Advanced crowdsourcing for speech and beyond: introduction by the organizers, Interspeech
Preethi Jyothi, Mark Hasegawa-Johnson (2015), Transcribing continuous speech using mismatched crowdsourcing, Interspeech
Shammur Absar Chowdhury, Marcos Calvo, Arindam Ghosh, Evgeny A. Stepanov, Ali Orkan Bayer, Giuseppe Riccardi, Fernando García, Emilio Sanchis (2015), Selection and aggregation techniques for crowdsourced semantic annotation task, Interspeech
Spencer Rothwell, Ahmad Elshenawy, Steele Carter, Daniela Braga, Faraz Romani, Michael Kennewick, Bob Kennewick (2015), Controlling quality and handling fraud in large scale crowdsourcing speech data collections, Interspeech
Spencer Rothwell, Steele Carter, Ahmad Elshenawy, Vladislavs Dovgalecs, Safiyyah Saleem, Daniela Braga, Bob Kennewick (2015), Data collection and annotation for state-of-the-art NER using unmanaged crowds, Interspeech
Tim Polzehl, Babak Naderi, Friedemann Köster, Sebastian Möller (2015), Robustness in speech quality assessment and temporal training expiry in mobile crowdsourcing environments, Interspeech
Babak Naderi, Tim Polzehl, Ina Wechsung, Friedemann Köster, Sebastian Möller (2015), Effect of trapping questions on the reliability of speech quality judgments in a crowdsourcing paradigm, Interspeech
Adrian Leemann, Marie-José Kolly, Jean-Philippe Goldman, Volker Dellwo, Ingrid Hove, Ibrahim Almajai, Sarah Grimm, Sylvain Robert, Daniel Wanitsch (2015), Voice Äpp: a mobile app for crowdsourcing Swiss German dialect data, Interspeech
Anastassia Loukina, Melissa Lopez, Keelan Evanini, David Suendermann-Oeft, Klaus Zechner (2015), Expert and crowdsourced annotation of pronunciation errors for automatic scoring systems, Interspeech
Hernisa Kacorri, Kaoru Shinkawa, Shin Saito (2015), Capcap: an output-agreement game for video captioning, Interspeech
Pepi Burgos, Eric Sanders, Catia Cucchiarini, Roeland van Hout, Helmer Strik (2015), Auris populi: crowdsourced native transcriptions of Dutch vowels spoken by adult Spanish learners, Interspeech
Samantha Wray, Ahmed Ali (2015), Crowdsource a little to label a lot: labeling a speech corpus of dialectal Arabic, Interspeech
Yashesh Gaur, Florian Metze, Yajie Miao, Jeffrey P. Bigham (2015), Using keyword spotting to help humans correct captioning faster, Interspeech
Tara McAllister Byun, Elaine Hitchcock, Daphna Harel (2015), Validating and optimizing a crowdsourced method for gradient measures of child speech, Interspeech
Zhong-Qiu Wang, DeLiang Wang (2015), Joint training of speech separation, filterbank and acoustic model for robust automatic speech recognition, Interspeech
Shakti Rath, Sunil Sivadas, Bin Ma (2015), Joint environment and speaker normalization using factored front-end CMLLR, Interspeech
Akihiro Abe, Kazumasa Yamamoto, Seiichi Nakagawa (2015), Robust speech recognition using DNN-HMM acoustic model combining noise-aware training with spectral subtraction, Interspeech
Chengzhu Yu, Atsunori Ogawa, Marc Delcroix, Takuya Yoshioka, Tomohiro Nakatani, John H. L. Hansen (2015), Robust i-vector extraction for neural network adaptation in noisy environment, Interspeech
Michal Borsky, Petr Mizera, Petr Pollak (2015), Spectrally selective dithering for distorted speech recognition, Interspeech
Liang Lu, Steve Renals (2015), Feature-space speaker adaptation for probabilistic linear discriminant analysis acoustic models, Interspeech
Patrick Cardinal, Najim Dehak, Yu Zhang, James Glass (2015), Speaker adaptation using the i-vector technique for bottleneck features, Interspeech
Penny Karanasou, Mark J. F. Gales, Philip C. Woodland (2015), I-vector estimation using informative priors for adaptation of deep neural networks, Interspeech
Sri Garimella, Arindam Mandal, Nikko Strom, Bjorn Hoffmeister, Spyros Matsoukas, Sree Hari Krishnan Parthasarathi (2015), Robust i-vector based adaptation of DNN acoustic model for speech recognition, Interspeech
Natalia Tomashenko, Yuri Khokhlov (2015), GMM-derived features for effective unsupervised adaptation of deep neural network acoustic models, Interspeech
Roger Hsiao, Tim Ng, Stavros Tsakalidis, Long Nguyen, Richard Schwartz (2015), Unsupervised adaptation for deep neural network using linear least square method, Interspeech
Sheng Li, Xugang Lu, Yuya Akita, Tatsuya Kawahara (2015), Ensemble speaker modeling using speaker adaptive training deep neural network for speaker adaptation, Interspeech
Mortaza Doulaty, Oscar Saz, Thomas Hain (2015), Data-selective transfer learning for multi-domain speech recognition, Interspeech
Tomas Lustyk, Petr Bergl, Tino Haderlein, Elmar Nöth, Roman Cmejla (2015), Language-independent method for analysis of German stuttering recordings, Interspeech
Ahmed Al-nasheri, Zulfiqar Ali, Ghulam Muhammad, Mansour Alsulaiman (2015), An investigation of MDVP parameters for voice pathology detection on three different databases, Interspeech
Jiantao Wu, Ping Yu, Nan Yan, Lan Wang, Xiaohui Yang, Manwa L. Ng (2015), Energy distribution analysis and nonlinear dynamical analysis of adductor spasmodic dysphonia, Interspeech
Benjawan Kasisopa, Nittayapa Klangpornkun, Denis Burnham (2015), Auditory-visual tone perception in hearing impaired Thai listeners, Interspeech
Panying Rong, Yana Yunusova, Jordan R. Green (2015), Speech intelligibility decline in individuals with fast and slow rates of ALS progression, Interspeech
Rong Na A, Koichi Mori, Naomi Sakai (2015), Latency analysis of speech shadowing reveals processing differences in Japanese adults who do and do not stutter, Interspeech
Brigitte Bigi, Katarzyna Klessa, Laurianne Georgeton, Christine Meunier (2015), A syllable-based analysis of speech temporal organization: a comparison between speaking styles in dysarthric and healthy populations, Interspeech
Bernd T. Meyer, Birger Kollmeier, Jasper Ooster (2015), Autonomous measurement of speech intelligibility utilizing automatic speech recognition, Interspeech
Monja Angelika Knoll, Melissa Johnstone, Charlene Blakely (2015), Can you hear me? acoustic modifications in speech directed to foreigners and hearing-impaired people, Interspeech
Yu Ting Yeung, Ka Ho Wong, Helen Meng (2015), Improving automatic forced alignment for dysarthric speech transcription, Interspeech
Marcin Włodarczak, Mattias Heldner, Jens Edlund (2015), Communicative needs and respiratory constraints, Interspeech
Uwe D. Reichel, Nina Pörner, Dianne Nowack, Jennifer Cole (2015), Analysis and classification of cooperative and competitive dialogs, Interspeech
Alessandra Cervone, Catherine Lai, Silvia Pareti, Peter Bell (2015), Towards automatic detection of reported speech in dialogue using prosodic cues, Interspeech
Andrew Rosenberg, Raul Fernandez, Bhuvana Ramabhadran (2015), Modeling phrasing and prominence using deep recurrent learning, Interspeech
Céline De Looze, Irena Yanushevskaya, Andy Murphy, Eoghan O'Connor, Christer Gobl (2015), Pitch declination and reset as a function of utterance duration in conversational speech data, Interspeech
Valerie Freeman, Gina-Anne Levow, Richard Wright, Mari Ostendorf (2015), Investigating the role of `yeah' in stance-dense conversation, Interspeech
Jiyoun Choi, Mirjam Broersma, Anne Cutler (2015), Enhanced processing of a lost language: linguistic knowledge or linguistic skill?, Interspeech
Ann-Kathrin Grohe, Gregory J. Poarch, Adriana Hanulíková, Andrea Weber (2015), Production inconsistencies delay adaptation to foreign accents, Interspeech
Mikhail Ordin, Leona Polyanskaya (2015), Acquisition of English speech rhythm by monolingual children, Interspeech
Odette Scharenborg (2015), Durational information in word-initial lexical embeddings in spoken Dutch, Interspeech
Fei Chen, Nan Yan, Lan Wang, Tao Yang, Jiantao Wu, Han Zhao, Gang Peng (2015), The development of categorical perception of lexical tones in Mandarin-speaking preschoolers, Interspeech
Tomohiko Ooigawa (2015), Perception of Italian liquids by Japanese listeners: comparisons to Spanish liquids, Interspeech
George Saon, Hong-Kwang J. Kuo, Steven Rennie, Michael Picheny (2015), The IBM 2015 English conversational telephone speech recognition system, Interspeech
Xunying Liu, Federico Flego, Linlin Wang, C. Zhang, Mark J. F. Gales, Philip C. Woodland (2015), The cambridge university 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation, Interspeech
Samuel Thomas, George Saon, Hong-Kwang J. Kuo, Lidia Mangu (2015), The IBM BOLT speech transcription system, Interspeech
M. Ali Basha Shaik, Zoltán Tüske, M. Ali Tahir, Markus Nußbaum-Thom, Ralf Schlüter, Hermann Ney (2015), Improvements in RWTH LVCSR evaluation systems for Polish, Portuguese, English, urdu, and Arabic, Interspeech
Thiago Fraga-Silva, Jean-Luc Gauvain, Lori Lamel, Antoine Laurent, Viet-Bac Le, Abdel Messaoudi (2015), Active learning based data selection for limited resource STT and KWS, Interspeech
Preethi Jyothi, Mark Hasegawa-Johnson (2015), Improved hindi broadcast ASR by adapting the language model and pronunciation model using a priori syntactic and morphophonemic knowledge, Interspeech
Maarten Versteegh, Roland Thiollière, Thomas Schatz, Xuan Nga Cao, Xavier Anguera, Aren Jansen, Emmanuel Dupoux (2015), The zero resource speech challenge 2015, Interspeech
Leonardo Badino, Alessio Mereta, Lorenzo Rosasco (2015), Discovering discrete subword units with binarized autoencoders and hidden-Markov-model encoders, Interspeech
Roland Thiollière, Ewan Dunbar, Gabriel Synnaeve, Maarten Versteegh, Emmanuel Dupoux (2015), A hybrid dynamic time warping-deep neural network architecture for unsupervised acoustic modeling, Interspeech
Wiehan Agenbag, Thomas Niesler (2015), Automatic segmentation and clustering of speech using sparse coding and metaheuristic search, Interspeech
Hongjie Chen, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li (2015), Parallel inference of dirichlet process Gaussian mixture models for unsupervised acoustic modeling: a feasibility study, Interspeech
Pallavi Baljekar, Sunayana Sitaram, Prasanna Kumar Muthukumar, Alan W. Black (2015), Using articulatory features and inferred phonological segments in zero resource speech processing, Interspeech
Daniel Renshaw, Herman Kamper, Aren Jansen, Sharon Goldwater (2015), A comparison of neural network methods for unsupervised representation learning on the zero resource speech challenge, Interspeech
Okko Räsänen, Gabriel Doyle, Michael C. Frank (2015), Unsupervised word discovery from speech using automatic segmentation into syllable-like units, Interspeech
Vince Lyzinski, Gregory Sell, Aren Jansen (2015), An evaluation of graph clustering methods for unsupervised term discovery, Interspeech
Vijayaditya Peddinti, Daniel Povey, Sanjeev Khudanpur (2015), A time delay neural network architecture for efficient modeling of long temporal contexts, Interspeech
Xiangang Li, Xihong Wu (2015), Long short-term memory based convolutional recurrent neural networks for large vocabulary speech recognition, Interspeech
C. Zhang, Philip C. Woodland (2015), Parameterised sigmoid and reLU hidden activation functions for DNN acoustic modelling, Interspeech
Chiyuan Zhang, Stephen Voinea, Georgios Evangelopoulos, Lorenzo Rosasco, Tomaso Poggio (2015), Discriminative template learning in group-convolutional networks for invariant speech representations, Interspeech
Sunil Sivadas, Zhenzhou Wu, Ma Bin (2015), Investigation of parametric rectified linear units for noise robust speech recognition, Interspeech
Hang Su, Haihua Xu (2015), Multi-softmax deep neural network for semi-supervised training, Interspeech
Jia Cui, George Saon, Bhuvana Ramabhadran, Brian Kingsbury (2015), A multi-region deep neural network model in speech recognition, Interspeech
Liang Lu, Xingxing Zhang, Kyunghyun Cho, Steve Renals (2015), A study of the recurrent neural network encoder-decoder for large vocabulary speech recognition, Interspeech
Linchen Zhu, Kevin Kilgour, Sebastian Stüker, Alex Waibel (2015), Gaussian free cluster tree construction using deep neural network, Interspeech
Mengxiao Bi, Yanmin Qian, Kai Yu (2015), Very deep convolutional neural networks for LVCSR, Interspeech
William Chan, Nan Rosemary Ke, Ian Lane (2015), Transferring knowledge from a RNN to a DNN, Interspeech
Changliang Liu, Jinyu Li, Yifan Gong (2015), SVD-based universal DNN modeling for multiple scenarios, Interspeech
Zhuo Chen, Shinji Watanabe, Hakan Erdogan, John R. Hershey (2015), Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks, Interspeech
Yuzhou Liu, DeLiang Wang (2015), Speaker-dependent multipitch tracking using deep neural networks, Interspeech
Sujith P., A. P. Prathosh, A. G. Ramakrishnan, Prasanta Kumar Ghosh (2015), An error correction scheme for GCI detection algorithms using pitch smoothness criterion, Interspeech
RaviShankar Prasad, B. Yegnanarayana (2015), Robust pitch estimation in noisy speech using ZTW and group delay function, Interspeech
Zhaoqiong Huang, Ge Zhan, Dongwen Ying, Yonghong Yan (2015), Robust localization of single sound source based on phase difference regression, Interspeech
Daniele Salvati, Carlo Drioli, Gian Luca Foresti (2015), Frequency map selection using a RBFN-based classifier in the MVDR beamformer for speaker localization in reverberant rooms, Interspeech
Ning Ma, Guy J. Brown, Tobias May (2015), Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions, Interspeech
Shuai Nie, Wei Xue, Shan Liang, Xueliang Zhang, Wenju Liu, Liwei Qiao, Jianping Li (2015), Joint optimization of recurrent networks exploiting source auto-regression for source separation, Interspeech
Rong Gong, Philippe Cuvillier, Nicolas Obin, Arshia Cont (2015), Real-time audio-to-score alignment of singing voice based on melody and lyric information, Interspeech
Jun-Yong Lee, Hye-Seung Cho, Hyoung-Gook Kim (2015), Vocal separation from monaural music using adaptive auditory filtering based on kernel back-fitting, Interspeech
Frederick Z. Yen, Mao-Chang Huang, Tai-Shih Chi (2015), A two-stage singing voice separation algorithm using spectro-temporal modulation features, Interspeech
Hyungjun Lim, Myung Jong Kim, Hoirin Kim (2015), Robust sound event classification using LBP-HOG based bag-of-audio-words feature representation, Interspeech
Mikko Tiainen, Lari Vainio, Kaisa Tiippana, Naeem Komeilipoor, Martti Vainio (2015), Action planning and congruency effect between articulation and grasping, Interspeech
Ron M. Hecht, Aharon Bar-Hillel, Stas Tiomkin, Hadar Levi, Omer Tsimhoni, Naftali Tishby (2015), Cognitive workload and vocabulary sparseness: theory and practice, Interspeech
Valentin Andrei, Horia Cucu, Andi Buzo, Corneliu Burileanu (2015), Counting competing speakers in a timeframe — human versus computer, Interspeech
Fei Chen, Alexander Siu Tai Kwok (2015), Segmental contribution to the intelligibility of ideal binary-masked sentences, Interspeech
Mako Ishida, Takayuki Arai (2015), Perception of an existing and non-existing L2 English phoneme behind noise by Japanese native speakers, Interspeech
Chitralekha Bhat, Sunil Kopparapu (2015), Viseme comparison based on phonetic cues for varying speech accents, Interspeech
Colm O'Reilly, Nicola M. Marples, David J. Kelly, Naomi Harte (2015), Quantifying difference in vocalizations of bird populations, Interspeech
Jae Choi, Jeunghun Kim, Shin Jae Kang, Nam Soo Kim (2015), Reverberation-robust acoustic indoor localization, Interspeech
Huaiping Ming, Dong-Yan Huang, Lei Xie, Haizhou Li, Minghui Dong (2015), An alternating optimization approach for phase retrieval, Interspeech
Xiong Xiao, Shengkui Zhao, Xionghu Zhong, Douglas L. Jones, Eng Siong Chng, Haizhou Li (2015), Learning to estimate reverberation time in noisy and reverberant rooms, Interspeech
Cheng Pang, Jie Zhang, Hong Liu (2015), Direction of arrival estimation based on reverberation weighting and noise error estimator, Interspeech
Huy Phan, Lars Hertel, Marco Maass, Radoslaw Mazur, Alfred Mertins (2015), Representing nonspeech audio signals through speech classification models, Interspeech
Luciana Ferrer, Mitchell McLaren, Aaron Lawson, Martin Graciarena (2015), Mitigating the effects of non-stationary unseen noises on language recognition performance, Interspeech
Moez Ajili, Jean-François Bonastre, Solange Rossato, Juliette Kahn, Itshak Lapidot (2015), An information theory based data-homogeneity measure for voice comparison, Interspeech
David Dean, Ahilan Kanagasundaram, Houman Ghaemmaghami, Md. Hafizur Rahman, Sridha Sridharan (2015), The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition, Interspeech
Hagai Aronowitz (2015), Score stabilization for speaker recognition trained on a small development set, Interspeech
Abhinav Misra, Shivesh Ranjan, Chunlei Zhang, John H. L. Hansen (2015), Anti-spoofing system: an investigation of measures to detect synthetic and human speech, Interspeech
Michael J. Carne (2015), A likelihood ratio-based forensic voice comparison in microphone vs. mobile mismatched conditions using Japanese /ai/, Interspeech
Mirjam Wester, Cassia Valentini-Botinhao, Gustav Eje Henter (2015), Are we using enough listeners? no! — an empirically-supported critique of interspeech 2014 TTS evaluations, Interspeech
Jonathan Chevelu, Damien Lolive, Sébastien Le Maguer, David Guennec (2015), How to compare TTS systems: a new subjective evaluation methodology focused on differences, Interspeech
Lukas Latacz, Werner Verhelst (2015), Double-ended prediction of the naturalness ratings of the blizzard challenge 2008-2013, Interspeech
Takashi Nose, Yusuke Arao, Takao Kobayashi, Komei Sugiura, Yoshinori Shiga, Akinori Ito (2015), Entropy-based sentence selection for speech synthesis using phonetic and prosodic contexts, Interspeech
Tomoki Koriyama, Takao Kobayashi (2015), A comparison of speech synthesis systems based on GPR, HMM, and DNN with a small amount of training data, Interspeech
Raphael Ullmann, Ramya Rasipuram, Mathew Magimai-Doss, Hervé Bourlard (2015), Objective intelligibility assessment of text-to-speech systems through utterance verification, Interspeech
Dominique Fohr, Irina Illina (2015), Continuous word representation using neural networks for proper name retrieval from diachronic documents, Interspeech
X. Chen, T. Tan, Xunying Liu, Pierre Lanchantin, M. Wan, Mark J. F. Gales, Philip C. Woodland (2015), Recurrent neural network language model adaptation for multi-genre broadcast speech recognition, Interspeech
Wengong Jin, Tianxing He, Yanmin Qian, Kai Yu (2015), Paragraph vector based topic model for language model adaptation, Interspeech
Ching-Feng Yeh, Yuan-ming Liou, Hung-yi Lee, Lin-shan Lee (2015), Personalized speech recognizer with keyword-based personalized lexicon and language model using word vector representations, Interspeech
Sheng Li, Yuya Akita, Tatsuya Kawahara (2015), Discriminative data selection for lightly supervised training of acoustic model using closed caption texts, Interspeech
Amit Das, Mark Hasegawa-Johnson (2015), Cross-lingual transfer learning during supervised training in low resource scenarios, Interspeech
Ramón F. Astudillo, Shinji Watanabe, Ahmed Hussen Abdelaziz, Dorothea Kolossa (2015), Robust speech processing using observation uncertainty and uncertainty propagation: session and paper overview, Interspeech
Dayana Ribas, Emmanuel Vincent, José Ramón Calvo (2015), Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE, Interspeech
Yuuki Tachioka, Shinji Watanabe (2015), Uncertainty training and decoding methods of deep neural networks based on stochastic representation of enhanced features, Interspeech
Rahim Saeidi, Paavo Alku (2015), Accounting for uncertainty of i-vectors in speaker recognition using uncertainty propagation and modified imputation, Interspeech
Sri Harish Mallidi, Tetsuji Ogawa, Karel Veselý, Phani S. Nidadavolu, Hynek Hermansky (2015), Autoencoder based multi-stream combination for noise robust speech recognition, Interspeech
Christian Huemmer, Roland Maas, Andreas Schwarz, Ramón F. Astudillo, Walter Kellermann (2015), Uncertainty decoding for DNN-HMM hybrid systems based on numerical sampling, Interspeech
Ahmed Hussen Abdelaziz, Shinji Watanabe, John R. Hershey, Emmanuel Vincent, Dorothea Kolossa (2015), Uncertainty propagation through deep neural networks, Interspeech
Marco Kühne (2015), Handling derivative filterbank features in bounded-marginalization-based missing data automatic speech recognition, Interspeech
Arun Narayanan, Ananya Misra, Kean K. Chin (2015), Large-scale, sequence-discriminative, joint adaptive training for masking-based robust ASR, Interspeech
Ramón F. Astudillo, Joana Correia, Isabel Trancoso (2015), Integration of DNN based speech enhancement and ASR, Interspeech
C. Zhang, Philip C. Woodland (2015), A general artificial neural network extension for HTK, Interspeech
Tom Ko, Vijayaditya Peddinti, Daniel Povey, Sanjeev Khudanpur (2015), Audio augmentation for speech recognition, Interspeech
Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur (2015), A diversity-penalizing ensemble training method for deep learning, Interspeech
Gakuto Kurata, Daniel Willett (2015), Deep neural network training emphasizing central frames, Interspeech
Kai Chen, Zhi-Jie Yan, Qiang Huo (2015), Training deep bidirectional LSTM acoustic model for LVCSR by a context-sensitive-chunk BPTT approach, Interspeech
Pawel Swietojanski, Peter Bell, Steve Renals (2015), Structured output layer with auxiliary targets for context-dependent acoustic modelling, Interspeech
Peter Bell, Steve Renals (2015), Complementary tasks for context-dependent deep neural network acoustic models, Interspeech
Jie Li, Heng Zhang, Xinyuan Cai, Bo Xu (2015), Towards end-to-end speech recognition for Chinese Mandarin using long short-term memory recurrent neural networks, Interspeech
Mingming Chen, Zhanlei Yang, Jizhong Liang, Yanpeng Li, Wenju Liu (2015), Improving deep neural networks based multi-accent Mandarin speech recognition using i-vectors and accent-specific top layer, Interspeech
Zhen Huang, Jinyu Li, Sabato Marco Siniscalchi, I-Fan Chen, Ji Wu, Chin-Hui Lee (2015), Rapid adaptation for deep neural networks through multi-task learning, Interspeech
Sree Hari Krishnan Parthasarathi, Bjorn Hoffmeister, Spyros Matsoukas, Arindam Mandal, Nikko Strom, Sri Garimella (2015), fMLLR based feature-space speaker adaptation of DNN acoustic models, Interspeech
Xiangang Li, Xihong Wu (2015), I-vector dependent feature space transformations for adaptive speech recognition, Interspeech
Mortaza Doulaty, Oscar Saz, Thomas Hain (2015), Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition, Interspeech
Taichi Asami, Ryo Masumura, Hirokazu Masataki, Manabu Okamoto, Sumitaka Sakauchi (2015), Training data selection for acoustic modeling via submodular optimization of joint kullback-leibler divergence, Interspeech
Eunah Cho, Kevin Kilgour, Jan Niehues, Alex Waibel (2015), Combination of NN and CRF models for joint detection of punctuation and disfluencies, Interspeech
Tze Siong Lau, I-Fan Chen, Chin-Hui Lee (2015), Tunable keyword-aware language modeling and context dependent fillers for LVCSR-based spoken keyword search, Interspeech
Haipeng Wang, Anton Ragni, Mark J. F. Gales, Kate M. Knill, Philip C. Woodland, C. Zhang (2015), Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages, Interspeech
Quoc Truong Do, Shinnosuke Takamichi, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura (2015), Preserving word-level emphasis in speech-to-speech translation using linear regression HSMMs, Interspeech
Hoang Gia Ngo, Nancy F. Chen, Binh Minh Nguyen, Bin Ma, Haizhou Li (2015), Phonology-augmented statistical transliteration for low-resource languages, Interspeech
Kazuki Oouchi, Ryota Konno, Takahiro Akyu, Kazuma Konno, Kazunori Kojima, Kazuyo Tanaka, Shi-wook Lee, Yoshiaki Itoh (2015), Evaluation of re-ranking by prioritizing highly ranked documents in spoken term detection, Interspeech
Abhijeet Saxena, B. Yegnanarayana (2015), Distinctive feature based representation of speech for query-by-example spoken term detection, Interspeech
Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh (2015), Combination of diverse subword units in spoken term detection, Interspeech
Dhananjay Ram, Afsaneh Asaei, Pranay Dighe, Hervé Bourlard (2015), Sparse modeling of posterior exemplars for keyword detection, Interspeech
Tin Lay Nwe, Qianli Xu, Cuntai Guan, Bin Ma (2015), Stress level detection using double-layer subband filter, Interspeech
Jürgen Trouvain, Khiet P. Truong (2015), Prosodic characteristics of read speech before and after treadmill running, Interspeech
Khiet P. Truong, Arne Nieuwenhuys, Peter Beek, Vanessa Evers (2015), A database for analysis of speech under physical stress: detection of exercise intensity while running and talking, Interspeech
Will Paul, Cecilia Ovesdotter Alm, Reynold Bailey, Joe Geigel, Linwei Wang (2015), Stressed out: what speech tells us about stress, Interspeech
Andreas Tsiartas, Andreas Kathol, Elizabeth Shriberg, Massimiliano de Zambotti, Adrian Willoughby (2015), Prediction of heart rate changes from speech features during interaction with a misbehaving dialog system, Interspeech
Mary Pietrowicz, Mark Hasegawa-Johnson, Karrie Karahalios (2015), Acoustic correlates for perceived effort levels in expressive speech, Interspeech
Khalid Daoudi, Ashwini Jaya Kumar (2015), Pitch-based speech perturbation measures using a novel GCI detection algorithm: application to pathological voice classification, Interspeech
Dimitra Vergyri, Bruce Knoth, Elizabeth Shriberg, Vikramjit Mitra, Mitchell McLaren, Luciana Ferrer, Pablo Garcia, Charles Marmar (2015), Speech-based assessment of PTSD in a military population using diverse feature classes, Interspeech
Bea Yu, Thomas F. Quatieri, James R. Williamson, James C. Mundt (2015), Cognitive impairment prediction in the elderly based on vocal biomarkers, Interspeech
J. -A. Gómez-García, L. Moro-Velázquez, Juan Ignacio Godino-Llorente, G. Castellanos-Domínguez (2015), Automatic age detection in normal and pathological voice, Interspeech
Sylvie Mozziconacci (2002), Prosody and emotions, SpeechProsody
Anne Wichmann (2002), Attitudinal intonation and the inferential process, SpeechProsody
Branka Zei Pollermann (2002), A place for prosody in a unified model of cognition and emotion, SpeechProsody
Colin W. Wightman (2002), ToBI or not toBI?, SpeechProsody
Hansjörg Mixdorff (2002), Speech technology, toBI, and making sense of prosody, SpeechProsody
Aijun Li (2002), Chinese prosody and prosodic labeling of spontaneous speech, SpeechProsody
Carlos Gussenhoven (2002), Intonation and interpretation: phonetics and phonology, SpeechProsody
Thorstein Fretheim (2002), Intonation as a constraint on inferential processing, SpeechProsody
Julia Hirschberg (2002), The pragmatics of intonational meaning, SpeechProsody
Ulrike Toepel, Kai Alter (2002), Cerebral strategies in the segmentation and interpretation of speech, SpeechProsody
Marc D. Pell (2002), Surveying emotional prosody in the brain, SpeechProsody
Janet Dean Fodor (2002), Psycholinguistics cannot escape prosody, SpeechProsody
Yi Xu (2002), Articulatory constraints and tonal alignment, SpeechProsody
Mariapaola D’Imperio (2002), Language-specific and universal constraints on tonal alignment: the nature of targets and "anchors", SpeechProsody
Jan P. H. van Santen (2002), Quantitative modeling of pitch accent alignment, SpeechProsody
Franck Ramus (2002), Acoustic correlates of linguistic rhythm: perspectives, SpeechProsody
Fred Cummins (2002), Speech rhythm and rhythmic taxonomy, SpeechProsody
Esther Grabe (2002), Variation adds to prosodic typology, SpeechProsody
Karen M. Arnold, Peter W. Jusczyk (2002), Text-to-tune alignment in speech and song, SpeechProsody
Corine Astésano, Ellen Gurman Bard, Alice Turk (2002), Functions of the French initial accent: a preliminary study, SpeechProsody
Eva Liina Asu (2002), Downtrends in different types of question in Estonian, SpeechProsody
Michaela Atterer (2002), Assigning prosodic structure for speech synthesis: a rule-based approach, SpeechProsody
Véronique Aubergé (2002), A Gestalt morphology of prosody directed by functions: the example of a step by step model developed at ICP, SpeechProsody
Margit Aufterbeck (2002), Aspects of prehead and onset: the onset onglide phenomenon, SpeechProsody
Odile Bagou, Cécile Fougeron, Uli H. Frauenfelder (2002), Contribution of prosody to the segmentation and storage of "words" in the acquisition of a new mini-language, SpeechProsody
Plinio A. Barbosa (2002), Explaining cross-linguistic rhythmic variability via a coupled-oscillator model of rhythm production, SpeechProsody
Robert Batúsek (2002), A duration model for Czech text-to-speech synthesis, SpeechProsody
Roxane Bertrand, R. Espesser (2002), Voice diversity in conversation: a case study, SpeechProsody
Bruce Birch (2002), The IP as the domain of syllabification, SpeechProsody
Judith Bishop (2002), ‘stress accent² without phonetic stress: accent type and distribution in Bininj Gun-wok, SpeechProsody
Antonis Botinis, Robert Bannert, Marios Fourakis, Stamatia Pagoni-Tetlow (2002), Crosslinguistic segmental durations and prosodic typology, SpeechProsody
Philippe Boula de Mareüil, Philippe Célérier, Jacques Toen (2002), Generation of emotions by a morphing technique in English, French and Spanish, SpeechProsody
Caroline Bouzon, Daniel Hirst (2002), The influence of prosodic factors on the duration of words in British English, SpeechProsody
Geneviève Caelen-Haumont (2002), Perlocutory values and functions of melisms in spontaneous dialogue, SpeechProsody
Estelle Campione, Jean Véronis (2002), A large-scale multilingual study of silent pause duration, SpeechProsody
Jianfen Cao, Weibin Zhu (2002), Syntactic and lexical constraint in prosodic segmentation and grouping, SpeechProsody
V. Cardenoso-Payo, D. Escudero-Mancebo (2002), Statistical modelling of stress groups in Spanish, SpeechProsody
Yiya Chen (2002), Accentual lengthening of monosyllabic-constituents in Beijing Mandarin, SpeechProsody
Aoju Chen, Carlos Gussenhoven, Toni Rietveld (2002), Language-specific uses of the effort code, SpeechProsody
Hyunsong Chung (2002), Duration models and the perceptual evaluation of spoken Korean, SpeechProsody
Kathrin Claßen (2002), Realisations of nuclear pitch accents in Swabian dialect and Parkinson²s dysarthria: a preliminary report, SpeechProsody
Michel Contini, Jean-Pierre Lai, Antonio Romano, Stefania Roullet, Lurdes de Castro Moutinho, Rosa Lídia Coimbra, Urbana Pereira Bendiha, Suzana Secca Ruivo (2002), Un projet d’atlas multimédia prosodique de l’espace roman, SpeechProsody
Francesco Cutugno, L. D’Anna, M. Petrillo, E. Zovato (2002), APA: towards an automatic tool for prosodic analysis, SpeechProsody
Audra Dainora (2002), Does intonational meaning come from tones or tunes? evidence against a compositional approach, SpeechProsody
Elisabeth Delais-Roussarie, A. Rialland, Jenny Doetjes, J. M. Marandin (2002), The prosody of post-focus sequences in French, SpeechProsody
Rodolfo Delmonte (2002), A prosodic module for self-learning activities, SpeechProsody
Jenny Doetjes, Elisabeth Delais-Roussarie, Petra Sleeman (2002), The prosody of left detached constituents in French, SpeechProsody
Marie Dohalská-Zichová (2002), Rythme - le "barrage" pour la perception des noms propres, SpeechProsody
P. Durand, A. Durand-Deska, Ryszard Gubrynowicz, B. Marek (2002), Polish: prosodic aspects of "czy" questions, SpeechProsody
T. Ehrette, N. Chateau, Christophe d’Alessandro, V. Maffiolo (2002), Prosodic parameters of perceived emotions in vocal server voices, SpeechProsody
Gorka Elordieta (1). Magdalena Romera (2002), Prosody and meaning in interaction: the case of the Spanish discourse functional unit entonces 'then', SpeechProsody
Caglayan Erdem, Hans Georg Zimmermann (2002), Soft input feature selection within neural prosody generation, SpeechProsody
Caglayan Erdem, Hans Georg Zimmermann (2002), Duration control by asymmetric causal retro-causal neural networks, SpeechProsody
Anders Eriksson, Esther Grabe, Hartmut Traunmüller (2002), Perception of syllable prominence by listeners with and without competence in the tested language, SpeechProsody
Zsuzsanna Fagyal (2002), Tonal template for background information: the scaling of pitch in utterance-medial parentheticals in French, SpeechProsody
Gunnar Fant, Anita Kruckenberg, Kjell Gustafson, Johan Liljencrants (2002), A new approach to intonation analysis and synthesis of Swedish, SpeechProsody
Martine Faraco, Tsuyoshi Kida, Marie-Laure Barbier, Annie Piolat (2002), Didactic prosody and notetaking in L1 and L2, SpeechProsody
Raul Fernandez, Rosalind W. Picard (2002), Dialog act classification from prosodic features using support vector machines, SpeechProsody
Janet Fletcher, Nicholas Evans, Erich Round (2002), Left-edge tonal events in Kayardild (australian) - a typological perspective, SpeechProsody
Janet Fletcher, Roger Wales, Lesley Stirling, Ilana Mushin (2002), A dialogue act analysis of rises in australian English map task dialogues, SpeechProsody
Katarzyna Francusik, Maciei Karpinkski, Janusz Klesta (2002), A preliminary study of the intonational phrase, nuclear melody and pauses in Polish semi-spontaneous narration, SpeechProsody
Thorstein Fretheim, Wim A. van Dommelen (2002), Norwegian intonation and the resolution of concessive anaphora, SpeechProsody
Claudia K. Friedrich, Sonja A. Kotz, Angela D. Friederici, Kai Alter (2002), Pitch contour guides spoken word recognition, SpeechProsody
Sónia Frota (2002), The prosody of focus: a case-study with cross-linguistic implications, SpeechProsody
Sónia Frota, Marina Vigário, Fernando Martins (2002), Language discrimination and rhythm classes: evidence from Portuguese, SpeechProsody
Antonio Galves, Jesus Garcia, Denise Duarte, Charlotte Galves (2002), Sonority as a basis for rhythmic class discrimination, SpeechProsody
Jesus Garcia, Ulrike Gut, Antonio Galves (2002), Vocale - a semi-automatic annotation tool for prosodic research, SpeechProsody
Salem Ghazali, Rym Hamdi, Melissa Barkat (2002), Speech rhythm variation in Arabic dialects, SpeechProsody
Dafydd Gibbon (2002), Prosodic information in an integrated lexicon, SpeechProsody
Barbara Gili Fivela (2002), Tonal alignment in two pisa Italian peak accents, SpeechProsody
Esther Grabe, Brechtje Post (2002), Intonational variation in the british isles, SpeechProsody
Björn Granström, David House, Marc Swerts (2002), Multimodal feedback cues in human-machine interactions, SpeechProsody
Steven Greenberg, Hannah Carvey, Leah Hitchcock (2002), The relation between stress accent and pronunciation variation in spontaneous american English discourse, SpeechProsody
Nina Grønnum, Hans Basbøll (2002), Stød and length: acoustic and cognitive reality?, SpeechProsody
Ryszard Gubrynowicz (2002), A study of speech prosody of subjects with profound hearing loss recorded at child age and 20 years later, SpeechProsody
Sofia Gustafson-Capková, Beáta Megyesi (2002), Silence and discourse context in read speech and dialogues in Swedish, SpeechProsody
Ulrike Gut, Jan-Torsten Milde (2002), The prosody of Nigerian English, SpeechProsody
Petra Hansson (2002), Articulation rate variation in south Swedish phrases, SpeechProsody
Nancy Hedberg, Juan M. Sosa (2002), The prosody of questions in natural discourse, SpeechProsody
Sophie Herment-Dujardin, Daniel Hirst (2002), Emphasis in English: a perceptual study based on modified synthetic speech, SpeechProsody
Nadine Herry, Daniel Hirst (2002), Subjective and objective evaluation of the prosody of English spoken by French speakers: the contribution of computer assisted learning, SpeechProsody
Anthony Hind (2002), Metrical patterns and melodicity in English contrasted with French, SpeechProsody
Keikichi Hirose, Nobuaki Minematsu, Masaya Eto (2002), Data-driven synthesis of fundamental frequency contours for TTS systems based on a generation process model, SpeechProsody
Keikichi Hirose, Nobuaki Minematsu, Makoto Terao (2002), N-gram language modeling of Japanese using prosodic boundaries, SpeechProsody
B. Holm, Gérard Bailly (2002), Learning the hidden structure of intonation: implementing various functions of prosody, SpeechProsody
Fang Hu (2002), A prosodic analysis of wh-words in standard Chinese, SpeechProsody
Carlos Toshinori Ishi, Keikichi Hirose, Nobuaki Minematsu (2002), Using perceptually-related f0- and power-based parameters to identify accent types of accentual phrases, SpeechProsody
Kiwako Ito (2002), Ambiguity in broad focus and narrow focus interpretation in Japanese, SpeechProsody
Mika Ito (2002), Japanese politeness and suprasegmentals - a study based on natural speech materials, SpeechProsody
Soyoung Kang, Shari Speer (2002), Prosody and clause boundaries in Korean, SpeechProsody
Roland Kehrein (2002), The prosody of authentic emotions, SpeechProsody
Tsuyoshi Kida (2002), Prosody - a laughing matter? a crosscultural comparison of a humour phenomenon (rakugo) in france, Tokyo, and Osaka, SpeechProsody
Shinya Kiriyama, Keikichi Hirose, Nobuaki Minematsu (2002), Control of prosodic focuses for reply speech generation in a spoken dialogue system of information retrieval on academic documents, SpeechProsody
Shigeyoshi Kitazawa, Tatsuya Kitamura, Kazuya Mochizuki, Toshihiko Itoh (2002), Periodicity of Japanese accent in continuous speech, SpeechProsody
Rachael-Anne Knight (2002), The effect of pitch Span on intonational plateaux, SpeechProsody
Emiel Krahmer, Zsófia Ruttkay, Marc Swerts, Wieger Wesselink (2002), Pitch, eyebrows and the perception of focus, SpeechProsody
Anne Lacheret-Dujour (2002), The intonational marking of topical salience in spontaneous speech evidence from spoken French, SpeechProsody
Wai-Sum Lee, Fangxin Chen, K. K. Luke, Liqin Shen (2002), The prosody of bisyllabic and polysyllabic words in Hong Kong Cantonese, SpeechProsody
David Le Gac (2002), Tonal alternations and prosodic structure in Somali, SpeechProsody
Lisa Lim, Umberto Ansaldo (2002), Prosodic erosion as a diagnostic of grammaticalisation in isolating languages: tone and stress in Sinitic, SpeechProsody
Naïma Louali, Amina Mettouchi (2002), Structures intonatives en berb&# 8;re: l²&# 9;nonc&# 9; pr&# 9;dicatif, SpeechProsody
Vicky C. H. Man (2002), Focus effects on Cantonese tones: an acoustic study, SpeechProsody
J. M. Marandin, Claire Beyssade, Elisabeth Delais-Roussarie, A. Rialland (2002), Discourse marking in French: C accents and discourse moves, SpeechProsody
Giovanna Marotta (2002), L'intonation des énonc&# 9;s interrogatifs ouverts dans l'Italien toscan, SpeechProsody
Karine Martel (2002), Mutual knowledge and prosody in young children, SpeechProsody
Philippe Martin (2002), Regional variations of sentence intonation in French - the continuation contour in Parisian French, SpeechProsody
Jörg Mayer, Dirk Wildgruber, Axel Riecker, Grzegorz Dogil, Hermann Ackermann, Wolfgang Grodd (2002), Prosody production and perception: converging evidence from fMRI studies, SpeechProsody
Jana Mejvaldová, Petr Horák (2002), Synonymie et homonymie attitudinale en tch&# 8;que et en fran&# 7;ais, SpeechProsody
C. Menezes, Donna Erickson, Osamu Fujimura (2002), Contrastive emphasis: comparison of pitch accents with syllable magnitudes, SpeechProsody
Piet Mertens (2002), Synthesizing elaborate intonation contours in text-to-speech for French, SpeechProsody
Jan-Torsten Milde, Ulrike Gut (2002), A prosodic corpus of non-native speech, SpeechProsody
Nobuaki Minematsu, Mariko Sekiguchi, Keikichi Hirose (2002), Performance improvement in estimating subjective agedness with prosodic features, SpeechProsody
Hansjörg Mixdorff, Noam Ami (2002), The prosody of modern Hebrew - a quantitative study, SpeechProsody
Hansjörg Mixdorff, Martti Vainio, Stefan Werner, Juhani Järvikivi (2002), The manifestation of linguistic information in prosodic features of Finnish, SpeechProsody
Tadao Miyamoto, Crystal Johnson (2002), Accentual phrasing in Japanese: the significance of underlying accents, SpeechProsody
Bernd Möbius, Grzegorz Dogil (2002), Phonemic and postural effects on the production of prosody, SpeechProsody
Eva Navas, Inmaculada Hernáez, Nerea Ezeiza (2002), Assigning phrase breaks using CARTs for Basque TTS, SpeechProsody
Irina Nesterenko (2002), French listeners counting syllables in read French and Russian: implications for the cognitive realty of syllable, SpeechProsody
Marie Nilsenová (2002), A game-theoretical approach to the meaning of intonation in rising declaratives and negative polar questions, SpeechProsody
Miguel Oliveira (2002), Pausing strategies as means of information processing in spontaneous narratives, SpeechProsody
Hanny den Ouden, Leo Noordman, Jacques Terken (2002), The prosodic realization of organizational features of texts, SpeechProsody
Pierre-Yves Oudeyer (2002), Novel useful features and algorithms for the recognition of emotions in human speech, SpeechProsody
Pierre-Yves Oudeyer (2002), The synthesis of cartoon emotional speech, SpeechProsody
Ho-Hsien Pan (2002), The location of f0 offset targets for taiwanese long tones, SpeechProsody
Marta Payà Canals (2002), Incidental clauses in spoken Catalan: prosodic characteristics and pragmatic function, SpeechProsody
François Pellegrino, Jean-Hugues Chauchat, Ricco Rakotomalala, Jérôme Farinas (2002), Can automatically extracted rhythmic units discriminate among languages?, SpeechProsody
Jörg Peters (2002), Tonal effects on rhythm in West Middle German, SpeechProsody
Lourdes Pietrosemoli, Elsa Mora (2002), Dysprosody in three patients with vascular cerebral damage, SpeechProsody
Olivier Piot, Mehdi Lyaghat (2002), Expression et reconnaissance de onze attitudes assertives et interrogatives en persan standard, SpeechProsody
Cristel Portes, Emmanuelle Rami, C. Auran, Albert Di Cristo (2002), Prosody and discourse: a multi-linear analysis, SpeechProsody
Brechtje Post (2002), French tonal structures, SpeechProsody
Pilar Prieto (2002), Coarticulation and stability effects in tonal clash contexts in Catalan, SpeechProsody
Yao Qian, Wuyun Pan (2002), Prosodic word: the lowest constituent in the Mandarin prosody processing, SpeechProsody
A. Rialland, Jenny Doetjes, G. Rebuschi (2002), What is focused in c'est XP qui/que cleft sentences in French?, SpeechProsody
Axel Riecker, Dirk Wildgruber, Grzerorz Dogil, Jörg Mayer, Hermann Ackermann, Wolfgang Grodd (2002), Hemispheric lateralization effects of rhythm implementation during syllable repetitions: a fMRI study, SpeechProsody
Toni Rietveld, Joop Kerkhoff (2002), The temporal alignment of l*h accents, SpeechProsody
Albert Rilliard, Véronique Aubergé (2002), Towards a linguistic validation of a prosodic generation model, SpeechProsody
Guillaume Rolland, Hélène Loevenbruck (2002), Characteristics of the accentual phrase in French: an acoustic, articulatory and perceptual study, SpeechProsody
Salvo Rossi Pierluigi, Francesco Palmieri, Francesco Cutugno (2002), A method for automatic extraction of fujisaki-model parameters, SpeechProsody
Kerstin Sander, Patricia Roth, Henning Scheich (2002), The identification of sad prosodies differentiates between high and low repressive women, SpeechProsody
Uli Sauerland, Oliver Bott (2002), Prosody and scope in German inverse linking constructions, SpeechProsody
Felix Schaeffler, Pär Wretling, Eva Strangert (2002), On the development of a quantity typology for Swedish dialects, SpeechProsody
Annett Schirmer, Sonja A. Kotz (2002), Sex differentiates the STROOP-effect in emotional speech: ERP evidence, SpeechProsody
D. Schön, C. Magne, M. Schrooten, M. Besson (2002), The music of speech: electrophysiological approach, SpeechProsody
Antje Schweitzer, Norbert Braunschweiler, Edmilson Morais (2002), Prosody generation in the Smartkom project, SpeechProsody
Elisabeth Selkirk (2002), Contrastive FOCUS vs. presentational focus: prosodic evidence from right node raising in English, SpeechProsody
Anne Catherine Simon, Anne Grobet (2002), Intégration ou autonomisation prosodique des connecteurs, SpeechProsody
Natalia Smirnova (2002), On the phonological status of the HL*h vs. h*LH timing-related tonal opposition in Dutch, SpeechProsody
Mariko Sugahara (2002), Conditions on post-FOCUS dephrasing in Tokyo Japanese, SpeechProsody
Ela Thurgood (2002), The recognition of geminates in ambiguous contexts in Polish, SpeechProsody
Richard Todd (2002), Speaker-ethnicity: attributions based on the use of prosodic cues, SpeechProsody
Chiu-yu Tseng (2002), The prosodic status of breaks in running speech: examination and evaluation, SpeechProsody
Christiane Ulbrich (2002), A comparative study of intonation in three standard varieties of German, SpeechProsody
Jennifer J. Venditti, Matthew Stone, Preetham Nanda, Paul Tepper (2002), Discourse constraints on the interpretation of nuclear-accented pronouns, SpeechProsody
François Viallet, Bernard Teston, Ludovic Jankowski, Alain Purson, Jean Claude Peragut, Jean Regis, Tatiana Witjas (2002), Effects of pharmacological versus electrophysiological treatments on Parkinsonian dysprosody, SpeechProsody
Francisco Vizcaino Ortega (2002), A preliminary analysis of yes/no questions in Glasgow English, SpeechProsody
Petra Wagner, Eva Fischenbeck (2002), Stress perception and production in German stress clash environments, SpeechProsody
Michiko Watanabe (2002), Fillers as indicators of discourse segment boundaries in Japanese monologues, SpeechProsody
Pauline Welby (2002), The realization of early and late rises in French intonation: a production study, SpeechProsody
Beate Wendt, Henning Scheich (2002), The "magdeburger prosodie-korpus", SpeechProsody
Pär Wretling, Eva Strangert, Felix Schaeffler (2002), Quantity and preaspiration in Northern Swedish dialects, SpeechProsody
Yufang Yang, Bei Wang (2002), Acoustic correlates of hierarchical prosodic boundary in Mandarin, SpeechProsody
Jiahong Yuan, Chilin Shih, Greg P. Kochanski (2002), Comparison of declarative and interrogative intonation in Chinese, SpeechProsody
Ivan Yuen (2002), Tonal invariance and downtrend in Cantonese, SpeechProsody
A. Zaki, A. Rajouani, Z. Luxey, M. Najim (2002), Rules based model for automatic synthesis of f0 variation for declarative Arabic sentences, SpeechProsody
Eric Zee (2002), The effect of speech rate on the temporal organization of syllable production in Cantonese, SpeechProsody
Brigitte Zellner Keller (2002), Revisiting the status of speech rhythm, SpeechProsody
Elisabeth Zetterholm (2002), Intonation pattern and duration differences in imitated speech, SpeechProsody
Natalia N. Zharkova (2002), Acquisition of prosody in Russian, SpeechProsody
Yiqing Zu, Hong Zheng (2002), Effect of prosodic structure on segmental variants, SpeechProsody
Phil Rose (2004), Technical forensic speaker identification from a Bayesian linguist's perspective, Odyssey
Didier Meuwly (2004), Forensic speaker recognition: an evidence odyssey, Odyssey
Mark Przybocki, Alvin F. Martin (2004), NIST speaker recognition evaluation chronicles, Odyssey
Daniel Moraru, Sylvain Meignier, Corinne Fredouille, Laurent Besacier, Jean-François Bonastre (2004), ELISA nist RT03 broadcast news speaker diarization experiments, Odyssey
Joseph P. Campbell, Hirotaka Nakasone, Christopher Cieri, David Miller, Kevin Walker, Alvin F. Martin, Mark A. Przybocki (2004), The MMSR bilingual and crosschannel corpora for speaker recognition research and evaluation, Odyssey
Niko Brümmer (2004), Application-independent evaluation of speaker detection, Odyssey
William M. Campbell, Douglas A. Reynolds, Joseph P. Campbell (2004), Fusing discriminative and generative methods for speaker recognition: experiments on switchboard and NFI/TNO field data, Odyssey
Ying Liu, Martin Russell, Michael Carey (2004), Speaker recognition using a trajectory-based segmental HMM, Odyssey
Sachin Kajarekar, Luciana Ferrer, Kemal Sönmez, Jing Zheng, Elizabeth Shriberg, Andreas Stolcke (2004), Modeling NERFs for speaker recognition, Odyssey
Alex Solomonoff, Carl Quillen, William M. Campbell (2004), Channel compensation for SVM speaker recognition, Odyssey
Filippo Botti, Anil Alexander, Andrzej Drygajlo (2004), An interpretation framework for the evaluation of evidence in forensic automatic speaker recognition with limited suspect data, Odyssey
Anil Alexander, Filippo Botti, Andrzej Drygajlo (2004), Handling mismatch in corpus-based forensic speaker recognition, Odyssey
David A. van Leeuwen, Jos S. Bouten (2004), Results of the 2003 NFI-TNO forensic speaker recognition evaluation, Odyssey
Joaquin Gonzalez-Rodriguez, Daniel Ramos-Castro, Marta Garcia-Gomar, Javier Ortega-Garcia (2004), On robust estimation of likelihood ratios: the ATVS-UPM system at 2003 NFI/TNO forensic evaluation, Odyssey
Brendan Baker, Robbie Vogt, Michael Mason, Sridha Sridharan (2004), Improved phonetic and lexical speaker recognition through MAP adaptation, Odyssey
David Klusácek (2004), Optimal detection in case of the sparse training data, Odyssey
D. Garcia-Romero, J. Fierrez-Aguilar, Joaquin Gonzalez-Rodriguez, Javier Ortega-Garcia (2004), On the use of quality measures for text-independent speaker recognition, Odyssey
Asmaa El Hannani, Dijana Petrovska-Delacrétaz, Gérard Chollet (2004), Linear and non-linear fusion of ALISP-based and GMM systems for text-independent speaker verification, Odyssey
Claude Barras, Sylvain Meignier, Jean-Luc Gauvain (2004), Unsupervised online adaptation for speaker verification over the telephone, Odyssey
Jason Pelecanos, Upendra Chaudhari, Ganesh Ramaswamy (2004), Compensation of utterance length for speaker verification, Odyssey
Mathieu Ben, Frédéric Bimbot, Guillaume Gravier (2004), Enhancing the robustness of Bayesian methods for text-independent automatic speaker verification, Odyssey
Robbie Vogt, Sridha Sridharan (2004), Bayes factor scoring of GMMs for speaker verification, Odyssey
William M. Campbell, Elliot Singer, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds (2004), Language recognition with support vector machines, Odyssey
Terrence Martin, Eddie Wong, Brendan Baker, Michael Mason, Sridha Sridharan (2004), Pitch and energy trajectory modelling in a syllable length temporal framework for language identification, Odyssey
Pedro A. Torres-Carrasquillo, Terry P. Gleason, Douglas A. Reynolds (2004), Dialect identification using Gaussian mixture models, Odyssey
Elliot Singer, Douglas A. Reynolds (2004), Analysis of multitarget detection for speaker and language recognition, Odyssey
M. Chetouani, M. Faundez-Zanuy, B. Gas, J. L. Zarader (2004), A new nonlinear speaker parameterization algorithm for speaker identification, Odyssey
Raymond E. Slyh, Eric G. Hansen, Timothy R. Anderson (2004), Glottal modeling and closed-phase analysis for speaker recognition, Odyssey
Leena Mary, K. Sri Rama Murty, S.R. Mahadeva Prasanna, Bayya Yegnanarayana (2004), Features for speaker and language identification, Odyssey
Yaniv Zigel, Arnon Cohen (2004), Text-dependent speaker verification using feature selection with recognition related criterion, Odyssey
S. E. Tranter, Douglas A. Reynolds (2004), Speaker diarisation for broadcast news, Odyssey
Jia-Hsin Hsieh, Chung-Hsien Wu (2004), Unsupervised speaker segmentation of broadcast news using MDL-based Gaussian model, Odyssey
Marie Roch, Yanliang Cheng (2004), Speaker segmentation using the MAP-adapted Bayesian information criterion, Odyssey
Daniel Moraru, Laurent Besacier, Eric Castelli (2004), Using a priori information for speaker diarization, Odyssey
Tomoko Matsui, Kunio Tanabe (2004), Speaker identification with dual penalized logistic regression machine, Odyssey
J. Fortuna, P. Sivakumaran, A. M. Ariyaeeinia, A. Malegaonkar (2004), Relative effectiveness of score normalisation methods in open-set speaker identification, Odyssey
Ran Gazit, Yaakov Metzger (2004), Voice mining with multiple target speakers, Odyssey
Javier R. Saeta, Javier Hernando, Oscar Manso, Manel Medina (2004), Applying speaker verification to certificate revocation, Odyssey
Suhadi, Stephan Grashey, Sorel Stan, Tim Fingscheidt (2004), Evaluation of a small-footprint text and language independent speaker recognition system on forensic data, Odyssey
Ran D. Zilca, Jason W. Pelecanos, Upendra V. Chaudhari, Ganesh N. Ramaswamy (2004), Real time robust speech detection for text independent speaker recognition, Odyssey
Kofi Boakye, Barbara Peskin (2004), Text-constrained speaker recognition on a text-independent task, Odyssey
Daniel Boies, Matthieu Hébert, Larry P. Heck (2004), Study on the effect of lexical mismatch in text-dependent speaker verification, Odyssey
Enrique Argones-Rúa, Elisardo González-Agulla, Carmen García-Mateo, Óscar William Márquez-Flórez (2004), User verification in a BioVXML framework, Odyssey
Raphael Blouet, Chafic Mokbel, Hoda Mokbel, Eduardo Sánchez Soto, Gérard Chollet, Hanna Greige (2004), BECARS: a free software for speaker verification, Odyssey
Yaakov Metzger, Ran Gazit (2004), text-prompted without text: a language-independent voice-prompted speaker recognition system, Odyssey
Hermann J. Künzel, Joaquín Gonzalez-Rodriguez, Javier Ortega-García (2004), Effect of voice disguise on the performance of a forensic automatic speaker recognition system, Odyssey
Eric G. Hansen, Raymond E. Slyh, Timothy R. Anderson (2004), Speaker recognition using phoneme-specific GMMs, Odyssey
Gernot A. Fink, Thomas Plötz (2004), Integrating speaker identification and learning with adaptive speech recognition, Odyssey
Dimitrios Rentzos, Saeed Vaseghi, Qin Yan (2004), Voice profile: a structured probability model with application to voice morphing, Odyssey
Norman Poh, Samy Bengio (2004), Noise-robust multi-stream fusion for text-independent speaker authentication, Odyssey
Fabio Valente, Christian Wellekens (2004), Variational Bayesian speaker clustering, Odyssey
Javier R. Saeta, Javier Hernando (2004), On the use of score pruning in speaker verification for speaker dependent threshold estimation, Odyssey
Patrick Kenny, Pierre Dumouchel (2004), Experiments in speaker verification using factor analysis likelihood ratios, Odyssey
Delphine Charlet (2004), Neighborhood-adapted GMM for speaker recognition, Odyssey
Yongxin Zhang, Michael S. Scordilis (2004), Optimization of GMM training for speaker verification, Odyssey
Samy Bengio, Johnny Mariéthoz (2004), A statistical significance test for person authentication, Odyssey
Daryl Ning, Vinod Chandran (2004), The effectiveness of higher order spectral phase features in speaker identification, Odyssey
Hirotaka Nakasone, Maria Mimikopoulos, Steven D. Beck, Somit Mathur (2004), Pitch synchronized speech processing (PSSP) for speaker recognition, Odyssey
Mihalis Siafarikas, Todor Ganchev, Nikos Fakotakis (2004), Wavelet packet based speaker verification, Odyssey
Steven D. Beck, Reva Schwartz, Hirotaka Nakasone (2004), A bilingual multi-modal voice corpus for language and speaker recognition (LASR) services, Odyssey
Carlos Lino Rengifo, Diego Andrés Alvarez, Ricardo Henao, Germán Castellanos, Jorge Eduardo Hurtado (2004), Active learning on the classification of voice pathologies, Odyssey
Hyoung-Gook Kim, Martin Haller, Thomas Sikora (2004), Comparison of MPEG-7 basis projection features and MFCC applied to robust speaker recognition, Odyssey
Samy Bengio, Johnny Mariéthoz (2004), The expected performance curve: a new assessment measure for person authentication, Odyssey
Joseph P Campbell (2014), Speaker Recognition for Forensic Applications, Odyssey
Alvin F. Martin, Craig S. Greenberg, John M. Howard, George R. Doddington, John J. Godfrey, Vincent M. Stanford (2014), Effects of the New Testing Paradigm of the 2012 NIST Speaker Recognition Evaluation, Odyssey
David van der Vloed, Jos Bouten, David van Leeuwen (2014), NFI-FRITS: A forensic speaker recognition database and some first experiments, Odyssey
David van Leeuwen, Niko Brummer, Albert Swart (2014), A comparison of linear and non-linear calibrations for speaker recognition, Odyssey
Yun Lei, Luciana Ferrer, Aaron Lawson, Mitchell McLaren, Nicolas Scheffer (2014), Trial-based Calibration for Speaker Recognition in Unseen Conditions, Odyssey
Johan Rohdin, Sangeeta Biswas, Koichi Shinoda (2014), Discriminative PLDA training with application-specific loss functions for speaker verification, Odyssey
Joaquin Gonzalez-Rodriguez, Juana Gil, Rubén Pérez, Javier Franco-Pedroso (2014), What are we missing with i-vectors? A perceptual analysis of i-vector-based falsely accepted trials, Odyssey
Pierre-Michel Bousquet, Jean-François Bonastre, Driss Matrouf (2014), Exploring some limits of Gaussian PLDA modeling for i-vector distributions, Odyssey
Najim Dehak, Oldrich Plchot, Mohamad Hasan Bahari, Lukas Burget, Hugo Van Hamme, Reda Dehak (2014), GMM Weights Adaptation Based on Subspace Approaches for Speaker Verification, Odyssey
Andreas Nautsch, Christian Rathgeb, Christoph Busch, Herbert Reininger, Klaus Kasper (2014), Towards Duration Invariance of i-Vector-based Adaptive Score Normalization, Odyssey
Zhi-Yi Li, Wei-Qiang Zhang, Wei-Wei Liu, Yao Tian, Jia Liu (2014), Text-Independent Speaker Verification via State Alignment, Odyssey
Kong Aik Lee, Bin Ma, Haizhou Li, Liping Chen, Wu Guo, Lirong Dai (2014), Local Variability Modeling for Text-Independent Speaker Verification, Odyssey
Yusuf Ziya Isik, Hakan Erdogan, Ruhi Sarikaya (2014), A Latent Dirichlet Allocation Based Front-End for Speaker Verification, Odyssey
Ville Hautamäki, Rosa Gonzalez Hautamäki, Tomi Kinnunen, Anne-Maria Laukkanen (2014), Comparison of human listeners and speaker verification systems using voice mimicry data, Odyssey
Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Md Jahangir Alam, Pierre Dumouchel (2014), Supervised/Unsupervised Voice Activity Detectors for Text-dependent Speaker Recognition on the RSR2015 Corpus, Odyssey
Johan Rohdin, Sangeeta Biswas, Koichi Shinoda (2014), i-Vector Selection for Effective PLDA Modeling in Speaker Recognition, Odyssey
Brecht Desplanques, Kris Demuynck, Jean-Pierre Martens (2014), Combining Joint Factor Analysis and iVectors for Robust Language Recognition, Odyssey
Alexandros Lazaridis, Elie Khoury, Jean-Philippe Goldman, Mathieu Avanzi, Sébastien Marcel, Philip N. Garner (2014), Swiss French Regional Accent Identification, Odyssey
Laura Fernandez Gallardo, Michael Wagner, Sebastian Möller (2014), Spectral Sub-band Analysis of Speaker Verification Employing Narrowband and Wideband Speech, Odyssey
Gang Liu, John H.L. Hansen (2014), Supra-Segmental Feature Based Speaker Trait Detection, Odyssey
Karthika Vijayan, Vinay Kumar, K Sri Rama Murty (2014), Allpass modelling of Fourier phase for speaker verification, Odyssey
Jinghua Zhong, Weiwu Jiang, Helen Meng, Na Li, Zhifeng Li (2014), An Integration of Random Subspace Sampling and Fishervoice for Speaker Verification, Odyssey
Gang Liu, John Hansen, Chengzhu Yu, Abhinav Misra, Navid Shokouhi (2014), Investigating State-of-the-Art Speaker Verification in the case of Unlabeled Development Data, Odyssey
Alvin F. Martin, Craig S. Greenberg, John M. Howard, George R. Doddington, John J. Godfrey (2014), NIST Language Recognition Evaluation - Past and Future, Odyssey
Gang Liu, Qian Zhang, John Hansen (2014), Robust Language Recognition Based on Diverse Features, Odyssey
Nobuaki Minematsu, Shun Kasahara, Takehiko Makino, Daisuke Saito, Keikichi Hirose (2014), Speaker-basis Accent Clustering Using Invariant Structure Analysis and the Speech Accent Archive, Odyssey
Alan Mccree (2014), Multiclass Discriminative Training of i-vector Language Recognition, Odyssey
Jean-François Bonastre, Itshak Lapidot, Samy Bengio (2014), Telephone Conversation Speaker Diarization Using Mealy-HMMs, Odyssey
Hervé Bredin, Antoine Laurent, Achintya Sarkar, Viet-Bac Le, Sophie Rosset, Claude Barras (2014), Person Instance Graphs for Named Speaker Identification in TV Broadcast, Odyssey
Grégor Dupuy, Sylvain Meignier, Paul Deléglise, Yannick Estève (2014), Recent Improvements on ILP-based Clustering for Broadcast News Speaker Diarization, Odyssey
Pranay Dighe, Marc Ferras, Herve Bourlard (2014), Modeling Overlapping Speech using Vector Taylor Series, Odyssey
Martin Cooke (2014), Speaking in adverse conditions: from behavioural observations to intelligibility-enhancing speech modifications, Odyssey
Patrick Kenny, Themos Stafylakis, Alam Jahangir, Pierre Ouellet, Marcel Kockmann (2014), Joint Factor Analysis for Text-Dependent Speaker Verification, Odyssey
Giovanni Soldi, Simon Bozonnet, Federico Alegre, Christophe Beaugeant, Nicholas Evans (2014), Short-Duration Speaker Modelling with Phone Adaptive Training, Odyssey
Changhuai You, Kong Aik Lee, Bin Ma, Haizhou Li (2014), Text-Dependent Speaker Verification System in VHF Communication Channel, Odyssey
Alan Mccree, Douglas Reynolds, Daniel Garcia-Romero, Tomi Kinnunen, Craig Greenberg, Désiré Bansé, George Doddington, John Godfrey, Alvin Martin, Mark Przybocki (2014), The NIST 2014 Speaker Recognition i-vector Machine Learning Challenge, Odyssey
Sergey Novoselov, Timur Pekhovsky, Konstantin Simonchik (2014), STC Speaker Recognition System for the NIST i-Vector Challenge, Odyssey
Bostjan Vesnicer, Jerneja Zganec-Gros, Simon Dobrisek, Vitomir Struc (2014), Incorporating Duration Information into I-Vector-Based Speaker Recognition Systems, Odyssey
Abbas Khosravani, Mohammad Mahdi Homayounpour (2014), Linearly Constrained Minimum Variance for Robust I-vector Based Speaker Recognition, Odyssey
Marc Ferras, Elie Khoury, Sébastien Marcel, Laurent El Shafey (2014), Hierarchical speaker clustering methods for the NIST i-vector Challenge, Odyssey
Samy Bengio (2014), Large Scale Learning of a Joint Embedding Space, Odyssey
Niko Brummer, Alan Mccree, Stephen Shum, Daniel Garcia-Romero, Carlos Vaquero (2014), Unsupervised Domain Adaptation for I-Vector Speaker Recognition, Odyssey
Alan Mccree, Stephen Shum, Douglas Reynolds, Daniel Garcia-Romero (2014), Unsupervised Clustering Approaches for Domain Adaptation in Speaker Recognition Systems, Odyssey
Sandro Cumani, Pietro Laface (2014), Generative pairwise models for speaker recognition, Odyssey
Hagai Aronowitz (2014), Compensating Inter-Dataset Variability in PLDA Hyper-Parameters for Robust Speaker Recognition, Odyssey
Yun Lei, Luciana Ferrer, Aaron Lawson, Mitchell McLaren, Nicolas Scheffer (2014), Application of Convolutional Neural Networks to Language Identification in Noisy Conditions, Odyssey
Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Vishwa Gupta, Jahangir Alam (2014), Deep Neural Networks for extracting Baum-Welch statistics for Speaker Recognition, Odyssey
Pavel Matejka, Le Zhang, Tim Ng, Ondrej Glembek, Jeff Ma, Bing Zhang, Sri Harish Mallidi (2014), Neural Network Bottleneck Features for Language Identification, Odyssey
Omid Ghahabi, Javier Hernando (2014), i-Vector Modeling with Deep Belief Networks for Multi-Session Speaker Recognition, Odyssey
Christophe d'Alessandro (2007), Phase-based methods for voice source analysis, NOLISP
Danilo Mandic (2007), Exploiting nonlinearity in signal processing: qualitative assessment of adaptive filtering algorithms and signal modality characterisation, NOLISP
Xavi Gonzalvo, Ignasi Iriondo, Joan Claudi Socoró, Francesc Alías, Carlos Monzo (2007), HMM-based Spanish speech synthesis using CBR as F0 estimator, NOLISP
Mehmet Atas, Süleyman Baykut, Tayfun Akgül (2007), A wavelet-based technique towards a more natural sounding synthesized speech, NOLISP
Ignasi Iriondo, Santiago Planet, Joan-Claudi Socoró, Francesc Alías (2007), Objective and subjective evaluation of an expressive speech corpus, NOLISP
Marcos Faundez-Zanuy (2007), On the usefulness of linear and nonlinear prediction residual signals for speaker recognition, NOLISP
Christophe Charbuillet, Bruno Gas, Mohamed Chetouani, Jean Luc Zarader (2007), Multi-filter-bank approach for speaker verification based on genetic algorithm, NOLISP
Lara Stoll, Joe Frankel, Nikki Mirghafori (2007), Speaker recognition via nonlinear discriminant features, NOLISP
Ufuk Ülüg, Tolga Esat Özkurt, Tayfun Akgül (2007), Bispectrum mel-frequency cepstrum coefficients for robust speaker identification, NOLISP
Michael Gerber, Tobias Kaufmann, Beat Pfister (2007), Perceptron-based class verification, NOLISP
Andrew Errity, John McKenna, Barry Kirkpatrick (2007), Manifold learning-based feature transformation for phone classification, NOLISP
Xavier Domont, Martin Heckmann, Heiko Wersing, Frank Joublin, Stefan Menzel, Bernhard Sendhoff, Christian Goerick (2007), Word recognition with a hierarchical neural network, NOLISP
Joseph Keshet, David Grangier, Samy Bengio (2007), Discriminative keyword spotting, NOLISP
Ana I. García-Moral, Rubén Solera-Urena, Carmen Peláez-Moreno, Fernando Díaz-de-María (2007), Hybrid models for automatic speech recognition: a comparison of classical ANN and kernel-based methods, NOLISP
Guillaume Gravier, Daniel Moraru (2007), Towards phonetically-driven hidden Markov models: can we incorporate phonetic landmarks in HMM-based ASR?, NOLISP
Sid-Ahmed Selouani, Habib Hamam, Douglas O’Shaughnessy (2007), A hybrid genetic-neural front-end extension for robust speech recognition over telephone lines, NOLISP
P. Gómez, A. Álvarez, L. M. Mazaira, R. Fernández, V. Rodellar (2007), Estimating the stability and dispersion of the biometric glottal fingerprint in continuous speech, NOLISP
Korin Richmond (2007), Trajectory mixture density network with multiple mixtures for acoustic-articulatory inversion, NOLISP
Aitor Álvarez, Idoia Cearreta, Juan Miguel López, Andoni Arruti, Elena Lazkano, Basilio Sierra, Nestor Garay (2007), Application of feature subset selection based on evolutionary algorithms for automatic emotion recognition in speech, NOLISP
Friedhelm R. Drepper (2007), Non-stationary self-consistent acoustic objects as atoms of voiced speech, NOLISP
I. Paraskevas, E. Chilton, M. Rangoussi (2007), The Hartley phase cepstrum as a tool for signal analysis, NOLISP
Anis Ben Aicha, Sofia Ben Jebara (2007), Quantitative perceptual separation of two kinds of degradation in speech denoising applications, NOLISP
Neda Faraji, S. M. Ahadi, S. Saloomeh Shariati (2007), Threshold reduction for improving sparse coding shrinkage performance in speech enhancement, NOLISP
S. España-Boquera, M.J. Castro-Bleda, F. Zamora-Mart´ýnez, J. Gorbe-Moya (2007), Efficient Viterbi algorithms for lexical tree based models, NOLISP
Lin Yang, Jianping Zhang, Yonghong Yan (2007), Acoustic units selection in Chinese-English bilingual speech recognition, NOLISP
Zhaojie Liu, Pengyuan Zhang, Jian Shao, Qingwei Zhao, Yonghong Yan (1) Ji Feng (2007), Tone recognition in Mandarin spontaneous speech, NOLISP
Neda Faraji, S. M. Ahadi (2007), Evaluation of a feature selection scheme on ICA-based filter- bank for speech recognition, NOLISP
Denilson C. Silva (2007), A robust endpoint detection algorithm based on identification of the noise nature, NOLISP
Aïcha Bouzid, Noureddine Ellouze (2007), EMD analysis of speech signal in voiced mode, NOLISP
Karl Schnell, Arild Lacroix (2007), Estimation of speech features of glottal excitation by nonlinear prediction, NOLISP
O. Pernía, J. M. Gúrriz, J. Ramírez, C. G. Puntonet, I. Turias (2007), An efficient VAD based on a generalized Gaussian PDF, NOLISP
Heiga Zen (2013), Deep learning in speech synthesis, SSW
Nigel Ward (2013), Prosodic patterns in dialog, SSW
Xavier Serra (2013), Singing voice synthesis in the context of music technology research, SSW
Norbert Braunschweiler, Langzhou Chen (2013), Automatic detection of inhalation breath pauses for improved pause modelling in HMM-TTS, SSW
Vivek Kumar Rangarajan Sridhar, John Chen, Srinivas Bangalore, Alistair Conkie (2013), Role of pausing in text-to-speech synthesis for simultaneous interpretation, SSW
Alok Parlikar, Alan W. Black (2013), Minimum error rate training for phrasing in speech synthesis, SSW
Benjamin Picart, Sandrine Brognaux, Thomas Drugman (2013), HMM-based speech synthesis of live sports commentaries: integration of a two-layer prosody annotation, SSW
Tatsuo Inukai, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura (2013), Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric, SSW
Sunayana Sitaram, Gopala Krishna Anumanchipalli, Justin Chiu, Alok Parlikar, Alan W. Black (2013), Text to speech in new languages without a standardized orthography, SSW
Oliver Watts, Adriana Stan, Robert A. J. Clark, Yoshitaka Mamiya, Mircea Giurgiu, Junichi Yamagishi, Simon King (2013), Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from ‘found’ data: evaluation and analysis, SSW
Mauro Nicolao, Fabio Tesser, Roger K. Moore (2013), A phonetic-contrast motivated adaptation to control the degree-of-articulation on Italian HMM-based synthetic voices, SSW
Cassia Valentini-Botinhao, Mirjam Wester, Junichi Yamagishi, Simon King (2013), Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise, SSW
Kayoko Yanagisawa, Javier Latorre, Vincent Wan, Mark J. F. Gales, Simon King (2013), Noise robustness in HMM-TTS speaker adaptation, SSW
Daniel Erro, Agustin Alonso, Luis Serrano, Eva Navas, Inma Hernaez (2013), New method for rapid vocal tract length adaptation in HMMbased speech synthesis, SSW
Nobukatsu Hojo, Kota Yoshizato, Hirokazu Kameoka, Daisuke Saito, Shigeki Sagayama (2013), Text-to-speech synthesizer based on combination of composite wavelet and hidden Markov models, SSW
Qiong Hu, Korin Richmond, Junichi Yamagishi, Javier Latorre (2013), An experimental comparison of multiple vocoder types, SSW
Yusuke Ijima, Noboru Miyazaki, Hideyuki Mizuno (2013), Statistical model training technique for speech synthesis based on speaker class, SSW
Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Thierry Dutoit (2013), Mage - reactive articulatory feature control of HMM-based parametric speech synthesis, SSW
Martí Umbert, Jordi Bonada, Merlijn Blaauw (2013), Systematic database creation for expressive singing voice synthesis control, SSW
Matthew P. Aylett, Blaise Potard, Christopher J. Pidcock (2013), Expressive speech synthesis: synthesising ambiguity, SSW
Timo Baumann, David Schlangen (2013), Interactional adequacy as a factor in the perception of synthesized speech, SSW
Tamás Gábor Csapó, Géza Németh (2013), A novel irregular voice model for HMM-based speech synthesis, SSW
Kazuhiko Iwata, Tetsunori Kobayashi (2013), Expression of speaker’s intentions through sentence-final particle/ intonation combinations in Japanese conversational speech synthesis, SSW
Oriol Guasch, Sten Ternström, Marc Arnela, Francesc Alías (2013), Unified numerical simulation of the physics of voice. the EUNISON project, SSW
Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, Thierry Dutoit (2013), Mage - HMM-based speech synthesis reactively controlled by the articulators, SSW
Maria Astrinaki, Junichi Yamagishi, Simon King, Nicolas d’Alessandro, Thierry Dutoit (2013), Reactive accent interpolation through an interactive map application, SSW
Christophe Veaux, Maria Astrinaki, Keiichiro Oura, Robert A. J. Clark, Junichi Yamagishi (2013), Real-time control of expressive speech synthesis using kinect body tracking, SSW
Àngel Calzada Defez, Joan Claudi Socoró Carrié, Robert A. J. Clark (2013), Parametric model for vocal effort interpolation with harmonics plus noise models, SSW
Anh-Tuan Dinh, Thanh-Son Phan, Tat-Thang Vu, Chi Mai Luong (2013), Vietnamese HMM-based speech synthesis with prosody information, SSW
Hiroya Hashimoto, Keikichi Hirose, Nobuaki Minematsu (2013), Context labels based on "bunsetsu" for HMM-based speech synthesis of Japanese, SSW
Yoshitaka Mamiya, Adriana Stan, Junichi Yamagishi, Peter Bell, Oliver Watts, Robert A. J. Clark, Simon King (2013), Using adaptation to improve speech transcription alignment in noisy and reverberant environments, SSW
Nobuyuki Nishizawa, Tsuneo Kato (2013), Speech synthesis using a maximally decimated pseudo QMF bank for embedded devices, SSW
Sathish Pammi, Marcela Charfuelan (2013), HMM-based scost quality control for unit selection speech synthesis, SSW
Lakshmi Saheer, Blaise Potard (2013), Understanding factors in emotion perception, SSW
Rubén San-Segundo, Juan Manuel Montero, Mircea Giurgiu, Ioana Muresan, Simon King (2013), Multilingual number transcription for text-to-speech conversion, SSW
Ryoichi Takashima, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki (2013), Noise-robust voice conversion based on spectral mapping on sparse space, SSW
Markus Toman, Michael Pucher, Dietmar Schabus (2013), Cross-variety speaker transformation in HSMM-based speech synthesis, SSW
Markus Toman, Michael Pucher, Dietmar Schabus (2013), Multi-variety adaptive acoustic modeling in HSMM-based speech synthesis, SSW
Florian Hinterleitner, Christoph Norrenbrock, Sebastian Möller (2013), Is intelligibility still the main problem? a review of perceptual quality dimensions of synthetic speech, SSW
Sébastien Le Maguer, Nelly Barbot, Olivier Boeffard (2013), Evaluation of contextual descriptors for HMM-based speech synthesis in French, SSW
Jaime Lorenzo-Trueba, Roberto Barra-Chicote, Junichi Yamagishi, Oliver Watts, Juan Manuel Montero (2013), Towards speaking style transplantation in speech synthesis, SSW
Thomas Merritt, Simon King (2013), Investigating the shortcomings of HMM synthesis, SSW
Raúl Montaño, Francesc Alías, Josep Ferrer (2013), Prosodic analysis of storytelling discourse modes and narrative situations oriented to text-to-speech synthesis, SSW
Ulpu Remes, Reima Karhila, Mikko Kurimo (2013), Objective evaluation measures for speaker-adaptive HMM-TTS systems, SSW
Fabio Tesser, Giacomo Sommavilla, Giulio Paci, Piero Cosi (2013), Experiments with signal-driven symbolic prosody for statistical parametric speech synthesis, SSW
Anandaswarup Vadapalli, Peri Bhaskararao, Kishore Prahallad (2013), Significance of word-terminal syllables for prediction of phrase breaks in text-to-speech systems for Indian languages, SSW
Catherine Watson, Wei Liu, Bruce MacDonald (2013), The effect of age and native speaker status on synthetic speech intelligibility, SSW
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li (2013), Exemplar-based voice conversion using non-negative spectrogram deconvolution, SSW
Ibrahim Almosallam, Atheer Alkhalifa, Mansour Alghamdi, Mohamed Alkanhal, Ashraf Alkhairy (2013), SASSC: a standard Arabic single speaker corpus, SSW
Ladan Golipour, Alistair Conkie, Ann Syrdal (2013), Prosodically modifying speech for unit selection speech synthesis databases, SSW
Heng Lu, Simon King, Oliver Watts (2013), Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis, SSW
Jindřich Matoušek, Daniel Tihelka, Milan Legát (2013), Is unit selection aware of audible artifacts?, SSW
Kenji Matsui, Kenta Kimura, Yoshihisa Nakatoh, Yumiko O. Kato (2013), Development of electrolarynx with hands-free prosody control, SSW
Trung-Nghia Phung, Chi Mai Luong, Masato Akagi (2013), A hybrid TTS between unit selection and HMM-based TTS under limited data conditions, SSW
Antti Suni, Daniel Aalto, Tuomo Raitio, Paavo Alku, Martti Vainio (2013), Wavelets for intonation modeling in HMM speech synthesis, SSW
B. Ramani, S. Lilly Christina, G. Anushiya Rachel, V. Sherlin Solomi, Mahesh Kumar Nandwana, Anusha Prakash, S. Aswin Shanmugam, Raghava Krishnan, S. Kishore Prahalad, K. Samudravijaya, P. Vijayalakshmi, T. Nagarajan, Hema A. Murthy (2013), A common attribute based unified HTS framework for speech synthesis in Indian languages, SSW
Takenori Yoshimura, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda (2013), Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis, SSW
Yi-Chin Huang, Chung-Hsien Wu, Shih-Lun Lin (2013), Residual compensation based on articulatory feature-based phone clustering for hybrid Mandarin speech synthesis, SSW
Daniel Hirst, Albert Rilliard, Véronique Aubergé (1998), Comparison of subjective evaluation and an objective evaluation metric for prosody in text-to-speech synthesis, SSW
Gerit P. Sonntag, Thomas Portele, Felicitas Haas (1998), Comparing the comprehensibility of different synthetic voices in a dual task experiment, SSW
Christophe d'Alessandro (1998), Joint evaluation of text-to-speech synthesis in French within the AUPELF ARC-B3 project, SSW
Nick Campbell (1998), Where is the information in speech? (and to what extent can it be modelled in synthesis?), SSW
Osamu Mizuno, Shin'ya Nakajima (1998), Synthetic speech/sound control language: MSCL, SSW
Richard Sproat, Andrew Hunt, Mari Ostendorf, Paul Taylor, Allan Black, Kevin Lenzo, Mike Edgington (1998), SABLE: A standard for TTS markup, SSW
Jennifer J. Venditti, Jan P. H. van Santen (1998), Modeling segmental durations for Japanese text-to-speech synthesis, SSW
Li-Chiung Yang (1998), Contextual Effects on Syllable Duration, SSW
Albert Febrer, Jaume Padrell, Antonio Bonafonte (1998), Modeling phone duration: application to Catalan TTS, SSW
Jürgen Trouvain, William J. Barry, Claus Nielsen, Ove Andersen (1998), Implications of energy declination for speech synthesis, SSW
Robert I. Damper, Y. Marchand, M. J. Adamson, Kjell Gustafson (1998), Comparative evaluation of letter-to-sound conversion techniques for English text-to-speech synthesis, SSW
Bernd Möbius (1998), Word and syllable models for German text-to-speech synthesis, SSW
Robert I. Damper, Y. Marchand (1998), Improving pronunciation by analogy for text-to-speech applications, SSW
George Anton Kiraz, Bernd Möbius (1998), Multilingual syllabification using weighted finite-state transducers, SSW
Alan W Black, Kevin Lenzo, Vincent Pagel (1998), Issues in building general letter to sound rules, SSW
Chilin Shih, Bernd Möbius (1998), Contextual effects on voicing profiles of German and Mandarin consonants, SSW
Albert Rilliard, Véronique Aubergé (1998), Reiterant speech for the evaluation of natural vs. synthetic prosody, SSW
C. Grover, J. Fackrell, H. Vereecken, Jean-Pierre Martens, Bert Van Coile (1998), Designing prosodic databases for automatic modelling in 6 languages, SSW
Frédérique Sannier, Véronique Aubergé, Rabia Belrhali (1998), How a French text-to-speech system can describe loanwords, SSW
Chilin Shih, Wentao Gu, Jan P. H. van Santen (1998), Efficient adaptation of TTS duration model to new speakers, SSW
Arthur Dirksen, Ludmila Menert (1998), Prosody control in fluent Dutch text-to-speech, SSW
Oliver Jokisch, Diane Hirschfeld, Matthias Eichner, Rüdiger Hoffmann (1998), Creating an individual speech rhythm: a data driven approach, SSW
Janet E. Cahn (1998), Generating pitch accent distributions that show individual and stylistic differences, SSW
Philippe Boula de Mareüil, Christophe d'Alessandro (1998), Text chunking for prosodic phrasing in French, SSW
Corey Miller (1998), Individuation of postlexical phonology for speech synthesis, SSW
Eric Keller, Brigitte Zellner (1998), Motivations for the prosodic predictive chain, SSW
Brigitte Zellner (1998), Temporal structures for fast and slow speech rate, SSW
Paul Taylor, Alan W. Black, Richard Caley (1998), The architecture of the Festival speech synthesis system, SSW
Thomas Portele (1998), JUst CONcatenation - A Corpus-based Approach and its Limits, SSW
Pedro M. Carvalho, Luís C. Oliveira, Isabel M. Trancoso, M. Céu Viana (1998), Concatenative speech synthesis for European Portuguese, SSW
Ove Andersen, N.-J. Dyhr, I. S. Engberg, C. Nielsen (1998), Synthesising short vowels from their long counterparts in a concatenative based text-to-speech system, SSW
H. Timothy Bunnell, Steven R. Hoskins, Debra M. Yarrington (1998), A biphone constrained concatenation method for diphone synthesis, SSW
Nick Campbell (1998), Foreign Language Speech System, SSW
Ken Fujisawa, Nick Campbell (1998), Prosody-based unit selection for Japanese speech synthesis, SSW
Mark Beutnagel, Alistair Conkie, Ann K. Syrdal (1998), Diphone synthesis using unit selection, SSW
Wen Ding, Ken Fujisawa, Nick Campbell (1998), Improving speech synthesis of CHATR using a perceptual discontinuity function and constraints of prosodic modification, SSW
Michael W. Macon, Andrew E. Cronk, Johan Wouters (1998), Generalization and discrimination in tree-structured unit selection, SSW
Andrew P. Breen, Peter Jackson (1998), Non-uniform unit selection and the similarity metric within BT’s Laureate TTS system, SSW
D. Torre Toledano, M. A. Rodríguez Crespo, J. G. Escalada Sardina (1998), Trying to mimic human segmentation of speech using HMM and fuzzy logic post-correction rules, SSW
Agaath Sluijter, E. Bosgoed, J. Kerkhoff, E. Meier, Toni Rietveld, A. Sanderman, Marc Swerts, Jacques Terken (1998), Evaluation of speech synthesis systems for Dutch in telecommunication applications, SSW
Sebastian Heid, Sarah Hawkins (1998), PROCSY: A hybrid approach to high-quality formant synthesis using HLSyn, SSW
Alexander Kain, Mike Macon (1998), Personalizing a speech synthesizer by voice adaptation, SSW
M. Plumpe, S. Meredith (1998), Which is more important in a concatenative text to speech system - pitch, duration, or spectral discontinuity?, SSW
C. Silva, S. Chennoukh (1998), Estimation of articulatory parameter trajectory from speech acoustic dynamics, SSW
L. Charonnat, G. Ó-Néill, Guy Mercier (1998), An Irish speech synthesiser, SSW
Pierre Badin, Gérard Bailly, M. Raybaudi, C. Segebarth (1998), A three-dimensional linear articulatory model based on MRI data, SSW
Keiichi Funaki, Yoshikazu Miyanaga, Koji Tochinai (1998), On subband analysis based on glottal-ARMAX speech model, SSW
Yannis Stylianou (1998), Concatenative speech synthesis using a harmonic plus noise model, SSW
Yannis Stylianou (1998), Removing phase mismatches in concatenative speech synthesis, SSW
Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi (1998), Speaker adaptation for HMM-based speech synthesis system using MLLR, SSW
Christophe d’Alessandro, Boris Doval (1998), Experiments in voice quality modification of natural speech signals: the spectral approach, SSW
Martin Holzapfel, Rüdiger Hoffmann, Harald Höge (1998), A wavelet-domain PSOLA approach, SSW
Jialin Zhong, Joseph Olive (1998), Cloning synthetic talking heads, SSW
Jan P. H. van Santen, Bernd Möbius, Jennifer J. Venditti, Chilin Shih (1998), Description of the Bell Labs Intonation System, SSW
Hiroya Fujisaki, Sumio Ohno, Changfu Wang (1998), A command-response model for F0 contour generation in multilingual speech synthesis, SSW
Ann K. Syrdal, Gregor Möhler, Kurt Dusterhoff, Alistair Conkie, Alan W. Black (1998), Three methods of intonation modeling, SSW
Gregor Möhler, Alistair Conkie (1998), Parametric modeling of intonation using vector quantization, SSW
Jennifer J. Venditti, Kazuaki Maeda, Jan P. H. van Santen (1998), Modeling Japanese boundary pitch movements for speech synthesis, SSW
F. Malfrère, Thierry Dutoit, Piet Mertens (1998), Automatic prosody generation using suprasegmental unit selection, SSW
Jan P.H. van Santen, Louis C.W. Pols, Masanobu Abe, Dan Kahn, Eric Keller, Julie Vonwiller (1998), Report on the Third ESCA TTS Workshop evaluation procedure, SSW
Corine Astésano (2022), De la supramodalité du rythme : Implications pour la description prosodique, la remédiation linguistique et l’apprentissage des langues, JEP
Alexis Pierrard, Philippe Boula de Mareüil (2022), Étude acoustique préliminaire des /r/ fricatifs en espagnol des hautes terres de Bolivie, JEP
Yohann Meynadier, Alain Ghio, Alexendra Césari-Liétard (2022), Phonologie des voyelles nasales du français méridional à la lumière de l’aérophonométrie, JEP
Tsiky Rakotomalala, Pierre Baraduc, Pascal Perrier (2022), Optimalité dans la production de parole: quel rôle sur la formation de trajectoires ?, JEP
Hang Le, Sina Alisamir, Marco Dinarelli, Fabien Ringeval, Solène Evain, Ha Nguyen, Marcely Zanon Boito, Salima Mdhaffar, Ziyi Tong, Natalia Tomashenko, Titouan Parcollet, Allauzen Alexandre, Yannick Estève, Benjamin Lecouteux, François Portet, Solange Rossato, Didier Schwab, Laurent Besacier (2022), LeBenchmark, un référentiel d'évaluation pour le français oral, JEP
Salima Mdhaffar, Jean François Bonastre, Marc Tommasi, Natalia Tomashenko, Yannick Estève (2022), Extraction d'informations liées au locuteur depuis un modèle acoustique personnalisé, JEP
Sebastião Quintas, Alberto Abad, Julie Mauclair, Virginie Woisard, Julien Pinquier (2022), Utilisation de réseaux de neurones profonds avec attention pour la prédiction de l’intelligibilité de la parole de patients atteints de cancers ORL, JEP
Mohamed Embarki, Jonathan Owens (2022), Sonorité, resyllabation et ‘syndrome gahawa’ en arabe, JEP
Erwan Pépiot, Aron Arnold (2022), Différences acoustiques inter-genres chez des bilingues Anglais/Français : une étude du Voice Onset Time, JEP
Thalassio Briand, Camille Fauth, Hélène Vassiliadou (2022), Marques de l’émotion dans la fluence d’un patient cérébrolésé : Étude préliminaire de faisabilité, JEP
Mélanie Lancien (2022), Documenter la variabilité en français québécois : une description acoustique des allophones lénifiés de /R/, JEP
Pierre Champion, Anthony Larcher, Denis Jouvet (2022), Anonymisation de parole par quantification vectorielle, JEP
Vincent Roger, Jérôme Farinas, Virginie Woisard, Julien Pinquier (2022), Création d’une mesure entropique de la parole pour évaluer l’intelligibilité de patients atteints de cancers des voies aérodigestives supérieures, JEP
Mohammad Abuoudeh, Olivier Crouzet (2022), L’influence des variations de débit de parole sur les informations temporelles et spectrales associées à la longueur vocalique en arabe jordanien, JEP
Elisabeth Delais-Roussarie, Cyrille Granget (2022), La prosodie de la L1 contraint-elle l’acquisition de la morphologie verbale en français L2 ?, JEP
Cecile Fougeron, Nicolas Audibert (2022), Variabilité intra-individuelle en parole lue, JEP
Julie Glikman, Camille Fauth (2022), Un nouvel accès à la parole spontanée : les vocaux, JEP
Fabrice Hirsch, Ivana Didirková, Slim Ouni, Shakeel Ahmad Sheikh, Yves Laprie, Marie-Claude Monfrais-Pfauwadel, Eleonor Burkhardt (2022), La vélocité des mouvements labiaux et mandibulaires : un indice pour différencier les disfluences typiques du bégaiement et les disfluences normales? Une étude pilote, JEP
Lei Xi, Rachid Ridouane (2022), Quand la syntaxe a besoin de la prosodie : comment les indices prosodiques en français aident des apprenants sinophones à traiter l’information syntaxique – une étude perceptive, JEP
Xuejing Chen, Rachid Ridouane, Pierre André Hallé (2022), Perception des clusters selon leur profil de sonorité : le cas des auditeurs du mandarin confrontés à des clusters russes, JEP
Théo Mariotte, Anthony Larcher, Silvio Montrésor, Jean-Hugh Thomas (2022), Détection de Parole Superposée Multicanal à l'aide de Mécanismes d'Auto-Attention, JEP
Kübra Bodur, Christine Meunier, Corinne Fredouille (2022), Formes réduites en conversation: caractéristiques des séquences et des locuteurs, JEP
Solene Evain, Solange Rossato, Benjamin Lecouteux, François Portet (2022), Typologie de la parole spontanée à des fins d'analyse linguistique et de développement de systèmes de reconnaissance automatique de la parole, JEP
Delphine Charuau, Béatrice Vaxelaire, Rudolph Sock (2022), La variation des patterns respiratoires en parole chez l'enfant, JEP
Dayeon Yoon, Nicolas Audibert, Cécile Fougeron (2022), Différences mélodiques et spectrales entre sexes comparées chez les locuteurs coréens et français, JEP
Mohammad Mohammadamini, Driss Matrouf, Sandipana Dowerah, Romain Serizel, Denis Jouvet, Jean-François Bonastre (2022), Le comportement des systèmes de reconnaissance du locuteur de l'état de l'art face aux variabilités acoustiques, JEP
Lucie Van Bogaert, Laura Machart, Anne Vilain, Loevenbruck Helene (2022), Perception de parole chez l’enfant porteur d’implant(s) cochléaire(s) : Étude sur l’Auditory Verbal Therapy et la Langue française Parlée Complétée, JEP
Estelle Chardenon, Christine Meunier, Cécile Fougeron (2022), Variations de débit articulatoire sur un corpus conversationnel avec différents types d'interactions, JEP
Sara Larotonda, Véronique Delvaux, Bernard Harmegnies, Myriam Piccaluga, Sophie Van Malderen, Kathy Huet (2022), Étude du rôle de l’incongruité contextuelle et de la prosodie dans la compréhension de l’ironie chez des adultes tout venants., JEP
Sandrine Ferré, Océane Mahé, Marta Manenti, Heglyn Pimenta, Philippe Prévost (2022), Détecter un déficit phonologique chez les adultes - un module adulte pour LITMUS-QU-NWR-FR, JEP
Véronique Delvaux, Bernard Harmegnies, Kathy Huet, Myriam Piccaluga, Virginie Roland, Clémence Verhaegen (2022), Les personnes atteintes de la maladie de Parkinson font preuve de flexibilité phonétique, JEP
Marie Rebourg, Muriel Lalain, Alain Ghio, Corinne Fredouille, Laura Monestier, Nicolas Fakhry, Virginie Woisard (2022), Évaluation de l’intelligibilité des segments vocaliques de patients traités pour un cancer des VADS et de locuteurs contrôles, JEP
Robin Vaysse, Alain Ghio, Corine Astésano, Jérôme Farinas, François Viallet (2022), Analyse macroscopique des variations et modulations de F0 en lecture dans la maladie de Parkinson : données sur 320 locuteurs, JEP
Marion Blondel (2022), Prosodie et langue(s) des signes : un aperçu poétique, JEP
Benjamin Bigot, Hélène Devulder, Matthieu Puigt (2022), Amélioration de l’intelligibilité de la parole dans des enregistrements de « boîtes noires aéronautiques » à l’aide de méthodes de Séparation Aveugle de Sources, JEP
Paula Alejandra Cano-Cordoba, Thi Thuy Hien Tran, Nathalie Vallée, Christophe Savariaux, Silvain Gerber, Nicha Yamlamai (2022), Caractérisation des consonnes plosives non relâchées en thaï : Une étude glottographique, JEP
Laurence Gallitre, Solange Rossato (2022), Organisation temporelle des silences dans le discours en situation de crise : une étude de cas dans l’aéronautique, JEP
Amélie Richard, Karen Reilly, Sophie Jacquin-Courtois (2022), Que révèlent les disfluences sur le manque du mot rapporté par les patientes ayant un cancer du sein, JEP
Caroline Smith (2022), Réalisation de syntagmes accentuels avec différents nombres de syllabes : Différences entre locutrices L1 et L2, JEP
Caihong Weng, Alexander Martin, Ioana Chitoran (2022), Perceptual assimilation of Mandarin non-sibilant fricatives by speakers of Quanzhou Southern Min, JEP
Badreddine Hamma, Lassâad Oueslati (2022), Valeurs aspectuelles et modales du verbe finir dans ses emplois comme semi-auxiliaire et dans ses emplois libres, JEP
Jane Wottawa, Martine Adda-Decker, Frédéric Isel (2022), La perception de [h] et [ʔ] chez des bilingues français-allemand tardifs. Et si c'était juste du souffle ?, JEP
Anais Tran Ngoc, Julien Meyer, Fanny Meunier (2022), Bénéfices de la pratique musicale sur la catégorisation de la parole sifflée : analyse des processus de transferts, JEP
Clémence Guieu-Grandsire, Naomi Yamaguchi, Shigeko Shinohara (2022), Perception des rhotiques chez des enfants monolingues et bilingues franco-grecs, JEP
Solène Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli, Titouan Parcollet, Alexandre Allauzen, Yannick Estève, Benjamin Lecouteux, François Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier (2022), Modèles neuronaux pré-appris par auto-supervision sur des enregistrements de parole en français, JEP
Xiao Xiao, Nicolas Audibert, Christophe d'Alessandro, Grégoire Locqueville, Barbara Kuhnert, Rébecca Kleinberger, Claire Pillot-Loiseau (2022), Évaluation de la stylisation chironomique pour l'apprentissage de l'intonation du français L2, JEP
Martin Lebourdais, Marie Tahon, Antoine Laurent, Anthony Larcher, Sylvain Meignier (2022), Parole superposée et genre, étude des annotations pour les médias audiovisuels., JEP
Mathilde Hutin, Caihong Weng, Martine Adda-Decker, Ioana Vasilescu, Lori Lamel (2022), Disfluences et erreurs d’alignement au niveau du phonème : le cas des consonnes de liaison en français, JEP
Sondes Abderrazek, Corinne Fredouille, Alain Ghio, Muriel Lalain, Christine Meunier, Virginie Woisard (2022), Interprétation des représentations profondes des traits phonétiques via l'approche NCD - Neuro-based Concept Detector : Application aux troubles de la parole, JEP
Qianwen Guan, Yaru Wu, Ioana Chitoran (2022), Rôle du contexte prosodique et segmental dans la variation phonétique des séquences consonantiques en /ʁ/ en français, JEP
Gaëlle Laperrière, Valentin Pelloin, Antoine Caubrière, Salima Mdhaffar, Nathalie Camelin, Sahar Ghannay, Bassam Jabaian, Yannick Estève (2022), Le benchmark MEDIA revisité : données, outils et évaluation dans un contexte d’apprentissage profond, JEP
Severine Guillaume, Guillaume Wisniewski, Cécile Macaire, Guillaume Jacques, Alexis Michaud, Benjamin Galliot, Maximin Coavoux, Solange Rossato, Minh-Châu Nguyên, Maxime Fily (2022), Les modèles pré-entraînés à l'épreuve des langues rares : expériences de reconnaissance de mots sur la langue japhug (sino-tibétain), JEP
Thibault Cordier, Fabrice Lefèvre, Tanguy Urvoy, Lina Maria Rojas-Barahona (2022), Et la robustesse ?... bordel ! Comment les stratégies de dialogue par apprentissage structuré résistent aux bruits des entrées ?, JEP
Mathilde Degand, Véronique Delvaux, Bernard Harmegnies, Kathy Huet, Myriam Piccaluga, Clémence Verhaegen, Virginie Roland (2022), Activités de réminiscence dans la maladie d’Alzheimer : analyse des gestes en situation d’échanges dyadiques., JEP
Lucile Gelin, Thomas Pellegrini, Julien Pinquier, Morgane Daniel (2022), Améliorations d’un système Transformer de reconnaissance de phonèmes appliqué à la parole d'enfants apprenants lecteurs, JEP
Paul-Gauthier Noé, Andreas Nautsch, Driss Matrouf, Pierre-Michel Bousquet, Jean-François Bonastre (2022), Faire le pont entre l’observation et la preuve : Application au respect de la vie privée, JEP
Aline Marchand (2022), Acquisition de la prosodie en L2 : une étude acoustique de l'accentuation en français par des adultes turcophones, JEP
Muriel Lalain, Laura Monestier, Alain Ghio, Corinne Fredouille, Marie Rebourg, Nicolas Fakhry, Virginie Woisard (2022), Prédiction du degré d'altération de l'intelligibilité chez des patients traités pour un cancer de la cavité buccale ou de l'oropharynx, JEP
Jinyu Li, Leonardo Lancia (2022), Restructuration rythmique de la parole produite sous feedback auditif retardé, JEP
Gaëlle Ferré (2022), Les pauses gestuelles correspondent-elles à des dysfluences verbales chez les personnes aphasiques ?, JEP
Leonardo Contreras Roa, Paolo Mairano, Caroline Moreau, Anahita Basirat (2022), Impact de l’amorçage rythmique sur la production de la parole chez des personnes atteintes de la maladie de Parkinson : Étude pilote, JEP
Patrick Louis Rohrer, Pilar Prieto, Elisabeth Delais-Roussarie (2022), Le rythme prosodique guide le rythme gestuel, JEP
Emmanuel Ferragne, Anne Guyot-Talbot, Margaux Cecchini, Martine Beugnet, Emmanuelle Delanoë-Brun, Laurianne Georgeton, Christophe Stécoli, Jean-François Bonastre, Corinne Fredouille (2022), Représentations de l’expertise vocale dans les séries policières : quand la fiction s’invite dans les enquêtes et au tribunal, JEP
Camille Joseph, Ivana Didirková, Claudia Schweitzer, Christelle Dodane (2022), Les premières archives de la parole (1890-1940) , JEP
Melissa Barkat-Defradas, Camille Fauth, Didier Demolin, Alexandre Suire (2022), Évolution de la voix humaine (France, période 1940 – 2019), JEP
Christophe D'Alessandro (2022), Une nouvelle organologie de la voix : chironomie et prosodie de la parole et du chant, JEP
Hubert Nourtel, Pierre Champion, Denis Jouvet, Anthony Larcher, Marie Tahon (2022), Analyse de l'anonymisation du locuteur sur de la parole émotionnelle, JEP
Jean-François Bonastre, Imen Ben Amor (2022), BA-LR : une approche transparente de comparaison de voix en criminalistique, JEP
Laurent Girin, Xiaoyu Bie, Simon Leglaive, Thomas Hueber, Xavier Alameda-Pineda (2022), Les auto-encodeurs variationnels dynamiques et leur application à la modélisation de spectrogrammes de parole, JEP
Eva Goeseels, Bernard Harmegnies, Kathy Huet, Myriam Piccaluga, Virginie Roland, Clémence Verhaegen, Véronique Delvaux (2022), Etude du phénomène de convergence phonétique : comparaison enfants adultes, JEP
Lee Boram, Yamaguchi Naomi, Cécile Fougeron (2022), Perception des occlusives du coréen L2 : reorganisation du cue weighting au cours du temps, JEP
Christelle Dodane, Corine Astésano (2022), Mise en place de l’accentuation initiale et finale chez deux enfants monolingues français âgés de 18 à 36 mois, JEP
Verdiana De Fino, Lionel Fontan, Julien Pinquier, Corentin Barcat, Isabelle Ferrané, Sylvain Detey (2022), Mesures automatiques de parole non-native : exploration pilote d’un corpus d’apprenants japonais de français et différenciation de niveaux, JEP
Vincent P. Martin, Jean-Luc Rouas, Agathe Basse, Benoît Caudron, Marie Huillet, Pierre Philip (2022), Est-il possible d’annoter la naturalité des pauses lors de la lecture d’un texte à haute voix ?, JEP
Olivier Crouzet, Agnieszka Duniec, Elisabeth Delais-Roussarie (2022), Analyse Factorielle de signaux musicaux : comparaison avec les données de parole dans la perspective de l'hypothèse de codage efficace., JEP
Oralie Cattan, Sahar Ghannay, Christophe Servan, Sophie Rosset (2022), Etude comparative de modèles Transformers en compréhension de la parole en Français, JEP
Charlotte Alazard, Leonardo Contreras Roa, Marie Philippart de Foy, Lionel Fontan, Nadia Yassine-Diab, Julien Pinquier, Isabelle Ferrané (2022), Apport du geste dans l'acquisition de la prononciation en L2 : quand la réalité ne correspond pas aux attentes, JEP
Coline Caillol, Emmanuel Ferragne (2022), Américanisation de la prononciation de l'anglais en voix chantée: le cas des voyelles de TRAP-BATH et LOT, JEP
Andres Felipe Lara, Claire Pillot-Loiseau (2022), L’interférence interlinguistique : étude comparant le VOT chez des apprenants tardifs, des bilingues simultanés et des monolingues, JEP
Daria D'Alessandro, Alice Yildiz, Cécile Fougeron (2022), Variation individuelle de la coarticulation en fonction de la frontière prosodique, JEP
Grégory Miras, Claire Pillot-Loiseau (2022), Discrimination des voyelles par des apprenants du français comme langue étrangère : effets de l’exposition et de la pratique de la musique instrumentale, JEP
Lila Kim, Cédric Gendrot (2022), Classification automatique de voyelles nasales pour une caractérisation de la qualité de voix des locuteurs par des réseaux de neurones convolutifs, JEP
Martin Lenglet, Olivier Perrotin, Gérard Bailly (2022), Modélisation de la Parole avec Tacotron2 : Analyse acoustique et phonétique des plongements de caractère, JEP
Jenifer Vega Rodriguez, Nathalie Vallée, Thiago Chacón (2022), Glottalisation en korebaju : la question d’un trait mixte segmental et suprasegmental, JEP
Qianwen Guan, Pierrick Philippe (2022), Influence de la similarité acoustique entre L1 et L2 dans la production des voyelles anglaises par les natifs français, JEP
Lily Wadoux, Nelly Barbot, Jonathan Chevelu, Damien Lolive (2022), Impact du contenu phonétique sur les plongements de locuteurs pour le clonage de voix : vers l'application aux pathologies vocales, JEP
Valentin Pelloin, Nathalie Camelin, Antoine Laurent, Renato De Mori, Sylvain Meignier (2022), Architectures neuronales bout-en-bout pour la compréhension de la parole, JEP
Sophie Fagniart, Brigitte Charlier, Véronique Delvaux, Bernard Harmegnies, Anne Huberlant, Myriam Piccaluga, Kathy Huet (2022), Développement langagier d’enfants porteurs d’implants cochléaires et normo-entendants : lien avec différents indices acoustiques de nasalité vocalique., JEP
Nathalie Henrich Bernardoni, Raphaël Girault, Hamid Yousefi-Mashouf, Paul Luizard, Laurent Orgéas, Lucie Bailly (2022), Développement d’un banc in vitro pour la caractérisation vibratoire de plis vocaux biomimétiques et pré-déformés, JEP
Bianca Maria De Paolis, Fabian Santiago, Cecilia Andorno (2022), Syntaxe ou prosodie? Une étude préliminaire sur l'expression de la focalisation étroite par les apprenants italophones de français L2, JEP
Natalia Tomashenko, Salima Mdhaffar, Marc Tommasi, Yannick Estève, Jean-François Bonastre (2022), Sur la vérification du locuteur à partir de traces d'exécution de modèles acoustiques personnalisés, JEP
Chakir Zeroual, Abdelatif Belkadi, Laura Abou-Haidar (2022), Effet de l’anxiété et de la motivation langagières sur le niveau de prononciation du FLE chez des universitaires marocains, JEP
Makoto Ito (2022), La perception du schwa en français par les apprenants japonophones et les natifs francophones, JEP
Cedric Gendrot, Emmanuel Ferragne, Anaïs Chanclu (2022), Analyse phonétique de la variation inter-locuteurs au moyen de réseaux de neurones convolutifs : voyelles seules et séquences courtes de parole, JEP
Yaru Wu, Ivana Didirková, Anne-Catherine Simon (2022), Disfluences en parole continue en français : paramètres prosodiques des pauses pleines et des allongements vocaliques, JEP
Lucie Judkins, Charlotte Alazard, Corine Astésano (2022), Phénomènes de groupement et pause en parole native vs non-native, JEP
Djegdjiga Amazouz, Martine Adda-Decker, Lori Lamel (2022), Variation du voisement des occlusives orales en code-switching: analyses par ABX automatique et mesures acoustiques, JEP
Barbara Tillman (2022), Musique et langage : Effets d’amorçage rythmique sur le traitement du langage, JEP
Ian Maddieson (2022), Le ton, est-il originel ? – Le lien musique-langage, JEP
Christelle Veger-Paganelli, Claire Pillot-Loiseau (2022), Caractéristiques acoustiques des voyelles parlées et chantées en langue corse : comparaison de textes interprétés en 1916 et 2020, JEP
Annalisa Paroni, Nathalie Henrich Bernardoni, Hélène Loevenbruck, Silvain Gerber, Pierre Baraduc, Christophe Savariaux (2022), Etude comparative acoustique et articulatoire de la plosion entre parole et beatbox, JEP
Yishan Jin, Damien Chabanal (2022), Perception et production des consonnes occlusives orales du français par des sinophones de groupes dialectaux wu et non wu, JEP
Jean-Luc Schwartz, Pierre Bessière, Pascal Perrier, Marc-Antoine Georges, Mamady Nabé, Julien Diard, Marie-Lou Barnaud, Raphaël Laurent, Jean-François Patri, Clément Moulin-Frier (2022), COSMO : un modèle bayésien des fondements sensorimoteurs de la perception et de la production de la parole, JEP
Pierre Baraduc, Coriandre Vilain (2022), Retours acoustiques de la production de parole : caractérisation des différences informationnelles entre le son aérien et le son par conduction osseuse, JEP
Sadaoki Furui (1991), Recent advances in speech recognition, Eurospeech
Frank Fallside (1991), On the acquisition of speech by machines, ASM, Eurospeech
P. Ramesh, Jay G. Wilpon, M. A. McGee, David B. Roe, Chin-Hui Lee, Lawrence R. Rabiner (1991), Speaker independent recognition of spontaneously spoken connected digits, Eurospeech
P. S. Gopalakrishnan, David Nahamoo (1991), Immediate recognition of embedded command words, Eurospeech
Lynn D. Wilcox, Marcia A. Bush (1991), HMM-based wordspotting for voice editing and indexing, Eurospeech
Janet M. Baker (1991), Large vocabulary speaker-adaptive continuous speech recognition research overview at dragon systems, Eurospeech
Victoria Sgardoni, Dimitrios A. Gaganelis, Eleftherios D. Frangoulis (1991), Continuous density HMM context dependent phones for speech recognition over the telephone, Eurospeech
Katsuhiko Shirai, K. Hashimoto, T. Kobayashi (1991), Text-to-speech synthesizer using superposition of sinusoidal waves generated by synchronized oscillators, Eurospeech
M. Guerti, G. Bailly (1991), Synthesis-by-rule using compost: modelling resonance trajectories, Eurospeech
Yasushi Ishikawa, Kunio Nakajima (1991), Neural network based spectral interpolation method for speech synthesis by rule, Eurospeech
Marine Garnier-Rizet (1991), A rule-based segmental synthesis module for French, Eurospeech
Norman M. Fraser, G. Nigel Gilbert (1991), Effects of system voice quality on user utterances in speech dialogue systems, Eurospeech
P. Day, A. Grünupp, K.-P. Muthig (1991), A human factors study of speech-to-text technology: consequences of discrete speech, Eurospeech
Iain R. Murray, John L. Arnott, Alan E. Newell (1991), A comparison of document composition using a listening typewriter and conventional office systems, Eurospeech
Paulus H. Vossen (1991), Evaluating speech input and output in a CAD-system using the hidden-operator method, Eurospeech
M. Zajicek, J. Hewitt (1991), Mixed mode input for a standard wordprocessor. investigating links between input mode, speech and keyboard, and specific task areas, Eurospeech
P. Lockwood, J. Boudy (1991), Experiments with a non-linear spectral subtractor (NSS), hidden Markov models and the projection, for robust speech recognition in cars, Eurospeech
P. Lockwood, C. Baillargeat, J. M. Gillot, J. Boudy, G. Faucon (1991), Noise reduction for speech enhancement in cars: non-linear spectral subtraction / kalman filtering, Eurospeech
Klaus Fellbaum, Dieter Becker (1991), Isolated word recognition with integrated noise reduction, Eurospeech
Javier Hernando, Climent Nadeu (1991), A comparative study of parameters and distances for noisy speech recognition, Eurospeech
Jorma T. Laaksonen (1991), A new reliability-based phoneme segmentation method for the "neural" phonetic typewriter, Eurospeech
Bruno Apolloni, Francesco Pazienti, Vincenzo Trotta (1991), Isolated word adaptive recognizer based on neural networks, Eurospeech
Nobuo Hataoka, Alex H. Waibel (1991), Evaluation of speaker-independent phoneme recognition on TIMIT database using TDNNs, Eurospeech
Nelson Morgan, Hervé Bourlard, C. Wooters, Phil Kohn, M. Cohen (1991), Phonetic context in hybrid HMM/MLP continuous speech recognition, Eurospeech
E. C. Andrews, J. S. Mason (1991), Neural network classification of complex-valued speech features, Eurospeech
Dennis Norris (1991), Rewiring lexical networks on the fly, Eurospeech
K. Elenius, G. Takacs (1991), Phoneme recognition with an artificial neural network, Eurospeech
Jiang Jianxin, Yi Kechu, Hu Zheng (1991), A new self-organization algorithm of forming a phoneme map, Eurospeech
Shuping Ran, J. Bruce Millar (1991), Phoneme classification using neural networks based on acoustic-phonetic structure, Eurospeech
Nigel Dodd, Donald Macfarlane, Chris Marland (1991), Networks for speech recognition structurally optimised by genetic techniques implemented on parallel hardware, Eurospeech
J. Pittam, J. Ingram (1991), Influence of vietnamese tone and prosody on the acquisition of English stress patterns, Eurospeech
Walter F. Sendlmeier (1991), The voiced/unvoiced distinction of initial stops by normal and hearing impaired listeners, Eurospeech
Krishna S. Nathan (1991), Comparison of formant transition based stop classifiers: time-varying and time-invariant signal models, Eurospeech
Christian Benoît, Christian Abry, L. J. Roe (1991), The effect of context on labiality in French, Eurospeech
A. K. Datta, N. R. Ganguli, B. Mukherjee (1991), Nasalisation in bengali speech sounds acoustic-phonetic study, Eurospeech
N. R. Ganguli (1991), Vowel formant frequency distribution of a major indian language, Eurospeech
Bernard Harmegnies, M. Bruyninckx, Joaquim Llisterri, Dolors Poch (1991), Effects of language change on voice quality in bilingual speakers, corpus content effect, Eurospeech
T. I. Shevchenko, T. S. Skopintseva (1991), Effects of social and regional backgrounds on LTAS in british English, Eurospeech
Henk van den Heuvel, Bert Cranen, Toni Rietveld (1991), Speaker related variability in the durations of dutch speech segments, Eurospeech
Johan Liljencrants (1991), Numerical simulations of glottal flow, Eurospeech
Joop Jansen, Bert Cranen, Louis Boves (1991), Modelling of source characteristics of speech sounds by means of the LF-model, Eurospeech
H. Herzel, J. Wendler (1991), Evidence of chaos in phonatory samples, Eurospeech
L. Trinh Van, Bernard Guérin, E. Castelli (1991), Source-tract coupling and the subglottal system in an articulatory synthesizer, Eurospeech
Paul Bamberg, Anne Demedts, John Elder, Caroline Huang, Charles Ingold, Mark Mandel, Linda Manganaro, Stijn van Even (1991), Phoneme-based training for large-vocabulary recognition in six european languages, Eurospeech
Helene Cerf-Danon, Steven DeGennaro, Marco Ferretti, Jorge Gonzalez, Eric Keppel (1991), 1. 0 TANGORA - a large vocabulary speech recognition system for five languages, Eurospeech
Hermann Ney, Roberto Billi (1991), Prototype systems for large-vocabulary speech recognition: polyglot and spicos, Eurospeech
J. H. Wright (1991), Adaptation of grammar-based language models for continuous speech recognition, Eurospeech
Keh-Yih Su, Tung-Hui Chiang, Yi-Chung Lin (1991), A robustness and discrimination oriented score function for integrating speech and language processing, Eurospeech
Paolo Baggia, Lorenzo Fissore, E. Gerbino, Egidio P. Giachin, C. Rullent (1991), Improving speech understanding performance through feedback verification, Eurospeech
A. Corazza, Renato De Mori, R. Gretter, G. Satta (1991), Computation of upper-bounds for island-driven stochastic parsers, Eurospeech
Francois Andry, Simon Thornton (1991), A parser for speech lattices using a UCG grammar, Eurospeech
Sheryl Young, Michael Matessa (1991), Using pragmatic and semantic knowledge to correct parsing of spoken language utterances, Eurospeech
A. J. Abrantes, J. S. Marques, Isabel M. Trancoso (1991), Hybrid sinusoidal modeling of speech without voicing decision, Eurospeech
J. S. Marques, Isabel M. Trancoso, A. J. Abrantes (1991), Harmonic coding of speech: an experimental study, Eurospeech
David Rowe, William Cowley, Andrew Perkis (1991), A multiband excitation linear predictive speech coder, Eurospeech
S. H. Leung, K. L. Lai, O. Y. Wong, A. Luk (1991), A new coded excitation model using multifrequency decomposition, Eurospeech
Daniele Sereno (1991), Frame substitution and adaptive post-filtering in speech coding, Eurospeech
S. A. Atungsiri, R. Soheili, A. M. Kondoz, B. G. Evans (1991), Effective lost speech frame reconstruction for CELP coders, Eurospeech
Hiromi Nagabuchi, Nobuhiko Kitawaki (1991), Evaluation and improvement of coded speech quality degraded by cell loss in ATM networks, Eurospeech
Alain J. Vigier (1991), Combined source-channel coding for a very noisy channed, Eurospeech
G. Rosina, M. Sant' Agostino, E. Turco, L. Vetrano (1991), Testing and quality enhancement of the GSM full rate voice channel, Eurospeech
U. Kipper, Herbert Reininger, Dietrich Wolf (1991), Low bit rate speech coding using CELP with adaptive excitation codebook, Eurospeech
A. Fuldseth, E. Harborg, F. T. Johansen, J. E. Knudsen (1991), A real-time implementable 7 khz speech coder at 16 kbit/s, Eurospeech
D. J. Zarkadis (1991), Adaptive spectral weighting for vector predictive coding of the LPC-spectra, Eurospeech
Samir Saoudi, J. Marc Boucher, Alain Le Guyader (1991), Medium band speech coding using optimal scalar quantization of LSP, Eurospeech
Philip Seeker, Andrew Perkis (1991), Joint source and channel coding of line spectrum pairs, Eurospeech
C. F. Chan, K. W. Law (1991), An algorithm for computing LSP frequencies directly from the reflection coefficients, Eurospeech
Peter Meyer, W. Peters, J. Paulus (1991), Variable rate speech coding using perceptive thresholds and adaptive VUS detection, Eurospeech
M. R. Suddle, S. A. Atungsiri, A. M. Kondoz, B. G. Evans (1991), A secure and robust CELP coder for land and satellite mobile systems, Eurospeech
C. M. Ribeiro, Isabel M. Trancoso (1991), A 4. 8 kbps celp coder with post-processing, Eurospeech
K. W. Law, O. Y. Wong, C. F. Chan (1991), A real-time high quality joint-excitation linear predictive coder at 8 kbps, Eurospeech
Rosario Drogo Deiacovo, Roberto Montagna (1991), Some experiments in perceptual masking of quantizing noise in analysis-by-synthesis speech coders, Eurospeech
Gao Yang, Kenri Leich, René Boite (1991), A very high-quality CELP coder at the rate of 2400 bps, Eurospeech
Z. Yong Liu (1991), An effective pulse adaptive code-excited linear predictive coder at 4kb/S, Eurospeech
C. F. Chan, S. H. Leung (1991), A vocoder using high-order LPC filter with very few non-zero coefficients, Eurospeech
Mario Rossi, Robert Espesser, Chaslav Pavlovic (1991), The effects of in internal reference system and cross-modality matching on the subjective rating of speech synthesisers, Eurospeech
H. A. Sydeserff, R. J. Caley, Stephen D. Isard, Mervyn A. Jack, Alex I. C. Monaghan, J. Verhoeven (1991), Evaluation of speech synthesis techniques in a comprehension task, Eurospeech
P. A. Howard-Jones (1991), 'SOAP' - a speech output assessment package for controlled multilingual evaluation of synthetic speech, Eurospeech
Tammo Houtgast, Jan A. Verhave (1991), A physical approach to speech quality assessment: correlation patterns in the speech spectrogram, Eurospeech
H. Miyata, Tammo Houtgast (1991), Weighted MTF for predicting speech intelligibility in reverberant sound fields, Eurospeech
Ute Jekosch (1991), Speech intelligibility studies for the european hermes spaceplane, Eurospeech
Jianing Wei, Andrew Faulkner, Adrian Fourcin (1991), An application of speech processing and encoding scheme for Chinese lexical tone and consonant perception by hearing impaired listeners, Eurospeech
D. Kanevsky, P. Gopalakrishan, C. Danis, G. Daggett, E. Epstein, David Nahamoo (1991), On the development of a phone communication aid for the hearing impaired, Eurospeech
Yolande Anglade, Jean-Marie Pierrel, Jean-Claude Junqua (1991), A spoken language interface for a telephone switchboard operator center, Eurospeech
Iain R. Murray, John L. Arnott, Norman Alm, Alan F. Newell (1991), A communication system for the disabled with emotional synthetic speech produced by rule, Eurospeech
Thomas Portele, Birgit Steffan, Rainer Preuß, Wolfgang Hess (1991), German speech synthesis by concatenation of non-parametric units, Eurospeech
Giuseppe Abbattista, Antonello Riccio, Enzo Mumolo (1991), Automatic document reader with speech output capabilities, Eurospeech
R. W. King (1991), Tools and processes for developing low-cost and high-quality text-to-speech synthesis for communication aids, Eurospeech
Hynek Hermansky, Louis Anthony Cox Jr. (1991), Perceptual linear predictive (PLP) analysis-resynthesis technique, Eurospeech
Reinhold Greisbach, Bernd J. Kröger, O. Esser, G. Plaßmann (1991), A display technique for measurements of natural and synthetic articulatory dynamics, Eurospeech
Yueh-Chin Chang, Yi-Fan Lee, Bang-Er Shia, Hsiao-Chuan Wang (1991), Statistical models for the Chinese text-to-speech system, Eurospeech
P. A. Taylor, I. A. Nairn, A. M. Sutherland, Mervyn A. Jack (1991), A realtime speech synthesis system, Eurospeech
H. Valbret, E. Moulines, Jean-Pierre Tubach (1991), Voice tranformation using PSOLA technique, Eurospeech
M. Giustiniani, Piero Pierucci (1991), Phonetic ergodic HMM for speech synthesis, Eurospeech
C. Delogu, P. Paoloni, P. Pocci, C. Sementina (1991), Quality evaluation of text-to-speech synthesizers using magnitude estimation, categorical estimation, pair comparison and reaction time methods, Eurospeech
H. Zingte, Cl. Hennebois (1991), Helping young children to associate sounds and letters through speech synthesis, Eurospeech
Herve Bourlard (1991), Neural nets and hidden Markov models: review and generalizations, Eurospeech
N. S. Jayant, J. D. Johnston, Y. Shoham (1991), Coding of wideband speech, Eurospeech
Roberto Pieraccini, Esther Levin (1991), Stochastic representation of semantic structure for speech understanding, Eurospeech
Colin Matheson, Fergus R. McInnes (1991), Incorporating probabilities into the dualgram language model, Eurospeech
Egidio P. Giachin (1991), A dynamic programming based framework for stochastic spoken language understanding, Eurospeech
Natividad Prieto, Enrique Vidal (1991), Learning language models through the ECGI method, Eurospeech
R. Cremonini, M. Ferretti, M. C. Galimberti, Giulio Maltese, Federico Mancini (1991), Using a generative grammar to train a probabilistic language model for speaker-independent speech recognition, Eurospeech
Katsuhiko Shirai, E. Kitagawa, T. Endo (1991), Optimal construction of context sensitive quantizer for phoneme recognition in continuous speech, Eurospeech
M. O'Kane, P. Kenne, D. Landy, S. Atkins (1991), Generalising from single-speaker recognition in a feature-based recogniser, Eurospeech
H. G. Hirsch, Peter Meyer, Hans-Wilhelm Ruehl (1991), Improved speech recognition using high-pass filtering of subband envelopes, Eurospeech
Yifan Gong, Jean-Paul Haton (1991), Comparing two phoneme identification methods using a continuous speech recognizer, Eurospeech
D. Ederveen, Louis Boves (1991), Knowledge-based phoneme recognition, Eurospeech
J. Kraayeveld, A. C. M. Rietveld, V. J. van Heuven (1991), Speaker characterization in dutch using prosodic parameters, Eurospeech
Alan K. Hunt (1991), New commercial applications of telephone-network-based speech recognition and speaker verification, Eurospeech
Jean-Francois Bonastre, Henri Meloni, Philippe Langlais (1991), Analytical strategy for speaker identification, Eurospeech
L. Xu, J. S. Mason (1991), Optimization of perceptually-based spectral transforms in speaker identification, Eurospeech
Alain de Cheveigne (1991), A mixed speech F0 estimation algorithm, Eurospeech
Edward Jones, Eliathamby Ambikairajah (1991), A perceptually-based pitch extractor for band-limited speech, Eurospeech
Yu Hua Gu (1991), A robust pseudo perceptual pitch estimator, Eurospeech
N. Dal Degan, M. Fratti (1991), Pitch estimation based on a "narrowed" autocorrelation function, Eurospeech
Seiichi Nakagawa, Yoshimitsu Hirata, Isao Murase (1991), The syntax-oriented spoken Japanese understanding system SPOJOS-SYNO II, Eurospeech
H. Bergmann, H.-H. Hamer, A. Noll, A. Paeseler, H. Tomaschewski (1991), An adaptable man-machine interface using connected-word recognition, Eurospeech
M. J. Poza, C. de la Torre, D. Tapias, L. Villarrubia (1991), An approach to automatic recognition of keywords in unconstrained speech using parametric models, Eurospeech
I. Lee Hetherington, Hong C. Leung, Victor W. Zue (1991), Toward vocabulary-independent recognition of telephone speech, Eurospeech
Ronald Cole, Krist Roginski, Mark Fanty (1991), English alphabet recognition with telephone speech, Eurospeech
J.-Y. Fiset, J.-M. Robert, Raymond Descout (1991), Evolutionary language models in air traffic control training, Eurospeech
G. J. F. Jones, J. H. Wright, E. N. Wrigley, M. J. Carey, Eluned S. Parris (1991), Isolated-word sentence recognition using probabilistic context-free grammar, Eurospeech
Mitchell Hood (1991), Lexical access in a speech understanding and dialogue system, Eurospeech
Reinhold Haeb-Umbach, Hermann Ney (1991), A look-ahead search technique for large vocabulary continuous speech recognition, Eurospeech
Carlos J. Teixeira, Isabel M. Trancoso (1991), Spectral subtraction for front-end noise reduction in a speech recognizer, Eurospeech
Lori F. Larnel, Jean-Luc Gauvain, Maxine Eskenazi (1991), BREF, a large vocabulary spoken corpus for French, Eurospeech
Luc Mathan, Dominique Morin (1991), Speech field databases: development and analysis, Eurospeech
Shuichi Itahashi (1991), Large scale Japanese dialect speech corpora, Eurospeech
Paulus H. Vossen (1991), Outline of a design-oriented evaluation framework for speech-driven applications, Eurospeech
Richard Winski, Kamran Kordi (1991), Assessment of continuous speech recognisers using recogniser sensitivity analysis, Eurospeech
C. Bourjot, A. Boyer, D. Fohr (1991), A tool for assessment of acoustic phonetic lattices, Eurospeech
Herman J. M. Steeneken, Jeroen G. van Velden (1991), Ramos - recognizer assessment by means of manipulation of speech applied to connected speech recognition, Eurospeech
Paul van Alphen, Louis C. W. Pols (1991), Comparing various feature vectors in automatic speech recognition, Eurospeech
Victor W. Zue, James Glass, David Goodine, Lynette Hirschman, Hong C. Leung, Michael Phillips, Joseph Polifroni, Stephanie Seneff (1991), The MIT ATIS system; preliminary development, spontaneous speech data collection, and performance evaluation, Eurospeech
S. Benaouicha, A. Rajouani, M. Zyoute (1991), Construction of an Arabic speech data base - duration model of Arabic vowels, Eurospeech
P. N. Denbigh, J. Zhao (1991), Pitch extraction and separation of overlapping speech, Eurospeech
Yoshua Bengio, Renato De Mori, Giovanni Flammia, Half Kompe (1991), Phonetically motivated acoustic parameters for continuous speech recognition using artificial neural networks, Eurospeech
Michael J. Carey, Eluned S. Parris (1991), Adapting input transformations using alpha-nets for whole word speech recognition, Eurospeech
Les T. Niles (1991), TIMIT phoneme recognition using an HMM-derived recurrent neural network, Eurospeech
P. O. Husoy, T. Svendsen (1991), ANN-based speech recognition using a preprocessor for non-linear time compression, Eurospeech
Helge B. D. Sorensen, Uwe Hartmann (1991), A self-structuring neural noise reduction model, Eurospeech
Bojan Petek, Alex H. Waibel, Joseph M. Tebelskis (1991), Integrated phoneme-function word architecture of hidden control neural networks for continuous speech recognition, Eurospeech
X. Zhang, J. S. Mason, E. C. Andrews (1991), Multiple dynamic features to enhance neural net based speaker verification, Eurospeech
Patrick Haffner, Alex H. Waibel (1991), Time-delay neural networks embedding time alignment: a performance analysis, Eurospeech
Yohji Fukuda, Haruya Matsumoto (1991), Phoneme recognition using recurrent neural networks, Eurospeech
Yasuhiro Komori, Kaichiro Hatazaki (1991), An integration of knowledge and neural networks toward a phoneme typewriter without a language model, Eurospeech
Junko Hosaka, Toshiyuki Takezawa, Terumasa Ehara (1991), Utilizing empirical data for postposition classification toward spoken Japanese speech recognition, Eurospeech
Michael Phillips, James Glass, Victor W. Zue (1991), Automatic learning of lexical representations for sub-word unit based speech recognition systems, Eurospeech
Roxane Lacouture, Renato De Mori (1991), Lexical tree compression, Eurospeech
Michael D. Riley, Andrej Ljolje (1991), Lexical access with a statistically-derived phonetic network, Eurospeech
G. Antoniol, F. Brugnara, D. Giuliani (1991), Admissible strategies for acoustic matching with a large vocabulary, Eurospeech
Alejandro Macarron, Gregorio Escalada, Miguel Angel Rodriguez (1991), Generation of duration rules for a Spanish text-to-speech synthesizer, Eurospeech
L. Mortamet (1991), Implementing duration expert rules into a text-to-speech synthesis system, Eurospeech
Nobuyoshi Kaiki, Katsuhiko Mimura, Yoshinori Sagisaka (1991), Statistical modeling of segmental duration and power control for Japanese, Eurospeech
W. Nick Campbell (1991), Phrase-level factors affecting timing in speech, Eurospeech
Matti Karjalainen, Toomas Altosaar (1991), Phoneme duration rules for speech synthesis by neural networks, Eurospeech
Fergus R. McInnes (1991), Context-sensitive phoneme lattice generation using interpolated demi-diphone and triphone models, Eurospeech
J. M. Song, T. Thomas, M. Patel (1991), Experiments of 991-word speaker independent continuous speech recognition on DARPA RM task, Eurospeech
Henri Meloni, F. Bechet, P. Gilles (1991), Bottom-up acoustic-phonetic decoding for the selection of word cohorts from a large vocabulary, Eurospeech
Antonio M. Peinado, Ramon Roman, Jose C. Segura, Antonio J. Rubio, Pedro Garcia, Jesus E. Diaz (1991), Entropic training for HMM speech recognition, Eurospeech
P. Kenny, S. Parthasarathy, V. N. Gupta, Matthew Lennig, Paul Mermelstein, Douglas O'Shaughnessy (1991), Energy, duration and Markov models, Eurospeech
J. J. Nijtmans (1991), A new recursive Markov model with a new state pruning approach for large vocabulary continuous speech recognition, Eurospeech
Fergus R. McInnes (1991), Context-sensitive phoneme lattice generation using interpolated demi-diphone and triphone models, Eurospeech
Peter Nowell, Henry S. Thompson (1991), An efficient implementation of the n-best algorithm for lexical access, Eurospeech
Alessandro Falaschi, Massimo Pucci (1991), Automatic derivation of HMM alternative pronunciation network topologies, Eurospeech
Isabel Galiano, Francisco Casacuberta, Emilio Sanchis (1991), On the structure of subword units for a speaker independent continuous speech task, Eurospeech
Yunxin Zhao, Hisashi Wakita, Xinhua Zhuang (1991), Generate word transcription dictionary from sentence utterances and evaluate its effect on speaker-independent continuous speech recognition, Eurospeech
A. P. Varga, Roger K. Moore (1991), Simultaneous recognition of concurrent speech signals using hidden Markov model decomposition, Eurospeech
I. A. Ballantyne, A. M. Sutherland, J. M. Hannah, Mervyn A. Jack (1991), A large vocabulary parallel processing continuous speech recognition system, Eurospeech
Richard C. Rose, Edward M. Hofstetter (1991), Techniques for robust word spotting in continuous speech messages, Eurospeech
Alessandro Falaschi, Alfredo Micozzi (1991), Word spotting by CSR through vector quantized background models, Eurospeech
Jean-Claude Junqua, Hisashi Wakita (1991), Towards an artificial laboratory for the design and simulation of cooperative speech processing algorithms, Eurospeech
K. Edwards, Fergus R. McInnes, Mervyn A. Jack (1991), Accent specific modifications for continuous speech recognition based on a sub-word lattice approach, Eurospeech
Eduardo Lleida, Jose B. Marino, Climent Nadeu, Albert Oliveras (1991), Two level continuous speech recognition using demisyllable-based HMM word spotting, Eurospeech
Ted H. Applebaum, Brian A. Hanson (1991), Tradeoffs in the design of regression features for word recognition, Eurospeech
Lalit R. Bahl, Peter F. Brown, Peter V. de Souza, Robert L. Mercer, David Nahamoo (1991), A fast algorithm for deleted interpolation, Eurospeech
Michael A. Franzini, Alex H. Waibel, Kai-Fu Lee (1991), Recent work in continuous speech recognition using the connectionist viterbi training procedure, Eurospeech
Volker Steinbiss (1991), A search organization for large-vocabulary recognition based on n-best decoding, Eurospeech
Yifan Gong, Jean-Paul Haton (1991), VINICS: a continuous speech recognizer based on a new robust formulation, Eurospeech
Shigeki Sagayama (1991), A matrix representation of HMM-based speech recognition algorithms, Eurospeech
Paul Dalsgaard, Ove Andersen, William Barry (1991), Multi-lingual acoustic-phonetic features for a number of european languages, Eurospeech
H. Kabre, Guy Pérénnou, Nadine Vigouroux (1991), A non-linear filtering method applied to automatic segmentation of multilingual speech corpora, Eurospeech
Piero Cosi, Daniele Falavigna, Maurizio Omologo (1991), A preliminary statistical evaluation of manual and automatic segmentation discrepancies, Eurospeech
J. M. McQueen, E. J. Briscoe (1991), A computational tool for examining lexical segmentation in continuous speech, Eurospeech
M. S. Schmidt, G. S. Watson (1991), The evaluation and optimization of automatic speech segmentation, Eurospeech
G. Feng, N. Achab, R. Combescure (1991), On-line speech segmentation using adaptive models: application to variable rate speech coding, Eurospeech
P. A. Taylor, Stephen D. Isard (1991), Automatic diphone segmentation, Eurospeech
Georg E. Ottesen (1991), An automatic diphone segmentation system, Eurospeech
R. A. Brierton, B. M. G. Cheetham (1991), An evaluation oof spectral transitivity functions for speech segmentation in variable frame-rate speech vocoding, Eurospeech
Dirk Van Compernolle, J. Smolders, P. Jaspers, T. Hellemans (1991), Speaker clustering for dialectic robustness in speaker independent recognition, Eurospeech
Dina Yashchin, William C. G. Ortel (1991), Experience with speech recognition in automating telephone operator functions, Eurospeech
F. Canavesio, Lorenzo Fissore, M. Oreglia, P. Ruscitti (1991), HMM modeling in the public telephone network environment: experiments and results, Eurospeech
Dominique Morin (1991), Influence of field data in HMM training for a vocal server, Eurospeech
A. Ciaramella, Lorenzo Fissore, A. Pacchiotti, R. Pacifici (1991), An isolated word speech recognizer prototype for mobile-radio applications, Eurospeech
James Monaghan, Christine Cheepen (1991), Linguistic modelling for a speech interface in the office context, Eurospeech
Andrea Di Carlo, Rino Falcone (1991), Ill-formedness problem in the spoken language processing, Eurospeech
Giulio Maltese, Federico Mancini (1991), A technique to automatically assign parts-of-speech to words taking into account word-ending information through a probabilistic model, Eurospeech
Marcello Pelillo, Mario Refice (1991), Syntactic category disambiguation through relaxation processes, Eurospeech
E. N. Wrigley, J. H. Wright (1991), Computational requirements of probabilistic LR parsing for speech recognition using a natural language grammar, Eurospeech
J. Tiboni, G. Perennon (1991), Phonotypical transcription through the GEPH expert system, Eurospeech
Briony Williams, Franziska Maier (1991), A spelling corrector for use in text-to-speech synthesis for English, Eurospeech
Thomas Russi (1991), Robust and efficient parsing for applications such as text-to-speech conversion, Eurospeech
Robert W. P. Luk, Robert I. Damper (1991), Stochastic transduction for English text-to-phoneme conversion, Eurospeech
M. Y. Hwang, X. D. Huang (1991), Acoustic distribution clustering in phonetic hidden Markov models, Eurospeech
M. Blomberg (1991), Modelling articulatory inter-timing variation in a speech recognition system based on synthetic references, Eurospeech
Kari Torkkola, Mikko Kokkonen, Mikko Kurimo, Pekka Utela (1991), Improving short-time speech frame recognition results by using context, Eurospeech
P. A. Rentzepopoulos, George K. Kokkinakis (1991), Phoneme to grapheme conversion using HMM, Eurospeech
S. H. Parfitt, R. A. Sharman (1991), A bi-directional model of English pronunciation, Eurospeech
David Goodine, Stephanie Seneff, Lynette Hirschman, Michael Phillips (1991), Full integration of speech and language understanding in the MIT spoken language system, Eurospeech
Takayuki Yamaoka, Hitoshi Iida (1991), Dialogue interpretation model and its application to next utterance prediction for spoken language processing, Eurospeech
W. Boogers (1991), Dialogue construction by compilation, Eurospeech
Izuru Nogaito, Masahiko Takahashi, Shingo Kuroiwa, Fumihiro Yato (1991), Dialogue management in an extension number guidance system, Eurospeech
Encarna Segarra, Pedro Garcia (1991), Automatic learning of acoustic and syntactic-semantic levels in continuous speech understanding, Eurospeech
Paolo Baggia, A. Ciaramella, D. Clementino, Lorenzo Fissore, E. Gerbino, Egidio P. Giachin, G. Micca, L. Nebbia, R. Pacifici, G. Pirani, C. Rullent (1991), A man-machine dialogue system for speech access to e-mail information using the telephone: implementation and first results, Eurospeech
Renee van Bezooijen, Louis C. W. Pols (1991), Performance of text-to-speech conversion for dutch: a comparative evaluation of allophone and diphone based synthesis at the level of the segment, the word, and the paragraph, Eurospeech
Christian Benoît, Francoise Emerard, Betina Schnabel, A. Tseva (1991), Quality comparisons of prosodic and of acoustic components of various synthesisers, Eurospeech
Martine Griee, Kiki Vagges, Daniel Hirst (1991), Assessment of intonation in text-to-speech synthesis systems - a pilot test in English and Italian, Eurospeech
Alex I. C. Monaghan (1991), Evaluation of the naturalness of prosody generated by the CSTR TTS system, Eurospeech
Ulrich Halka (1991), Speech-model processes for objective quality measurements of speech-coding systems, Eurospeech
S. Euler (1991), Adaptation techniques in tied density hidden Markov models, Eurospeech
D. Jouvet, K. Bartkova, Jean Monné (1991), On the modelization of allophones in an HMM based speech recognition system, Eurospeech
D. Jouvet, L. Mauuary, Jean Monné (1991), Automatic adjustments of the structure of Markov models for speech recognition applications, Eurospeech
Hong C. Leung, I. Lee Hetherington, Victor W. Zue (1991), Speech recognition using stochastic explicit-segment modeling, Eurospeech
D. Dubois (1991), Comparison of time-dependent acoustic features for a speaker-independent speech recognition system, Eurospeech
Jean-Luc Gauvain, Chin-Hui Lee (1991), Bayesian learning for hidden Markov model with Gaussian mixture state observation densities, Eurospeech
Hans-Wilhelm Ruehl (1991), Voice controlled mail ordering via telephone using SPREIN, Eurospeech
Stefan Dobler, Werner Armbruester, Peter Meyer, Hans-Wilhelm Ruehl (1991), A voice dialling device for mobile radio, Eurospeech
K. Smaili, F. Charpillet, Jean-Marie Pierrel, Jean-Paul Haton (1991), A continuous speech recognition approach for the design of a dictation machine, Eurospeech
David L. Thomson, Jay G. Wilpon, Rafid A. Sukkar, Dimitrios P. Prezas (1991), Automatic speech recognition in the Spanish telephone network, Eurospeech
Roberto Billi, P. Buttafava, P. De Stefani, M. Gamba, D. Voltolini (1991), Computer-aided, voice-based, medical report preparation: an application to radiology, Eurospeech
Filipe N. Carlos, Jose P. Carmona, Pedro M. Chagas, Luis C. Oliveira, Antonio J. Serralheiro, Isabel M. Trancoso (1991), A recognition / synthesis system applied to database access through the telephone network, Eurospeech
Seppo Helle (1991), An experiment in using a hypertext system in phonetics and speech processing education, Eurospeech
G. Antoniol, F. Brugnara, F. Dalla Palma, G. Lazzari, E. Moser (1991), A. RE. s. : an interface for automatic reporting by speech, Eurospeech
U. Schultheiß, B. Lochschmidt (1991), COGNITO - an experimental voice-controlled telecommunication system, Eurospeech
Jared Bernstein, Dimitry Rtischev (1991), A voice interactive language instruction system, Eurospeech
Edmund Rooney, Steven Hiller, John Laver, Maria-Gabriella Di Benedetto (1991), Macro and micro features for automated pronunciation improvement in the spell system, Eurospeech
Laurence Devillers, Christian Dugast (1991), Comparison of continuous mixture densities and TDNN in a viterbi-framework: experiments on speaker dependent DARPA RM1+, Eurospeech
Peter Thurston, Dennis Norris (1991), A comparison of two compression functions used for noisy vowel detection with back-propagation networks, Eurospeech
J. Ferreiros, A. Castro, J. M. Pardo (1991), Comparison between two different approaches in speaker - independent isolated digit recognition, Eurospeech
Franck Poirier (1991), DVQ: dynamic vector quantization application to speech processing, Eurospeech
Yoshua Bengio, Renato De Mori, Giovanni Flammia, Ralf Kompe (1991), A comparative study on hybrid acoustic phonetic decoders based on artificial neural networks, Eurospeech
Hidefumi Sawai, Satoru Nakamura (1991), Time-delay neural network architectures for high-performance speaker-independent recognition, Eurospeech
P. Wittenburg, R. Couwenberg (1991), Recurrent neural nets as building blocks for human word recognition, Eurospeech
Fisseha Mekuria, Tore FjÖllbrant (1991), A neural net model for vector quantization, Eurospeech
N. H. Russell, Frank Fallside, A. J. Robinson, R. W. Prager (1991), Lexical access using a recurrent error propagation network, Eurospeech
Peter Brauer, Per Hedelin, Dieter Huber, Petter Knagenhjelm, Johan Molno (1991), Model or non-model based classifiers, Eurospeech
Toomas Aliosaar, Matti Karjalainen (1991), Event-based recognition and analysis of speech by neural networks, Eurospeech
Frederick Jelinek (1991), Up from trigrams! - the struggle for improved language models, Eurospeech
Rolf Carlson (1991), Synthesis: modelling variability and constraints, Eurospeech
M. Guyomard, J. Siroux, A. Cozannet (1991), The role of dialogue in speech recognition the case of the yellow, Eurospeech
E. Gerbino, Paolo Baggia (1991), Interpretation of context-dependent utterances in man-machine dialogue, Eurospeech
S. Eggins, Julie P. Vonwiller, C. M. I. Matthiessen, P. Sefton (1991), The description of minor clauses in information-seeking telephone dialogues, Eurospeech
David B. Roe, Fernando Pereira, Richard W. Sproat, Michael D. Riley, Pedro J. Moreno, Alejandro Macarron (1991), Toward a spoken language translator for restricted-domain context-free languages, Eurospeech
N. Venkata Subramaniam, N. Alwar, G. Mallikarjuna, P. Prabhakar Rao, S. Raman (1991), Bidirectional machine translation in indian languages, Eurospeech
C. Papaodysseus, E. Koukoutsis, C. Triantafyllou, C. Vasilatos (1991), Exact monitoring of the numerical error in various speech algorithms, Eurospeech
Jacques Koreman, Bert Cranen, Louis Boves (1991), Automatic computation and comparison of dynamically varying voice source parameters, Eurospeech
Paavo Alku (1991), Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering, Eurospeech
Thieny Galas, Xavier Rodet (1991), Generalized functional approximation for source-filter system modeling, Eurospeech
Frederic Bimbot, Bishnu S. Atal (1991), An evaluation of temporal decomposition, Eurospeech
Klaus Zünkler (1991), A discriminative recognizer for isolated and continuous speech using statistical separability measures, Eurospeech
O. Schmidbauer, H. Höge (1991), Speaker adaptation based on articulatory features, Eurospeech
F. Brugnara, Renato De Mori, D. Giuliani, M. Omologo (1991), A parallel HMM approach to speech recognition, Eurospeech
Tsuneo Nitta, Jun'ichi Iwasaki, Hiroshi Matsu'ura (1991), Speaker independent word recognition using HMMs with an orthogonalized phonetic segment codebook, Eurospeech
Pascale Fung, Tatsuya Kawahara, Shuji Doshita (1991), Unsupervised speaker normalization by speaker Markov model converter for speaker-independent speech recognition, Eurospeech
Rob J. J. H. van Son, Louis C. W. Pols (1991), The influence of formant track shape on the perception of synthetic vowels, Eurospeech
P. A. Howard-Jones (1991), Fluctuation of noise background: measurement and significance in relation to speech masking, Eurospeech
C. Ma, L. F. Willems (1991), The audibility of narrow band noise in fiat spectral complex sounds, Eurospeech
Gitta P. M. Laan, Dick R. van Bergem, Fiorien J. Koopmans-van Beinum (1991), The importance of spectral quality of vowels for the intelligibility of sentences, Eurospeech
Herman J. M. Steeneken, Tammo Houtgast (1991), On the mutual dependency of octave-band-specific contributions to speech intelligibility, Eurospeech
Brit van Ooyen, Anne Cutler, Dennis Norris (1991), Detection times for vowels versus consonants, Eurospeech
Dick R. van Bergem (1991), The influence of sentence accent, word stress, and word class on the quality of vowels, Eurospeech
Florien J. Koopmans-van Beinum (1991), A peak-and-level model for focus words in read and spontaneous natural speech and in synthetic speech, Eurospeech
J. Ingram, J. Pittam (1991), Connected speech processes in second language learning, Eurospeech
Rodmonga K. Potapova (1991), Modification of acoustic features in Russian connected speech, Eurospeech
Helmer Strik, Louis Boves (1991), On the relation between voice source characteristics and prosody, Eurospeech
Sverre Stensby (1991), Prosody in a rule-based norwegian text-to-speech system, Eurospeech
A. S. Madhukumar, S. Rajendran, C. Chandra Sekhar, B. Yegnanarayana (1991), Synthesizing intonation for speech in hindi, Eurospeech
James L. Hieronymus, Briony J. Williams (1991), An investigation of the relation between perceived pitch accent and automatically-located accent in british English, Eurospeech
S. Quazza (1991), Modelling Italian intonation in a text-to-speech system, Eurospeech
Michael H. O'Malley, Howard Resnick, Michelle Caisse (1991), An analysis of strategies for finding prosodic clues in text, Eurospeech
Marcello Balestri (1991), A coded dictionary for stress assignment rules in Italian, Eurospeech
Enrico te Lindert, Hugo C. van Leeuwen (1991), Speech maker: text-to-speech conversion based on a multi-level, synchronized data structure, Eurospeech
E. Lewis, Mark A. A. Tatham (1991), A new text-to-speech synthesis system, Eurospeech
Luis C. Oliveira, M. Ceu Viana, Isabel M. Trancoso (1991), DIXI - portuguese text-to-speech system, Eurospeech
P. Molbaek Hansen, N. Reinholt Petersen, J. Rischel, C. Henriksen (1991), Higher-level linguistic information in a text-to-speech system for danish, Eurospeech
Gabor Olaszy (1991), Adaptation of the multivox text-to-speech system to Italian, Eurospeech
Partha Niyogi, Victor W. Zue (1991), Correlation analysis of vowels and their application to speech recognition, Eurospeech
John N. Holmes (1991), Use of phonetic knowledge when designing and training stochastic models for speech recognition, Eurospeech
B. Kaspar, K. Schuhmacher (1991), Modelling phones by microsegments in a phonetically oriented recognition system, Eurospeech
Il K. Kim, H. S. Lee (1991), An extended LVQ2 algorithm and its application to phoneme classification, Eurospeech
P. J. Dix, G. J. Vernooij, G. Bloothooft (1991), A hierarchical broad phonetic classification scheme, Eurospeech
Julia Hirschberg (1991), Using text analysis to predict intonational boundaries, Eurospeech
Merle Horne (1991), Why do speakers accent 'given' information ?, Eurospeech
Julie P. Vonwiller, R. W. King, R. W. T. Lloyd (1991), Automatic prosody assignment for interactive synthesized dialogue systems, Eurospeech
NickYoud NickYoud, Jill House (1991), Generating intonation in a voice dialogue system, Eurospeech
Rodolfo Delmonte, Roberto Dolci (1991), Computing linguistic knowledge for text-to-speech systems with PROSO, Eurospeech
C. Acker, Peter Vary, H. Ostendarp (1991), Acoustic echo cancellation using prediction residual signals, Eurospeech
H. S. Dabis, Alan A. Wrench (1991), An evaluation of adaptive noise cancelling for speech recognition, Eurospeech
Enzo Mumolo, Antonello Riccio, Giuseppe Abbattista (1991), An efficient algorithm for real-time voiced/unvoiced decision, Eurospeech
Tim Aarset, Ben Gold (1991), Models of pitch perception, Eurospeech
P. Corney, J. S. Mason (1991), A new perspective on LPC excitation using singular value decomposition, Eurospeech
Werner Verhelst, Marcel Borger (1991), Intra-speaker transplantation of speech characteristics an application of waveform vocoding techniques and DTW, Eurospeech
S. H. Leung, O. Y. Wong, K. L. Lai (1991), Decomposition of the LPC excitation using wavelet functions, Eurospeech
Eliathamby Ambikairajah, Liam Kilmartin (1991), An adaptive cochlear model for speech recognition, Eurospeech
Gianni Jacovitti, Piero Pierucci, Alessandro Falaschi (1991), Speech segmentation and classification using higher order moments, Eurospeech
A. Ciaramella, D. Clementino, R. Pacifici (1991), A PC-housed speaker independent large vocabulary continuous telephonic speech recognizer, Eurospeech
Abdulmesih Aktas, Klaus Zünkler (1991), Speaker independent continuous HMM-based recognition of isolated words on a real-time multi-DSP system, Eurospeech
A. Tsopanoglou, E. D. Kyriakis-Bitzaros, J. Mourjopoulos, George K. Kokkinakis (1991), A real time speech decoder using instantaneous frequency and energy, Eurospeech
M. Schultheiß, A. Lacroix (1991), Fast hardware for efficient parallel processing of speech signals, Eurospeech
Jan Sedivy, Jiff Filcev, Jan Uhlir, Tomas Vanek, Vaclav Hanzl, Zdenek Oliva, Petr Kotek (1991), The one chip speech recognition system, Eurospeech
L. Villarrubia, M. J. Poza, C. Crespo (1991), Influence of the telephone line on automatic speech recognition, Eurospeech
Hynek Hermansky, Nelson Morgan, Aruna Bayya, Phil Kohn (1991), Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP), Eurospeech
Jean-Claude Junqua, Ben Reaves, Brian Mak (1991), A study of endpoint detection algorithms in adverse conditions: incidence on a DTW and HMM recognizer, Eurospeech
Susanne Dvorak, Thomas Hormann (1991), High-performance speech recognition in noise by continuously updated reference templates, Eurospeech
Klara Vicsi (1991), Speech enhancement in the case of speech recognizers, Eurospeech
Juan Gomez-Mena, J. Santos-Suarez, Ramón Garcia-Gomez (1991), A robust feature extraction method for automatic speech recognition in noisy environments, Eurospeech
Lorenzo Fissore, Egidio P. Giachin, P. Laface, G. Micca (1991), Selection of speech units for a speaker-independent CSR task, Eurospeech
Egidio P. Giachin, Chin-Hui Lee, Lawrence R. Rabiner, Aaron E. Rosenberg, Roberto Pieraccini (1991), Word juncture modeling using inter-word context-dependent phone-like units, Eurospeech
Akito Nagai, Shigeki Sagayama, Kenji Kita (1991), Phoneme-context-dependent LR parsing algorithms for HMM-based continuous speech recognition, Eurospeech
H. Drexler, R. Roddeman, Louis Boves, Helmer Strik (1991), Optimizing lexical fast search in a large vocabulary isolated word speech recognition system, Eurospeech
Tore Fjällbrant, Fisseha Mekuria (1991), Signal processing using an auditory filter bank with side-lobes and phase-jumps, Eurospeech
J. S. C. van Dijk (1991), Notes on auditive coding of sophisticated signals, Eurospeech
Manfred Beham (1991), An auditorily based spectral transformation of speech signals, Eurospeech
Andrew C. Morris, Pierre Escudier, Jean-Luc Schwartz (1991), On and off units detect information bottle-necks for speech recognition, Eurospeech
Jose A. Pozas-Alvarez (1991), A new logic operator-based auditory system model, Eurospeech
Jeremy Peckham (1991), Speech understanding and dialogue over the telephone: an overview of progress in the sundial project, Eurospeech
Jean-Pierre Tubach, P. Doignon (1991), A system for natural spoken language queries design, implementation and assessment, Eurospeech
G. Deville, P. Mousel (1991), Operational validation of syntactic-semantic models in a spoken man-machine dialogue system, Eurospeech
B. Gaiffe, L. Romary, Jean-Marie Pierrel (1991), References in a multimodal dialogue: towards a unified processing, Eurospeech
P. Lefebvre, G. Duncan, F. Poirier (1991), The user-unix dialogue: a novel integrated approach to enhancing the operating system interface, Eurospeech
Bodo Arndt (1991), Adoption op verbal and visual dialogue behaviour in document handling systems, Eurospeech
P. M. T. Smeele, A. C. Sittig (1991), The contribution of vision to speech perception, Eurospeech
R. J. Lickley, R. C. Shillcock, E. G. Bard (1991), Processing disfluent speech: how and when are disfluencies found?, Eurospeech
A. Chointere, J.-M. Robert, Raymond Descout (1991), Building a user interface for a speech recognition-based telephone application system, Eurospeech
A. C. Murray, C. R. Frankish, D. M. Jones (1991), System design and human factors in auditory interfaces, Eurospeech
Lian Zheng, Jianhua Tao, Zhengqi Wen, Rongxiu Zhong (2020), CASIA Voice Conversion System for the Voice Conversion Challenge 2020, VCCBC
Zhiba Su, Wendi He, Yang Sun (2020), The Ximalaya TTS System for Blizzard Challenge 2020, VCCBC
Yi Zhou, Xiaohai Tian, Xuehao Zhou, Mingyang Zhang, Grandee Lee, Riu Liu, Berrak Sisman, Haizhou Li (2020), NUS-HLT System for Blizzard Challenge 2020, VCCBC
Jian Lu, Zeru Lu, Ting He, Peng Zhang, Xinhui Hu, Xinkang Xu (2020), The RoyalFlush Synthesis System for Blizzard Challenge 2020, VCCBC
Laipeng He, Qiang Shi, Lang Wu, Jianqing Sun, Renke He, Yanhua Long, Jiaen Liang (2020), The SHNU System for Blizzard Challenge 2020, VCCBC
Qiao Tian, Zewang Zhang, Ling-Hui Chen, Heng Lu, Chengzhu Yu, Chao Weng, Dong Yu (2020), The Tencent speech synthesis system for Blizzard Challenge 2020, VCCBC
Jing-Xuan Zhang, Li-Juan Liu, Yan-Nian Chen, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling, Li-Rong Dai (2020), Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer, VCCBC
Wen-Chin Huang, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda (2020), The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS, VCCBC
Yang Song, Min Liang, Guilin Yang, Kun Xie, Jie Hao (2020), The OPPO System for the Blizzard Challenge 2020, VCCBC
Zhao Yi, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda (2020), Voice Conversion Challenge 2020 –- Intra-lingual semi-parallel and cross-lingual voice conversion –-, VCCBC
Huhao Fu, Yiben Zhang, Kailong Liu, Chao Liu (2020), The HITSZ TTS system for Blizzard challenge 2020, VCCBC
Zexin Cai, Ming Li (2020), The Duke Entry for 2020 Blizzard Challenge, VCCBC
Li-Juan Liu, Yan-Nian Chen, Jing-Xuan Zhang, Yuan Jiang, Ya-Jun Hu, Zhen-Hua Ling, Li-Rong Dai (2020), Non-Parallel Voice Conversion with Autoregressive Conversion Model and Duration Adjustment, VCCBC
Xiao Zhou, Zhen-Hua Ling, Simon King (2020), The Blizzard Challenge 2020, VCCBC
Yitao Yang, Jinghui Zhong, Shehui Bu (2020), Submission from SCUT for Blizzard Challenge 2020, VCCBC
Tuan Vu Ho, Masato Akagi (2020), Non-parallel Voice Conversion based on Hierarchical Latent Embedding Vector Quantized Variational Autoencoder, VCCBC
Tao Wang, Jianhua Tao, Ruibo Fu, Zhengqi Wen, Chunyu Qiang (2020), The NLPR Speech Synthesis entry for Blizzard Challenge 2020, VCCBC
Beibei Hu, Zilong Bai, Qiang Li (2020), The Ajmide Text-To-Speech System for Blizzard Challenge 2020, VCCBC
Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Toda (2020), Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN, VCCBC
Fanbo Meng, Ruimin Wang, Peng Fang, Shuangyuan Zou, Wenjun Duan, Ming Zhou, Kai Liu, Wei Chen (2020), The Sogou System for Blizzard Challenge 2020, VCCBC
Haitong Zhang (2020), The NeteaseGames System for Voice Conversion Challenge 2020 with Vector-quantization Variational Autoencoder and WaveNet, VCCBC
Oriol Barbany, Milos Cernak (2020), FastVC: Fast Voice Conversion with non-parallel data, VCCBC
Xiaohai Tian, Zhichao Wang, Shan Yang, Xinyong Zhou, Hongqiang Du, Yi Zhou, Mingyang Zhang, Kun Zhou, Berrak Sisman, Lei Xie, Haizhou Li (2020), The NUS & NWPU system for Voice Conversion Challenge 2020, VCCBC
Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Zhao Yi, Xiaohai Tian, Tomoki Toda (2020), Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions, VCCBC
Wen-Chin Huang, Patrick Lumban Tobing, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda (2020), The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders, VCCBC
Victor P. da Costa, Ranniery Maia, Igor M. Quintanilha, Sergio L. Netto, Luiz W. P. Biscainho (2020), The UFRJ Entry for the Voice Conversion Challenge 2020, VCCBC
YuHuai Peng, Cheng-Hung Hu, Alexander Kang, Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao, Hsin-Min Wang (2020), The Academia Sinica Systems of Voice Conversion for VCC2020, VCCBC
Hieu-Thi Luong, Junichi Yamagishi (2020), Latent linguistic embedding for cross-lingual text-to-speech and voice conversion, VCCBC
Qiuyue Ma, Ruolan Liu, Xue Wen, Chunhui Lu, Xiao Chen (2020), Submission from SRCB for Voice Conversion Challenge 2020, VCCBC
Gaurav Bhatt, Akshita Gupta, Aditya Arora, Balasubramanian Raman (2018), Acoustic features fusion using attentive multi-channel deep architecture, CHiME
Christoph Boeddecker, Jens Heitkaemper, Joerg Schmalenstroeer, Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach (2018), Front-end processing for the CHiME-5 dinner party scenario, CHiME
Rama Doddipatla, Takehiko Kagoshima, Cong-Thanh Do, Petko Petkov, Catalin-Tudor Zorila, Euihyun Kim, Daichi Hayakawa, Hiroshi Fujimura, Yannis Stylianou (2018), The Toshiba entry to the CHiME 2018 Challenge, CHiME
Jun Du, Tian Gao, Lei Sun, Feng Ma, Yi Fang, Di-Yuan Liu, Qiang Zhang, Xiang Zhang, Hai-Kun Wang, Jia Pan, Jian-Qing Gao, Chin-Hui Lee, Jing-Dong Chen (2018), The USTC-iFlytek systems for CHiME-5 Challenge, CHiME
Sonal Joshi, Ashish Panda, Meet Soni, Rupayan Chakraborty, Sunilkumar Kopparapu, Nikhil Mohanan, Premanand Nayak, Rajbabu Velmurugan, Preeti Rao (2018), CHiME 2018 Workshop: Enhancing beamformed audio using time delay neural network denoising autoencoder, CHiME
Naoyuki Kanda, Rintaro Ikeshita, Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu, Xiaofei Wang, Vimal Manohar, Nelson Enrique Yalta Soplin, Matthew Maciejewski, Szu-Jui Chen, Aswin Shanmugam Subramanian, Ruizhi Li, Zhiqi Wang, Jason Naradowsky, L. Paola Garcia-Perera, Gregory Sell (2018), The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays, CHiME
Gil Keren, Jing Han, Björn Schuller (2018), Scaling speech enhancement in unseen environments with noise embeddings, CHiME
Siddharth Dalmia, Suyoun Kim, Florian Metze (2018), Situation informed end-to-end ASR for noisy environments, CHiME
Markus Kitza, Wilfried Michel, Christoph Boeddeker, Jens Heitkaemper, Tobias Menne, Ralf Schlüter, Hermann Ney, Joerg Schmalenstroeer, Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach (2018), The RWTH/UPB system combination for the CHiME 2018 Workshop, CHiME
Chenxing Li, Tieqiang Wang (2018), The ZTSpeech system for CHiME-5 Challenge: A far-field speech recognition system with front-end and robust back-end, CHiME
Yanhua Long, Renke He (2018), The SHNU system for the CHiME-5 Challenge, CHiME
Ivan Medennikov, Ivan Sorokin, Aleksei Romanenko, Dmitry Popov, Yuri Khokhlov, Tatiana Prisyach, Nikolay Malkovskiy, Vladimir Bataev, Sergei Astapov, Maxim Korenevsky, Alexander Zatvornitskiy (2018), The STC System for the CHiME 2018 Challenge, CHiME
Alim Misbullah (2018), Robust network structures for acoustic model on CHiME5 Challenge dataset, CHiME
Nikhil Mohanan, Premanand Nayak, Rajbabu Velmurugan, Preeti Rao, Sonal Joshi, Ashish Panda, Meet Soni, Rupayan Chakraborty, Sunilkumar Kopparapu (2018), NMF based front-end processing in multi-channel distant speech recognition, CHiME
Ankur Patil, Maddala V. Siva Krishna, Mehak Piplani, Pulikonda Aditya Sai, Hardik B. Sailor, Hemant A. Patil (2018), DA-IICT/IIITV system for the 5th CHiME 2018 Challenge, CHiME
Dan Qu, Cheng-Ran Liu, Xu-Kiu Yang, Wen-lin Zhang (2018), The NDSC transcription system for the 2018 CHiME-5 Challenge, CHiME
Sining Sun, Yangyang Shi, Ching-Feng Yeh, Suliang Bu, Mei-Yuh Hwang, Lei Xie (2018), Multiple beamformers with ROVER for the CHiME-5 Challenge, CHiME
Hannes Unterholzner, Lukas Pfeifenberger, Franz Pernkopf, Marco Matassoni, Alessio Brutti, Daniele Falavigna (2018), Channel-selection for distant-speech recognition on CHiME-5 dataset, CHiME
Feifei Xiong, Jisi Zhang, Bernd Meyer, Heidi Christensen, Jon Barker (2018), Channel selection using neural network posterior probability for speech recognition with distributed microphone arrays in everyday environments, CHiME
Zhiwei Zhao, Jian Wu, Lei Xie (2018), The NWPU System for CHiME-5 Challenge, CHiME
John H L Hansen (2018), Robust speaker diarization and recognition in naturalistic data streams: Challenges for multi-speaker tasks & learning spaces, CHiME
Florian Metze (2018), Open-domain audiovisual speech recognition and video summarization, CHiME
Paul Taylor (2002), Pronunciation issues in text-to-speech synthesis, PMLA
James Flege (2002), Factors affecting the pronunciation of a second language, PMLA
Thomas Hain (2002), Implicit pronunciation modelling in ASR, PMLA
Alan Bell, Michelle L. Gregory, Jason M. Brenier, Daniel Jurafsky, Ayako Ikeno, Cynthia Girand (2002), Which predictability measures affect content word durations?, PMLA
Diamantino Caseiro, F. M. Silva, Isabel Trancoso, C. Viana (2002), Automatic alignment of map task dialogs using WFSTs, PMLA
Patgi Kam, Tan Lee (2002), Modeling pronunciation variation for Cantonese speech recognition, PMLA
J. M. Kessens, Helmer Strik, Catia Cucchiarini (2002), Modeling pronunciation variation for ASR: Comparing criteria for rule selection, PMLA
Kyung-Tak Lee, Lynette Melnar, Jim Talley (2002), Symbolic speaker adaptation for pronunciation modeling, PMLA
Abhinav Sethy, Shrikanth Narayanan, S. Parthasarthy (2002), A syllable based approach for improved recognition of spoken names, PMLA
Daniel Willett, Erik McDermott, Shigeru Katagiri (2002), Unsupervised pronunciation adaptation for off-line transcription of Japanese lecture speeches, PMLA
Mari Ostendorf, Rebecca Bates (2002), Modeling pronunciation variation in conversational speech using prosody, PMLA
Maxine Eskenazi, Gary Pelton (2002), Pinpointing pronunciation errors in children's speech: examining the role of the speech recognizer, PMLA
Eric Fosler-Lussier, Ingunn Amdal, Hong-Kwang Jeff Kuo (2002), On the road to improved lexical confusability metrics, PMLA
Serguei Koval, Natalia Smirnova, Mikhail Khitrov (2002), Modelling pronunciation variability for ASR tasks, PMLA
Odette Scharenborg, Lou Boves (2002), Pronunciation variation modelling in a model of human word recognition, PMLA
Stephanie Seneff, Chao Wang (2002), Modelling phonological rules through linguistic hierarchies, PMLA
Ming-yi Tsai, Fu-chiang Chou, Lin-shan Lee (2002), Improved pronunciation modeling by properly integrating better approaches for baseform generation, ranking and pruning, PMLA
Wayne Ward, Holly Krech, Xiuyang Yu, Keith Herold, George Figgs, Ayako Ikeno, Dan Jurafsky, William Byrne (2002), Lexicon adaptation for LVCSR: Speaker idiosyncracies, non-native speakers, and pronunciation choice, PMLA
Martine Adda-Decker, Philippe Boula de Mareüil, Gilles Adda, Lori Lamel (2002), Investigating syllabic structure and its variation in speech from French radio interviews, PMLA
Jerome R. Bellegarda (2002), A novel approach to unsupervised grapheme-to-phoneme conversion, PMLA
Timothy J. Hazen, I. Lee Hetherington, Han Shu, Karen Livescu (2002), Pronunciation modeling using a finite-state transducer representation, PMLA
Hauke Schramm, Peter Beyerlein (2002), Discriminative optimization of the lexical model, PMLA
Louis ten Bosch, Nick Cremelie (2002), Pronunciation modeling and lexical adaptation using small training sets, PMLA
Matthias Wolff, Matthias Eichner, Rüdiger Hoffmann (2002), Measuring the quality of pronunciation dictionaries, PMLA
Qian Yang, Jean-Pierre Martens, Pieter-Jan Ghesquiere, Dirk Van Compernolle (2002), Pronunciation variation modeling for asr: large improvements are possible but small ones are likely, PMLA
Chiranjeevi Yarra, Prasanta Kumar Ghosh (2019), voisTUTOR: Virtual Operator for Interactive Spoken English TUTORing, SLaTE
Fred Richardson, John Steinberg, Gordon Vidaver, Steve Feinstein, Ray Budd, Jennifer Melot, Paul Gatewood, Douglas Jones (2019), Corpora Design and Score Calibration for Text Dependent Pronunciation Proficiency Recognition, SLaTE
Ray Budd, Tamas Marius, Paul Gatewood, Doug Jones (2019), Using K-Means in SVR-Based Text Difficulty Estimation, SLaTE
Volodymyr Sokhatskyi, Olga Zvyeryeva, Ievgen Karaulov, Dmytro Tkanov (2019), Embedding-based system for the Text part of CALL v3 shared task, SLaTE
Elham Akhlaghi, Branislav Bédi, Matthias Butterweck, Cathy Chua, Johanna Gerlach, Hanieh Habibi, Junta Ikeda, Manny Rayner, Sabina Sestigiani, Ghil'ad Zuckermann (2019), Demonstration of LARA: A Learning and Reading Assistant, SLaTE
Claudia Baur, Andrew Caines, Cathy Chua, Johanna Gerlach, Mengjie Qian, Manny Rayner, Martin Russell, Helmer Strik, Xizi Wei (2019), Overview of the 2019 Spoken CALL Shared Task, SLaTE
Elham Akhlaghi, Branislav Bédi, Matt Butterweck, Cathy Chua, Johanna Gerlach, Hanieh Habibi, Junta Ikeda, Manny Rayner, Sabina Sestigiani, Ghil'ad Zuckermann (2019), Overview of LARA: A Learning and Reading Assistant, SLaTE
Helmer Strik, Anna Ovchinnikova, Camilla Giannini, Angela Pantazi, Catia Cucchiarini (2019), Students' acceptance of MySpeechTrainer to improve spoken Academic English, SLaTE
Roberto Gretter, Marco Matassoni, Daniele Falavigna (2019), The FBK system for the 2019 Spoken CALL Shared Task, SLaTE
Wei Xue, Catia Cucchiarini, Roeland van Hout, Helmer Strik (2019), Acoustic correlates of speech intelligibility: the usability of the eGeMAPS feature set for atypical speech, SLaTE
Ralph L. Rose (2019), Fluidity: Developing second language fluency with real-time feedback during speech practice, SLaTE
Gary Yeung, Alison L. Bailey, Amber Afshan, Morgan Tinkler, Marlen Q. Pérez, Alejandra Martin, Anahit A. Pogossian, Samuel Spaulding, Hae Won Park, Manushaqe Muco, Abeer Alwan, Cynthia Breazeal (2019), A robotic interface for the administration of language, literacy, and speech pathology assessments for children, SLaTE
Satoshi Kobashikawa, Atushi Odakura, Takao Nakamura, Takeshi Mori, Kimitaka Endo, Takafumi Moriya, Ryo Masumura, Yushi Aono, Nobuaki Minematsu (2019), Does Speaking Training Application with Speech Recognition Motivate Junior High School Students in Actual Classroom? -- A Case Study, SLaTE
Prasanna V. Kothalkar, Dwight Irvin, Ying Luo, Joanne Rojas, John Nash, Beth Rous, John H. L. Hansen (2019), Tagging child-adult interactions in naturalistic, noisy, daylong school environments using i-vector based diarization system, SLaTE
Johanna Dobbriner, Oliver Jokisch (2019), Implementing and Evaluating Methods of Dialect Classification on Read and Spontaneous German Speech, SLaTE
Sweekar Sudhakara, Manoj Kumar Ramanathi, Chiranjeevi Yarra, Anurag Das, Prasanta Kumar Ghosh (2019), Noise robust goodness of pronunciation measures using teacher's utterance, SLaTE
Chiranjeevi Yarra, Manoj Kumar Ramanathi, Prasanta Kumar Ghosh (2019), Comparison of automatic syllable stress detection quality with time-aligned boundaries and context dependencies, SLaTE
Erika Godde, Gérard Bailly, Marie-Line Bosse (2019), Reading Prosody Development: Automatic Assessment for a Longitudinal Study, SLaTE
Zhenchao Lin, Yusuke Inoue, Tasavat Trisitichoke, Shintaro Ando, Daisuke Saito, Nobuaki Minematsu (2019), Native Listeners' Shadowing of Non-native Utterances as Spoken Annotation Representing Comprehensibility of the Utterances, SLaTE
Aparna Srinivasan, Chiranjeevi Yarra, Prasanta Kumar Ghosh (2019), Automatic assessment of pronunciation and its dependent factors by exploring their interdependencies using DNN and LSTM, SLaTE
Mohamed El Hajji, Morgane Daniel, Lucile Gelin (2019), Transfer Learning based Audio Classification for a noisy and speechless recordings detection task, in a classroom context, SLaTE
Jorge Proença, Ganna Raboshchuk, Ângela Costa, Paula Lopez-Otero, Xavier Anguera (2019), Teaching American English pronunciation using a TTS service, SLaTE
Mengjie Qian, Peter Jančovič, Martin Russell (2019), The University of Birmingham 2019 Spoken CALL Shared Task Systems: Exploring the importance of word order in text processing, SLaTE
Neasa Ní Chiaráin, Ailbhe Ní Chasaide (2019), An Scéalaí: autonomous learners harnessing speech and language technologies, SLaTE
Yiting Lu, Mark J. F. Gales, Katherine M. Knill, Potsawee Manakul, Yu Wang (2019), Disfluency Detection for Spoken Learner English, SLaTE
Adriana Guevara-Rukoz, Alexander Martin, Yutaka Yamauchi, Nobuaki Minematsu (2019), Prototyping a web-based phonetic training game to improve /r/-/l/ identification by Japanese learners of English, SLaTE
Lei Chen, Qianyong Gao, Qiubing Liang, Jiahong Yuan, Yang Liu (2019), Automatic Scoring Minimal-Pair Pronunciation Drills by Using Recognition Likelihood Scores and Phonological Features, SLaTE
Antje Schweitzer, Norbert Braunschweiler, Grzegorz Dogil, Bernd Möbius (2004), Assessing the acceptability of the Smartkom speech synthesis voices, SSW
Jithendra Vepa, Simon King (2004), Subjective evaluation of join cost & smoothing methods, SSW
Erwin Marsi (2004), Optionality in evaluating prosody prediction, SSW
Yoshinori Shiga, Simon King (2004), Accurate spectral envelope estimation for articulation-to-speech synthesis, SSW
Alexander Kain, Xiaochuan Niu, John-Paul Hosom, Qi Miao, Jan P. H. van Santen (2004), Formant re-synthesis of dysarthric speech, SSW
Tomoki Toda, Alan W. Black, Keiichi Tokuda (2004), Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis, SSW
Toshio Hirai, Seiichi Tenpaku (2004), Using 5 ms segments in concatenative speech synthesis, SSW
Nobuo Nukaga, Ryota Kamoshida, Kenji Nagamatsu (2004), Unit selection using pitch synchronous cross correlation for Japanese concatenative speech synthesis, SSW
Ann K. Syrdal, Alistair D. Conkie (2004), Data-driven perceptually based join costs, SSW
Matthew Aylett (2004), Merging data driven and rule based prosodic models for unit selection TTS, SSW
Jan P. H. van Santen, Taniya Mishra, Esther Klabbers (2004), Estimating phrase curves in the general superpositional intonation model, SSW
Pablo Daniel Agüero, Antonio Bonafonte (2004), Intonation modeling for TTS using a joint extraction and prediction approach, SSW
Esther Klabbers, Jan P. H. van Santen (2004), Clustering of foot-based pitch contours in expressive speech, SSW
E. Eide, A. Aaron, R. Bakis, W. Hamza, Michael Picheny, J. Pitrelli (2004), A corpus-based approach to expressive speech synthesis, SSW
Guillaume Gibert, Gérard Bailly, Frédéric Elisei (2004), Audiovisual text-to-cued speech synthesis, SSW
Rachel Baker, Robert A. J. Clark, Michael White (2004), Synthesising contextually appropriate intonation in limited domains, SSW
Jelske Dijkstra, Louis C. W. Pols, Rob J. J. H. van Son (2004), Frisian TTS, an example of bootstrapping TTS for minority languages, SSW
Sebsibe H. Mariam, S. P. Kishore, Alan W. Black, Rohit Kumar, Rajeev Sangal (2004), Unit selection voice for Amharic using Festvox, SSW
Kalika Bali, Partha Pratim Talukdar, N. Sridhar Krishna, A.G. Ramakrishnan (2004), Tools for the development of a Hindi speech synthesis system, SSW
Hiroyuki Segi, Tohru Takagi, Takayuki Ito (2004), A concatenative speech synthesis method using context dependent phoneme sequences with variable length as search units, SSW
Justin Fackrell, Wojciech Skut (2004), Improving pronunciation dictionary coverage of names by modelling spelling variation, SSW
Yeon-Jun Kim, Ann Syrdal, Matthias Jilka (2004), Improving TTS by higher agreement between predicted versus observed pronunciations, SSW
Jerome R. Bellegarda (2004), A novel discontinuity metric for unit selection text-to-speech synthesis, SSW
Jordi Adell, Antonio Bonafonte (2004), Towards phone segmentation for concatenative speech synthesis, SSW
Joakim Gustafson, Kåre Sjölander (2004), Voice creation for conversational fairy-tale characters, SSW
Shinsuke Sakai (2004), F0 modeling with multi-layer additive modeling based on a statistical learning technique, SSW
John Kominek, Alan W. Black (2004), Impact of durational outlier removal from unit selection catalogs, SSW
Keikichi Hirose, Kentaro Sato, Nobuaki Minematsu (2004), Corpus-based synthesis of fundamental frequency contours with various speaking styles from text using F0 contour generation process model, SSW
Jianhua Tao, Yongguo Kang (2004), Multi-source based acoustic model for speech synthesis, SSW
Robert A. J. Clark, Korin Richmond, Simon King (2004), Festival 2 - build your own general purpose unit selection speech synthesiser, SSW
Hisashi Kawai, Tomoki Toda, Jinfu Ni, Minoru Minoru, Tsuzaki Tsuzaki, Keiichi Tokuda (2004), XIMERA: a new TTS from ATR based on corpus-based technologies, SSW
Fabio Tesser, Piero Cosi, Carlo Drioli, Graziano Tisato (2004), Prosodic data driven modelling of a narrative style in Festival TTS, SSW
Heiga Zen, Keiichi Tokuda, Tadashi Kitamura (2004), An introduction of trajectory model into HMM-based speech synthesis, SSW
N. Sridhar Krishna, Hema A. Murthy (2004), Duration modeling of Indian languages Hindi and Telugu, SSW
Jason Y. Zhang, Arthur R. Toth, Kevyn Collins-Thompson, Alan W. Black (2004), Prominence prediction for supersentential prosodic modeling based on a new database, SSW
Robert I. Damper, Yannick Marchand, John-David Marseters, Alex Bazin (2004), Aligning letters and phonemes for speech synthesis, SSW
Peter Rutten, David Talkin (2004), rvoice studio and activeprompts, SSW
Leonardo Badino, Claudia Barolo, Silvia Quazza (2004), Language independent phoneme mapping for foreign TTS, SSW
Enrico Zovato, Alberto Pacchiotti, Silvia Quazza, Stefano Sandri (2004), Towards emotional speech synthesis: a rule based approach, SSW
Alejandro C. Renato, José A. Alvarez (2004), Corpora of latin american Spanish for research in prosody and synthesis, SSW
John Kominek, Alan W. Black (2004), The CMU Arctic speech databases, SSW
Arthur R. Toth (2004), Forced alignment for speech synthesis databases using duration and prosodic phrase breaks, SSW
Wentao Gu, Hiroya Fujisaki, Keikichi Hirose (2004), Analysis of fundamental frequency contours of Cantonese based on a command-response model, SSW
Brian Langner, Alan W. Black (2004), Creating a database of speech in noise for unit selection synthesis, SSW
Alan W. Black (2004), Overview of voice building, SSW
Tomoki Toda (2004), Overview of voice conversion, SSW
Loll Rolling (1989), Speech ninety-two -new horizons for the european community, Eurospeech
Gunnar Fant (1989), Speech research in perspective, Eurospeech
Mei-Yuh Hwang, Hsiao-Wuen Hon, Kai-Fu Lee (1989), Modeling between-word coarticulation in continuous speech recognition, Eurospeech
Kari Torkkola, Kimmo Raivio (1989), Comparison of symbolic and connectionist approaches to eliminate coarticulation effects in phonemic speech recognition, Eurospeech
Alessandro Falaschi (1989), A functional based phonetic units definition for statistical speech recognizers, Eurospeech
Walter Weigel, Günther Ruske (1989), Continuous speech recognition using syllabic segmentation and demisyllable hidden Markov models, Eurospeech
Alain J. Vigier, Harvey F. Silverman (1989), Disambiguation of the e-set for connected-alphadigit recognition, Eurospeech
S. R. Young (1989), Use of dialogue, pragmatics and semantics to enhance speech recognition: applicability and limitations of dynamically reducing perplexity, Eurospeech
G. Th. Niedermair (1989), The use of a semantic network in speech dialogue, Eurospeech
Herbert S. Tropf (1989), Syntax in the spoken dialogue system spicos-II, Eurospeech
Brigitte Biebow, Pascal Coupey, Sylvie Szulman (1989), Using exceptions in a semantic network for a natural language application, Eurospeech
Michael Streit (1989), Presuppositions and anaphora in a question answering speech system, Eurospeech
Patti Price, Robert Moore, Hy Murveit, Fernando Pereira, Jared Bernstein, Mary Dalrymple (1989), The integration of speech and natural language in interactive spoken language systems, Eurospeech
P. Mousel, Jean-Marie Pierrel, A. Roussanaly (1989), Cooperation and representation of syntactic-semantic and pragmatic knowledge in a natural language task oriented spoken dialogue system, Eurospeech
K. Matrouf, Francoise Néel, Jean-Luc Gauvain, Joseph Mariani (1989), Adaptive syntax representation in an oral task-oriented dialogue for air-traffic controller training, Eurospeech
Sheri Hunnicutt (1989), Using syntactic and semantic information in a word prediction aid, Eurospeech
Naomi Inoue, Tsuyoshi Morimoto, Kentarou Ogura (1989), A linguistic knowledge base for applying semantic information to a speech understanding system, Eurospeech
Hiroaki Kitano, Hideto Tomabechi, Teruko Mitamura, Hitoshi Iida (1989), A massively parallel model of speech-to-speech dialog translation: a step toward interpreting telephony, Eurospeech
René Collier (1989), Intonation analysis: the perception of speech melody in relation to acoustics and production, Eurospeech
C. Avesani (1989), Towards a model of Italian intonation, Eurospeech
P. Mertens (1989), Automatic recognition of intonation in French and dutch, Eurospeech
Ingo Hertrich, R. D. Gartenberg (1989), A new method in intonation research using partly controlled, simulated dialogues, Eurospeech
Jacqueline Vaissière (1989), On automatic extraction of prosodic information for automatic speech recognition system, Eurospeech
I. S. Howard, J. R. Walliker (1989), The implementation of a portable real-time multilayer-perceptron speech fundamental period estimator, Eurospeech
Anton Batliner, Elmar Nöth (1989), The prediction of focus, Eurospeech
Hugo Quené, René Kager (1989), Automatic accentuation and prosodic phrasing for dutch text-to-speech conversion, Eurospeech
Renée van Bezooijen, Louis C. W. Pols (1989), Evaluation of a sentence accentuation algorithm for a dutch text-to-speech system, Eurospeech
Philippe Martin (1989), Automatic assignment of lexical stress in Italian, Eurospeech
James L. Hieronymus (1989), Automatic sentential vowel stress labelling, Eurospeech
Willem J. M. Peeters, William J. Barry (1989), Diphthong dynamics: production and perception in southern british English, Eurospeech
W. Datscheweit (1989), Quantitative measurement of the influence of acoustic cues on the perception of voiced plosives, Eurospeech
Jean-Luc Schwartz, Louis Jean Boe, Pascal Perrier, Bernard Guérin, Pierre Escudier (1989), Perceptual contract and stability in vowel systems: a 3-d simulation study, Eurospeech
Ian M. C. Watson, Marianne McCormick, Franz Seitz, Anthony Bladon, Rosalind Temple (1989), The use of perceptually scaled spectra in across-talker algorithmic classification of british English stop consonants, Eurospeech
Ton Broeders, Toni Rietveld (1989), Segmental marking as a cue in auditory voice identification of telephone speech, Eurospeech
Peter Fesseler, Heidi Hackbarth, Marianne Kugler, Arnd Boehm (1989), Automatic vocabulary extension for a speaker-adaptive speech recognition system based on CVC units, Eurospeech
Carlo Scagliola, Cesare Vicenzi, Angelo Carossino, Donatella Sciarra (1989), Iterative optimization of sub-word templates for speech recognition, Eurospeech
F. Monnet, S. Jousset, A. Demour, P. Richard (1989), The IKAROS continuous speech understanding system: first demonstrator, Eurospeech
D. Fournol, C. Godin, Y. Guidon, P. Richard (1989), The spin continuous-speech decoding system, Eurospeech
E. Dermatas, Nikos Fanotakis, George Kokkinakis (1989), Improved speaker independent IWRS for small vocabularies, Eurospeech
P. Buttafava, Roberto Billi, W. Digiampietro, G. Massia, V. Vittorelli (1989), Architecture and implementation of the olivetti PC-based very large vocabulary isolated word speech recognition system, Eurospeech
Martine Adda-Decker (1989), Continuous speech recognition using phone-based anchor point detection and diphone-based dp-matching, Eurospeech
Michael Bundgaard (1989), Statistical analysis of large-scale lexical corpuses in the context of continuous speech recognition systems (CSR systems), Eurospeech
Jose B. Marino, Climent Nadeu, Asunción Moreno, Eduardo Lleida, Enric Monte (1989), Recognition of numbers and strings of numbers by using demisyllables: one speaker experiment, Eurospeech
Giulio Colangeli, Filippo Ardito (1989), A transputer based system for parallel dynamic time warping, Eurospeech
J. M. Noyes, Clive R. Frankish (1989), Gender differences in speech recognition performance, Eurospeech
Rolf Carlson, Björn Granström, Anders Lindstrom (1989), Predicting name pronunciation for a reverse directory service, Eurospeech
Murray F. Spiegel, Marian J. Macchi, Kurt D. Gollhardt (1989), Synthesis of names by a demisyllable-based speech synthesizer (SPOKESMAN), Eurospeech
Robin W. King (1989), Layout processing, user control and prosody insertion in an on-line synthetic speech system, Eurospeech
William A. Ainsworth, B. Pell (1989), Connectionist architectures for a text-to-speech system, Eurospeech
L. F. M. ten Bosch, René Collier, Louis Boves (1989), From diphones to allophones: from data to rules, Eurospeech
H. Loman, Renée van Bezooijen, J. Kerkhoff, Louis Boves (1989), A working environment and procedure for the development of speech synthesis rules, Eurospeech
Gérard Bailly, A. Tran (1989), Compost: a rule-compiler for speech synthesis, Eurospeech
Tomohisa Hirokawa (1989), Speech synthesis using a waveform dictionary, Eurospeech
Marina Bäckström, Ken Ceder, Bertil Lyberg (1989), PROPHON - an interactive environment for text-to-speech conversion, Eurospeech
Kai-Fu Lee (1989), Hidden Markov models: past, present, and future, Eurospeech
L. R. Bahl, S. V. De Gennaro, P. S. Gopalakrishnan, R. L. Mercer (1989), A fast approximate acoustic match for large vocabulary speech recognition, Eurospeech
A. J. Serralheiro, Y. Ephraim, Lawrence R. Rabiner (1989), On nonstationary hidden Markov modeling of speech signals, Eurospeech
X. D. Huang, Hsiao-Wuen Hon, Kai-Fu Lee (1989), Large-vocabulary speaker-independent continuous speech recognition with semi-continuous hidden Markov models, Eurospeech
Andrew Varga, Keith Ponting (1989), Control experiments on noise compensation in hidden Markov model based continuous word recognition, Eurospeech
Akihiro Imamura, Hiroshi Hamada, Ryohei Nakatsu (1989), Speaker-independent word recognition through telephone networks using hidden Markov models, Eurospeech
Hsiao-Wuen Hon, Kai-Fu Lee, Robert Weide (1989), Towards speech recognition without vocabulary-specific training, Eurospeech
Peter Davies (1989), Hidden Markov modelling of modern standard Chinese tones in connected speech, Eurospeech
J. H. Wright, E. N. Wrigley, M. J. Carey (1989), Probabilistic multilevel language analysis for speech recognition, Eurospeech
Kai-Fu Lee, Sanjoy Mahajan (1989), Corrective and reinforcement learning for speaker-independent continuous speech recognition, Eurospeech
Gerhard Rigoll (1989), An information theory approach to speaker adaptation, Eurospeech
C. J. Darwin (1989), Speech perception seen through the ear, Eurospeech
Zong Liang Wu, Jean Luc Schwartz, Pierre Escudier (1989), A theoretical study of neural mechanisms specialized in the detection of articulatory-acoustic events, Eurospeech
M. J. Pont, Robert I. Damper (1989), A possible neural basis for the categorical perception of the English voiced/voiceless contrast, Eurospeech
R. W. Hukin, Robert I. Damper (1989), Testing an auditory model by resynthesis, Eurospeech
F. Berthommier, Jean-Luc Schwartz, Pierre Escudier (1989), Auditory processing in a post-cochlear neural network vowel spectrum processing based on spike synchrony, Eurospeech
Mariken ter Keurs, Reinier Plomp, Joost M. Festen (1989), Effects of spectral smearing on speech reception, Eurospeech
Y. Fujihashi, A. Fukui (1989), Telephone speech recognition system with high noise immunity, Eurospeech
Gu Yong, John S. Mason (1989), Speaker normalization via a linear transformation on a perceptual feature space and its benefits in ASR adaptation, Eurospeech
Hans-Wilhelm Ruehl, S. Dobler, J. Weith, Peter Meyer, A. Noll, H. H. Hamer, H. Piotrowski (1989), Speech recognition in the noisy car environment, Eurospeech
Michael Carey, Amanda Howe, Roger Tucker (1989), On the recognition of key words in unconstrained conversation, Eurospeech
John S. Mason, J. Oglesby, L. Xu (1989), Codebooks to optimise speaker recognition, Eurospeech
L. Xu, John S. Mason (1989), Instantaneous and transitional perceptually-based features in speaker identification, Eurospeech
Claudio Rocchi, Enzo Mumolo (1989), A new method for performing weighted distances for speaker authentication, Eurospeech
A. Federico, G. Ibba, A. Paoloni, N. De Sario, B. Saverione (1989), Comparison between automatic methods and human listeners in speaker recognition tasks, Eurospeech
Antonella Giannini, Massimo Pettorino, Umberto Cinque (1989), Speaker's identification by voice, Eurospeech
Janusz Zalewski (1989), Text dependent speaker recognition in noise, Eurospeech
Yoshimitsu Hirata, Seiich Nakagawa (1989), A lOObit/s speech coding using a speech recognition technique, Eurospeech
Jean-Paul Lefevre, Roberto Viola (1989), Real-time multirate speech codec for manned spacecraft communications, Eurospeech
R. Soheili, A. M. Kondoz, B. G. Evans (1989), New innovations in multi-pulse speech coding for bit rates below 8 kb/s, Eurospeech
I. Boyd, C. B. Southcott, P. J. Bolingbroke (1989), A speech coder for aeronautical telecommunications, Eurospeech
Kazunori Ozawa (1989), A 4.8 kb/s high-quality speech coding using various types of excitation signals, Eurospeech
M. Delprat, M. Lever, C. Gruet (1989), Efficient excitation model and fast selection in CELP coding of speech, Eurospeech
S. A. Atungsiri, A. M. Kondoz, B. G. Evans (1989), A low bit rate speech coder optimized for forward error control, Eurospeech
Kazumi Satoh, Hideaki Kurihara, Shigeyuki Unagami, Masanori Kajihara, Yoshihiro Tomita (1989), 8- and 16-kb/s APC-AB voice codec using a single chip DSP, Eurospeech
N. Moreau, P. Dymarski (1989), Mixed excitation CELP coder, Eurospeech
B. R. Wery, A. Leroux, H. Ph. Delbrouck, J. Leclerc (1989), A new parametric speech analysis and synthesis technique in the frequency domain, Eurospeech
Pascal Blanchet (1989), Multilayer perceptron architectures for data compression tasks, Eurospeech
Steve Renals, Jonathan Dalby (1989), Analysis of a neural network model for speech recognition, Eurospeech
Stefano Patarnello, Stefano Scarei (1989), Self-organizing boolean networks for speech recognition, Eurospeech
Enric Monte, Eduardo Lleida, Jose B. Marino (1989), New backpropagation algorithm using quadratic potential functions, and an experiment on isolated word recognition, Eurospeech
Jari Kangas, Teuvo Kohonen (1989), Transient map method in stop consonant discrimination, Eurospeech
M.-H. Caharel, Laurent Miclet (1989), Filtering a phonetic lattice with a connexionnist network, Eurospeech
R. Bulot, P. Nocera (1989), Explicit knowledge and neural networks for speech recognition, Eurospeech
L. Bottou, F. Fogelman Soulie, Pascal Blanchet, Jean-Sylvain Lienard (1989), Experiments with time delay networks and dynamic time warping for speaker independent isolated digits recognition, Eurospeech
Paul Dalsgaard (1989), Semi-automatic phonemic labelling of speech data using a self-organising neural network, Eurospeech
Anders Baekgaard, Paul Dalsgaard (1989), Recognition of continuous speech using neural nets and expert system, Eurospeech
Yasuhiro Komori, Kaichiro Hatazaki, Takaharu Tanaka, Takeshi Kawabata, Kiyohiro Shikano (1989), Phoneme recognition expert system using spectrogram reading knowledge and neural networks, Eurospeech
P. Haffner, Alex Waibel, H. Sawai, Kiyohiro Shikano (1989), Fast back-propagation learning methods for large phonemic neural networks, Eurospeech
Richard Rohwer, David Cressy (1989), Phoneme classification by boolean networks, Eurospeech
Mikko Kokkonen, Kari Torkkola (1989), Using self-organizing maps and multi-layered feed-forward nets to obtain phonemic transcriptions of spoken utterances, Eurospeech
Mark A. Huckvale, I. S. Howard, William J. Barry (1989), Automatic phonetic feature labelling of continuous speech, Eurospeech
Inger Karlsson (1989), A female voice for a text-to-speech system, Eurospeech
William J. Barry, Martine Grice, Valerie Hazan, Adrian J. Fourcin (1989), Excitation distributions for synthesised speech, Eurospeech
Jacques M. B. Terken, René Collier (1989), Automatic synthesis of natural-sounding intonation for text-to-speech conversion in dutch, Eurospeech
P. J. Moreno, M. Martinez, José M. Pardo, J. A. Vallejo (1989), Improving naturalness in a text-to-speech system with a new fundamental frequency algorithm, Eurospeech
T. Brovchenko, V. Voloshin (1989), Discourse intonation and expressive synthetic speech, Eurospeech
Gunnar Fant, Anita Kruckenberg, Lennart Nord (1989), Rhythmical structures in text reading - a language contrasting study, Eurospeech
Alex I. C. Monaghan (1989), Phonological domains for intonation in speech synthesis, Eurospeech
S. Quazza, G. Varese, E. Vivalda (1989), Syntactic pre-processing for high quality text-to-speech, Eurospeech
Danielle Larreur, Francoise Emerard, F. Marty (1989), Linguistic and prosodic processing for a text-to-speech synthesis system, Eurospeech
N. J. Youd, Frank Fallside (1989), Driving a speech synthesizer from conceptual input in the context of a voice dialogue system, Eurospeech
Art Blokland, Henry S. Thompson (1989), A parser for feature-based speech recognition, Eurospeech
Noel Nguyen-Trong (1989), A recent advance in factorial analysis, related to phonetic feature extraction, Eurospeech
Katsuhiko Shirai, Noriyuki Aoki, Naoki Hosaka (1989), Phoneme recognition in continuous speech using feature selection based on mutual information, Eurospeech
Franck Poirier (1989), Automatic labelling of continuous speech based on hierarchical representation of the energy, Eurospeech
Henry S. Thompson (1989), A chart parsing realisation of dynamic programming, with best-first enumeration of paths in a lattice, Eurospeech
E. Dermatas, George Kokkinakis (1989), A system for automatic text labelling, Eurospeech
D. Cericola, M. Danieli, M. J. Mollo, D. Voltolini (1989), Morpho-syntactic tools for speech processing, Eurospeech
A. Mastrolonardo, M. Refice (1989), Measuring the power of self-organized linguistic models, Eurospeech
Christophe Fouquere (1989), Is nonmonotonic grammar a solution to natural language processing?, Eurospeech
Emmanuel Reynier, Jean Caelen (1989), ATN compiler and parser for an ASR system, Eurospeech
Kjell Elenius, Rolf Carlson (1989), Assigning parts-of-speech to words from their orthography using a connectionist model, Eurospeech
Mike McAllister (1989), The problems of punctuation ambiguity in fully automatic text-to-speech conversion, Eurospeech
M. Refice, M. Savino (1989), Word endings analysis of european languages, Eurospeech
M. Prakash, G. V. Ramana Rao, C. Chandra Sekhar, B. Yegnanarayana (1989), Parsing spoken utterances in an inflectional language, Eurospeech
J. B. Berthelin, J. P. Fournier, B. Grau (1989), Processing non-expected language, Eurospeech
K. E. P. Carter, S. Gookson, A. F. Newell, J. L. Arnott, R. Dye (1989), The effect of feedback on oomposition rate using a simulated listening typewriter, Eurospeech
Benjamin Chigier, Erik Urdang, Judith Spitz (1989), Analysis of two algorithms for telephone speech recognition, Eurospeech
T. Thomas, J. Peckham, E. Frangoulis, J. Cove (1989), The sensitivity of speech recognisers to speaker variability and speaker variation, Eurospeech
V. V. Vu, R. A. King (1989), Automatic diagnostic and assessment procedures for the comparison and optimisation of time encoded speech (TES) DVI systems, Eurospeech
R. D. Hughes, R. A. King (1989), A comparison of the performance of "normal" and "whispered" speech with simple time encoded digital speech (TES) direct voice input (DVI) systems in a tactical military environment, Eurospeech
Hiroshi Hamada, Satoshi Miki, Ryohei Nakatsu (1989), Automatic evaluation of English pronunciation based on speech recognition techniques, Eurospeech
Mark J. Bakkum, Reinier Plomp, Louis C. W. Pols (1989), Objective evaluation of word pronunciation by filter-band analysis, Eurospeech