doi: 10.21437/ICSLP.1996
New developments in the INRS continuous speech recognition system
Z. Li, M. Heon, Douglas O'Shaughnessy
On designing pronunciation lexicons for large vocabulary, continuous speech recognition
Lori Lamel, Gilles Adda
Word graph rescoring using confidence measures
Pablo Fetter, Frédéric Dandurand, Peter Regel-Brietzmann
A bottom-up approach for handling unseen triphones in large vocabulary continuous speech recognition
X. L. Aubert, Peter Beyerlein, Meinhard Ullrich
Discriminative optimisation of large vocabulary recognition systems
V. Valtchev, P. C. Woodland, S. J. Young
Japanese large-vocabulary continuous-speech recognition using a business-newspaper corpus
Tatsuo Matsuoka, Katsutoshi Ohtsuki, Takeshi Mori, Sadaoki Furui, Katsuhiko Shirai
Handling compound nouns in a Swedish speech-understanding system
David Carter, Jaan Kaja, Leonardo Neumeyer, Manny Rayner, Fuliang Weng, Mats Wiren
Initial evaluation of a preselection module for a flexible large vocabulary speech recognition system in
J. Macias-Guarasa, A. Gallardo, J. Ferreiros, José M. Pardo, L. Villarrubia
Asynchronous integration of visual information in an automatic speech recognition system
Mamoun Alissali, Paul Deleglise, Alexandrina Rogozan
Audiovisual speech recognition using multiscale nonlinear image decomposition.
I. A. Matthews, J. Bangham, S. J. Cox
Robust audiovisual integration using semicontinuous hidden Markov models
Qin Su, Peter L. Silsbee
The effect of visual information on word initial consonant perception of dysarthric speech
Richard P. Schumeyer, Kenneth E. Barner
A multiple deformable template approach for visual speech recognition
Devi Chandramohan, Peter L. Silsbee
Speaker independent bimodal phonetic recognition experiments
Piero Cosi, E. Magno Caldognetto, Franco Ferrero, M. Dugatto, K. Vagges
Speechreading using shape and intensity information
Juergen Luettin, Neil A. Thacker, Steve W. Beet
Speaker identification by lipreading
Juergen Luettin, Neil A. Thacker, Steve W. Beet
How word onsets drive lexical access and segmentation: evidence from acoustics, phonology and processing
David W. Gow Jr., Janis Melvold, Sharon Manuel
RAW: a real-speech model for human word recognition
David van Kuijk, Peter Wittenburg, Ton Dijkstra
How facilitatory can lexical information be during word recognition? evidence from moroccan arabic
Mehdi Meftah, Sami Boudelaa
Effects of frequency on the auditory perception of open- versus closed-class words
Alette P. Haveman
Phonotactic and metrical influences on adult ratings of spoken nonsense words
Michael S. Vitevitch, Paul A. Luce, Jan Charles-Luce, David Kemmerer
Lipreading supplemented by voice fundamental frequency: to what extent does the addition of voicing increase lexical uniqueness for the lipreader?
Edward T. Auer Jr., Lynne E. Bernstein
Strategies used in rhyme-monitoring
S. te Riele, Sieb G. Nooteboom, H. Quené
How do dutch listeners process words with epenthetic schwa?
Wilma van Donselaar, Cecile Kuijpers, Anne Cutler
Whole-word phonetic distances and the PGPfone alphabet
Patrick Juola, Philip Zimmermann
Automatic vowel quality description using a variable mapping to an eight cardinal vowel reference set
Shuping Ran, J. Bruce Millar, Phil Rose
Automatic detection and segmentation of pronunciation variants in German speech corpora
Andreas Kipp, Maria-Barbara Wesenick, Florian Schiel
ANGIE: a new framework for speech analysis based on morpho-phonological modelling
Stephanie Seneff, Raymond Lau, Helen Meng
Perceptual contrast in the Korean and English vowel system normalized
Byunggon Yang
On phonetic characteristics of pause in the Korean read speech
Yong-Ju Lee, Sook-Hyang Lee
Cross-language effects of lexical stress in word recognition: the case of Arabic English bilinguals
Sami Boudelaa, Mehdi Meftah
Automatic generation of German pronunciation variants
Maria-Barbara Wesenick
Estimating the quality of phonetic transcriptions and segmentations of speech signals
Maria-Barbara Wesenick, Andreas Kipp
An acoustic analysis of contemporary vowels of the standard slovenian language
Bojan Petek, Rastislav Sustarsic, Smiljana Komar
Using decision trees to construct optimal acoustic cues
Sandrine Robbe, Anne Bonneau, Sylvie Coste, Yves Laprie
Maximum jaw displacement in contrastive emphasis
Donna Erickson, Osamu Fujimura
Subglottal pressure and final lowering in English
Rebecca Herman, Mary Beckman, Kiyoshi Honda
Phonological variation: epenthesis and deletion of schwa in Dutch
Cecile Kuijpers, Wilma van Donselaar, Anne Cutler
Feedback considerations for speech training systems
James J. Mahshie
Clinical applications of computer-based speech training for children with hearing impairment
Anne-Marie Öster
Enhancing information-rich regions of natural VCV and sentence materials presented in noise
Valerie Hazan, Andrew Simpson
Speech perceptual abilities of children with specific reading difficulty (dyslexia)
Valerie Hazan, Alan Adlard
Bimodal perception of spectrum compressed speech
Larry D. Paarmann, Michael K. Wynne
Effect of sentential context on syllabic stress perception by hearing-impaired listeners
Dragana Barac-Cikoja, Sally Revoile
Applications of automatic speech recognition to speech and language development in young children
Martin Russell, Catherine Brown, Adrian Skilling, Rob Series, Julie Wallace, Bill Bohnam, Paul Barker
Sub-band adaptive speech enhancement for hearing aids
D. R. Campbell
Adapting a TTS system to a reading machine for the blind
Thomas Portele, Jürgen Krämer
Modeling of spoken dialogue with and without visual information
Katsuhiko Shirai
Multimodal discourse modelling in a multi-user multi-domain environment
Stephanie Seneff, David Goddeau, Christine Pao, Joseph Polifroni
Automatic acquisition of probabilistic dialogue models
Kenji Kita, Yoshikazu Fukui, Masaaki Nagata, Tsuyoshi Morimoto
Units of dialogue management: an example
Paul Heisterkamp, Scott McGlashan
Error resolution during multimodal human-computer interaction
Sharon Oviatt, Robert VanGent
Improved spontaneous dialogue recognition using dialogue and utterance triggers by adaptive probability boosting
Ramesh R. Sarukkai, Dana H. Ballard
Speech recognition for spontaneously spoken German dialogues
Kai Hübener, Uwe Jost, Henrik Heine
Using prosodic information to constrain language models for spoken dialogue
Paul Taylor, Hiroshi Shimodaira, Stephen Isard, Simon King, Jacqueline Kowtko
Combining the detection and correction of speech repairs
Peter A. Heeman, Kyung-ho Loken-Kim, James F. Allen
Generating spontaneous elliptical utterance
Yuji Sagawa, Wataru Sugimoto, Noboru Ohnishi
Developing the modelling of Swedish prosody in spontaneous dialogue
Gösta Bruce, Marcus Filipsson, Johan Frid, Björn Granström, Kjell Gustafson, Merle Horne, David House, Birgitta Lastow, Paul Touati
Spoken language generation in a multimedia system
Shimei Pan, Kathleen R. McKeown
Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features
Keikichi Hirose, Mayumi Sakata, Hiromichi Kawanami
Spoken dialogue interface in a dual task situation
Shuichi Tanaka, Shu Nakazato, Keiichiro Hoashi, Katsuhiko Shirai
A dialogue control strategy based on the reliability of speech recognition
Yasuhisa Niimi, Yutaka Kobayashi
Speechwear: a mobile speech system
Alexander I. Rudnicky, Stephen Reed, Eric H. Thayer
WHEELS: a conversational system in the automobile classifieds domain
Helen Meng, Senis Busayapongchai, James Glass, David Goddeau, Lee Hetherington, Edward Hurley, Christine Pao, Joseph Polifroni, Stephanie Seneff, Victor Zue
Effective human-computer cooperative spoken dialogue: the AGS demonstrator
M. D. Sadek, A. Ferrieux, A. Cozannet, P. Bretier, F. Panaget, J. Simonin
Dialog in the RAILTEL telephone-based system
S. K. Bennacef, L. Devillers, S. Rosset, Lori Lamel
Dialogue processing in a conversational speech translation system
Alon Lavie, Lori Levin, Yan Qu, Alex Waibel, Donna Gates, Marsal Gavaldà, Laura Mayfield, Maite Taboada
Combination of word-based and category-based language models
T. R. Niesler, P. C. Woodland
A multi-level lexical-semantics based language model design for guided integrated continuous speech recognition
Francisco J. Valverde-Albacete, José M. Pardo
A category based approach for recognition of out-of-vocabulary words
Florian Gallwitz, Elmar Nöth, Heinrich Niemann
Scalable backoff language models
Kristie Seymore, Ronald Rosenfeld
Modeling long distance dependence in language: topic mixtures vs. dynamic cache models
R. Iyer, Mari Ostendorf
Bayesian estimation methods for n-gram language model adaptation
Marcello Federico
Modeling disfluencies in conversational speech
Man-hung Siu, Mari Ostendorf
Evaluation of a language model using a clustered model backoff
John Miller, Fil Alleva
Language modeling using x-grams
Antonio Bonafonte, José B. Mariño
Class phrase models for language modelling
Klaus Ries, Finn Dag Buo, Alex Waibel
Introducing linguistic constraints into statistical language modeling
Petra Geutner
Language modeling with stochastic automata
Jianying Hu, William Turin, Michael K. Brown
Feature dimension reduction using reduced-rank maximum likelihood estimation for hidden Markov models
Don X. Sun
Using multi-level segmentation coefficients to improve HMM speech recognition
Kai Hübener
A comparative study of linear feature transformation techniques for automatic speech recognition
T. Eisele, Reinhold Haeb-Umbach, D. Langmann
Inclusion of temporal information into features for speech recognition
Ben Milner
New cepstral representation using wavelet analysis and spectral transformation for robust speech recognition
Hubert Wassner, Gérard Chollet
Wavelet based feature extraction for phoneme recognition
C. J. Long, S. Datta
New fast wavelet packet transform algorithms for frame synchronized speech processing
Andrzej Drygajlo
Frequency-warping in speech
S. Umesh, L. Cohen, N. Marinovic, D. Nelson
Extracting speech features from human speech-like noise
Daisuke Kobayashi, Shoji Kajita, Kazuya Takeda, Fumitada Itakura
Subband-crosscorrelation analysis for robust speech recognition
Shoji Kajita, Kazuya Takeda, Fumitada Itakura
A new ASR approach based on independent processing and recombination of partial frequency bands
Hervé Bourlard, Stéphane Dupont
Frequency and time filtering of filter-bank energies for HMM speech recognition
Climent Nadeu, José B. Mariño, Javier Hernando, Albino Nogueiras
Extraction of tongue contours in x-ray images with minimal user interaction
Yves Laprie, Marie-Odile Berger
Three-dimensional measurement of the vocal tract by MRI
Didier Demolin, Thierry Metens, Alain Soquet
Syllable affiliation of final consonant clusters undergoes a phase transition over speaking rates
Philip Gleason, Betty Tuller, J. A. Scott Kelso
Towards a biomechanical model of the larynx
Arthur Lobo, Michael O'Malley
Generating intonation by superposing gestures
Yann Morlec, Gérard Bailly, Vèronique Aubergé
Effects of auditory feedback on F0 trajectory generation
Hideki Kawahara, Hiroko Kato, J. C. Williams
On the effects of accent and language on low rate speech coders
I. S. Burnett, J. J. Parry
VQ codevector index assignment using genetic algorithms for noisy channels
J. S. Pan, Fergus R. McInnes, Mervyn A. Jack
An improved vector quantization algorithm for speech transmission over noisy channels
Gavin C. Cawley
Very low delay and high quality coding of 20 hz-15 khz speech signals at 64 kbit/s
C. Murgia, G. Feng, A. Le Guyader, C. Quinquis
Application of speaker modification techniques to phonetic vocoding
Carlos M. Ribeiro, Isabel M. Trancoso
Entropy coded vector quantization with hidden Markov models
Tadashi Yonezaki, Kiyohiro Shikano
An application of recurrent neural networks to low bit rate speech coding
Minoru Kohata
CELP coding system based on mel-generalized cepstral analysis
Kazuhito Koishida, Keiichi Tokuda, Takao Kobayashi, Satoshi Imai
Wideband re-synthesis of narrowband CELP-coded speech using multiband excitation model
Cheung-Fat Chan, Wai-Kwong Hui
Recurrent neural networks for phoneme recognition
Takuya Koizumi, Mikio Mori, Shuji Taniguchi, Mitsutoshi Maruya
A model for the acoustic phonetic structure of arabic language using a single ergodic hidden Markov model
M. A. Mokhtar, A. Zein-el-Abddin
Modelling long term variability information in mixture stochastic trajectory framework
Yifan Gong, Irina Illina, Jean-Paul Haton
Segmental phonetic features recognition by means of neural-fuzzy networks and integration in an n-best solutions post-processing
T. Moudenc, R. Sokol, Guy Mercier
Stochastic trajectory model with state-mixture for continuous speech recognition
Irina Illina, Yifan Gong
Recognition of spelled names over the telephone
Hermann Hild, Alex Waibel
Optimal tying of HMM mixture densities using decision trees
Gilles Boulianne, Patrick Kenny
Speech recognition using an enhanced FVQ based on a codeword dependent distribution normalization and codeword weighting by fuzzy objective function
Hwan Jin Choi, Yung Hwan Oh
Using the self-organizing map to speed up the probability density estimation for speech recognition with mixture density HMMs
Mikko Kurimo, Panu Somervuo
Temporal cues for vowels and universals of vowel inventories
Carrie E. Lang, John J. Ohala
Acoustic variability in spontaneous conversational speech of american English talkers
Ann K. Syrdal
Cross-language speech perception: Swedish, English, and Spanish speakers' perception of front rounded vowels
Raquel Willerman, Patricia K. Kuhl
Inter-language vowel perception and production by Korean and Japanese listeners
John C. L. Ingram, See-Gyoon Park
Intelligibility and acoustic correlates of Japanese accented English vowels
Diane Kewley-Port, Reiko Akahane-Yamada, Kiyoaki Aikawa
Segmentation strategies for spoken language recognition: evidence from semi-bilingual Japanese speakers of English
Kiyoko Yoneyama
Integrating connectionist, statistical and symbolic approaches for continuous spoken Korean processing
Geunbae Lee, Jong-Hyeok Lee, Kyubong Park, Byung-Chang Kim
Towards ASR on partially corrupted speech
Hynek Hermansky, Sangita Timberwala, Misha Pavel
Parametric trajectory models for speech recognition
Herbert Gish, Kenney Ng
Use of Gaussian selection in large vocabulary continuous speech recognition using HMMs
K. M. Knill, M. J. F. Gales, S. J. Young
Cross phone state clustering using lexical stress and context
J. Hogberg, Kare Sjölander
Likelihood ratio decoding and confidence measures for continuous speech recognition
Eduardo Lleida-Solano, Richard C. Rose
A study on continuous Chinese speech recognition based on stochastic trajectory models
Xiaohui Ma, Yifan Gong, Yuqing Fu, Jiren Lu, Jean-Paul Haton
A proposal for a new algorithm of reference interval-free continuous DP for real-time speech or text retrieval
Yoshiaki Itoh, Jiro Kiyama, Hiroshi Kojima, Susumu Seki, Ryuichi Oka
Language modeling by string pattern n-gram for Japanese speech recognition
Akinori Ito, Masaki Kohda
Statistical language modeling using a variable context length
Reinhard Kneser
A comparison of hybrid HMM architectures using global discriminative training
Finn Tore Johansen
Improved probability estimation with neural network models
Wei Wei, Etienne Barnard, Mark Fanty
A neural network using acoustic sub-word units for continuous speech recognition
Ha-Jin Yu, Yung-Hwan Oh
On the error criteria in neural networks as a tool for human classification modelling
Louis F. M. ten Bosch, Roel Smits
A non-linear filtering approach to stochastic training of the articulatory-acoustic mapping using the EM algorithm
Gordon Ramsay
A tool for automated design of language models
Y. P. Yang, J. R. Deller Jr.
Acoustic-phonetic decoding based on elman predictive neural networks
F. Freitag, E. Monte
On improving discrimination capability of an RNN based recognizer
Tan Lee, P. C. Ching
An evaluation of statistical language modeling for speech recognition using a mixed category of both words and parts-of-speech
Yumi Wakita, Jun Kawai, Hitoshi Iida
Novel speech processing mechanism derived from auditory neocortical circuit analysis
Boris Aleksandrovsky, James Whitson, Gretchen Andes, Gary Lynch, Richard Granger
Modeling neurons in the anteroventral cochlear nucleus for amplitude modulation (AM) processing: application to speech sound
Ping Tang, Jean Rouat
Noise suppression and loudness normalization in an auditory model-based acoustic front-end
Halewijn Vereecken, Jean-Pierre Martens
A psychoacoustic model for the noise masking of voiceless plosive bursts
Jim Hant, Brian Strope, Abeer Alwan
Training machine classifiers to match the performance of human listeners in a natural vowel classification task
Martin Hunke, Thomas Holton
A neural matrix model for active tracking of frequency-modulated tones
Kiyoaki Aikawa, Hideki Kawahara, Minoru Tsuzaki
A user-configurable system for voice label recognition
Richard C. Rose, Eduardo Lleida-Solano, G. W. Erhart, R. V. Grubbe
Keyword spotting enhancement for video soundtrack indexing
Philippe Gelin, Chris. J. Wellekens
New efficient fillers for unlimited word recognition and keyword spotting
Rachida El Méliani, Douglas O'Shaughnessy
Automatic transcription of general audio data: preliminary analyses
Michelle S. Spina, Victor Zue
Transcribing radio news
Francis Kubala, Tasos Anastasakos, Hubert Jin, Long Nguyen, Richard Schwartz
Correcting recognition errors via discriminative utterance verification
Anand R. Setlur, Rafid A. Sukkar, John Jacob
Does training in speech perception modify speech production?
Reiko Akahane-Yamada, Yoh'ichi Tohkura, Ann R. Bradlow, David B. Pisoni
Phrase-final lengthening and stress-timed shortening in the speech of native speakers and Japanese learners of English
Motoko Ueyama
Japanese accentuations by foreign students and Japanese speakers of non-tokyo dialect
Nobuko Yamada
Devoicing of Japanese vowels by taiwanese learners of Japanese
J. Kevin Varden, Tsutomu Sato
Fluency and use of segmental dialect features in the acquisition of a second language (French) by English speakers
Danièle Archambault, Catherine Foucher, Blagovesta Maneva
Estimating child and adolescent formant frequency values from adult data
P. Martland, Sandra P. Whiteside, Steve W. Beet, L. Baghai-Ravary
Acoustic correlates of linguistic stress and accent in dutch and american English
Agaath M. C. Sluijter, Vincent J. van Heuven
On the levels of accentuation in spoken Japanese
Hiroya Fujisaki, Sumio Ohno, Osamu Tomita
Tonal distinctions between emphatic stress and pretonic lengthening in quebec French
Linda Thibault, Marise Ouellet
Distinction between 'normal' focus and 'contrastive/emphatic' focus
Anja (Petzold) Elsner
Perception of tonal accent by americans learning Japanese
Yukihiro Nishinuma, Masako Arai, Takako Ayusawa
Modeling intra-speaker pitch range variation: predicting F0 targets when "speaking up"
Elizabeth Shriberg, D. Robert Ladd, Jacques Terken
Predicting dialogue acts for a speech-to-speech translation system
Norbert Reithinger, Ralf Engel, Michael Kipp, Martin Klesen
Automatic speech translation based on the semantic structure
Johannes Müller, Holger Stahl, Manfred Lang
A methodology for application development for spoken language systems
Lewis M. Norton, Carl E. Weir, K. W. Scholz, Deborah A. Dahl, Ahmed Bouzid
A new restaurant guide conversational system: issues in rapid prototyping for specialized domains
Stephanie Seneff, Joseph Polifroni
Semantic interpretation of a Japanese complex sentence in an advisory dialogue - focused on the postpositional word "KEDO," which works as a conjunction between clauses
Tadahiko Kumamoto, Akira Ito
A Korean morphological analyzer for speech translation system
Youngkuk Hong, Myoung-Wan Koo, Gijoo Yang
Generic and domain-specific aspects of the waxholm NLP and dialog modules
Rolf Carlson, Sheri Hunnicutt
A real-time system for summarizing human-human spontaneous spoken dialogues
Megumi Kameyama, Goh Kawai, Isao Arima
Evaluation of spoken language understanding and dialogue systems
Bernd Hildebrandt, Heike Rautenstrauch, Gerhard Sagerer
Inter-speaker interaction of F0 in dialogs
Kuniko Kakita
A robust dialogue system for making an appointment
Hans Brandt-Pook, Gernot A. Fink, Bernd Hildebrandt, Franz Kummert, Gerhard Sagerer
Segmentation of spoken dialogue by interjections, disfluent utterances and pauses
Kazuyuki Takagi, Shuichi Itahashi
A form-based dialogue manager for spoken language applications
David Goddeau, Helen Meng, Joseph Polifroni, Stephanie Seneff, Senis Busayapongchai
The design of complex telephony applications using large vocabulary speech technology
S. J. Whittaker, D. J. Attwater
Building 10,000 spoken dialogue systems
Stephen Sutton, David G. Novick, Ronald A. Cole, Pieter Vermeulen, Jacques de Villiers, Johan Schalkwyk, Mark Fanty, Mark Fanty
Speaker intention modeling for large vocabulary Mandarin spoken dialogues
Yen-Ju Yang, Lee-Feng Chien, Lin-Shan Lee
Hybrid language models and spontaneous legal discourse
P. E. Kenne, Mary O'Kane
Topic change and local perplexity in spoken legal dialogue
P. E. Kenne, Mary O'Kane
Intonational cues to discourse structure in Japanese
Jennifer J. Venditti, Marc Swerts
Principles for the design of cooperative spoken human-machine dialogue
Niels Ole Bernsen, Hans Dybkjær, Laila Dybkjær
Development and comparison of three syllable stress classifiers
Karen L. Jenkin, Michael S. Scordilis
Interaction of speech disorders with speech coders: effects on speech intelligibility
D. G. Jamieson, Li Deng, M. Price, Vijay Parsa, J. Till
Detecting arytenoid cartilage misplacement through acoustic and electroglottographic jitter analysis
Maurílio N. Vieira, Arnold G. D. Maran, Fergus R. McInnes, Mervyn A. Jack
Robust F0 and jitter estimation in pathological voices
Maurílio N. Vieira, Fergus R. McInnes, Mervyn A. Jack
Speech monitoring of infective laryngitis
F. Plante, H. Kessler, B. M. G. Cheetham, J. Earis
Searching for nonlinear relations in whitened jitter time series
Jean Schoentgen, Raoul de Guchteneere
Vocal fold pathology assessment using AM autocorrelation analysis of the teager energy operator
Liliana Gavidia-Ceballos, John H. L. Hansen, James F. Kaiser
Continuous positive airway pressure (CPAP) in the treatment of hypernasality
David P. Kuehn
Enhancement of alaryngeal speech by adaptive filtering
Carol Y. Espy-Wilson, Venkatesh R. Chari, Caroline B. Huang
Simulation of disordered speech using a frequency-domain vocal tract model
Li Deng, Xuemin Shen, D. G. Jamieson, J. Till
A stochastic model of fundamental period perturbation and its application to perception of pathological voice quality
Yasuo Endo, Hideki Kasuya
A screening test for speech pathology assessment using objective quality measures
Eric J. Wallen, John H. L. Hansen
Recent advances in hypernasal speech detection using the nonlinear teager energy operator
Douglas A. Cairns, John H. L. Hansen, James F. Kaiser
Human palate and related structures: their articulatory consequences
Kiyoshi Honda, Shinji Maeda, Michiko Hashi, Jim Dembowski, John R. Westbury
A continuum mechanics representation of tongue deformation
Edward P. Davis, Andrew Douglas, Maureen Stone
From MRI and acoustic data to articulatory synthesis: a case study of the lateral approximants in american English
Philbert Bangayan, Abeer Alwan, Shrikanth Narayanan
Liquids in tamil
Shrikanth Narayanan, Abigail Kaun, Dani Byrd, Peter Ladefoged, Abeer Alwan
Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images
Chang-Sheng Yang, Hideki Kasuya
Vocal tract acoustics using the transmission line matrix (TLM) method
S. El-Masri, X. Pelorson, P. Saguet, Pierre Badin
Building sensori-motor prototypes from audiovisual exemplars
Gérard Bailly
Parameterized VT area function inversion
Mats Båvegård, Gunnar Fant
An improved vocal tract model of vowel production implementing piriform resonance and transvelar nasal coupling
Jianwu Dang, Kiyoshi Honda
Pseudo-articulatory speech synthesis for recognition using automatic feature extraction from x-ray data
C. S. Blackburn, S. J. Young
Modeling hyperarticulate speech during human-computer error resolution
Sharon Oviatt, Gina-Anne Levow, Margaret MacEachern, Karen Kuhn
Using stress to disambiguate spoken Thai sentences containing syntactic ambiguity
Siripong Potisuk, Mary P. Harper, Jackson T. Gandour
Use of prosodic information to integrate acoustic and linguistic knowledge in continuous Mandarin speech recognition with very large vocabulary
Hung-yun Hsieh, Ren-yuan Lyu, Lin-shan Lee
Word boundary detection using pitch variations
G. V. Ramana Rao, J. Srichand
Detection of phrase boundaries in Japanese by low-pass filtering of fundamental frequency contours
Atsuhiro Sakurai, Keikichi Hirose
A new method for speech delexicalization, and its application to the perception of French prosody
Vincent Pagel, Noelle Carbonell, Yves Laprie
Task adaptation for dialogues via telephone lines
Udo Bub
The influence of bigram constraints on word recognition by humans: implications for computer speech recognition
Ronald A. Cole, Yonghong Yan, Troy Bailey
ALICE: acquisition of language in conversational environment - an approach to weakly supervised training of spoken language system for language porting
Tetsunori Kobayashi
Pitch pattern clustering of user utterances in human-machine dialogue
Takashi Yoshimura, Satoru Hayamizu, Hiroshi Ohmura, Kazuyo Tanaka
Simplifying language through error-correcting decoding
J. C. Amengual, Enrique Vidal, J. M. Benedí
A mixed approach to speech understanding
Mauro Cettolo, Anna Corazza, Renato De Mori
Speech recognition for an information kiosk
Jean-Luc Gauvain, J. J. Gangolf, Lori Lamel
Localizing an automatic inquiry system for public transport information
Helmer Strik, Albert Russel, Henk van den Heuvel, Catia Cucchiarini, Louis Boves
Prompt constrained natural language - evolving the next generation of telephony services
Stephen M. Marcus, Deborah W. Brown, Randy G. Goldberg, Max S. Schoeffler, William R. Wetzel, Richard R. Rosinski
Key-phrase detection and verification for flexible speech understanding
Tatsuya Kawahara, Chin-Hui Lee, Biing-Hwang Juang
Interactive recovery from speech recognition errors in speech user interfaces
Bernhard Suhm, Brad Myers, Alex Waibel
Estimation of language models for new spoken language applications
Sunil Issar
H-infinity filtering for speech enhancement
Xuemin Shen, Li Deng, Anisa Yasmin
A comparitive analysis of channel-robust features and channel equalization methods for speech recognition
Saeed V. Vaseghi, Ben Milner
Robust speech recognition features based on temporal trajectory filtering of frequency band spectrum
Jia-lin Shen, Wen-liang Hwang, Lin-shan Lee
Durational modelling for improved connected digit recognition
Kevin Power
Study on the dereverberation of speech based on temporal envelope filtering
Carlos Avendano, Hynek Hermansky
Estimating Markov model structures
Thorsten Brants
A fertility channel model for post-correction of continuous speech recognition
Eric K. Ringger, James F. Allen
Restoration of wide band signal from telephone speech using linear prediction error processing
Hiroshi Yasukawa
Smoothed spectral subtraction for a frequency-weighted HMM in noisy speech recognition
Hiroshi Matsumoto, Noboru Naitoh
A simple architecture for using multiple cues in sound separation
William S. Woods, Martin Hansen, Thomas Wittkop, Birger Kollmeier
On the robust automatic segmentation of spontaneous speech
Bojan Petek, Ove Andersen, Paul Dalsgaard
Bayesian adaptation of speech recognizers to field speech data
C. G. Miglietta, C. Mokbel, D. Jouvet, J. Monné
Sub-band adaptive filtering applied to speech enhancement
A. J. Darlington, D. J. Campbell
Noise robust estimate of speech dynamics for speaker recognition
J. P. Openshaw, John S. Mason
Overview of speech enhancement techniques for automatic speaker recognition
Javier Ortega-García, Joaquín González-Rodríguez
Dynamic features for segmental speech recognition
Naomi Harte, Saeed V. Vaseghi, Ben Milner
Speech recognition based on a model of human auditory system
Takuya Koizumi, Mikio Mori, Shuji Taniguchi
APVQ encoder applied to wideband speech coding
J. M. Salavedra, E. Masgrau
Simple fast vector quantization of the line spectral frequencies
Jin Zhou, Yair Shoham, Ali Akansu
N-best-based instantaneous speaker adaptation method for speech recognition
Tomoko Matsui, Sadaoki Furui
Mixture splitting technic and temporal control in a HMM-based recognition system
C. Montacié, M.-J. Caraty, C. Barras
A unified spectral transformation adaptation approach for robust speech recognition
Lei Yao, Dong Yu, Taiyi Huang
On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognition
Qiang Huo, Chin-Hui Lee
Speaker adaptation by modeling the speaker variation in a continuous speech recognition system
Nikko Ström
An enquiring system of unknown words in TV news by spontaneous repetition (application of speaker normalization by speaker subspace projection)
Yasuo Ariki, Shigeaki Tagashira
Adaptive recognition method based on posterior use of distribution pattern of output probabilities
Jin-Song Zhang, Beiqian Dai, Changfu Wang, Hingkeung Kwan, Keikichi Hirose
Iterative unsupervised adaptation using maximum likelihood linear regression
P. C. Woodland, D. Pye, M. J. F. Gales
A compact model for speaker-adaptive training
Tasos Anastasakos, John McDonough, Richard Schwartz, John Makhoul
Iterative unsupervised speaker adaptation for batch dictation
Shigeru Homma, Jun-ichi Takahashi, Shigeki Sagayama
Rapid unsupervised adaptation to children's speech on a connected-digit task
Daniel C. Burnett, Mark Fanty
Speaker adaptation using tree structured shared-state HMMs
Jun Ishii, Masahiro Tonomura, Shoichi Matsunaga
Language understanding using hidden understanding models
Richard Schwartz, Scott Miller, David Stallard, John Makhoul
Processing of semantic information in fluently spoken language
Allen L. Gorin
Automatic linguistic segmentation of conversational speech
Andreas Stolcke, Elizabeth Shriberg
Towards understanding spontaneous speech: word accuracy vs. concept accuracy
M. Boros, W. Eckert, Florian Gallwitz, Günther Görz, G. Hanrieder, Heinrich Niemann
A stochastic case frame approach for natural language understanding
Wolfgang Minker, S. K. Bennacef, Jean-Luc Gauvain
Improving speech understanding by incorporating database constraints and dialogue history
Frank Seide, Bernhard Rüber, Andreas Kellner
Learning to parse spontaneous speech
Finn Dag Buo, Alex Waibel
Spontaneous speech and natural language processing ALPES: a robust semantic-led parser
Jean-Yves Antoine
The natural language processing module for a voice assisted operator at telefónica i+D
J. Alvarez-Cercadillo, J. Caminero-Gil, C. Crespo-Casas, D. Tapias-Merino
Compound words in large-vocabulary German speech recognition systems
André Berton, Pablo Fetter, Peter Regel-Brietzmann
Prosody, empty categories and parsing - a success story
Anton Batliner, A. Feldhaus, S. Geissler, T. Kiss, Ralf Kompe, Elmar Nöth
almost parsing technique for language modeling
B. Srinivas
A new discourse structure model for spontaneous spoken dialogue
Tetsuro Chino, Hiroyuki Tsuboi
An architecture for spoken dialogue management
David Duff, Barbara Gates, Susann LuperFoy
Pausing strategies in discourse in dutch
Monique E. van Donzel, Florien J. Koopmans-van Beinum
Filled pauses as markers of discourse structure
Marc Swerts, Anne Wichmann, Robbert-Jan Beun
The prosodic analysis of Korean dialogue speech - through a comparative study with read speech
Cheol-jae Seong, Minsoo Hahn
Changing the topic: how long does it take?
Mary O'Kane, P. E. Kenne
Learning pronunciation dictionary from speech data
Christian-Michael Westendorf, Jens Jelitto
The trended HMM with discriminative training for phonetic classification
C. Rathinavelu, Li Deng
Improving decision trees for acoustic modeling
Ariane Lazaridès, Yves Normandin, Roland Kuhn
An improved training algorithm in HMM-based speech recognition
Gongjun Li, Taiyi Huang
Speech recognition using a strong correlation assumption for the instantaneous spectra
J. Ming, P. O'Boyle, J. McMahon, F. J. Smith
On parameter filtering in continuous subword-unit-based speech recognition
Pau Pachès-Leal, Climent Nadeu
Estimation of statistical phoneme center considering phonemic environments
Shigeki Okawa, Katsuhiko Shirai
Integration of context-dependent durational knowledge into HMM-based speech recognition
Xue Wang, Louis F. M. ten Bosch, Louis C. W. Pols
Speech recognition based on acoustically derived segment units
T. Fukada, M. Bacchiani, Kuldip K. Paliwal, Yoshinori Sagisaka
Robust gender-dependent acoustic-phonetic modelling in continuous speech recognition based on a new automatic male/female classification
Rivarol Vergin, Azarshid Farhat, Douglas O'Shaughnessy
A codebook adaptation algorithm for SCHMM using formant distribution
Tae Young Yang, Won Ho Shin, Weon Goo Kim, Dae Hee Youn
Parameter tying for flexible speech recognition
J. Simonin, S. Bodin, D. Jouvet, K. Bartkova
Word-spotting based on inter-word and intra-word diphone models
Tsuneo Nitta, Shin'ichi Tanaka, Yasuyuki Masai, Hiroshi Matsu'ura
Duration modeling with expanded HMM applied to speech recognition
Antonio Bonafonte, Josep Vidal, Albino Nogueiras
Different strategies for distribution clustering using discrete, semicontinuous and continuous HMMs in CSR
Ricardo de Córdoba, José M. Pardo
Improved HMM phone and triphone models for realtime ASR telephony applications
Ilija Zeljkovic, Shrikanth Narayanan
Improved extended HMM composition by incorporating power variance
Yasuhiro Minami, Sadaoki Furui
Optimal filtering and smoothing for speech recognition using a stochastic target model
Gordon Ramsay, Li Deng
Speech recognition using syllable-like units
Zhihong Hu, Johan Schalkwyk, Etienne Barnard, Ronald A. Cole
Context modeling and clustering in continuous speech recognition
Jean-Claude Junqua, Lorenzo Vassallo
Hierarchical partition of the articulatory state space for overlapping-feature based speech recognition
Li Deng, Jim Jian-Xiong Wu
A fuzzy acoustic-phonetic decoder for speech recognition
Olivier Oppizzi, David Fournier, Philippe Gilles, Henri Méloni
Syllable-level desynchronisation of phonetic features for speech recognition
Katrin Kirchhoff
A probabilistic framework for feature-based speech recognition
James Glass, Jane Chang, Michael McCandless
Modeling context-dependent phonetic units in a continuous speech recognition system for Mandarin Chinese
Jim Jian-Xiong Wu, Li Deng, Jacky Chan
Search for unexplored effects in speech production
Cecil H. Coker, M. H. Krane, B. Y. Reis, R. A. Kubli
Articulatory synthesis from x-rays and inversion for an adaptive speech robot
Pierre Badin, Christian Abry
Analysis of acoustic properties of the nasal tract using 3-d FEM
Hisayoshi Suzuki, Takayoshi Nakai, Hirosi Sakakibara
Experiments with analysis by synthesis of glottal airflow
Johan Liljencrants
From segmental duration properties to rhythmic structure: a study of interactions between high and low level
Marise Ouellet, Benoît Tardif
Analysis of context-dependent segmental duration for automatic speech recognition
Xue Wang, Louis C. W. Pols, Louis F. M. ten Bosch
The role of the rhythmic groups in the segmentation of continuous French speech
Delphine Dahan
The implications of temporal patterns for the prosody of boundary signaling in connected speech
Zita McRobbie-Utasi
Experimental phonetic study of the syllable duration of Korean with respect to the positional effect
Hyunbok Lee, Cheol-jae Seong
Timing of pitch movements and accentuation of syllables
Dik J. Hermes
A probabilistic approach to AMDF pitch detection
Goangshiuan S. Ying, Leah H. Jamieson, Carl D. Michell
From sagittal cut to area function: an RMI investigation
Alain Soquet, Véronique Lecuit, Thierry Metens, Didier Demolin
Pitch detection and voiced/unvoiced decision algorithm based on wavelet transforms
Léonard Janer, Juan José Bonet, Eduardo Lleida-Solano
Decomposition of speech signals into a deterministic and a stochastic part
Yannis Stylianou
Improved glottal closure instant detector based on linear prediction and standard pitch concept
Cheol-Woo Jo, Ho-Gyun Bang, William A. Ainsworth
Analysis of speech segments using variable spectral/temporal resolution
Xihong Wang, Stephen A. Zahorian, Stefan Auberg
Time-based clustering for phonetic segmentation
Brian Eberman, William Goldenthal
Formant analysis using mixtures of Gaussians
Parham Zolfaghari, Tony Robinson
Deriving articulatory representations from speech with various excitation modes
Hywel B. Richards, John S. Mason, Melvyn J. Hunt, John S. Bridle
blind speech segmentation: automatic segmentation of speech without linguistic knowledge
Manish Sharma, Richard J. Mammone
Speech synthesis using a nonlinear energy damping model for the vocal folds vibration effect
Hiroshi Ohmura, Kazuyo Tanaka
Neural networks learning with L1 criteria and its efficiency in linear prediction of speech signals
Munehiro Namba, Hiroyuki Kamata, Yoshihisa Ishida
Preprocessing and neural classification of English stop consonants [b,d,g,p,t,k]
Anna Esposito, C. E. Ezin, M. Ceccarelli
A comparison of modified k-means(MKM) and NN based real time adaptive clustering algorithms for articulatory space codebook formation
K. S. Ananthakrishnan
A novel approach to the estimation of voice source and vocal tract parameters from speech signals
Wen Ding, Hideki Kasuya
Syllable detection in read and spontaneous speech
Hartmut R. Pfitzinger, Susanne Burger, Sebastian Heid
Maximum likelihood learning of auditory feature maps for stationary vowels
Kuansan Wang, Chin-Hui Lee, Biing-Hwang Juang
Explicit segmentation of speech using Gaussian models
Antonio Bonafonte, Albino Nogueiras, Antonio Rodriguez-Garrido
A comparison of several recent methods of fundamental frequency and voicing decision estimation
E. Mousset, William A. Ainsworth, José A. R. Fonollosa
Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency
Toshihiko Abe, Takao Kobayashi, Satoshi Imai
Integrated polispectrum on speech recognition
Asunción Moreno, Miquel Rutllán
An incremental speaker-adaptation technique for hybrid HMM-MLP recognizer
Joao P. Neto, Ciro A. Martins, Luís B. Almeida
Phoneme segmentation of continuous speech using multi-layer perceptron
Youngjoo Suh, Youngjik Lee
Stochastic perceptual speech models with durational dependence
Jeff Bilmes, Nelson Morgan, Su-Lin Wu, Hervé Bourlard
Boosting the performance of connectionist large vocabulary speech recognition
G. D. Cook, A. J. Robinson
HMMs and OWE neural network for continuous speech recognition
Nicolas Pican, Dominique Fohr, Jean-François Mari
Smoothed local adaptation of connectionist systems
Steve Waterhouse, Dan Kershaw, Tony Robinson
Robust speech recognition with speaker localization by a microphone array
Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano
Sound source localization in reverberant environments using an outlier elimination algorithm
Ea-Ee Jan, James L. Flanagan
The 1995 abbot LVCSR system for multiple unknown microphones
Dan Kershaw, Tony Robinson, Steve Renals
Experiments of speech recognition in a noisy and reverberant environment using a microphone array and HMM
D. Giuliani, Maurizio Omologo, P. Svaizer
Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays
Joaquín González-Rodríguez, Javier Ortega-García, César Martin, Luis Hernández
Robust automatic speech recognition using a multi-channel signal separation front-end
Kuan-Chieh Yen, Yunxin Zhao
Prosody generation in text-to-speech conversion using dependency graphs
Anders Lindström, Ivan Bretan, Mats Ljungqvist
Extraction method of non-restrictive modification in Japanese as a marked factor of prosody
Hisako Asano, Hisashi Ohara, Yoshifumi Ooyama
Modeling contrast in the generation and synthesis of spoken language
Scott Prevost
A left-to-right processing model of pausing in Japanese based on limited syntactic information
Hajime Tsukada
Modeling of intonation bearing emphasis for TTS-synthesis of greek dialogues
D. Galanis, V. Darsinos, George Kokkinakis
Synthesizing prosody: a prominence-based approach
Barbara Heuft, Thomas Portele
Multilingual text analysis for text-to-speech synthesis
Richard Sproat
Spoken-style explanation generator for Japanese kanji using a text-to-speech system
Yoshifumi Ooyama, Hisako Asano, Koji Matsuoka
A method for estimating prosodic symbol from text for Japanese text-to-speech synthesis
Ken-ichi Magata, Tomoki Hamagami, Mitsuo Komura
Statistical methods in data-driven modeling of Spanish prosody for text to speech
E. López-Gonzalo, J. M. Rodríguez-García
Intonation processing for TTS using stylization and neural network learning method
Jung-Chul Lee, Youngjik Lee, Sang-Hun Kim, Minsoo Hahn
Generating F0 contours from toBI labels using linear regression
Alan W. Black, Andrew J. Hunt
The broad study of homograph disambiguity for Mandarin speech synthesis
Wern-Jun Wang, Shaw-Hwa Hwang, Sin-Horng Chen
The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposes
Thierry Dutoit, Vincent Pagel, N. Pierret, F. Bataille, O. Van der Vrecken
Training data selection for voice conversion using speaker selection and vector field smoothing
Makoto Hashimoto, Norio Higuchi
A new voice transformation method based on both linear and nonlinear prediction analysis
Ki Seung Lee, Dae Hee Youn, Il Whan Cha
On the transformation of the speech spectrum for voice conversion
G. Baudoin, Yannis Stylianou
Spectral analysis of synthetic speech and natural speech with noise over the telephone line
Cristina Delogu, Andrea Paoloni, Susanna Ragazzini, Paola Ridolfi
A new speech synthesis system based on the ARX speech production model
Weizhong Zhu, Hideki Kasuya
Speech synthesis using the CELP algorithm
Geraldo Lino de Campos, Evandro Bacci Gouvêa
A Mandarin text-to-speech system
Shaw-Hwa Hwang, Sin-Horng Chen, Yih-Ru Wang
Residual-based speech modification algorithms for text-to-speech synthesis
Mike D. Edgington, A. Lowry
A generalized LR parser for text-to-speech synthesis
Per Olav Heggtveit
Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis
M. P. Pollard, B. M. G. Cheetham, C. C. Goodyear, Mike D. Edgington, A. Lowry
An excitation synchronous pitch waveform extraction method and its application to the VCV-concatenation synthesis of Japanese spoken words
Yasuhiko Arai, Ryo Mochizuki, Hirofumi Nishimura, Takashi Honda
A new Chinese text-to-speech system with high naturalness
Ren-Hua Wang, Qinfeng Liu, Difei Tang
Voice conversion based on topological feature maps and time-variant filtering
Ansgar Rinscheid
Language training system utilizing speech modification
Meron Yoram, Keikichi Hirose
Perception of English /r/ and /l/ speech contrasts by native Korean listeners with extensive English-language experience
D. G. Jamieson, K. Yu
Automatic text-independent pronunciation scoring of foreign language student speech
Leonardo Neumeyer, Horacio Franco, Mitchel Weintraub, Patti Price
Assessing the contribution of instructional technology in the teaching of pronunciation
Antônio Simoes
Detection of foreign speakers' pronunciation errors for second language training - preliminary results
Maxine Eskenazi
Foreign accent in intonation patterns - a contrastive study applying a quantitative model of the F0 contour
Hansjörg Mixdorff
Input modality effects in foreign accent
Duncan J. Markham, Yasuko Nagano-Madsen
For speech perception by humans or machines, three senses are better than one
Lynne E. Bernstein, Christian Benoît
A few factors which affect the degree of incorporating lip-read information into speech perception
Kaoru Sekiyama, Yoh'ichi Tohkura, Michio Umeda
Characterizing audiovisual information during speech
E. Vatikiotis-Bateson, K. G. Munhall, Y. Kasahara, F. Garcia, H. Yehia
The implications of the tadoma method of speechreading for spoken language processing
Charlotte M. Reed
Seeing speech in space and time: psychological and neurological findings
Ruth Campbell
Studies of the mcgurk effect: implications for theories of speech perception
Kerry P. Green
Using the visual component in automatic speech recognition
N. M. Brooke
Perceptual organization of speech in one and several modalities: common functions, common resources
Robert E. Remez
Multi-modal encoding of speech in memory: a first report
David B. Pisoni, Helena M. Saldaña, Sonya M. Sheffert
What's in the "pure" prosody?
Volker Strom, Christina Widera
F0 declination in read-aloud and spontaneous speech
Marc Swerts, Eva Strangert, Mattias Heldner
Prediction of prosodic phrase boundaries considering variable speaking rate
Yeon-jun Kim, Yung-hwan Oh
Prediction of F0 parameter of contextualized utterances in dialogue
Yoichi Yamashita, Riichiro Mizoguchi
The production and perception of potentially ambiguous intonation contours by speakers of Russian and Japanese
V. Makarova, J. Matsui
What is invariant and what is optional in the realization of a FOCUSED word? a cross-dialectal study of Swedish sentences with moving focus
Robert Eklund
Quantifying spectral characteristics of fricatives
Christine H. Shadle, Sheila J. Mair
Acoustic characteristics of ejectives in ingush
Natasha Warner
An acoustic profile of consonant reduction
Rob J. J. H. van Son, Louis C. W. Pols
Devoicing in post-vocalic canadian-French obstruants
Danièle Archambault, Blagovesta Maneva
Paying attention to speaking rate
Alexander L. Francis, Howard C. Nusbaum
The lack of invariance problem and the goal of speech perception
Irene Appelbaum
The acoustic structure of vowels in mothers' speech to infants and adults
Jean E. Andruski, Patricia K. Kuhl
Acoustical characteristics of sound production of deaf and normally hearing infants
Chris J. Clement, Florien J. Koopmans-van Beinum, Louis C. W. Pols
Word recognition by Japanese infants
P. A. Halle, Toshisada Deguchi, Yuji Tamekawa, B. Boysson-Bardies, Shigeru Kiritani
Investigations of the word segmentation abilities of infants
Peter W. Jusczyk
Developmental change in perception of clause boundaries by 6- and 10-month-old Japanese infants
Akiko Hayashi, Yuji Tamekawa, Toshisada Deguchi, Shigeru Kiritani
A frequency domain method for parametrization of the voice source
Paavo Alku, Erkki Vilkman
Glottal correlates of the word stress and the tense/lax opposition in German
Krzysztof Marasek
Coarticulatory stability in american English /r/
Suzanne Boyce, Carol Y. Espy-Wilson
An MRI-based analysis of the English /r/ and /l/ articulations
Shinobu Masaki, Reiko Akahane-Yamada, Mark K. Tiede, Yasuhiro Shimada, Ichiro Fujimoto
Does lexical stress or metrical stress better predict word boundaries in Dutch?
David van Kuijk
Optopalatograph (OPG): a new apparatus for speech production analysis
Alan A. Wrench, A. D. McIntosh, William J. Hardcastle
Prediction of vowel systems using a deductive approach
René Carré
Distinctions between [t] and [tch] using electropalatography data
Sheila J. Mair, Celia Scully, Christine H. Shadle
Relating formants and articulation in intelligibility test words
Michiko Hashi, Raymond D. Kent, John R. Westbury, Mary J. Lindstrom
The role of coarticulation in the perception of vowel quality in modern standard Arabic
Imad Znagui, Mohamed Yeou
Updating the reading EPG
Simon Arnfield, Wilf Jones
Lexical stress detection on stress-minimal word pairs
Goangshiuan S. Ying, Leah H. Jamieson, Ruxin Chen, Carl D. Mitchell
An acoustic study of the interaction between stressed and unstressed syllables in spoken Mandarin
Jing Wang
Automatic detection of accent nuclei at the head of words for speech recognition
Nobuaki Minematsu, Seiichi Nakagawa
Automatic generation of prosodic structure for high quality Mandarin speech synthesis
Fu-chiang Chou, Chiu-yu Tseng, Lin-shan Lee
A study on Japanese prosodic pattern and its modeling in restricted speech
Tomoki Hamagami, Ken-ichi Magata, Mitsuo Komura
A phonetic study of focus in intransitive verb sentences
Steve Hoskins
Goethe for prosody
Stefan Rapp
Prosodic cues in syntactically ambiguous strings; an interactive speech planning mechanism
K. A. Straub
A functional model for generation of the local components of F0 contours in Chinese
Jinfu Ni, Ren-Hua Wang, Deyu Xia
The acquisition of voiceless stops in the interlanguage of second language learners of English and Spanish
Marie Fellbaum
Evaluating automatic speech recognition as a component of a multi-input device human-computer interface
B. A. Mellor, C. Baber, C. Tunley
Data collection for the MASK kiosk: WOz vs prototype system
A. Life, I. Salter, J. N. Temem, F. Bernard, S. Rosset, S. K. Bennacef, Lori Lamel
An experimental Japanese/English interpreting video phone system
M. Karaorman, T. H. Applebaum, T. Itoh, M. Endo, Y. Ohno, M. Hoshimi, T. Kamai, K. Matsui, K. Hata, S. Pearson, Jean-Claude Junqua
User participation and compliance in speech automated telecommunications applications
Sara Basson, Stephen Springer, Cynthia Fong, Hong Leung, Ed Man, Michele Olson, John Pitrelli, Ranvir Singh, Suk Wong
Embedding speech in web interfaces
Samuel Bayer
Voice-activated home banking system and its field trial
Toshihiro Isobe, Masatoshi Morishima, Fuminori Yoshitani, Nobuo Koizumi, Ken'ya Murakami
A text analyzer for Korean text-to-speech systems
Sangho Lee, Yung-Hwan Oh
Design and evaluation of a phonological phrase parser for Spanish text-to-speech
Helen E. Karn
Comparison of two tree-structured approaches for grapheme-to-phoneme conversion
Ove Andersen, Roland Kuhn, Ariane Lazaridès, Paul Dalsgaard, Jürgen Haas, Elmar Nöth
A recurrent network that learns to pronounce English text
M. J. Adamson, Robert I. Damper
Archisegment-based letter-to-phone conversion for concatenative speech synthesis in Portuguese
Eleonora Cavalcante Albano, Agnaldo Antonio Moreira
A new method of generating speech synthesis units based on phonological knowledge and clustering technique
Yuki Yoshida, Shin'ya Nakajima, Kazuo Hakoda, Tomohisa Hirokawa
Consistency in transcription and labelling of German intonation with GToBI
Martine Grice, Matthias Reyelt, Ralf Benzmüller, Jörg Mayer, Anton Batliner
Syntactic-prosodic labeling of large spontaneous speech data-bases
Anton Batliner, Ralf Kompe, Andreas Kiessling, Heinrich Niemann, Elmar Nöth
Relationship between discourse structure and dynamic speech rate
Florien J. Koopmans-van Beinum, Monique E. van Donzel
Using prosodic clues to decide when to produce back-channel utterances
Nigel Ward
Dialog act classification with the help of prosody
Marion Mast, Ralf Kompe, Stefan Harbeck, Andreas Kiessling, Heinrich Niemann, Elmar Nöth, Ernst G. Schukat-Talamazzini, Volker Warnke
Using lexical stress in continuous speech recognition for dutch
David van Kuijk, Henk van den Heuvel, Louis Boves
Automatic accent classification of foreign accented australian English speech
Karsten Kumpf, Robin W. King
Discriminative adaptation for speaker verification
F. Korkmazskiy, Biing-Hwang Juang
Perceptual features of unknown foreign languages as revealed by multi-dimensional scaling
V. Stockmal, D. Muljani, Z. S. Bond
On-line incremental adaptation for speaker verification using maximum likelihood estimates of CDHMM parameters
Kin Yu, John S. Mason
Combining methods to improve speaker verification decision
Dominique Genoud, Frédéric Bimbot, Guillaume Gravier, Gérard Chollet
Incremental speaker adaptation with minimum error discriminative training for speaker identification
Cesar Martín del Alamo, J. Alvarez, C. de la Torre, F. J. Poyatos, Lúis Hernández
Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models
Konstantin P. Markov, Seiichi Nakagawa
On using prosodic cues in automatic language identification
Ann E. Thymé-Gobbel, Sandra E. Hutchins
Speaker recognition model using two-dimensional mel-cepstrum and predictive neural network
Tadashi Kitamura, Shinsai Takei
Unknown language rejection in language identification system
Hingkeung Kwan, Keikichi Hirose
Spoken language identification using large vocabulary speech recognition
James L. Hieronymus, Shubha Kadambe
Accent identification
Carlos Teixeira, Isabel M. Trancoso, António Serralheiro
Comparison of text-independent speaker recognition methods on telephone speech with acoustic mismatch
Sarel van Vuuren
On the sources of inter- and intra-speaker variability in the acoustic dynamics of speech
Xue Yang, J. Bruce Millar, Iain Macleod
Language identification with inaccurate string matching
Kay M. Berkling, Etienne Barnard
Robust prosodic features for speaker identification
M. J. Carey, E. S. Parris, H. Lloyd-Thomas, S. J. Bennett
Text independent speaker identification on noisy environments by means of self organizing maps
E. Monte, J. Hernando, X. Miró, A. Adolf
Language identification using language-dependent phonemes and language-independent speech units
Paul Dalsgaard, Ove Andersen, Hanne Hesselager, Bojan Petek
Adding the affective dimension: a new look in speech analysis and synthesis
Klaus R. Scherer
Ethological theory and the expression of emotion in the voice
John J. Ohala
Synthesizing emotions in speech: is it time to get excited?
Iain R. Murray, John L. Arnott
Recognizing emotion in speech
Frank Dellaert, Thomas Polzin, Alex Waibel
Emotions in time domain synthesis
Barbara Heuft, Thomas Portele, Monika Rauth
Word class driven synthesis of prosodic annotations
Simon Arnfield
Dynamical modelling of vowel sounds as a synthesis tool
M. Banbrook, S. McLaughlin
Emotional speech elicited using computer games
Tom Johnstone
Automatic statistical analysis of the signal and prosodic signs of emotion in speech
Roddy Cowie, Ellen Douglas-Cowie
A study on task-independent subword selection and modeling for speech recognition
Chin-Hui Lee, Biing-Hwang Juang, Wu Chou, J. J. Molina-Perez
Simultaneous ANN feature and HMM recognizer design using string-based minimum classification error (MCE) training
Mazin G. Rahim, Chin-Hui Lee
Quantizing mixture-weights in a tied-mixture HMM
Sunil K. Gupta, Frank K. Soong, Raziel Haimi-Cohen
Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation
M. J. F. Gales, D. Pye, P. C. Woodland
Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition
A. C. Surendran, Chin-Hui Lee, Mazin G. Rahim
Estimation of channel bias for telephone speech recognition
Jen-Tzung Chien, Hsiao-Chuan Wang, Lee-Min Lee
Synthesis of English intonation using explicit models of reading and spontaneous speech
M. E. Johnson
Implementation and evaluation of a model for synthesis of Swedish intonation
Merle Horne, Marcus Filipsson
Natural prosody generation for domain specific text-to-speech systems
Nobuyuki Katae, Shinta Kimura
Improving text-to-speech synthesis
Mark Tatham, Eric Lewis
Synthesis of stressed speech from isolated neutral speech using HMM-based models
Sahar E. Bou-Ghazale, John H. L. Hansen
Modeling segment intonation for Slovene TTS system
Ales Dobnikar
Word predictability after hesitations: a corpus-based study
Elizabeth Shriberg, Andreas Stolcke
Interruptions and intonation
Li-chiung Yang
On not recognizing disfluencies in dialogue
Robin J. Lickley, Ellen Gurman Bard
A theory of word frequencies and its application to dialogue move recognition
Phil Garner, Sue Browning, Roger Moore, Martin Russell
Utterance units and grounding in spoken dialogue
David R. Traum, Peter A. Heeman
Coordinating turn-taking with gaze
David G. Novick, Brian Hansen, Karen Ward
BABEL: an eastern european multi-language database
Peter Roach, Simon Arnfield, William J. Barry, J. Baltova, Marian Boldea, Adrian Fourcin, W. Gonet, Ryszard Gubrynowicz, E. Hallum, Lori Lamel, Krzysztof Marasek, Alain Marchal, E. Meister, Klára Vicsi
USTC95---a putonghua corpus
Ren-Hua Wang, Deyu Xia, Jinfu Ni, Bicheng Liu
Telephone data collection using the world wide web
Edward Hurley, Joseph Polifroni, James Glass
The "SIVA" speech database for speaker verification: description and evaluation
M. Falcone, A. Gallo
A multi-level description of date expressions in German telephone speech
Christoph Draxler
Viterbi search visualization using vista: a generic performance visualization tool
Robert H. Jr. Halstead, Ben Serridge, Jean-Manuel Van Thong, William Goldenthal
A multilingual phonetic representation and analysis system for different speech databases
Toomas Altosaar, Matti Karjalainen, Martti Vainio
FRESCO: the French telephone speech data collection - part of the european Speechdat(m) project
D. Langmann, Reinhold Haeb-Umbach, Louis Boves, E. den Os
Predicting the out-of-vocabulary rate and the required vocabulary size for speech processing applications
Johannes Müller, Holger Stahl, Manfred Lang
AMULET: automatic MUltisensor speech labelling and event tracking: study of the spatio-temporal correlations in voiceless plosive production
Nathalie Parlangeau, Alain Marchal
Constructing multi-level speech database for spontaneous speech processing
Minsoo Hahn, Sanghun Kim, Jung-Chul Lee, Yong-Ju Lee
Preliminaries to a romanian speech database
Marian Boldea, Alin Doroga, Tiberiu Dumitrescu, Maria Pescaru
Labelled data bank of spoken standard German the kiel corpus of read/spontaneous speech
Klaus J. Kohler
SAPPHIRE: an extensible speech analysis and recognition tool based on tcl/tk
Lee Hetherington, Michael McCandless
Automatic detection of topic boundaries and keywords in arbitrary speech using incremental reference interval-free continuous DP
Jiro Kiyama, Yoshiaki Itoh, Ryuichi Oka
Very-large-vocabulary Mandarin voice message file retrieval using speech queries
Bo-Ren Bai, Lee-Feng Chien, Lin-Shan Lee
Gandalf - a Swedish telephone speaker verification database
H. Melin
The DCIEM map task corpus: spontaneous dialogue under sleep deprivation and drug treatment
Ellen Gurman Bard, C. Sotillo, A. H. Anderson, M. M. Taylor
The nemours database of dysarthric speech
Xavier Menéndez-Pidal, James B. Polikoff, Shirley M. Peters, Jennie E. Leonzio, H. T. Bunnell
POST: parallel object-oriented speech toolkit
Jean Hennebert, Dijana Petrovska Delacrétaz
Channel and noise normalization using affine transformed cepstrum
Xiaoyu Zhang, Richard J. Mammone
Spectral estimation and normalisation for robust speech recognition
Tom Claes, Fei Xie, Dirk van Compernolle
Trellis encoded vector quantization for robust speech recognition
Wu Chou, Nambi Seshadri, Mazin G. Rahim
Phone clustering using the bhattacharyya distance
Brian Mak, Etienne Barnard
Variability of lombard effects under different noise conditions
Atsushi Wakao, Kazuya Takeda, Fumitada Itakura
Lombard effect compensation and noise suppression for noisy Lombard speech recognition
Sang-mun Chi, Yung-Hwan Oh
The use of shibboleth words for automatically classifying speakers by dialect
A. W. F. Huggins, Yogen Patel
Data collection of Japanese dialects and its influence into speech recognition
Ikuo Kudo, Takao Nakama, Tomoko Watanabe, Reiko Kameyama
Statistical dialect classification based on mean phonetic features
David R. Miller, James Trischitta
Norwegian numerals: a challenge to automatic speech recognition
Knut Kvale
Evaluation of the telefónica i+d natural numbers recognizer over different dialects of Spanish from Spain and America
C. de la Torre, J. Caminero-Gil, J. Alvarez, Cesar Martín del Alamo, Lúis Hernández-Gómez
Rhythmic constraints on English stress timing
Fred Cummins, Robert F. Port
On the interaction of clash, focus and phonological phrasing
Irene Vogel, Steve Hoskins
On the quantal nature of speech timing
Gunnar Fant, Anita Kruckenberg
Differential perception of tonal contours through the syllable
David House
Pitch, loudness, and segmental duration correlates: towards a model for the phonetic aspects of finnish prosody
Martti Vainio, Toomas Altosaar
Prosodic manipulation system of speech material for perceptual experiments
Nobuaki Minematsu, Seiichi Nakagawa, Keikichi Hirose
Clustered language models with context-equivalent states
J. P. Ueberla, I. R. Gransden
Modeling of contextual effects and its application to word spotting
Yuji Yonezawa, Masato Akagi
A new keyword spotting algorithm with pre-calculated optimal thresholds
J. Junkawitsch, L. Neubauer, Harald Höge, Günther Ruske
Detection of ambiguous portions of signal corresponding to OOV words or misrecognized portions of input
Roxane Lacouture, Yves Normandin
Techniques for approximating a trigram language model
Fabio Brugnara, Marcello Federico
Unsupervised and incremental speaker adaptation under adverse environmental conditions
Keizaburo Takagi, Koichi Shinoda, Hiroaki Hattori, Takao Watanabe
An adaptive-beam pruning technique for continuous speech recognition
Hugo Van hamme, Filip Van Aelten
Data based filter design for RASTA-like channel normalization in ASR
Carlos Avendano, Sarel van Vuuren, Hynek Hermansky
A comparison of time conditioned and word conditioned search techniques for large vocabulary speech recognition
S. Ortmanns, Hermann Ney, Frank Seide, I. Lindam
Language-model look-ahead for large vocabulary speech recognition
S. Ortmanns, Hermann Ney, A. Eiden
A new search algorithm in segmentation lattices of speech signals
Jean-Luc Husson, Yves Laprie
LR-parser-driven viterbi search with hypotheses merging mechanism using context-dependent phone models
Tomokazu Yamada, Shigeki Sagayama
Discrete-utterance recognition with a fast match based on total data reduction
Jan Nouza
On-line garbage modeling with discriminant analysis for utterance verification
J. Caminero-Gil, C. de la Torre, L. Villarrubia, Cesar Martín del Alamo, Lúis Hernández
Cheating with imperfect transcripts
Paul Placeway, John Lafferty
Novel training method for classifiers used in speaker adaptation
Naoto Iwahashi
Large vocabulary word recognition based on a graph-structured dictionary
Katsuki Minamino
A word graph based n-best search in continuous speech recognition
Bach-Hiep Tran, Frank Seide, Volker Steinbiss
Viterbi beam search with layered bigrams
David M. Goblirsch
A wave decoder for continuous speech recognition
Eric Burhke, Wu Chou, Qiru Zhou
Long term on-line speaker adaptation for large vocabulary dictation
Eric Thelen
Incremental generation of word graphs
Gerhard Sagerer, Heike Rautenstrauch, Gernot A. Fink, Bernd Hildebrandt, A. Jusek, Franz Kummert
Improvement in n-best search for continuous speech recognition
Irina Illina, Yifan Gong
Sethos: the UPC speech understanding system
Antonio Bonafonte, José B. Mariño, Albino Nogueiras
Segmental search for continuous speech recognition
Pietro Laface, Luciano Fissore, A. Maro, Franco Ravera
An investigation into the generation of mouth shapes for a talking head
A. P. Breen, E. Bowers, W. Welsh
A text-to-audiovisual-speech synthesizer for French
Bertrand Le Goff, Christian Benoît
Analysis of head movements and its role in spoken dialogue
Yuri Iwano, Shioya Kageyama, Emi Morikawa, Shu Nakazato, Katsuhiko Shirai
RWC multimodal database for interactions by integration of spoken language and visual information
Satoru Hayamizu, Osamu Hasegawa, Katunobu Itou, Katuhiko Sakaue, Kazuyo Tanaka, Shigeki Nagaya, Masayuki Nakazawa, T. Endoh, Fumio Togawa, Kenji Sakamoto, Kazuhiko Yamamoto
About the relationship between eyebrow movements and F0 variations
Christian Cavé, Isabelle Guaïtella, Roxane Bertrand, Serge Santi, Françoise Harlay, Robert Espesser
How many words is a picture really worth?
Laurel Fais, Kyung-ho Loken-Kim, Tsuyoshi Morimoto
Visual synthesis of source acoustic speech through kohonen neural networks
A. Lagana, F. Lavagetto, A. Storace
Audio-visual speech perception without speech cues
Helena M. Saldaña, David B. Pisoni, Jennifer M. Fellowes, Robert E. Remez
Multilingual speech recognition at dragon systems
Jim Barnett, A. Corrada, G. Gao, Larry Gillick, Yoshiko Ito, S. Lowe, L. Manganaro, Barbara Peskin
Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds
Joachim Köhler
Japanese speech databases for robust speech recognition
Atsushi Nakamura, Shoichi Matsunaga, Tohru Shimizu, Masahiro Tonomura, Yoshinori Sagisaka
Spoken language processing in a multilingual context
Lori Lamel, Maqrtine Adda-Decker, Jean-Luc Gauvain, Gilles Adda
Multilingual human-computer interactions: from information access to language learning
Victor Zue, Stephanie Seneff, Joseph Polifroni, Helen Meng, James Glass
Speedata: multilingual spoken data entry
U. Ackermann, B. Angelini, Fabio Brugnara, Marcello Federico, D. Giuliani, R. Gretter, G. Lazzari, H. Niemann
Head automata for speech translation
Hiyan Alshawi
Word clustering with parallel spoken language corpora
Ye-Yi Wang, John Lafferty, Alex Waibel
Toward translating Korean speech into other languages
Jae-Woo Yang, Youngjik Lee
VERBMOBIL: the evolution of a complex large speech-to-speech translation system
Thomas Bub, Johannes Schwinn
Translation of conversational speech with JANUS-II
Alon Lavie, Alex Waibel, Lori Levin, Donna Gates, Marsal Gavaldà, Torsten Zeppenfeld, Puming Zhan, Oren Glickman
Pseudo-articulatory representations in speech synthesis and recognition
William H. Edmondson, Jon P. Iles, Dorota J. Iskra
Synthesis of initial (/s/-) stop-liquid clusters using HLsyn
David R. Williams
Synthesis of trill
Chilin Shih
Phone-based speech synthesis with neural network and articulatory control
W. K. Lo, P. C. Ching
Analysis of ten vowel sounds across gender and regional/cultural accent
P. Martland, Sandra P. Whiteside, Steve W. Beet, L. Baghai-Ravary
Speech morphing by gradually changing spectrum parameter and fundamental frequency
Masanobu Abe
The multi-lag-window method for robust extended-range F0 determination
Edouard Geoffrois
Nonlinear estimation of DEGG signals with applications to speech pitch detection
Kenneth E. Barner
Pitch analysis methods for cross-speaker comparison
John A. Maidment, M. Luisa Garcia-Lecumberri
Continuous adaptation of linear models with impulsive excitation
Steve W. Beet, L. Baghai-Ravary
Quantitative analysis of the local speech rate and its application to speech synthesis
Sumio Ohno, Masamichi Fukumiya, Hiroya Fujisaki
A fast and reliable rate of speech detector
Jan P. Verhasselt, Jean-Pierre Martens
JANUS-II: towards spontaneous Spanish speech recognition
Puming Zhan, Klaus Ries, Marsal Gavaldà, Donna Gates, Alon Lavie, Alex Waibel
Reduced semi-continuous models for large vocabulary continuous speech recognition in Dutch
Kris Demuynck, Jacques Duchateau, Dirk van Compernolle
Validating different flexible vocabulary approaches on the Swiss French Polyphone and Polyvar databases
Andrei Constantinescu, Olivier Bornet, Gilles Caloz, Gérard Chollet
Use of a reliability coefficient in noise cancelling by neural net and weighted matching algorithms
Nestor Becérra Yoma, Fergus R. McInnes, Mervyn A. Jack
Likelihood normalization using an ergodic HMM for continuous speech recognition
Kazuhiko Ozeki
Dynamic control of a production model
Laurence Candille, Henri Méloni
Speech recognition using sub-word units dependent on phonetic contexts of both training and recognition vocabularies
Hiroaki Hattori, Eiko Yamada
Hidden Markov models merging acoustic and articulatory information to automatic speech recognition
Bruno Jacob, Christine Senac
Creation of unseen triphones from diphones and monophones using a speech production approach
Mats Blomberg, Kjell Elenius
Speaker-independent dictation of Chinese speech with 32k vocabulary
Bo Xu, Bing Ma, Shuwu Zhang, Fei Qu, Taiyi Huang
Using accent-specific pronunciation modelling for robust speech recognition
J. J. Humphries, P. C. Woodland, D. Pearce
Dictionary learning for spontaneous speech recognition
Tilo Sloboda, Alex Waibel
Comparison of channel normalisation techniques for automatic speech recognition over the phone
Johan de Veth, Louis Boves
Anchor point detection for continuous speech recognition in Spanish: the spotting of phonetic events
Manuel A. Leandro, José M. Pardo
Cepstral compensation by polynomial approximation for environment-independent speech recognition
Bhiksha Raj, Evandro Bacci Gouvêa, Pedro J. Moreno, Richard M. Stern
Effect of speech coders on speech recognition performance
B. T. Lilly, Kuldip K. Paliwal
Wavelet transforms for non-uniform speech recogntion systems
Léonard Janer, Josep Martí, Climent Nadeu, Eduardo Lleida-Solano
A binaural model as a front-end for isolated word recognition
Tsuyoshi Usagawa, Markus Bodden, Klaus Rateitschek
A new speech enhancement: speech stream segregation
Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata
Non-segmental analysis and synthesis based on a speech database
Andrew Slater, John Coleman
Microsegment synthesis - economic principles in a low-cost solution
Ralf Benzmüller, William J. Barry
Whistler: a trainable text-to-speech system
X. D. Huang, Alex Acero, J. Adcock, H. W. Hon, J. Goldsmith, J. Liu, Mike Plumpe
Generation of multiple synthesis inventories by a bootstrapping procedure
Thomas Portele, Karl-Heinz Stöber, Horst Meyer, Wolfgang Hess
Modeling segmental duration in German text-to-speech synthesis
Bernd Möbius, Jan P. H. van Santen
Autolabelling Japanese ToBI
Nick Campbell
General phrase speaker verification using sub-word background models and likelihood-ratio scoring
S. Parthasarathy, Aaron E. Rosenberg
Unknown-multiple signal source clustering problem using ergodic HMM and applied to speaker classification
J. Murakami, M. Sugiyama, H. Watanabe
GMM and ARVM cooperation and competition for text-independent speaker recognition on telephone speech
J.-L. Le Floch, C. Montacié, M.-J. Caraty
Selective use of the speech spectrum and a VQGMM method for speaker identification
Qiguang Lin, Ea-Ee Jan, ChiWei Che, Dong-Suk Yuk, James L. Flanagan
Speaker verification through large vocabulary continuous speech recognition
Michael Newman, Larry Gillick, Yoshiko Ito, Don McAllaster, Barbara Peskin
Predictive neural networks in text independent speaker verification: an evaluation on the SIVA database
Andrea Paoloni, Susanna Ragazzini, G. Ravaioli
Durational characterstics of hindi consonant clusters
Nisheeth Shrotriya, Rajesh Verma, Sunil K. Gupta, S. S. Agrawal
The use of wavelet transforms in phoneme recognition
Beng T. Tan, Minyue Fu, Andrew Spray, Phillip Dermody
Acoustic properties of phonemes in continuous speech for different speaking rate
Hisao Kuwabara
Prosodic parameterization of spoken Japanese based on a model of the generation process of F0 contours
Hiroya Fujisaki, Sumio Ohno
A logistic regression model for detecting prominences
Arman Maghbouleh
High-quality prosodic modification of speech signals
Beat Pfister
On the syllable structures of Chinese relating to speech recognition
Jialu Zhang
Can a moraic nasal occur word-initially in Japanese?
Takashi Otake, Kiyoko Yoneyama
Perceptual assimilation of american English vowels by Japanese listeners
Winifred Strange, Reiko Akahane-Yamada, B. H. Fitzgerald, R. Kubo
Context and speaker effects in the perceptual assimilation of German vowels by american listeners
Winifred Strange, Ocke-Schwen Bohn, S. A. Trent, M. C. McNair, K. C. Bielec
Examination of a perceptual non-native speech contrast: pharyngealized/non-pharyngealized discrimination by French-speaking adults
Mohamed Zahid
Context-dependent relevance of burst and transitions for perceived place in stops: it's in production, not perception
Roel Smits
The perception of morae in long vowels comparison among Japanese, Korean and English speakers
Ryoji Baba, Kaori Omuro, Hiromitsu Miyazono, Tsuyoshi Usagawa, Masahiko Higuchi
Juncture cues to disfluency
Robin J. Lickley
Effects of duration and formant movement on vowel perception
James R. Sawusch
Benchmarking human performance for continuous speech recognition
N. Deshmukh, R. J. Duncan, A. Ganapathiraju, J. Picone
Intelligibility of speech with filtered time trajectories of spectral envelopes
Takayuki Arai, Misha Pavel, Hynek Hermansky, Carlos Avendano
Perceptual use of vowel and speaker information in breath sounds
Douglas H. Whalen, Sonya M. Sheffert
The role of neighborhood relative frequency in spoken word recognition
Philippe Mousty, Monique Radeau, Ronald Peereman, Paul Bertelson
Transitional probability and phoneme monitoring
James M. McQueen, Mark A. Pitt
Identification of vowel features from French stop bursts
Anne Bonneau
Listening in a second language
Z. S. Bond, Thomas J. Moore, Beverley Gable
Perception of lexical tone across languages: evidence for a linguistic mode of processing
Denis Burnham, Elizabeth Francis, Di Webster, Sudaporn Luksaneeyanawin, Chayada Attapaiboon, Francisco Lacerda, Peter Keller
Acoustic correlates to the effects of talker variability on the perception of English /r/ and /l/ by Japanese listeners
James S. Magnuson, Reiko Akahane-Yamada