Introduction to the Conference
Author Index Table of Contents
[INTERSPEECH-2015] INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015; ISSN: 1990-9770; ISCA Archive, http://www.isca-speech.org/archive/interspeech_2015
Introduction (19 MB)
Keynotes Special sessions Tutorials
Names written in boldface refer to first authors, in CAPITAL letters to keynote and invited papers. Full papers can be accessed from the abstracts. Some papers are accompanied by audio examples and other multimedia files which can also be addressed from the respective abstracts. Please note that each abstract opens in a separate tab or window.
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Active Perception in Human and Machine Speech Communication
Advanced Crowdsourcing for Speech and Beyond
Automatic Speaker Verification Spoofing and Countermeasures (ASVspoof 2015)
Biosignal-based Spoken Communication
Interspeech 2015 Computational Paralinguistics ChallengE (ComParE): Degree of Nativeness, Parkinson's & Eating Condition
Robust Speech Processing Using Observation Uncertainty and Uncertainty Propagation
Speech and Language Processing of Children's Speech Speech Science in End-user Applications
Zero Resource Speech Technologies: Unsupervised Discovery of Linguistic Units
Show and Tell Session 1-4
Acoustic Model Adaptation and Training
Acoustic Modeling and Decoding Methods for Speech Recognition
Adaptive Methods for LVCSR Advances in iVector-based Speaker Verification Audio Signal Analysis and Representation
Bandwidth Extension, Quality and Intelligibility Measures Brain- and Other Biosignal-based Spoken Communication
Computational Models of Human Speech Perception Conversational Interaction Deep Neural Networks for Speech Synthesis
Deep Neural Networks in Language and Accent Recognition Deep Neural Networks in Speaker Recognition
Detecting and Predicting Mental and Social Disorders Dialogue and Discourse
Discriminative Acoustic Training Methods for ASR Distant and Reverberant Speech Recognition
Emotion 1, 2 Evaluation of Speech Synthesis
Fast Efficient and Scalable Computing for Neural Nets Feature Extraction and Modeling with Neural Networks
Information and Metadata Extraction from Speech
L1/L2 Speech Perception and Acquisition L2 Speech Perception and Production
Language Modeling for Conversational Speech Language Modeling for Speech Recognition
LVCSR Systems and Applications Mining and Annotation of Spoken and Multimodal Resources
Neural Networks: Novel Architectures for LVCSR Neural Networks and Speaker Adaptation Neural Networks for Language Modeling
Phonetic Recognition: Novel Approaches and Understanding Pronunciation, Prosody and Audiovisual Features and Models
Prosody 1-3 Prosody Modeling for Speech Synthesis
Robust Speech Recognition: Adaptation Robust Speech Recognition: Features, Far-field and Reverberation Robustness in Speaker Recognition
Social Signals, Assessment and Paralinguistics Source Separation and Computational Auditory Scene Analysis
Speaker and Language Recognition Speaker Recognition and Diarization 1-3 Speech Analysis and Representation 1-3
Speech and Audio Segmentation and Classification; Voice Activity Detection 1-3 Speech and Cognition in Adverse Conditions
Speech and Hearing Disorders Speech and Music Analysis Speech Enhancement Speech Intelligibility Enhancement
Speech Production Data and Models Speech Production Measurements and Analyses
Speech Recognition: Evaluation and Low-resource Languages Speech Recognition: Technologies and Systems for New Applications
Speech Synthesis 1-3 Speech Transmission Spoken Dialogue Systems
Spoken Language Processing Spoken Language Understanding 1-3 Spoken Term Detection, Spoken MT & Transliteration Spoken Translation & Speech-to-speech
Statistical Parametric Speech Synthesis Stress, Load, and Pathologies Syllables and Segments 1, 2
Topics in Paralinguistics Varieties of Speech Voice Conversion Voice Quality
Beckman, Mary E.: "The emergence of compositional structure in language evolution and development" (abstract)
Sarikaya, Ruhi: "The technology powering personal digital assistants" (abstract).
Amunts, Katrin: "The HBP-atlas — concept, perspectives, and application for language and speech research" (abstract).
Scherer, Klaus: "Voices of power, passion, and personality" (abstract).
Sainath, Tara N. / Weiss, Ron J. / Senior, Andrew / Wilson, Kevin W. / Vinyals, Oriol: "Learning the speech front-end with raw waveform CLDNNs", 1-5.
Bhargava, Mayank / Rose, Richard: "Architectures for deep neural network based acoustic models defined over windowed speech waveforms", 6-10.
Palaz, Dimitri / Magimai-Doss, Mathew / Collobert, Ronan: "Analysis of CNN-based speech recognition system using raw speech as input", 11-15.
Ogawa, Tetsuji / Ueda, Kenshiro / Katsurada, Kouichi / Kobayashi, Tetsunori / Nitta, Tsuneo: "Bilinear map of filter-bank outputs for DNN-based speech recognition", 16-20.
Lin, Payton / Lyu, Dau-Cheng / Chang, Yun-Fan / Tsao, Yu: "Speech recognition with temporal neural networks", 21-25.
Golik, Pavel / Tüske, Zoltán / Schlüter, Ralf / Ney, Hermann: "Convolutional neural networks for acoustic modeling of raw time signal in LVCSR", 26-30.
Glavitsch, Ulrike / He, Lei / Dellwo, Volker: "Stable and unstable intervals as a basic segmentation procedure of the speech signal", 31-35.
Windmann, Andreas / Šimko, Juraj / Wagner, Petra: "Polysyllabic shortening and word-final lengthening in English", 36-40.
Eriksson, Anders / Heldner, Mattias: "The acoustics of word stress in English as a function of stress level and speaking style", 41-45.
Zahner, Katharina / Pohl, Muna / Braun, Bettina: "Pitch accent distribution in German infant-directed speech", 46-50.
Mixdorff, Hansjörg / Cossio-Mercado, Christian / Hönemann, Angelika / Gurlekian, Jorge / Evin, Diego / Torres, Humberto: "Acoustic correlates of perceived syllable prominence in German", 51-55.
Simonetti, Simone / Kim, Jeesun / Davis, Chris: "Cross-modality matching of linguistic and emotional prosody", 56-59.
Michalsky, Jan: "Pitch scaling as a perceptual cue for questions in German", 924-928.
Reichel, Uwe D. / Mády, Katalin / Beňuš, Štefan: "Parameterization of prosodic headedness", 929-933.
Sarma, Biswajit Dev / Sarmah, Priyankoo / Lalhminghlui, Wendy / Prasanna, S. R. Mahadeva: "Detection of mizo tones", 934-937.
Repp, Sophie / Rosin, Lena: "The intonation of echo wh-questions", 938-942.
Jabeen, Farhat / Bögel, Tina / Butt, Miriam: "Immediately postverbal questions in urdu", 943-947.
Mády, Katalin: "Prosodic (non-)realisation of broad, narrow and contrastive focus in Hungarian: a production and a perception study", 948-952.
Beňuš, Štefan / Reichel, Uwe D. / Šimko, Juraj: "F0 discontinuity as a marker of prosodic boundary strength in lombard speech", 953-957.
Gendrot, Cédric / Adda-Decker, Martine / Wu, Yaru: "Comparing journalistic and spontaneous speech: prosodic and spectral analysis", 958-962.
Schauffler, Nadja / Schweitzer, Katrin: "Rhythm influences the tonal realisation of focus", 963-967.
Andreeva, Bistra / Möbius, Bernd / Demenko, Grazyna / Zimmerer, Frank / Jügler, Jeanin: "Linguistic measures of pitch range in slavic and Germanic languages", 968-972.
Qiu, Chunan / Liang, Jie: "The effect of stress on vowel space in daxi hakka Chinese", 973-977.
O'Reilly, Maria / Chasaide, Ailbhe Ní: "Declination, peak height and pitch level in declaratives and questions of south connaught irish", 978-982.
Sarmah, Priyankoo / Dihingia, Leena / Lalhminghlui, Wendy: "Contextual variation of tones in mizo", 983-986.
Wochner, Daniela / Schlegel, Jana / Dehé, Nicole / Braun, Bettina: "The prosodic marking of rhetorical questions in German", 987-991.
Zorilă, Tudor-Cătălin / Stylianou, Yannis: "A fast algorithm for improved intelligibility of speech-in-noise based on frequency and time domain energy reallocation", 60-64.
Koutsogiannaki, Maria / Petkov, Petko N. / Stylianou, Yannis: "Intelligibility enhancement of casual speech for reverberant environments inspired by clear speech properties", 65-69.
Jemaa, A. Ben / Mechergui, N. / Courtois, G. / Mudry, A. / Djaziri-Larbi, S. / Turki, M. / Lissek, H. / Jaidane, M.: "Intelligibility enhancement of vocal announcements for public address systems: a design for all through a presbycusis pre-compensation filter", 70-74.
Schepker, Henning / Hülsmeier, David / Rennies, Jan / Doclo, Simon: "Model-based integration of reverberation for noise-adaptive near-end listening enhancement", 75-79.
Rottschäfer, Sebastian / Buschmeier, Hendrik / Welbergen, Herwin van / Kopp, Stefan: "Online Lombard adaptation in incremental speech synthesis", 80-84.
Jokinen, Emma / Remes, Ulpu / Alku, Paavo: "Comparison of Gaussian process regression and Gaussian mixture models in spectral tilt modelling for intelligibility enhancement of telephone speech", 85-89.
Kumar, Naveen / Narayanan, Shrikanth S.: "A discriminative reliability-aware classification model with applications to intelligibility classification in pathological speech", 90-94.
Orozco-Arroyave, J. R. / Hönig, Florian / Arias-Londoño, J. D. / Vargas-Bonilla, J. F. / Skodda, Sabine / Rusz, J. / Nöth, Elmar: "Voiced/unvoiced transitions in speech as a potential bio-marker to detect parkinson's disease", 95-99.
Villa-Cañas, T. / Arias-Londoño, J. D. / Orozco-Arroyave, J. R. / Vargas-Bonilla, J. F. / Nöth, Elmar: "Low-frequency components analysis in running speech for the automatic detection of parkinson's disease", 100-104.
Vásquez-Correa, J. C. / Arias-Vergara, T. / Orozco-Arroyave, J. R. / Vargas-Bonilla, J. F. / Arias-Londoño, J. D. / Nöth, Elmar: "Automatic detection of parkinson's disease from continuous speech recorded in non-controlled noise conditions", 105-109.
Cummins, Nicholas / Sethu, Vidhyasaharan / Epps, Julien / Krajewski, Jarek: "Relevance vector machine for depression prediction", 110-114.
Marchi, Erik / Schuller, Björn / Baron-Cohen, Simon / Golan, Ofer / Bölte, Sven / Arora, Prerna / Häb-Umbach, Reinhold: "Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages", 115-119.
Liu, Chunxi / Xu, Puyang / Sarikaya, Ruhi: "Deep contextual language understanding in spoken dialogue systems", 120-124.
Tam, Yik-Cheung / Shi, Yangyang / Chen, Hunk / Hwang, Mei-Yuh: "RNN-based labeled data generation for spoken language understanding", 125-129.
Vukotic, Vedran / Raymond, Christian / Gravier, Guillaume: "Is it time to Switch to word embedding and recurrent neural networks for spoken language understanding?", 130-134.
Ravuri, Suman / Stolcke, Andreas: "Recurrent neural network and LSTM models for lexical utterance classification", 135-139.
Lu, Hung-tsung / Liou, Yuan-ming / Lee, Hung-yi / Lee, Lin-shan: "Semantic retrieval of personal photos using a deep autoencoder fusing visual features with speech annotations represented as word/paragraph vectors", 140-144.
Morchid, Mohamed / Dufour, Richard / Matrouf, Driss: "A comparison of normalization techniques applied to latent space representations for speech analytics", 145-149.
Sheikh, Imran / Illina, Irina / Fohr, Dominique: "Study of entity-topic models for OOV proper name retrieval", 1344-1348.
Boutin, Simon / Tremblay, Réal / Cardinal, Patrick / Peters, Doug / Dumouchel, Pierre: "Audio quotation marks for natural language understanding", 1349-1352.
Yang, Xiaohao / Liu, Jia: "Using word confusion networks for slot filling in spoken language understanding", 1353-1357.
Chiu, Justin / Miao, Yajie / Black, Alan W. / Rudnicky, Alexander I.: "Distributed representation-based spoken word sense induction", 1358-1362.
Shen, Sheng-syun / Lee, Hung-yi / Li, Shang-wen / Zue, Victor / Lee, Lin-shan: "Structuring lectures in massive open online courses (MOOCs) for efficient learning by linking similar sections and predicting prerequisites", 1363-1367.
Charlet, Delphine / Damnati, Géraldine / Trione, Jérémy: "News talk-show chaptering with journalistic genres", 1368-1372.
Ramanarayanan, Vikram / Chen, Lei / Leong, Chee Wee / Feng, Gary / Suendermann-Oeft, David: "An analysis of time-aggregated and time-series features for scoring different aspects of multimodal presentation data", 1373-1377.
Racca, David N. / Jones, Gareth J. F.: "Incorporating prosodic prominence evidence into term weights for spoken content retrieval", 1378-1382.
Chen, Kuan-Yu / Liu, Shih-Hung / Wang, Hsin-Min / Chen, Berlin / Chen, Hsin-Hsi: "Leveraging word embeddings for spoken document summarization", 1383-1387.
Renkens, Vincent / hamme, Hugo Van: "Mutually exclusive grounding for weakly supervised non-negative matrix factorisation", 1388-1392.
Bastianelli, Emanuele / Croce, Danilo / Basili, Roberto / Nardi, Daniele: "Using semantic maps for robust natural language interaction with robots", 1393-1397.
Luan, Yi / Watanabe, Shinji / Harsham, Bret: "Efficient learning for spoken language understanding tasks with word embedding based pre-training", 1398-1402.
Ferreira, Emmanuel / Jabaian, Bassam / Lefèvre, Fabrice: "Zero-shot semantic parser for spoken language understanding", 1403-1407.
Tafforeau, Jeremie / Artieres, Thierry / Favre, Benoit / Bechet, Frederic: "Adapting lexical representation and OOV handling from written to spoken language with word embedding", 1408-1412.
Yang, Xiaohao / Liu, Jia: "Dialog state tracking using long short-term memory neural networks", 1800-1804.
Lopes, José / Salvi, Giampiero / Skantze, Gabriel / Abad, Alberto / Gustafson, Joakim / Batista, Fernando / Meena, Raveesh / Trancoso, Isabel: "Detecting repetitions in spoken dialogue systems using phonetic distances", 1805-1809.
Crook, Paul A. / Robichaud, Jean-Philippe / Sarikaya, Ruhi: "Multi-language hypotheses ranking and domain tracking for open domain dialogue systems", 1810-1814.
Solanki, Vijay / Vinciarelli, Alessandro / Stuart-Smith, Jane / Smith, Rachel: "Measuring mimicry in task-oriented conversations: degree of mimicry is related to task difficulty", 1815-1819.
Laskowski, Kornel: "Auto-imputing radial basis functions for neural-network turn-taking models", 1820-1824.
Llimona, Quim / Luque, Jordi / Anguera, Xavier / Hidalgo, Zoraida / Park, Souneil / Oliver, Nuria: "Effect of gender and call duration on customer satisfaction in call center big data", 1825-1829.
Callejas, Zoraida / Griol, David: "Using profile similarity to measure agreement in personality perception", 1830-1834.
Nakamura, Shizuka / Watanabe, Miki / Yoshikawa, Yuichiro / Ogawa, Kohei / Ishiguro, Hiroshi: "Relieving mental stress of speakers using a tele-operated robot in foreign language speech education", 1835-1838.
Gravano, Agustín / Beňuš, Štefan / Levitan, Rivka / Hirschberg, Julia: "Backward mimicry and forward influence in prosodic contour choice in standard American English", 1839-1843.
Chowdhury, Shammur Absar / Danieli, Morena / Riccardi, Giuseppe: "The role of speakers and context in classifying competition in overlapping speech", 1844-1848.
Christodoulides, George / Avanzi, Mathieu: "Automatic detection and annotation of disfluencies in spoken French corpora", 1849-1853.
Hakkani-Tür, Dilek / Ju, Yun-Cheng / Zweig, Geoffrey / Tur, Gokhan: "Clustering novel intents in a conversational interaction system with semantic parsing", 1854-1858.
Despotovic, Vladimir / Walter, Oliver / Haeb-Umbach, Reinhold: "Semantic analysis of spoken input using Markov logic networks", 1859-1863.
Švec, Jan / Chýlek, Adam / Šmídl, Luboš: "Hierarchical discriminative model for spoken language understanding based on convolutional neural network", 1864-1868.
Chen, Yun-Nung / Wang, William Yang / Rudnicky, Alexander I.: "Learning semantic hierarchy with distributed representations for unsupervised spoken language understanding", 1869-1873.
Székely, Éva / Keane, Mark T. / Carson-Berndsen, Julie: "The effect of soft, modal and loud voice levels on entrainment in noisy conditions", 150-154.
Cowan, Benjamin R. / Branigan, Holly P.: "Does voice anthropomorphism affect lexical alignment in speech-based human-computer dialogue?", 155-159.
Ma, Ning / Brown, Guy J. / Gonzalez, Jose A.: "Exploiting top-down source models to improve binaural localisation of multiple sources in reverberant environments", 160-164.
Schymura, Christopher / Winter, Fiete / Kolossa, Dorothea / Spors, Sascha: "Binaural sound source localisation and tracking using a dynamic spherical head model", 165-169.
May, Tobias / Bentsen, Thomas / Dau, Torsten: "The role of temporal resolution in modulation-based speech segregation", 170-174.
Kayser, Hendrik / Spille, Constantin / Marquardt, Daniel / Meyer, Bernd T.: "Improving automatic speech recognition in spatially-aware hearing aids", 175-179.
Gomez, Randy / Ivanchuk, Levko / Nakamura, Keisuke / Mizumoto, Takeshi / Nakadai, Kazuhiro: "Dereverberation for active human-robot communication robust to speaker's face orientation", 180-184.
Chen, Nanxin / Qian, Yanmin / Yu, Kai: "Multi-task learning for text-dependent speaker verification", 185-189.
Stafylakis, Themos / Kenny, Patrick / Alam, Md. Jahangir / Kockmann, Marcel: "JFA for speaker recognition with random digit strings", 190-194.
Knyazeva, Elena / Wisniewski, Guillaume / Bredin, Hervé / Yvon, François: "Structured prediction for speaker identification in TV series", 195-199.
Cumani, Sandro / Laface, Pietro / Kulsoom, Farzana: "Speaker recognition by means of acoustic and phonetically informed GMMs", 200-204.
Panda, Ashish: "A fast approach to psychoacoustic model compensation for robust speaker recognition in additive noise", 205-209.
Doroshin, Danila / Lubimov, Nikolay / Nastasenko, Marina / Kotov, Mikhail: "Blind score normalization method for PLDA based speaker recognition", 210-213.
Novoselov, Sergey / Pekhovsky, Timur / Kudashev, Oleg / Mendelev, Valentin S. / Prudnikov, Alexey: "Non-linear PLDA for i-vector speaker verification", 214-218.
Vaquero, Carlos / Rodríguez, Patricia: "On the need of template protection for voice authentication", 219-223.
Kelly, Finnian / Hansen, John H. L.: "Evaluation and calibration of short-term aging effects in speaker verification", 224-228.
Chen, Liping / Lee, Kong Aik / Ma, Bin / Guo, Wu / Li, Haizhou / Dai, Li-Rong: "Phone-centric local variability vector for text-constrained speaker verification", 229-233.
George, Kuruvachan K. / Kumar, C. Santhosh / , Ramachandran K. I. / Panda, Ashish: "Cosine distance features for robust speaker verification", 234-238.
Shiota, Sayaka / Villavicencio, Fernando / Yamagishi, Junichi / Ono, Nobutaka / Echizen, Isao / Matsui, Tomoko: "Voice liveness detection algorithms based on pop noise caused by human breath for automatic speaker verification", 239-243.
Hurmalainen, Antti / Saeidi, Rahim / Virtanen, Tuomas: "Noise robust speaker recognition with convolutive sparse coding", 244-248.
Alam, Md. Jahangir / Kenny, Patrick / Stafylakis, Themos: "Combining amplitude and phase-based features for speaker verification with short duration utterances", 249-253.
Lee, Kong Aik / Larcher, Anthony / Wang, Guangsen / Kenny, Patrick / Brümmer, Niko / Leeuwen, David van / Aronowitz, Hagai / Kockmann, Marcel / Vaquero, Carlos / Ma, Bin / Li, Haizhou / Stafylakis, Themos / Alam, Md. Jahangir / Swart, Albert / Perez, Javier: "The reddots data collection for speaker recognition", 2996-3000.
He, Yongjun / Chen, Chen / Han, Jiqing: "Noise-robust speaker recognition based on morphological component analysis", 3001-3005.
Nautsch, Andreas / Saeidi, Rahim / Rathgeb, Christian / Busch, Christoph: "Analysis of mutual duration and noise effects in speaker recognition: benefits of condition-matched cohort selection in score normalization", 3006-3010.
Fredes, Josué / Novoa, José / Poblete, Victor / King, Simon / Stern, Richard M. / Yoma, Néstor Becerra: "Robustness to additive noise of locally-normalized cepstral coefficients in speaker verification", 3011-3015.
Shokouhi, Navid / Hansen, John H. L.: "Probabilistic linear discriminant analysis for robust speaker identification in co-channel speech", 3016-3020.
Wang, Hongcui / Jin, Di / Li, Lantian / Dang, Jianwu: "Community detection with manifold learning on speaker i-vector space for Chinese", 3021-3025.
Yella, Sree Harsha / Stolcke, Andreas: "A comparison of neural network feature transforms for speaker diarization", 3026-3030.
Shapiro, Ilya / Rabin, Neta / Opher, Irit / Lapidot, Itshak: "Clustering short push-to-talk segments", 3031-3035.
Fedorova, Anna / Glembek, Ondřej / Kinnunen, Tomi / Matějka, Pavel: "Exploring ANN back-ends for i-vector based speaker age estimation", 3036-3040.
Bansé, Désiré / Doddington, George R. / Garcia-Romero, Daniel / Godfrey, John J. / Greenberg, Craig S. / Hernández-Cordero, Jaime / Howard, John M. / Martin, Alvin F. / Mason, Lisa P. / McCree, Alan / Reynolds, Douglas A.: "Analysis of the second phase of the 2013-2014 i-vector machine learning challenge", 3041-3045.
Martin, Alvin F. / Greenberg, Craig S. / Howard, John M. / Bansé, Désiré / Doddington, George R. / Hernández-Cordero, Jaime / Mason, Lisa P.: "NIST language recognition evaluation — plans for 2015", 3046-3050.
Desplanques, Brecht / Demuynck, Kris / Martens, Jean-Pierre: "Factor analysis for speaker segmentation and improved speaker diarization", 3081-3085.
Inoue, Koji / Wakabayashi, Yukoh / Yoshimoto, Hiromasa / Takanashi, Katsuya / Kawahara, Tatsuya: "Enhanced speaker diarization with detection of backchannels using eye-gaze information in poster conversations", 3086-3090.
Delgado, Héctor / Anguera, Xavier / Fredouille, Corinne / Serrano, Javier: "Novel clustering selection criterion for fast binary key speaker diarization", 3091-3095.
Sell, Gregory / Garcia-Romero, Daniel / McCree, Alan: "Speaker diarization with i-vectors from DNN senone posteriors", 3096-3099.
Woubie, Abraham / Luque, Jordi / Hernando, Javier: "Using voice-quality measurements with prosodic and spectral features for speaker diarization", 3100-3104.
Madikeri, Srikanth / Himawan, Ivan / Motlicek, Petr / Ferras, Marc: "Integrating online i-vector extractor with information bottleneck based speaker diarization system", 3105-3109.
Raitio, Tuomo / Juvela, Lauri / Suni, Antti / Vainio, Martti / Alku, Paavo: "Phase perception of the glottal excitation of vocoded speech", 254-258.
Sitaram, Sunayana / Jeblee, Serena / Black, Alan W.: "Using acoustics to improve pronunciation for synthesis of low resource languages", 259-263.
Inai, Tadashi / Hara, Sunao / Abe, Masanobu / Ijima, Yusuke / Miyazaki, Noboru / Mizuno, Hideyuki: "Sub-band text-to-speech combining sample-based spectrum with statistically generated spectrum", 264-268.
Lu, Heng / Zhang, Wei / Shao, Xu / Zhou, Quan / Lei, Wenhui / Zhou, Hongbin / Breen, Andrew: "Pruning redundant synthesis units based on static and delta unit appearance frequency", 269-273.
Ohtani, Yamato / Nasu, Yu / Morita, Masahiro / Akamine, Masami: "Emotional transplant in statistical speech synthesis based on emotion additive model", 274-278.
Xie, Xurong / Liu, Xunying / Wang, Lan / Su, Rongfeng: "Generalized variable parameter HMMs based acoustic-to-articulatory inversion", 279-283.
Mohammadi, Seyed Hamidreza / Kain, Alexander: "Semi-supervised training of a voice conversion mapping function using a joint-autoencoder", 284-288.
Huber, Stefan / Roebel, Axel: "On glottal source shape parameter transformation using a novel deterministic and stochastic speech analysis and synthesis system", 289-293.
Huang, Yi-Chin / Wu, Chung-Hsien / Shie, Ming-Ge: "Fluent personalized speech synthesis with prosodic word-level spontaneous speech generation", 294-298.
Oshima, Yuji / Takamichi, Shinnosuke / Toda, Tomoki / Neubig, Graham / Sakti, Sakriani / Nakamura, Satoshi: "Non-native speech synthesis preserving speaker individuality based on partial correction of prosodic and phonetic characteristics", 299-303.
Toman, Markus / Pucher, Michael: "Evaluation of state mapping based foreign accent conversion", 304-308.
Wu, Zhizheng / King, Simon: "Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features", 309-313.
Wang, Yang / Yang, Minghao / Wen, Zhengqi / Tao, Jianhua: "Combining extreme learning machine and decision tree for duration prediction in HMM based speech synthesis", 2197-2201.
Ninh, Duy Khanh / Yamashita, Yoichi: "F0 parameterization of glottalized tones for HMM-based vietnamese TTS", 2202-2206.
Merritt, Thomas / Yamagishi, Junichi / Wu, Zhizheng / Watts, Oliver / King, Simon: "Deep neural network context embeddings for model selection in rich-context HMM synthesis", 2207-2211.
Chen, Bo / Chen, Zhehuai / Xu, Jiachen / Yu, Kai: "An investigation of context clustering for statistical speech synthesis with deep neural network", 2212-2216.
Watts, Oliver / Wu, Zhizheng / King, Simon: "Sentence-level control vectors for deep neural network speech synthesis", 2217-2221.
Betz, Simon / Wagner, Petra / Schlangen, David: "Micro-structure of disfluencies: basics for conversational speech synthesis", 2222-2226.
Szaszák, György / Beke, András / Olaszy, Gábor / Tóth, Bálint Pál: "Using automatic stress extraction from audio for improved prosody modelling in speech synthesis", 2227-2231.
Lanchantin, Pierre / Veaux, Christophe / Gales, Mark J. F. / King, Simon / Yamagishi, Junichi: "Reconstructing voices within the multiple-average-voice-model framework", 2232-2236.
Thu, Ye Kyaw / Pa, Win Pa / Ni, Jinfu / Shiga, Yoshinori / Finch, Andrew / Hori, Chiori / Kawai, Hisashi / Sumita, Eiichiro: "HMM based myanmar text to speech system", 2237-2241.
Takaki, Shinji / Kim, SangJin / Yamagishi, Junichi / Kim, JongJin: "Multiple feed-forward deep neural networks for statistical parametric speech synthesis", 2242-2246.
Yao, Kaisheng / Zweig, Geoffrey: "Sequence-to-sequence neural net models for grapheme-to-phoneme conversion", 3330-3334.
Kay, Rosie / Watts, Oliver / Chicote, Roberto Barra / Mayo, Cassie: "Knowledge versus data in TTS: evaluation of a continuum of synthesis systems", 3335-3339.
Eger, Steffen: "Improving G2p from wiktionary and other (web) resources", 3340-3344.
Ding, Chuang / Zhu, Pengcheng / Xie, Lei: "BLSTM neural networks for speech driven head motion synthesis", 3345-3349.
Tobing, Patrick Lumban / Kobayashi, Kazuhiro / Toda, Tomoki / Neubig, Graham / Sakti, Sakriani / Nakamura, Satoshi: "Articulatory controllable speech modification based on Gaussian mixture models with direct waveform modification using spectrum differential", 3350-3354.
Cornu, Thomas Le / Milner, Ben: "Reconstructing intelligible audio speech from visual speech features", 3355-3359.
Sitaram, Sunayana / Parlikar, Alok / Anumanchipalli, Gopala Krishna / Black, Alan W.: "Universal grapheme-based speech synthesis", 3360-3364.
Wester, Mirjam / Aylett, Matthew / Tomalin, Marcus / Dall, Rasmus: "Artificial personality and disfluency", 3365-3369.
Evrard, Marc / Delalez, Samuel / d'Alessandro, Christophe / Rilliard, Albert: "Comparison of chironomic stylization versus statistical modeling of prosody for expressive speech synthesis", 3370-3374.
Ardaillon, Luc / Degottex, Gilles / Roebel, Axel: "A multi-layer F0 model for singing voice synthesis using a b-spline representation with intuitive controls", 3375-3379.
Jauk, Igor / Bonafonte, Antonio / Lopez-Otero, Paula / Docio-Fernandez, Laura: "Creating expressive synthetic voices by unsupervised clustering of audiobooks", 3380-3384.
Aryal, Sandesh / Gutierrez-Osuna, Ricardo: "Articulatory-based conversion of foreign accents with deep neural networks", 3385-3389.
Matoušek, Jindřich / Tihelka, Daniel: "Anomaly-based annotation errors detection in TTS corpora", 314-318.
Schweitzer, Katrin / Gärtner, Markus / Riester, Arndt / Rösiger, Ina / Eckart, Kerstin / Kuhn, Jonas / Dogil, Grzegorz: "Analysing automatic descriptions of intonation with ICARUS", 319-323.
Chen, Nancy F. / Tong, Rong / Wee, Darren / Lee, Peixuan / Ma, Bin / Li, Haizhou: "iCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent", 324-328.
Wong, Ka Ho / Yeung, Yu Ting / Chan, Edwin H. Y. / Wong, Patrick C. M. / Levow, Gina-Anne / Meng, Helen: "Development of a Cantonese dysarthric speech corpus", 329-333.
Arsikere, Harish / Patil, Sonal / Kumar, Ranjeet / Shrivastava, Kundan / Deshmukh, Om: "Stylex: a corpus of educational videos for research on speaking styles and their impact on engagement and learning", 334-338.
Can, Doğan / Atkins, David C. / Narayanan, Shrikanth S.: "A dialog act tagging approach to behavioral coding: a case study of addiction counseling conversations", 339-343.
Vapnarsky, Valentina / Barras, Claude / Becquey, Cédric / Doukhan, David / Adda-Decker, Martine / Lamel, Lori: "Analysing rhythm in ritual discourse in yucatec maya using automatic speech alignment", 344-348.
Hasan, Madina / Doddipatla, Rama / Hain, Thomas: "Noise-matched training of CRF based sentence end detection models", 349-353.
Kuang, Jianjing / Liberman, Mark: "The effect of spectral slope on pitch perception", 354-358.
Bao, Honghao / Lu, Wenhuan / Honda, Kiyoshi / Wei, Jianguo / Fang, Qiang / Dang, Jianwu: "Combined cine- and tagged-MRI for tracking landmarks on the tongue surface", 359-363.
Barbier, Guillaume / Boë, Louis-Jean / Captier, Guillaume / Laboissière, Rafael: "Human vocal tract growth: a longitudinal study of the development of various anatomical structures", 364-368.
Sivaraman, Ganesh / Mitra, Vikramjit / Tiede, Mark K. / Saltzman, Elliot / Goldstein, Louis / Espy-Wilson, Carol: "Analysis of coarticulated speech using estimated articulatory trajectories", 369-373.
Barbier, Guillaume / Perrier, Pascal / Ménard, Lucie / Payan, Yohan / Tiede, Mark K. / Perkell, Joseph S.: "Speech planning in 4-year-old children versus adults: acoustic and articulatory analyses", 374-378.
Kaburagi, Tokihiko: "Morphological and acoustic analysis of the vocal tract using a multi-speaker volumetric MRI dataset", 379-383.
Skordilis, Zisis Iason / Ramanarayanan, Vikram / Goldstein, Louis / Narayanan, Shrikanth S.: "Experimental assessment of the tongue incompressibility hypothesis during speech production", 384-388.
Fér, Radek / Matějka, Pavel / Grézl, František / Plchot, Oldřich / Černocký, Jan: "Multilingual bottleneck features for language recognition", 389-393.
McCree, Alan / Garcia-Romero, Daniel: "DNN senone MAP multinomial i-vectors for phonotactic language recognition", 394-397.
Song, Yan / Hong, Xinhai / Jiang, Bing / Cui, Ruilian / McLoughlin, Ian / Dai, Li-Rong: "Deep bottleneck network based i-vector representation for language identification", 398-402.
Lozano-Diez, Alicia / Zazo-Candil, Ruben / Gonzalez-Dominguez, Javier / Toledano, Doroteo T. / Gonzalez-Rodriguez, Joaquin: "An end-to-end approach to language identification in short utterances using convolutional neural networks", 403-407.
Hautamäki, Ville / Siniscalchi, Sabato Marco / Behravan, Hamid / Salerno, Valerio Mario / Kukanov, Ivan: "Boosting universal speech attributes classification with deep neural network for foreign accent characterization", 408-412.
Geng, Wang / Li, Jie / Zhang, Shanshan / Cai, Xinyuan / Xu, Bo: "Multilingual tandem bottleneck feature for language identification", 413-417.
Asaei, Afsaneh / Cernak, Milos / Bourlard, Hervé: "On compressibility of neural network phonological features for low bit rate speech coding", 418-422.
Lenarczyk, Michał: "Robust and accurate LSF location with laguerre method", 423-427.
Issing, Jochen / Färber, Nikolaus / German, Reinhard: "Interactivity-aware playout adaptation", 428-432.
Issing, Jochen / Färber, Nikolaus / German, Reinhard: "Advanced time shrinking using a drop classifier based on codec features", 433-437.
Hines, Andrew / Gillen, Eoin / Harte, Naomi: "Measuring and monitoring speech quality for voice over IP with POLQA, viSQOL and p.563", 438-442.
Gallardo, Laura Fernández / Möller, Sebastian: "Towards the prediction of human speaker identification performance from measured speech quality", 443-447.
Levit, M. / Stolcke, Andreas / Subba, R. / Parthasarathy, S. / Chang, S. / Xie, S. / Anastasakos, T. / Dumoulin, Benoit: "Personalization of word-phrase-entity language models", 448-452.
Kobayashi, Akio / Ichiki, Manon / Oku, Takahiro / Onoe, Kazuo / Sato, Shoei: "Discriminative bilinear language modeling for broadcast transcriptions", 453-457.
Ma, Xi / Wang, Xiaoxi / Wang, Dong / Zhang, Zhiyong: "Recognize foreign low-frequency words with similar pairs", 458-462.
Masumura, Ryo / Asami, Taichi / Oba, Takanobu / Masataki, Hirokazu / Sakauchi, Sumitaka / Ito, Akinori: "Combinations of various language model technologies including data expansion and adaptation in spontaneous speech recognition", 463-467.
Aleksic, Petar / Ghodsi, Mohammadreza / Michaely, Assaf / Allauzen, Cyril / Hall, Keith / Roark, Brian / Rybach, David / Moreno, Pedro: "Bringing contextual information to google speech recognition", 468-472.
Vasserman, Lucy / Schogol, Vlad / Hall, Keith: "Sequence-based class tagging for robust transcription in ASR", 473-477.
Schuller, Björn / Steidl, Stefan / Batliner, Anton / Hantke, Simone / Hönig, Florian / Orozco-Arroyave, J. R. / Nöth, Elmar / Zhang, Yue / Weninger, Felix: "The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, parkinson's & eating condition", 478-482.
Hönig, Florian: "The degree of nativeness sub-challenge: the data" (abstract).
Montacié, Claude / Caraty, Marie-José: "Phrase accentuation verification and phonetic variation measurement for the degree of nativeness sub-challenge", 483-487.
Ribeiro, Eugénio / Ferreira, Jaime / Olcoz, Julia / Abad, Alberto / Moniz, Helena / Batista, Fernando / Trancoso, Isabel: "Combining multiple approaches to predict the degree of nativeness", 488-492.
Black, Matthew P. / Bone, Daniel / Skordilis, Zisis Iason / Gupta, Rahul / Xia, Wei / Papadopoulos, Pavlos / Chakravarthula, Sandeep Nallan / Xiao, Bo / Segbroeck, Maarten Van / Kim, Jangwon / Georgiou, Panayiotis G. / Narayanan, Shrikanth S.: "Automated evaluation of non-native English pronunciation quality: combining knowledge- and data-driven features at multiple time scales", 493-497.
Orozco-Arroyave, J. R.: "The parkinson's condition sub-challenge: the data" (abstract).
Sztahó, Dávid / Kiss, Gábor / Vicsi, Klára: "Estimating the severity of parkinson's disease from speech using linear regression and database partitioning", 498-502.
Zlotnik, Alexander / Montero, Juan M. / San-Segundo, Rubén / Gallardo-Antolín, Ascensión: "Random forest-based prediction of parkinson's disease progression using acoustic, ASR and intelligibility features", 503-507.
An, Guozhen / Brizan, David Guy / Ma, Min / Morales, Michelle / Syed, Ali Raza / Rosenberg, Andrew: "Automatic recognition of unified parkinson's disease rating from speech with acoustic, i-vector and phonotactic features", 508-512.
Hahm, Seongjun / Wang, Jun: "Parkinson's condition estimation using speech acoustic and inversely mapped articulatory data", 513-517.
Williamson, James R. / Quatieri, Thomas F. / Helfer, Brian S. / Perricone, Joseph / Ghosh, Satrajit S. / Ciccarelli, Gregory / Mehta, Daryush D.: "Segment-dependent dynamics in predicting parkinson's disease", 518-522.
Batliner, Anton: "The eating condition sub-challenge: the data" (abstract).
Prasad, Abhay / Ghosh, Prasanta Kumar: "Automatic classification of eating conditions from speech using acoustic feature selection and a set of hierarchical support vector machine classifiers", 884-888.
Wagner, Johannes / Seiderer, Andreas / Lingenfelser, Florian / André, Elisabeth: "Combining hierarchical classification with frequency weighting for the recognition of eating conditions", 889-893.
Pir, Dara / Brown, Theodore: "Acoustic group feature selection using wrapper method for automatic eating condition recognition", 894-898.
Pellegrini, Thomas: "Comparing SVM, softmax, and shallow neural networks for eating condition classification", 899-903.
Milde, Benjamin / Biemann, Chris: "Using representation learning and out-of-domain data for a paralinguistic speech task", 904-908.
Kaya, Heysem / Karpov, Alexey A. / Salah, Albert Ali: "Fisher vectors with cascaded normalization for paralinguistic analysis", 909-913.
Kim, Jangwon / Nasir, Md. / Gupta, Rahul / Segbroeck, Maarten Van / Bone, Daniel / Black, Matthew P. / Skordilis, Zisis Iason / Yang, Zhaojun / Georgiou, Panayiotis G. / Narayanan, Shrikanth S.: "Automatic estimation of parkinson's disease severity from diverse speech tasks", 914-918.
Grósz, Tamás / Busa-Fekete, Róbert / Gosztolya, Gábor / Tóth, László: "Assessing the degree of nativeness and parkinson's condition using Gaussian processes and deep rectifier neural networks", 919-923.
Steidl, Stefan: "The INTERSPEECH 2015 computational paralinguistics challenge: a summary of results" (abstract).
Batliner, Anton: "Wrapping up: the story of the compare challenges, what we learned and where to go", 4105.
Houghton, S. M. / Champion, Colin J. / Weber, Philip: "Recognition of voiced sounds with a continuous state HMM", 523-527.
Zeng, Xiangyu / Yin, Shi / Wang, Dong: "Learning speech rate in speech recognition", 528-532.
Chen, Guoguo / Xu, Hainan / Wu, Minhua / Povey, Daniel / Khudanpur, Sanjeev: "Pronunciation and silence probability modeling for ASR", 533-537.
Davel, Marelie / Barnard, Etienne / Heerden, Charl van / Hartmann, William / Karakos, Damianos / Schwartz, Richard / Tsakalidis, Stavros: "Exploring minimal pronunciation modeling for low resource languages", 538-542.
Zheng, Hao / Yang, Zhanlei / Qiao, Liwei / Li, Jianping / Liu, Wenju: "Attribute knowledge integration for speech recognition based on multi-task learning neural networks", 543-547.
Marcheret, Etienne / Potamianos, Gerasimos / Vopicka, Josef / Goel, Vaibhava: "Detecting audio-visual synchrony using deep neural networks", 548-552.
Kalantari, Shahram / Dean, David / Ghaemmaghami, Houman / Sridharan, Sridha / Fookes, Clinton: "Cross database training of audio-visual hidden Markov models for phone recognition", 553-557.
Kalantari, Shahram / Dean, David / Sridharan, Sridha: "Incorporating visual information for spoken term detection", 558-562.
Ninomiya, Hiroshi / Kitaoka, Norihide / Tamura, Satoshi / Iribe, Yurie / Takeda, Kazuya: "Integration of deep bottleneck features for audio-visual speech recognition", 563-567.
Kakouros, Sofoklis / Räsänen, Okko: "Automatic detection of sentence prominence in speech using predictability of word-level acoustic features", 568-572.
Cernak, Milos / Honnet, Pierre-Edouard: "An empirical model of emphatic word detection", 573-577.
Ning, Yishuang / Wu, Zhiyong / Lou, Xiaoyan / Meng, Helen / Jia, Jia / Cai, Lianhong: "Using tilt for automatic emphasis detection with Bayesian networks", 578-582.
Bai, Linxue / Jančovič, Peter / Russell, Martin / Weber, Philip: "Analysis of a low-dimensional bottleneck neural network representation of speech for modelling speech dynamics", 583-587.
Uchida, Hidetsugu / Saito, Daisuke / Minematsu, Nobuaki / Hirose, Keikichi: "Statistical acoustic-to-articulatory mapping unified with speaker normalization based on voice conversion", 588-592.
Pappagari, Raghavendra Reddy / Vijayan, Karthika / Murty, K. Sri Rama: "Analysis of features from analytic representation of speech using MP-ABX measures", 593-597.
Loweimi, Erfan / Barker, Jon / Hain, Thomas: "Source-filter separation of speech signal in the phase domain", 598-602.
Maia, Ranniery / Stylianou, Yannis / Akamine, Masami: "A maximum likelihood approach to the detection of moments of maximum excitation and its application to high-quality speech parameterization", 603-607.
Liberatore, Christopher / Aryal, Sandesh / Wang, Zelun / Polsley, Seth / Gutierrez-Osuna, Ricardo: "SABR: sparse, anchor-based representation of the speech signal", 608-612.
Csapó, Tamás Gábor / Németh, Géza: "Automatic transformation of irregular to regular voice by residual analysis and synthesis", 613-617.
Preuß, Simon / Birkholz, Peter: "Optical sensor calibration for electro-optical stomatography", 618-622.
Abari, Kálmán / Csapó, Tamás Gábor / Tóth, Bálint Pál / Olaszy, Gábor: "From text to formants — indirect model for trajectory prediction based on a multi-speaker parallel speech database", 623-627.
Hsu, Chung-Chien / Chien, Jen-Tzung / Chi, Tai-Shih: "Layered nonnegative matrix factorization for speech separation", 628-632.
Laporte, Catherine / Ménard, Lucie: "Robust tongue tracking in ultrasound images: a multi-hypothesis approach", 633-637.
Websdale, Danny / Cornu, Thomas Le / Milner, Ben: "Objective measures for predicting the intelligibility of spectrally smoothed speech with artificial excitation", 638-642.
Mertens, Christophe / Grenez, Francis / Viallet, François / Ghio, Alain / Skodda, Sabine / Schoentgen, Jean: "Vocal tremor analysis via AM-FM decomposition of empirical modes of the glottal cycle length time series", 766-770.
Godoy, Elizabeth / Malyska, Nicolas / Quatieri, Thomas F.: "Estimating lower vocal tract features with closed-open phase spectral analyses", 771-775.
Houghton, S. M. / Champion, Colin J.: "Inductive implementation of segmental HMMs as CS-HMMs", 776-780.
Meenakshi, G. Nisha / Ghosh, Prasanta Kumar: "A discriminative analysis within and across voiced and unvoiced consonants in neutral and whispered speech in multiple indian languages", 781-785.
Tsai, T. J. / Stolcke, Andreas: "Aligning meeting recordings via adaptive fingerprinting", 786-790.
Zöhrer, Matthias / Peharz, Robert / Pernkopf, Franz: "On representation learning for artificial bandwidth extension", 791-795.
Gowda, Dhananjaya / Saeidi, Rahim / Alku, Paavo: "AM-FM based filter bank analysis for estimation of spectro-temporal envelopes and its application for speaker recognition in noisy reverberant environments", 1166-1170.
Drugman, Thomas / Stylianou, Yannis: "Fast and accurate phase unwrapping", 1171-1175.
Lu, Xugang / Shen, Peng / Tsao, Yu / Hori, Chiori / Kawai, Hisashi: "Sparse representation with temporal max-smoothing for acoustic event detection", 1176-1180.
Rachel G., Anushiya / Vijayalakshmi P., Vijayalakshmi P. / T, Nagarajan: "Estimation of glottal closure instants from telephone speech using a group delay-based approach that considers speech signal as a spectrum", 1181-1185.
Montaño, Raúl / Alías, Francesc: "The role of prosody and voice quality in text-dependent categories of storytelling across languages", 1186-1190.
Hyafil, Alexandre / Cernak, Milos: "Neuromorphic based oscillatory device for incremental syllable boundary detection", 1191-1195.
Lee, Ann / Glass, James: "Mispronunciation detection without nonnative training data", 643-647.
Rasipuram, Ramya / Cernak, Milos / Nachen, Alexandre / Magimai-Doss, Mathew: "Automatic accentedness evaluation of non-native speech using phonetic and sub-phonetic posterior probabilities", 648-652.
Ma, Min / Evanini, Keelan / Loukina, Anastassia / Wang, Xinhao / Zechner, Klaus: "Using F0 contours to assess nativeness in a sentence repeat task", 653-657.
Lunsford, Rebecca / Heeman, Peter A.: "Using linguistic indicators of difficulty to identify mild cognitive impairment", 658-662.
Fontan, Lionel / Farinas, Jérôme / Ferrané, Isabelle / Pinquier, Julien / Aumont, Xavier: "Automatic intelligibility measures applied to speech signals simulating age-related hearing loss", 663-667.
Chakravarthula, Sandeep Nallan / Xiao, Bo / Imel, Zac E. / Atkins, David C. / Georgiou, Panayiotis G.: "Assessing empathy using static and dynamic behavior models based on therapist's language in addiction counseling", 668-672.
Liu, Yuzong / Iyer, Rishabh / Kirchhoff, Katrin / Bilmes, Jeff: "SVitchboard II and fiSVer i: high-quality limited-complexity corpora of conversational English speech", 673-677.
Kamper, Herman / Jansen, Aren / Goldwater, Sharon: "Fully unsupervised small-vocabulary speech recognition using a segmental Bayesian model", 678-682.
Tilk, Ottokar / Alumäe, Tanel: "LSTM for punctuation restoration in speech transcripts", 683-687.
Yılmaz, Emre / Baby, Deepak / hamme, Hugo Van: "Noise robust exemplar matching for speech enhancement: applications to automatic speech recognition", 688-692.
Gao, Yingming / Xie, Yanlu / Cao, Wen / Zhang, Jinsong: "A study on robust detection of pronunciation erroneous tendency based on deep neural network", 693-696.
Joshi, Shrikant / Deo, Nachiket / Rao, Preeti: "Vowel mispronunciation detection using DNN acoustic models with cross-lingual training", 697-701.
Kumar, Kshitiz / Bawab, Ziad Al / Zhao, Yong / Liu, Chaojun / Dumoulin, Benoit / Gong, Yifan: "Confidence-features and confidence-scores for ASR applications in arbitration and DNN speaker adaptation", 702-706.
Liu, Pengfei / Jameel, Shoaib / Lam, Wai / Ma, Bin / Meng, Helen: "Topic modeling for conference analytics", 707-711.
Sharma, Pulkit / Abrol, Vinayak / Dileep, A. D. / Sao, Anil Kumar: "Sparse coding based features for speech units classification", 712-715.
Niculescu, Andreea I. / Thai, Ngoc Thuy Huong / Ni, Chongjia / Lim, Boon Pang / Yeo, Kheng Hui / Banchs, Rafael E.: "Smarter driving with IDA, the intelligent driving assistant for singapore", 716-717.
Yeo, Kheng Hui / Banchs, Rafael E.: "Talk it out: adding speech interaction to support informational and transactional applications on public touch-screen kiosks", 718-719.
D'Haro, Luis Fernando / Kim, Seokhwan / Banchs, Rafael E.: "Conversational agent and management tools for conference and tourism domain", 720-721.
Salimbajevs, Askars / Strigins, Jevgenijs: "Latvian speech-to-text transcription service", 722-723.
Gałka, Jakub / Grzybowska, Joanna / Igras, Magdalena / Jaciów, Paweł / Wajda, Kamil / Witkowski, Marcin / Ziółko, Mariusz: "System supporting speaker identification in emergency call center", 724-725.
Abdelali, Ahmed / Ali, Ahmed / Guzmán, Francisco / Stahlberg, Felix / Vogel, Stephan / Zhang, Yifan: "QAT2 — the QCRI advanced transcription and translation system", 726-727.
Stadtschnitzer, Michael / Schmidt, Christoph: "Implementation of a live dialectal media subtitling system", 728-729.
Bell, Peter / Lai, Catherine / Llewellyn, Clare / Birch, Alexandra / Sinclair, Mark: "A system for automatic broadcast news summarisation, geolocation and translation", 730-731.
Znotiņš, Artūrs / Polis, Kaspars / Darģis, Roberts: "Media monitoring system for latvian radio and TV broadcasts", 732-733.
Assayag, Michel / Huang, Jonathan / Mamou, Jonathan / Pereg, Oren / Sahay, Saurav / Shamir, Oren / Stemmer, Georg / Wasserblat, Moshe: "Meeting assistant application", 734-735.
Ziółko, Bartosz / Jadczyk, Tomasz / Skurzok, Dawid / Żelasko, Piotr / Gałka, Jakub / Pȩdzimąż, Tomasz / Gawlik, Ireneusz / Pałka, Szymon: "SARMATA 2.0 automatic Polish language speech recognition system", 1062-1063.
Faria, Arlo / Riedhammer, Korbinian: "Remeeting — get more out of meetings", 1064-1065.
Masuda-Katsuse, Ikuyo: "Web application system for pronunciation practice by children with disabilities and to support cooperation of teachers and medical workers", 1066-1067.
Kaufhold, Caroline / Gamidov, Vadim / Kiessling, Andreas / Reinhard, Klaus / Nöth, Elmar: "PATSY — it's all about pronunciation!", 1068-1069.
Azarov, Elias / Vashkevich, Maxim / Likhachov, Denis / Petrovsky, Alexander: "Real-time pitch modification system for speech and singing voice", 1070-1071.
Duplessis, Guillaume Dubuisson / Béchade, Lucile / Sehili, Mohamed A. / Delaborde, Agnès / Letard, Vincent / Ligozat, Anne-Laure / Deléglise, Paul / Estève, Yannick / Rosset, Sophie / Devillers, Laurence: "Nao is doing humour in the CHIST-ERA joker project", 1072-1073.
Lange, Lisa / Pfeiffer, Bartholomäus / Duran, Daniel: "ABIMS — auditory bewildered interaction measurement system", 1074-1075.
Berkling, Kay / Pflaumer, Nadine / Coyplove, Alexei: "Phontasia — a game for training German orthography", 1874-1875.
Wong, Ka Ho / Leung, Wai Kim / Meng, Helen: "E-commu-book: an assistive technology for users with speech impairments", 1876-1877.
Röthlisberger, Martina / Karipidis, Iliana I. / Pleisch, Georgette / Dellwo, Volker / Richardson, Ulla / Brem, Silvia: "Swiss graphogame: concept and design presentation of a computerised reading intervention for children with high risk for poor reading outcomes", 1878-1879.
Pfab, Jakob / Jakob, Hanna / Späth, Mona / Draxler, Christoph: "Neolexon — a therapy app for patients with aphasia", 1880-1881.
Patil, Sonal / Arsikere, Harish / Deshmukh, Om: "Acoustic stress detection for improved navigation of educational videos", 1882-1883.
Anguera, Xavier: "Multimodal read-aloud ebooks for language learning", 1884-1885.
Besacier, Laurent / Gauthier, Elodie / Mangeot, Mathieu / Bretier, Philippe / Bagshaw, Paul / Rosec, Olivier / Moudenc, Thierry / Pellegrino, François / Voisin, Sylvie / Marsico, Egidio / Nocera, Pascal: "Speech technologies for african languages: example of a multilingual calculator for education", 1886-1887.
Lee, Kong Aik / Wang, Guangsen / Ng, Kam Pheng / Sun, Hanwu / Nguyen, Trung Hieu / Thai, Ngoc Thuy Huong / Ma, Bin / Li, Haizhou: "The reddots platform for mobile crowd-sourcing of speech data", 2603-2604.
Arai, Takayuki: "Two extensions of umeda and teranishi's physical models of the human vocal tract", 2605-2606.
Budnik, Matheuz / Besacier, Laurent / Poignant, Johann / Bredin, Hervé / Barras, Claude / Stefas, Mickael / Bruneau, Pierrick / Tamisier, Thomas: "Collaborative annotation for person identification in TV shows", 2607-2608.
Kisler, Thomas / Schiel, Florian / Reichel, Uwe D. / Draxler, Christoph: "Phonetic/linguistic web services at BAS", 2609-2610.
Winkelmann, Raphael: "Managing speech databases with emur and the EMU-webapp", 2611-2612.
Wankerl, Sebastian / Hönig, Florian / Batliner, Anton / Orozco-Arroyave, J. R. / Nöth, Elmar: "Visual comparison of speaker groups", 2613-2614.
Kumar, Rohit / Roy, Matthew E. / Hewavitharana, Sanjika / Mehay, Dennis N. / Zinovieva, Nina: "Tools for rapid customization of S2s systems for emergent domains", 2615-2616.
Metze, Florian / Riebling, Eric / Fosler-Lussier, Eric / Plummer, Andrew / Bates, Rebecca: "The speech recognition virtual kitchen turns one", 2617-2618.
Rennies, Jan / Volgenandt, Andreas / Schepker, Henning / Doclo, Simon: "Model-based adaptive pre-processing of speech for enhanced intelligibility in noise and reverberation", 2619-2620.
Möller, Sebastian / Westermann, Tilo: "Experiences with and new application ideas for the interspeech app", 2621-2622.
Sityaev, Dmitry / Kumar, Praphul / Ramchander, Rajesh: "Traditional IVR and visual IVR — killing two birds with one stone", 2623-2624.
Itakura, Kousuke / Nishimuta, Izaya / Bando, Yoshiaki / Itoyama, Katsutoshi / Yoshii, Kazuyoshi: "Bayesian integration of sound source separation and speech recognition: a new approach to simultaneous speech recognition", 736-740.
Himawan, Ivan / Motlicek, Petr / Sridharan, Sridha / Dean, David / Tjondronegoro, Dian: "Channel selection in the short-time modulation domain for distant speech recognition", 741-745.
Dekkers, Gert / Waterschoot, Toon van / Vanrumste, Bart / Broeck, Bert Van Den / Gemmeke, Jort F. / hamme, Hugo Van / Karsmakers, Peter: "A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users", 746-750.
Kim, Chanwoo / Chin, Kean K.: "Sound source separation algorithm using phase difference and angle distribution modeling near the target", 751-755.
Ravanelli, Mirco / Omologo, Maurizio: "Contaminated speech training methods for robust DNN-HMM distant speech recognition", 756-760.
Miao, Yajie / Metze, Florian: "Distance-aware DNNs for robust speech recognition", 761-765.
Levy, Helena: "Perception and production of vowel contrasts in German learners of English", 796-800.
Tong, Rong / Chen, Nancy F. / Ma, Bin / Li, Haizhou: "Goodness of tone (GOT) for non-native Mandarin tone recognition", 801-805.
Jügler, Jeanin / Zimmerer, Frank / Möbius, Bernd / Draxler, Christoph: "The effect of high-variability training on the perception and production of French stops by German native speakers", 806-810.
Bao, Wenfu / Feng, Hui / Dang, Jianwu / Liu, Zhilei / Yu, Yang / Wang, Siyu: "Perception of Mandarin tones by native tibetan speakers", 811-814.
Saha, Shambhu Nath / Mandal, Shyamal Kr. Das: "Study of acoustic correlates of English lexical stress produced by native (L1) bengali speakers compared to native (L1) English speakers", 815-819.
Nagano-Madsen, Yasuko: "Prosodic phrasing unique to the acquisition of L2 intonation — an analysis of L2 Japanese intonation by L1 Swedish learners", 820-823.
Sarı, Leda / Gündoğdu, Batuhan / Saraçlar, Murat: "Fusion of LVCSR and posteriorgram based keyword search", 824-828.
Mendels, Gideon / Cooper, Erica / Soto, Victor / Hirschberg, Julia / Gales, Mark J. F. / Knill, Kate M. / Ragni, Anton / Wang, Haipeng: "Improving speech recognition and keyword search for low resource languages using web data", 829-833.
Domoto, Kentaro / Utsuro, Takehito / Sawada, Naoki / Nishizaki, Hiromitsu: "Two-step spoken term detection using SVM classifier trained with pre-indexed keywords based on ASR result", 834-838.
Zhang, Le / Karakos, Damianos / Hartmann, William / Hsiao, Roger / Schwartz, Richard / Tsakalidis, Stavros: "Enhancing low resource keyword spotting with automatically retrieved web documents", 839-843.
Bertero, Dario / Wang, Linlin / Chan, Ho Yin / Fung, Pascale: "A comparison between a DNN and a CRF disfluency detection and reconstruction system", 844-848.
Hough, Julian / Schlangen, David: "Recurrent neural networks for incremental disfluency detection", 849-853.
Hu, Qiong / Wu, Zhizheng / Richmond, Korin / Yamagishi, Junichi / Stylianou, Yannis / Maia, Ranniery: "Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning", 854-858.
Achanta, Sivanand / Godambe, Tejas / Gangashetty, Suryakanth V.: "An investigation of recurrent neural network architectures for statistical parametric speech synthesis", 859-863.
Fan, Yuchen / Qian, Yao / Soong, Frank K. / He, Lei: "Sequence generation error (SGE) minimization based deep neural networks training for text-to-speech synthesis", 864-868.
Valentini-Botinhao, Cassia / Wu, Zhizheng / King, Simon: "Towards minimum perceptual error training for DNN-based speech synthesis", 869-873.
Song, Eunwoo / Kang, Hong-Goo: "Deep neural network-based statistical parametric speech synthesis system using improved time-frequency trajectory excitation model", 874-878.
Wu, Zhizheng / Swietojanski, Pawel / Veaux, Christophe / Renals, Steve / King, Simon: "A study of speaker adaptation for DNN-based speech synthesis", 879-883.
Wang, Yannan / Du, Jun / Dai, Li-Rong / Lee, Chin-Hui: "High-resolution acoustic modeling and compact language modeling of language-universal speech attributes for spoken language identification", 992-996.
Irtza, Saad / Sethu, Vidhyasaharan / Le, Phu Ngoc / Ambikairajah, Eliathamby / Li, Haizhou: "Phonemes frequency based PLLR dimensionality reduction for language recognition", 997-1001.
Cumani, Sandro / Plchot, Oldřich / Fér, Radek: "Exploiting i-vector posterior covariances for short-duration language recognition", 1002-1006.
Lykartsis, Athanasios / Weinzierl, Stefan: "Using the beat histogram for speech rhythm description and language identification", 1007-1011.
Saeidi, Rahim / Niemi, Tuija / Karppelin, Hanna / Pohjalainen, Jouni / Kinnunen, Tomi / Alku, Paavo: "Speaker recognition for speech under face cover", 1012-1016.
Rahman, Md. Hafizur / Kanagasundaram, Ahilan / Dean, David / Sridharan, Sridha: "Dataset-invariant covariance normalization for out-domain PLDA speaker verification", 1017-1021.
Xu, Longting / Lee, Kong Aik / Li, Haizhou / Yang, Zhen: "Sparse coding of total variability matrix", 1022-1026.
Cai, Weicheng / Li, Ming / Li, Lin / Hong, QingYang: "Duration dependent covariance regularization in PLDA modeling for speaker verification", 1027-1031.
Aronowitz, Hagai: "Exploiting supervector structure for speaker recognition trained on a small development set", 1032-1036.
Hong, QingYang / Li, Lin / Li, Ming / Huang, Ling / Wan, Lihong / Zhang, Jun: "Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system", 1037-1041.
Jelil, Sarfaraz / Das, Rohan Kumar / Sinha, Rohit / Prasanna, S. R. Mahadeva: "Speaker verification using Gaussian posteriorgrams on fixed phrase short utterances", 1042-1046.
Gallardo, Laura Fernández / Möller, Sebastian / Wagner, Michael: "Importance of intelligible phonemes for human speaker recognition in different channel bandwidths", 1047-1051.
Yamamoto, Hitoshi / Koshinaka, Takafumi: "Denoising autoencoder-based speaker feature restoration for utterances of short duration", 1052-1056.
Ribas, Dayana / Vincent, Emmanuel / Calvo, José Ramón: "Full multicondition training for robust i-vector based speaker recognition", 1057-1061.
Huang, Zhen / Siniscalchi, Sabato Marco / Chen, I-Fan / Li, Jinyu / Wu, Jiadong / Lee, Chin-Hui: "Maximum a posteriori adaptation of network parameters in deep models", 1076-1080.
Huang, Yan / Gong, Yifan: "Regularized sequence-level deep neural network model adaptation", 1081-1085.
Li, Xiangang / Wu, Xihong: "Modeling speaker variability using long short-term memory networks for speech recognition", 1086-1090.
Kumar, Kshitiz / Liu, Chaojun / Yao, Kaisheng / Gong, Yifan: "Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation", 1091-1095.
Karthick B., Murali / Kolhar, Prateek / Umesh, S.: "Speaker adaptation of convolutional neural network using speaker specific subspace vectors of SGMM", 1096-1100.
Miao, Yajie / Metze, Florian: "On speaker adaptation of long short-term memory recurrent neural networks", 1101-1105.
Parisotto, Emilio / Ghassabeh, Youness A. / MacDonald, Matt J. / Cozma, Adelina / Pang, Elizabeth W. / Rudzicz, Frank: "Automatic identification of received language in MEG", 1106-1110.
Werff, Laurens van der / Guðnason, Jón / Jóhannsdóttir, Kamilla Rún: "Detection of cardiovascular reactivity in speech", 1111-1115.
Francois-Nienaber, Alex / Meltzer, Jed A. / Rudzicz, Frank: "Lateralization in emotional speech perception following transcranial direct current stimulation", 1116-1120.
Yang, Minda / Sheth, Sameer A. / Schevon, Catherine A. / II, Guy M. McKhann / Mesgarani, Nima: "Speech reconstruction from human auditory cortex with deep neural networks", 1121-1125.
Brumberg, Jonathan S. / Castro, Nichol / Rao, Akshatha: "Temporal dynamics of the speech readiness potential, and its use in a neural decoder of speech-motor intention", 1126-1130.
Heger, Dominic / Herff, Christian / Pesters, Adriana de / Telaar, Dominic / Brunner, Peter / Schalk, Gerwin / Schultz, Tanja: "Continuous speech recognition from ECoG", 1131-1135.
Chen, Yu-hsin / Lopez-Moreno, Ignacio / Sainath, Tara N. / Visontai, Mirkó / Alvarez, Raziel / Parada, Carolina: "Locally-connected and convolutional neural networks for small footprint speaker recognition", 1136-1140.
Garcia-Romero, Daniel / McCree, Alan: "Insights into deep neural networks for speaker recognition", 1141-1145.
Richardson, Fred / Reynolds, Douglas A. / Dehak, Najim: "A unified deep neural network for speaker and language recognition", 1146-1150.
Tian, Yao / Cai, Meng / He, Liang / Liu, Jia: "Investigation of bottleneck features and multilingual deep neural networks for speaker verification", 1151-1155.
Xing, Hua / Liu, Gang / Hansen, John H. L.: "Frequency offset correction in single sideband (SSB) speech by deep neural network for speaker verification", 1156-1160.
Zheng, Hao / Zhang, Shanshan / Liu, Wenju: "Exploring robustness of DNN/RNN for extracting speaker baum-welch statistics in mismatched conditions", 1161-1165.
Yoshimura, Takenori / Hashimoto, Kei / Nankaku, Yoshihiko / Tokuda, Keiichi: "Simultaneous optimization of multiple tree structures for factor analyzed HMM-based speech synthesis", 1196-1200.
Pouget, Maël / Hueber, Thomas / Bailly, Gérard / Baumann, Timo: "HMM training strategy for incremental speech synthesis", 1201-1205.
Takamichi, Shinnosuke / Toda, Tomoki / Black, Alan W. / Nakamura, Satoshi: "Modulation spectrum-constrained trajectory training algorithm for HMM-based speech synthesis", 1206-1210.
Black, Alan W. / Muthukumar, Prasanna Kumar: "Random forests for statistical speech synthesis", 1211-1215.
Hong, Doo Hwa / Lee, Joun Yeop / Jang, Se Young / Kim, Nam Soo: "Speaker adaptation using relevance vector regression for HMM-based expressive TTS", 1216-1220.
Tsiaras, Vassilis / Maia, Ranniery / Diakoloukas, Vassilis / Stylianou, Yannis / Digalakis, Vassilis: "Towards a linear dynamical model based speech synthesizer", 1221-1225.
Looze, Céline De / Vaughan, Brian / Kelly, Finnian / Kay, Alison: "Providing objective metrics of team communication skills via interpersonal coordination mechanisms", 1226-1230.
Lee, Donghyeon / Lee, Jinsik / Kim, Eun-Kyoung / Lee, Jaewon: "Dialog act modeling for virtual personal assistant applications using a small volume of labeled data and domain knowledge", 1231-1235.
Zainkó, Csaba / Bartalis, Mátyás / Németh, Géza / Olaszy, Gábor: "A polyglot domain optimised text-to-speech system for railway station announcements", 1236-1240.
Mandal, Partho / Jain, Shalini / Ojha, Gaurav / Shukla, Anupam: "Development of hindi speech recognition system of agricultural commodities using deep neural network", 1241-1245.
Fehér, Thomas / Freitag, Michael / Gruber, Christian: "Real-time audio signal enhancement for hands-free speech applications", 1246-1250.
Erro, D. / Hernaez, Inma / Alonso, Agustin / García-Lorenzo, D. / Navas, Eva / Ye, J. / Arzelus, H. / Jauk, Igor / Hy, N. Q. / Magariños, C. / Pérez-Ramón, R. / Sulír, M. / Tian, Xiaohai / Wang, X.: "Personalized synthetic voices for speaking impaired: website and app", 1251-1254.
Sahraeian, Reza / Compernolle, Dirk Van / Wet, Febe de: "Under-resourced speech recognition based on the speech manifold", 1255-1259.
Golik, Pavel / Tüske, Zoltán / Schlüter, Ralf / Ney, Hermann: "Multilingual features based keyword search for very low-resource languages", 1260-1264.
Wang, Xiaoyun / Yamamoto, Seiichi: "Second language speech recognition using multiple-pass decoding with lexicon represented by multiple reduced phoneme sets", 1265-1269.
Juan, Sarah Samson / Besacier, Laurent / Lecouteux, Benjamin / Dyab, Mohamed: "Using resources from a closely-related language to develop ASR for a very under-resourced language: a case study for iban", 1270-1274.
Korenevsky, Maxim L. / Smirnov, Andrey B. / Mendelev, Valentin S.: "Prediction of speech recognition accuracy for utterance classification", 1275-1279.
Beck, Eugen / Schlüter, Ralf / Ney, Hermann: "Error bounds for context reduction and feature omission", 1280-1284.
Itoh, Nobuyasu / Kurata, Gakuto / Tachibana, Ryuki / Nishimura, Masafumi: "A metric for evaluating speech recognizer output based on human-perception model", 1285-1288.
Jannet, Mohamed Ameur Ben / Galibert, Olivier / Adda-Decker, Martine / Rosset, Sophie: "How to evaluate ASR output for named entity recognition?", 1289-1293.
Mixdorff, Hansjörg / Hönemann, Angelika / Rilliard, Albert: "Acoustic-prosodic analysis of attitudinal expressions in German", 1294-1298.
Khaki, Hossein / Erzin, Engin: "Continuous emotion tracking using total variability space", 1299-1303.
Lee, Chi-Chun / Bone, Daniel / Narayanan, Shrikanth S.: "An analysis of the relationship between signal-derived vocal arousal score and human emotion production and perception", 1304-1308.
Mori, Hiroki: "Morphology of vocal affect bursts: exploring expressive interjections in Japanese conversation", 1309-1313.
Mehrabani, Mahnoosh / Kalinli, Ozlem / Chen, Ruxin: "Emotion clustering based on probabilistic linear discriminant analysis", 1314-1318.
Albin, Aaron / Moore, Elliot: "Objective study of the performance degradation in emotion recognition through the AMR-WB+ codec", 1319-1323.
Kadiri, Sudarsana Reddy / Gangamohan, P. / Gangashetty, Suryakanth V. / Yegnanarayana, B.: "Analysis of excitation source features of speech for emotion recognition", 1324-1328.
Huang, Zhaocheng / Epps, Julien / Ambikairajah, Eliathamby: "An investigation of emotion change detection from speech", 1329-1333.
Gu, Wentao / Tang, Ping / Hirose, Keikichi / Aubergé, Véronique: "Crosslinguistic comparison on the perception of Mandarin attitudinal speech", 1334-1338.
Gosztolya, Gábor: "Conflict intensity estimation from speech using Greedy forward-backward feature selection", 1339-1343.
Chong, Chee Seng / Kim, Jeesun / Davis, Chris: "Exploring acoustic differences between Cantonese (tonal) and English (non-tonal) spoken expressions of emotions", 1522-1526.
Palogiannidi, Elisavet / Iosif, Elias / Koutsakis, Polychronis / Potamianos, Alexandros: "Valence, arousal and dominance estimation for English, German, Greek, Portuguese and Spanish lexica using semantic models", 1527-1531.
Xu, Xinzhou / Deng, Jun / Zheng, Wenming / Zhao, Li / Schuller, Björn: "Dimensionality reduction for speech emotion features by multiscale kernels", 1532-1536.
Lee, Jinkyu / Tashev, Ivan: "High-level feature representation using recurrent neural network for speech emotion recognition", 1537-1540.
Kim, Myung Jong / Yoo, Joohong / Kim, Younggwan / Kim, Hoirin: "Speech emotion classification using tree-structured sparse logistic regression", 1541-1545.
Vlasenko, Bogdan / Wendemuth, Andreas: "Annotators' agreement and spontaneous emotion classification performance", 1546-1550.
Arisoy, Ebru / Saraçlar, Murat: "Multi-stream long short-term memory neural network language model", 1413-1417.
Hall, Keith / Cho, Eunjoon / Allauzen, Cyril / Beaufays, Françoise / Coccaro, Noah / Nakajima, Kaisuke / Riley, Michael / Roark, Brian / Rybach, David / Zhang, Linda: "Composition-based on-the-fly rescoring for salient n-gram biasing", 1418-1422.
Marin, Alex / Ostendorf, Mari / He, Ji: "Learning phrase patterns for ASR name error detection using semantic similarity", 1423-1427.
Shazeer, Noam / Pelemans, Joris / Chelba, Ciprian: "Sparse non-negative matrix language modeling for skip-grams", 1428-1432.
Pelemans, Joris / Shazeer, Noam / Chelba, Ciprian: "Pruning sparse non-negative matrix n-gram language models", 1433-1437.
Chelba, Ciprian / Zhang, Xuedong / Hall, Keith: "Geo-location for voice search language modeling", 1438-1442.
Botros, Rami / Irie, Kazuki / Sundermeyer, Martin / Ney, Hermann: "On efficient training of word classes and their application to recurrent neural network language models", 1443-1447.
Bayer, Ali Orkan / Riccardi, Giuseppe: "Deep semantic encodings for language modeling", 1448-1452.
Sun, Ming / Chen, Yun-Nung / Rudnicky, Alexander I.: "Learning OOV through semantic relatedness in spoken dialog systems", 1453-1457.
Chong, Tze Yuang / Banchs, Rafael E. / Chng, Eng Siong / Li, Haizhou: "TDTO language modeling with feedforward neural networks", 1458-1462.
Paulik, Matthias: "Improvements to the pruning behavior of DNN acoustic models", 1463-1467.
Sak, Haşim / Senior, Andrew / Rao, Kanishka / Beaufays, Françoise: "Fast and accurate recurrent neural network acoustic models for speech recognition", 1468-1472.
Nakkiran, Preetum / Alvarez, Raziel / Prabhavalkar, Rohit / Parada, Carolina: "Compressing deep neural networks using a rank-constrained topology", 1473-1477.
Sainath, Tara N. / Parada, Carolina: "Convolutional neural networks for small-footprint keyword spotting", 1478-1482.
Berg, Ewout van den / Brand, Daniel / Bordawekar, Rajesh / Rachevsky, Leonid / Ramabhadran, Bhuvana: "Efficient GPU implementation of convolutional neural networks for speech recognition", 1483-1487.
Strom, Nikko: "Scalable distributed DNN training using commodity GPU cloud computing", 1488-1492.
Kalkur, Sachin N. / Reddy C., Sandeep / Hegde, Rajesh M.: "Joint source localization and separation in spherical harmonic domain using a sparsity based method", 1493-1497.
Zhang, Shaofei / Huang, Dong-Yan / Xie, Lei / Chng, Eng Siong / Li, Haizhou / Dong, Minghui: "Regularized non-negative matrix factorization using alternating direction method of multipliers and its application to source separation", 1498-1502.
Nie, Shuai / Liang, Shan / Xue, Wei / Zhang, Xueliang / Liu, Wenju / Dong, Like / Yang, Hong: "Two-stage multi-target joint learning for monaural speech separation", 1503-1507.
Xu, Yong / Du, Jun / Huang, Zhen / Dai, Li-Rong / Lee, Chin-Hui: "Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement", 1508-1512.
Kwon, Kisoo / Shin, Jong Won / Kim, Hyung Yong / Kim, Nam Soo: "Discriminative nonnegative matrix factorization using cross-reconstruction error for source separation", 1513-1516.
Khan, Faheem / Milner, Ben: "Using audio and visual information for single channel speaker separation", 1517-1521.
Höge, Harald: "On the nature of the features generated in the human auditory pathway for phone recognition", 1551-1555.
Yamamoto, Kodai / Irino, Toshio / Nisimura, Ryuichi / Kawahara, Hideki / Patterson, Roy D.: "How the slope of the speech spectrum affects the perception of speaker size", 1556-1560.
Rasilo, Heikki / Räsänen, Okko: "Weakly-supervised word learning is improved by an active online algorithm", 1561-1565.
Lin, Lin / Barker, Jon / Brown, Guy J.: "The effect of cochlear implant processing on speaker intelligibility: a perceptual study and computer model", 1566-1570.
Cao, Mengxue / Li, Aijun / Fang, Qiang / Kröger, Bernd J.: "Phonetic-phonological feature emerges by associating phonetic with semantic information — a GSOM-based modeling study", 1571-1575.
Bosch, L. ten / Boves, L. / Tucker, B. / Ernestus, M.: "DIANA: towards computational modeling reaction times in lexical decision in north American English", 1576-1580.
Chen, Qian / Ling, Zhen-Hua / Yang, Chen-Yu / Dai, Li-Rong: "Automatic phrase boundary labeling of speech synthesis database using context-dependent HMMs and n-gram prior distributions", 1581-1585.
Ribeiro, Manuel Sam / Yamagishi, Junichi / Clark, Robert A. J.: "A perceptual investigation of wavelet-based decomposition of f0 for text-to-speech synthesis", 1586-1590.
Moungsri, Decha / Koriyama, Tomoki / Kobayashi, Takao: "Duration prediction using multi-level model for GPR-based speech synthesis", 1591-1595.
Langarani, Mahsa Sadat Elyasi / Santen, Jan van / Mohammadi, Seyed Hamidreza / Kain, Alexander: "Data-driven foot-based intonation generator for text-to-speech synthesis", 1596-1600.
Gerazov, Branislav / Honnet, Pierre-Edouard / Gjoreski, Aleksandar / Garner, Philip N.: "Weighted correlation based atom decomposition intonation modelling", 1601-1605.
Fernandez, Raul / Rendel, Asaf / Ramabhadran, Bhuvana / Hoory, Ron: "Using deep bidirectional recurrent neural networks for prosodic-target prediction in a unit-selection text-to-speech system", 1606-1610.
Liao, Hank / Pundak, Golan / Siohan, Olivier / Carroll, Melissa K. / Coccaro, Noah / Jiang, Qi-Ming / Sainath, Tara N. / Senior, Andrew / Beaufays, Françoise / Bacchiani, Michiel: "Large vocabulary automatic speech recognition for children", 1611-1615.
Bone, Daniel / Black, Matthew P. / Ramakrishna, Anil / Grossman, Ruth / Narayanan, Shrikanth S.: "Acoustic-prosodic correlates of `awkward' prosody in story retellings from adolescents with autism", 1616-1620.
Fringi, Eva / Lehman, Jill Fain / Russell, Martin: "Evidence of phonological processes in automatic recognition of children's speech", 1621-1624.
Pucher, Michael / Toman, Markus / Schabus, Dietmar / Valentini-Botinhao, Cassia / Yamagishi, Junichi / Zillinger, Bettina / Schmid, Erich: "Influence of speaker familiarity on blind and visually impaired children's perception of synthetic voices in audio games", 1625-1629.
Shahnawazuddin, S. / Sinha, Rohit: "Low-memory fast on-line adaptation for acoustically mismatched children's speech recognition", 1630-1634.
Giuliani, Diego / BabaAli, Bagher: "Large vocabulary children's speech recognition with DNN-HMM and SGMM acoustic modeling", 1635-1639.
Govender, Avashna / Wet, Febe de / Tapamo, Jules-Raymond: "HMM adaptation for child speech synthesis", 1640-1644.
Kim, Jaebok / Truong, Khiet P. / Charisi, Vicky / Zaga, Cristina / Lohse, Manja / Heylen, Dirk / Evers, Vanessa: "Vocal turn-taking patterns in groups of children performing collaborative tasks: an exploratory study", 1645-1649.
Sadeghian, Roozbeh / Zahorian, Stephen A.: "Towards an automated screening tool for pediatric speech delay", 1650-1654.
Proença, Jorge / Celorico, Dirce / Candeias, Sara / Lopes, Carla / Perdigão, Fernando: "Children's reading aloud performance: a database and automatic detection of disfluencies", 1655-1659.
Sundar, Harshavardhan / Lehman, Jill Fain / Singh, Rita: "Keyword spotting in multi-player voice driven games for children", 1660-1664.
Guo, Jinxi / Paturi, Rohit / Yeung, Gary / Lulich, Steven M. / Arsikere, Harish / Alwan, Abeer: "Age-dependent height estimation and speaker normalization for children's speech using the first three subglottal resonances", 1665-1669.
Leemann, Adrian / Bernardasci, Camilla / Nolan, Francis: "The effect of speakers' regional varieties on listeners' decision-making", 1670-1674.
Fuchs, Robert: "Word-initial glottal stop insertion, hiatus resolution and linking in British English", 1675-1679.
Li, Shanpeng / Gu, Wentao: "Acoustic analysis of Mandarin affricates", 1680-1684.
Leykum, Hannah / Moosmüller, Sylvia / Dressler, Wolfgang U.: "Homophonous phonotactic and morphonotactic consonant clusters in word-final position", 1685-1689.
Gibson, Mark / Planas, Ana María Fernández / Gafos, Adamantios / Remirez, Emily: "Consonant duration and VOT as a function of syllable complexity and voicing in a sub-set of Spanish clusters", 1690-1694.
Arai, Takayuki: "Hands-on tool producing front vowels for phonetic education: aiming for pronunciation training with tactile sensation", 1695-1699.
Dutta, Indranil / Pandey, Ayushi: "Acoustics of articulatory constraints: vowel classification and nasalization", 1700-1704.
Kraus, Janina: "Voice-conditioned allophones of MOUTH and PRICE in bahamian creole", 1705-1709.
Kolly, Marie-José / Leemann, Adrian / Matter, Florian: "Analysis of spatial variation with app-based crowdsourced audio data", 1710-1714.
Jani, Mátyás / Cucchiarini, Catia / Hout, Roeland van / Strik, Helmer: "Confusability in L2 vowels: analyzing the role of different features", 1715-1719.
Zimmerer, Frank / Trouvain, Jürgen: "Perception of French speakers' German vowels", 1720-1724.
Bruni, Jagoda / Duran, Daniel / Dogil, Grzegorz: "Unintuitive phonetic behavior in tswana post-nasal stops", 1725-1729.
Prathosh, A. P. / Ramakrishnan, A. G. / Ananthapadmanabha, T. V.: "Classification of place-of-articulation of stop consonants using temporal analysis", 2655-2659.
Barlaz, Marissa / Fu, Maojing / Liang, Zhi-Pei / Shosted, Ryan / Sutton, Brad: "The emergence of nasal velar codas in Brazilian Portuguese: an rt-MRI study", 2660-2664.
Michon, Elise / Dupoux, Emmanuel / Cristia, Alejandrina: "Salient dimensions in implicit phonotactic learning", 2665-2669.
Howson, Phil: "An acoustic examination of the three-way sibilant contrast in lower sorbian", 2670-2674.
Yuan, Jiahong / Liberman, Mark: "Investigating consonant reduction in Mandarin Chinese with improved forced alignment", 2675-2678.
Pouplier, Marianne / Marin, Stefania / Kochetov, Alexei: "Durational characteristics and timing patterns of Russian onset clusters at two speaking rates", 2679-2683.
Wong, Chun Hoy / Lee, Tan / Yeung, Yu Ting / Ching, P. C.: "Modeling temporal dependency for robust estimation of LP model parameters in speech enhancement", 1730-1734.
Vaz, Colin / Narayanan, Shrikanth S.: "Learning a speech manifold for signal subspace speech denoising", 1735-1739.
Elshamy, Samy / Madhu, Nilesh / Tirry, Wouter / Fingscheidt, Tim: "An iterative speech model-based a priori SNR estimator", 1740-1744.
Zhang, Xiao-Lei / Wang, DeLiang: "Multi-resolution stacking for speech separation based on boosted DNN", 1745-1749.
Nørholm, Sidsel Marie / Krawczyk-Becker, Martin / Gerkmann, Timo / Par, Steven van de / Jensen, Jesper Rindom / Christensen, Mads Græsbøll: "Least squares estimate of the initial phases in STFT based speech enhancement", 1750-1754.
Nørholm, Sidsel Marie / Jensen, Jesper Rindom / Christensen, Mads Græsbøll: "Enhancement of non-stationary speech using harmonic chirp filters", 1755-1759.
Kinoshita, Keisuke / Delcroix, Marc / Ogawa, Atsunori / Nakatani, Tomohiro: "Text-informed speech enhancement with deep neural networks", 1760-1764.
Masaya, Shogo / Unoki, Masashi: "Complex tensor factorization in modulation frequency domain for single-channel speech enhancement", 1765-1769.
Kang, Hyeonjoo / Lee, JeeSok / Baek, Soonho / Kang, Hong-Goo: "Systematic integration of acoustic echo canceller and noise reduction modules for voice communication systems", 1770-1774.
Lee, Chul Min / Shin, Jong Won / Kim, Nam Soo: "DNN-based residual echo suppression", 1775-1779.
He, Qi / Bao, Changchun / Bao, Feng: "Codebook-based speech enhancement using Markov process and speech-presence probability", 1780-1784.
Chinaev, Aleksej / Haeb-Umbach, Reinhold: "On optimal smoothing in minimum statistics based noise tracking", 1785-1789.
Hao, Yue / Bao, Changchun / Bao, Feng / Deng, Feng: "A data-driven speech enhancement method based on modeled long-range temporal dynamics", 1790-1794.
Mayer, Florian / Mowlaee, Pejman: "Improved phase reconstruction in single-channel speech separation", 1795-1799.
Zhao, Tuo / Zhao, Yunxin / Chen, Xin: "Time-frequency kernel-based CNN for speech recognition", 1888-1892.
Weber, Philip / Champion, Colin J. / Houghton, S. M. / Jančovič, Peter / Russell, Martin: "Consonant recognition with continuous-state hidden Markov models and perceptually-motivated features", 1893-1897.
Ganapathy, Sriram / Thomas, Samuel / Dimitriadis, Dimitrios / Rennie, Steven: "Investigating factor analysis features for deep neural networks in noisy speech recognition", 1898-1902.
Travadi, Ruchir / Narayanan, Shrikanth S.: "Ensemble of Gaussian mixture localized neural networks with application to phone recognition", 1903-1907.
Pešán, Jan / Burget, Lukáš / Hermansky, Hynek / Veselý, Karel: "DNN derived filters for processing of modulation spectrum of speech", 1908-1911.
Nagamine, Tasha / Seltzer, Michael L. / Mesgarani, Nima: "Exploring how deep neural networks form phonemic categories", 1912-1916.
Loukina, Anastassia / Lopez, Melissa / Evanini, Keelan / Suendermann-Oeft, David / Ivanov, Alexei V. / Zechner, Klaus: "Pronunciation accuracy and intelligibility of non-native speech", 1917-1921.
Zimmerer, Frank / Trouvain, Jürgen: "Productions of /h/ in German: French vs. German speakers", 1922-1926.
Bonneau, Anne / Cadot, Martine: "German non-native realizations of French voiced fricatives in final position of a group of words", 1927-1931.
Best, Catherine T. / Shaw, Jason A. / Docherty, Gerard / Evans, Bronwen G. / Foulkes, Paul / Hay, Jennifer / Al-Tamimi, Jalal / Mair, Katharine / Mulak, Karen E. / Wood, Sophie: "From newcastle MOUTH to aussie ears: australians' perceptual assimilation and adaptation for newcastle UK vowels", 1932-1936.
Bundgaard-Nielsen, Rikke Louise / Baker, Brett / Maxwell, Olga / Fletcher, Janet: "Wubuy coronal stop perception by speakers of three dialects of bangla", 1937-1941.
Hirst, Daniel / Ding, Hongwei: "Using melody metrics to compare English speech read by native speakers and by L2 Chinese speakers from shanghai", 1942-1946.
Gibson, James / Malandrakis, Nikolaos / Romero, Francisco / Atkins, David C. / Narayanan, Shrikanth S.: "Predicting therapist empathy in motivational interviews using language features inspired by psycholinguistic norms", 1947-1951.
Malandrakis, Nikolaos / Narayanan, Shrikanth S.: "Therapy language analysis using automatically generated psycholinguistic norms", 1952-1956.
Xia, Wei / Gibson, James / Xiao, Bo / Baucom, Brian / Georgiou, Panayiotis G.: "A dynamic model for behavioral analysis of couple interactions using acoustic features", 1957-1961.
Gupta, Rahul / Chaspari, Theodora / Georgiou, Panayiotis G. / Atkins, David C. / Narayanan, Shrikanth S.: "Analysis and modeling of the role of laughter in motivational interviewing based psychotherapy conversations", 1962-1966.
Bonin, Francesca / Campbell, Nick / Vogel, Carl: "The discourse value of social signals at topic change moments", 1967-1971.
Schrank, Tobias / Schuppler, Barbara: "Automatic detection of uncertainty in spontaneous German dialogue", 1972-1976.
Ringeval, Fabien / Marchi, Erik / Mehu, Marc / Scherer, Klaus / Schuller, Björn: "Face reading from speech — predicting facial action units from audio cues", 1977-1981.
Nandwana, Mahesh Kumar / Bořil, Hynek / Hansen, John H. L.: "A new front-end for classification of non-speech sounds: a study on human whistle", 1982-1986.
Dumpala, Sri Harsha / Nellore, Bhanu Teja / Nevali, Raghu Ram / Gangashetty, Suryakanth V. / Yegnanarayana, B.: "Robust features for sonorant segmentation in continuous speech", 1987-1991.
Gergen, Sebastian / Nagathil, Anil / Martin, Rainer: "Reduction of reverberation effects in the MFCC modulation spectrum for improved classification of acoustic signals", 1992-1996.
Dennis, Jonathan / Tran, Huy Dat / Li, Haizhou: "Spiking neural networks and the generalised hough transform for speech pattern detection", 1997-2001.
Choi, Woohyun / Park, Sangwook / Han, David K. / Ko, Hanseok: "Acoustic event recognition using dominant spectral basis vectors", 2002-2006.
Hwang, Inyoung / Sim, Jaeseong / Kim, Sang-Hyeon / Song, Kwang-Sub / Chang, Joon-Hyuk: "A statistical model-based voice activity detection using multiple DNNs and noise awareness", 2277-2281.
Wang, Qing / Du, Jun / Bao, Xiao / Wang, Zi-Rui / Dai, Li-Rong / Lee, Chin-Hui: "A universal VAD based on jointly trained deep neural networks", 2282-2286.
Zhan, Ge / Huang, Zhaoqiong / Ying, Dongwen / Pan, Jielin / Yan, Yonghong: "Spectrographic speech mask estimation using the time-frequency correlation of speech presence", 2287-2291.
Ghaemmaghami, Houman / Dean, David / Kalantari, Shahram / Sridharan, Sridha / Fookes, Clinton: "Complete-linkage clustering for voice activity detection in audio and visual speech", 2292-2296.
Sriskandaraja, Kaavya / Sethu, Vidhyasaharan / Le, Phu Ngoc / Ambikairajah, Eliathamby: "A model based voice activity detector for noisy environments", 2297-2301.
Tao, Fei / Hansen, John H. L. / Busso, Carlos: "An unsupervised visual-only voice activity detection approach using temporal orofacial features", 2302-2306.
Raboshchuk, Ganna / Jančovič, Peter / Nadeu, Climent / Lilja, Alex Peiró / Köküer, Münevver / Mahamud, Blanca Muñoz / Veciana, Ana Riverola de: "Automatic detection of equipment alarms in a neonatal intensive care unit environment: a knowledge-based approach", 2902-2906.
Dai, Jia / Liu, Wenju / Ni, Chongjia / Dong, Like / Yang, Hong: "“multilingual” deep neural network for music genre classification", 2907-2911.
Liu, Baiyang / Hoffmeister, Bjorn / Rastrow, Ariya: "Accurate endpointing with expected pause duration", 2912-2916.
Liu, Wenbo / Yu, Zhiding / Raj, Bhiksha / Li, Ming: "Locality constrained transitive distance clustering on speech data", 2917-2921.
Espi, Miquel / Fujimoto, Masakiyo / Kinoshita, Keisuke / Nakatani, Tomohiro: "Feature extraction strategies in deep learning based acoustic event detection", 2922-2926.
Transfeld, Peter / Receveur, Simon / Fingscheidt, Tim: "An acoustic event detection framework and evaluation metric for surveillance in cars", 2927-2931.
Bouchekif, Abdessalam / Damnati, Géraldine / Estève, Yannick / Charlet, Delphine / Camelin, Nathalie: "Diachronic semantic cohesion for topic segmentation of TV broadcast news", 2932-2936.
Kraljevski, Ivan / Tan, Zheng-Hua / Bissiri, Maria Paola: "Comparison of forced-alignment speech recognition and humans for generating reference VAD", 2937-2941.
Lehner, Bernhard / Widmer, Gerhard / Sonnleitner, Reinhard: "Improving voice activity detection in movies", 2942-2946.
Su, Pei-Hao / Vandyke, David / Gašić, Milica / Kim, Dongho / Mrkšić, Nikola / Wen, Tsung-Hsien / Young, Steve: "Learning from real users: rating dialogue success with neural networks for reinforcement learning in spoken dialogue systems", 2007-2011.
Griol, David / Callejas, Zoraida / López-Cózar, Ramón: "A framework to develop context-aware adaptive dialogue system", 2012-2016.
Griol, David / Callejas, Zoraida: "A proposal to develop domain and subtask-adaptive dialog management models", 2017-2021.
Khan, Omar Zia / Robichaud, Jean-Philippe / Crook, Paul A. / Sarikaya, Ruhi: "Hypotheses ranking and state tracking for a multi-domain dialog system using multiple ASR alternates", 2022-2026.
Wu, Ji / Li, Miao / Lee, Chin-Hui: "An entropy minimization framework for goal-driven dialogue management", 2027-2031.
Zukerman, Ingrid / Partovi, Andisheh / Kim, Su Nam: "Context-dependent error correction of spoken referring expressions", 2032-2036.
Wu, Zhizheng / Kinnunen, Tomi: "Automatic speaker verification spoofing and countermeasures (ASVspoof 2015): introductory talk by the organizers" (abstract).
Wu, Zhizheng / Kinnunen, Tomi / Evans, Nicholas / Yamagishi, Junichi / Hanilçi, Cemal / Sahidullah, Md. / Sizov, Aleksandr: "ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge", 2037-2041.
Sanchez, Jon / Saratxaga, Ibon / Hernaez, Inma / Navas, Eva / Erro, D.: "The AHOLAB RPS SSD spoofing challenge 2015 submission", 2042-2046.
Wester, Mirjam / Wu, Zhizheng / Yamagishi, Junichi: "Human vs machine spoofing detection on wideband and narrowband data", 2047-2051.
Xiao, Xiong / Tian, Xiaohai / Du, Steven / Xu, Haihua / Chng, Eng Siong / Li, Haizhou: "Spoofing speech detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015 challenge", 2052-2056.
Hanilçi, Cemal / Kinnunen, Tomi / Sahidullah, Md. / Sizov, Aleksandr: "Classifiers for synthetic speech detection: a comparison", 2057-2061.
Patel, Tanvina B. / Patil, Hemant A.: "Combining evidences from mel cepstral, cochlear filter cepstral and instantaneous frequency features for detection of natural vs. spoofed speech", 2062-2066.
Villalba, Jesús / Miguel, Antonio / Ortega, Alfonso / Lleida, Eduardo: "Spoofing detection with DNN and one-class SVM for the ASVspoof 2015 challenge", 2067-2071.
Alam, Md. Jahangir / Kenny, Patrick / Bhattacharya, Gautam / Stafylakis, Themos: "Development of CRIM system for the automatic speaker verification spoofing and countermeasures challenge 2015", 2072-2076.
Janicki, Artur: "Spoofing countermeasure based on analysis of linear prediction error", 2077-2081.
Liu, Yi / Tian, Yao / He, Liang / Liu, Jia / Johnson, Michael T.: "Simultaneous utilization of spectral magnitude and phase information to extract supervectors for speaker verification anti-spoofing", 2082-2086.
Sahidullah, Md. / Kinnunen, Tomi / Hanilçi, Cemal: "A comparison of features for synthetic speech detection", 2087-2091.
Wang, Longbiao / Yoshida, Yohei / Kawakami, Yuta / Nakagawa, Seiichi: "Relative phase information for detecting human speech and spoofed speech", 2092-2096.
Chen, Nanxin / Qian, Yanmin / Dinkel, Heinrich / Chen, Bo / Yu, Kai: "Robust deep feature for spoofing detection — the SJTU system for ASVspoof 2015 challenge", 2097-2101.
Yamagishi, Junichi / Evans, Nicholas: "Automatic speaker verification spoofing and countermeasures (ASVspoof 2015): open discussion and future plans" (abstract).
Lee, Kyungmin / Park, Chiyoun / Kim, Ilhwan / Kim, Namhoon / Lee, Jaewon: "Applying GPGPU to recurrent neural network language model based fast network search in the real-time LVCSR", 2102-2106.
Oualil, Youssef / Schulder, Marc / Helmke, Hartmut / Schmidt, Anna / Klakow, Dietrich: "Real-time integration of dynamic context information for improving automatic speech recognition", 2107-2111.
Allauzen, Cyril / Riley, Michael: "Rapid vocabulary addition to context-dependent decoder graphs", 2112-2116.
Xu, Hainan / Chen, Guoguo / Povey, Daniel / Khudanpur, Sanjeev: "Modeling phonetic context with non-random forests for speech recognition", 2117-2121.
Lecouteux, Benjamin / Schwab, Didier: "Ant colony algorithm applied to automatic speech recognition graph decoding", 2122-2126.
Gysel, Christophe Van / Velikovich, Leonid / McGraw, Ian / Beaufays, Françoise: "Garbage modeling for on-device speech recognition", 2127-2131.
Xu, Haihua / Do, Van Hai / Xiao, Xiong / Chng, Eng Siong: "A comparative study of BNF and DNN multilingual training on cross-lingual low-resource speech recognition", 2132-2136.
Ratajczak, Martin / Tschiatschek, Sebastian / Pernkopf, Franz: "Neural higher-order factors in conditional random fields for phoneme classification", 2137-2141.
Jalalvand, Shahab / Falavigna, Daniele: "Stacked auto-encoder for ASR error detection and word error rate prediction", 2142-2146.
Parida, Satyabrata / Pattem, Ashok Kumar / Ghosh, Prasanta Kumar: "Estimation of the air-tissue boundaries of the vocal tract in the mid-sagittal plane from electromagnetic articulograph data", 2147-2151.
Canevari, Claudia / Badino, Leonardo / Fadiga, Luciano: "A new Italian dataset of parallel acoustic and articulatory data", 2152-2156.
Csapó, Tamás Gábor / Lulich, Steven M.: "Error analysis of extracted tongue contours from 2d ultrasound images", 2157-2161.
Bandini, Andrea / Ouni, Slim / Cosi, Piero / Orlandi, Silvia / Manfredi, Claudia: "Accuracy of a markerless acquisition technique for studying speech articulators", 2162-2166.
Chi, Yujie / Honda, Kiyoshi / Wei, Jianguo / Feng, Hui / Dang, Jianwu: "Measuring oral and nasal airflow in production of Chinese plosive", 2167-2171.
Drioli, Carlo / Foresti, Gian Luca: "Enhanced videokymographic data analysis based on vocal folds dynamics modeling", 2172-2176.
Kolb, Andrew J. / Johnson, Michael T. / Berry, Jeffrey: "Interpolation of tongue fleshpoint kinematics from combined EMA position and orientation data", 2177-2181.
Andrade-Miranda, Gustavo / Bernardoni, Nathalie Henrich / Godino-Llorente, Juan Ignacio: "A new technique for assessing glottal dynamics in speech and singing by means of optical-flow computation", 2182-2186.
Kochetov, Alexei / Howson, Phil: "On the incompatibility of trilling and palatalization: a single-subject study of sustained apical and uvular trills", 2187-2191.
Zhu, Pengcheng / Xie, Lei / Chen, Yunlin: "Articulatory movement prediction using deep bidirectional long short-term memory based recurrent neural networks and word/phone embeddings", 2192-2196.
Ruiz, Nicholas / Gao, Qin / Lewis, William / Federico, Marcello: "Adapting machine translation models toward misrecognized speech with text-to-speech pronunciation rules and acoustic confusability", 2247-2251.
Bechet, Frederic / Favre, Benoit / Rouvier, Mickael: "“speech is silver, but silence is golden”: improving speech-to-speech translation performance by slashing users input", 2252-2256.
Ng, Raymond W. M. / Shah, Kashif / Specia, Lucia / Hain, Thomas: "A study on the stability and effectiveness of features in quality estimation for spoken language translation", 2257-2261.
Pelemans, Joris / Vanallemeersch, Tom / Demuynck, Kris / hamme, Hugo Van / Wambacq, Patrick: "Efficient language model adaptation for automatic speech recognition of spoken translations", 2262-2266.
Mieno, Takashi / Neubig, Graham / Sakti, Sakriani / Toda, Tomoki / Nakamura, Satoshi: "Speed or accuracy? a study in evaluation of simultaneous speech translation", 2267-2271.
Junczys-Dowmunt, Marcin / Przybysz, Paweł / Staszuk, Arleta / Kim, Eun-Kyoung / Lee, Jaewon: "Large scale speech-to-text translation with out-of-domain corpora using better context-based models and domain adaptation", 2272-2276.
Kenny, Patrick / Stafylakis, Themos / Alam, Md. Jahangir / Kockmann, Marcel: "An i-vector backend for speaker verification", 2307-2311.
Correia, Joana / Brutti, Alessio / Abad, Alberto: "Multi-channel speaker verification based on total variability modelling", 2312-2316.
Li, Na / Mak, Man-Wai: "SNR-invariant PLDA modeling for robust speaker verification", 2317-2321.
Rahman, Md. Hafizur / Dean, David / Kanagasundaram, Ahilan / Sridharan, Sridha: "Investigating in-domain data requirements for PLDA training", 2322-2326.
Glembek, Ondřej / Matějka, Pavel / Plchot, Oldřich / Pešán, Jan / Burget, Lukáš / Schwarz, Petr: "Migrating i-vectors between speaker recognition systems using regression neural networks", 2327-2331.
Kanagasundaram, Ahilan / Dean, David / Sridharan, Sridha: "Improving PLDA speaker verification using WMFD and linear-weighted approaches in limited microphone data conditions", 2332-2336.
Gobl, Christer / Yanushevskaya, Irena / Chasaide, Ailbhe Ní: "The relationship between voice source parameters and the maxima dispersion quotient (MDQ)", 2337-2341.
Airaksinen, Manu / Bäckström, Tom / Alku, Paavo: "Glottal inverse filtering based on quadratic programming", 2342-2346.
Narendra, N. P. / Rao, K. Sreenivasa: "Automatic detection of creaky voice using epoch parameters", 2347-2351.
Bundgaard-Nielsen, Rikke Louise / Baker, Brett: "Perception of voicing in the absence of native voicing experience", 2352-2356.
Kreiman, Jody / Park, Soo Jin / Keating, Patricia A. / Alwan, Abeer: "The relationship between acoustic and perceived intraspeaker variability in voice quality", 2357-2360.
Jiao, Li / Ma, Qiuwu / Wang, Ting / Xu, Yi: "Perceptual cues of whispered tones: are they really special?", 2361-2365.
Morioka, Tsuyoshi / Iwata, Tomoharu / Hori, Takaaki / Kobayashi, Tetsunori: "Multiscale recurrent neural network based language model", 2366-2370.
Irie, Kazuki / Schlüter, Ralf / Ney, Hermann: "Bag-of-words input for long history representation in neural network-based language models for speech recognition", 2371-2375.
Emami, Ahmad: "Efficient machine translation decoding with slow language models", 2376-2379.
Masumura, Ryo / Asami, Taichi / Oba, Takanobu / Masataki, Hirokazu / Sakauchi, Sumitaka / Ito, Akinori: "Latent words recurrent neural network language models", 2380-2384.
Chunwijitra, Vataya / Chotimongkol, Ananlada / Wutiwiwatchai, Chai: "Combining multiple-type input units using recurrent neural network for LVCSR language modeling", 2385-2389.
Gangireddy, Siva Reddy / Renals, Steve / Nankaku, Yoshihiko / Lee, Akinobu: "Prosodically-enhanced recurrent neural network language models", 2390-2394.
Janke, Matthias / Wand, Michael: "Biosignal-based spoken communication: welcome and introduction" (abstract).
Anderson, Peter / Harandi, Negar M. / Moisik, Scott / Stavness, Ian / Fels, Sidney: "A comprehensive 3d biomechanically-driven vocal tract model including inverse dynamics for speech research", 2395-2399.
McLoughlin, Ian / Song, Yan: "Low frequency ultrasonic voice activity detection using convolutional neural networks", 2400-2404.
Bocquelet, Florent / Hueber, Thomas / Girin, Laurent / Savariaux, Christophe / Yvert, Blaise: "Real-time control of a DNN-based articulatory synthesizer for silent speech conversion: a pilot study", 2405-2409.
Fabre, Diandra / Hueber, Thomas / Bocquelet, Florent / Badin, Pierre: "Tongue tracking in ultrasound images using eigentongue decomposition and artificial neural networks", 2410-2414.
Wang, Jun / Hahm, Seongjun: "Speaker-independent silent speech recognition with across-speaker articulatory normalization and speaker adaptive training", 2415-2419.
Diener, Lorenz / Janke, Matthias / Schultz, Tanja: "Codebook clustering for unit selection based EMG-to-speech conversion", 2420-2424.
Mirbagheri, Majid / Ekin, Bradley / Atlas, Les / Lee, Adrian K. C.: "Flexible tracking of auditory attention", 2425-2429.
Janke, Matthias / Wand, Michael: "Biosignal-based spoken communication: panel and discussion" (abstract).
Mirsamadi, Seyedmahdad / Hansen, John H. L.: "A study on deep neural network acoustic model adaptation for robust far-field speech recognition", 2430-2434.
Mimura, Masato / Sakai, Shinsuke / Kawahara, Tatsuya: "Speech dereverberation using long short-term memory", 2435-2439.
Peddinti, Vijayaditya / Chen, Guoguo / Povey, Daniel / Khudanpur, Sanjeev: "Reverberation robust acoustic modeling using i-vectors with time delay neural networks", 2440-2444.
Kumar, Kshitiz / Liu, Chaojun / Gong, Yifan: "Delta-melspectra features for noise robustness to DNN-based ASR systems", 2445-2448.
Mitra, Vikramjit / Hout, Julien Van / McLaren, Mitchell / Wang, Wen / Graciarena, Martin / Vergyri, Dimitra / Franco, Horacio: "Combating reverberation in large vocabulary continuous speech recognition", 2449-2453.
Karafiát, Martin / Grézl, František / Burget, Lukáš / Szöke, Igor / Černocký, Jan: "Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge", 2454-2458.
Harvilla, Mark J. / Stern, Richard M.: "Robust parameter estimation for audio declipping in noise", 2459-2463.
Huang, Bin / Ke, Dengfeng / Zheng, Hao / Xu, Bo / Xu, Yanyan / Su, Kaile: "Multi-task learning deep neural networks for speech feature denoising", 2464-2468.
Wang, Yuxuan / Misra, Ananya / Chin, Kean K.: "Time-frequency masking for large scale robust speech recognition", 2469-2473.
Su, Rongfeng / Xie, Xurong / Liu, Xunying / Wang, Lan: "Efficient use of DNN bottleneck features in generalized variable parameter HMMs for noise robust speech recognition", 2474-2478.
Baby, Deepak / hamme, Hugo Van: "Investigating modulation spectrogram features for deep neural network-based automatic speech recognition", 2479-2483.
Han, Kun / He, Yanzhang / Bagchi, Deblin / Fosler-Lussier, Eric / Wang, DeLiang: "Deep neural network based spectral feature mapping for robust speech recognition", 2484-2488.
Xiao, Bo / Imel, Zac E. / Atkins, David C. / Georgiou, Panayiotis G. / Narayanan, Shrikanth S.: "Analyzing speech rate entrainment and its relation to therapist empathy in drug addiction counseling", 2489-2493.
Ando, Atsushi / Asami, Taichi / Okamoto, Manabu / Masataki, Hirokazu / Sakauchi, Sumitaka: "Agreement and disagreement utterance detection in conversational speech by extracting and integrating local features", 2494-2498.
Nasir, Md. / Xia, Wei / Xiao, Bo / Baucom, Brian / Narayanan, Shrikanth S. / Georgiou, Panayiotis G.: "Still together?: the role of acoustic features in predicting marital outcome", 2499-2503.
Gosztolya, Gábor: "On evaluation metrics for social signal detection", 2504-2508.
Kaushik, Lakshmish / Sangwan, Abhijeet / Hansen, John H. L.: "Laughter and filler detection in naturalistic audio", 2509-2513.
Pappu, Aasish / Stent, Amanda: "Automatic formatted transcripts for videos", 2514-2518.
Azaïs, Lucas / Payan, Adrien / Sun, Tianjiao / Vidal, Guillaume / Zhang, Tina / Coutinho, Eduardo / Eyben, Florian / Schuller, Björn: "Does my speech rock? automatic assessment of public speaking skills", 2519-2523.
Sergienko, Roman / Schmitt, Alexander: "Verbal intelligence identification based on text classification", 2524-2528.
Hsiao, Shan-Wen / Sun, Hung-Ching / Hsieh, Ming-Chuan / Tsai, Ming-Hsueh / Lin, Hsin-Chih / Lee, Chi-Chun: "A multimodal approach for automatic assessment of school principals' oral presentation during pre-service training program", 2529-2533.
Tsai, T. J.: "Are you TED talk material? comparing prosody in professors and TED speakers", 2534-2538.
Akira, Hayakawa / Haider, Fasih / Cerrato, Loredana / Campbell, Nick / Luz, Saturnino: "Detection of cognitive states and their correlation to speech recognition performance in speech-to-speech machine translation systems", 2539-2543.
Köster, Friedemann / Möller, Sebastian: "Perceptual speech quality dimensions in a conversational situation", 2544-2548.
Berger, Jens / Llagostera, Anna: "Multidimensional evaluation and predicting overall speech quality", 2549-2552.
Gaich, Andreas / Mowlaee, Pejman: "On speech intelligibility estimation of phase-aware single-channel speech enhancement", 2553-2557.
Marxer, Ricard / Cooke, Martin / Barker, Jon: "A framework for the evaluation of microscopic intelligibility models", 2558-2562.
Andersen, Asger Heidemann / Haan, Jan Mark de / Tan, Zheng-Hua / Jensen, Jesper: "A binaural short time objective intelligibility measure for noisy and enhanced speech", 2563-2567.
Tang, Yan / Cooke, Martin / Fazenda, Bruno M. / Cox, Trevor J.: "A glimpse-based approach for predicting binaural intelligibility with single and multiple maskers in anechoic conditions", 2568-2572.
Chen, Fei: "Improving the prediction power of the speech transmission index to account for non-linear distortions introduced by noise-reduction algorithms", 2573-2577.
Li, Kehuang / Huang, Zhen / Xu, Yong / Lee, Chin-Hui: "DNN-based speech bandwidth expansion and its application to adding high-frequency missing features for automatic speech recognition of narrowband speech", 2578-2582.
Pulakka, Hannu / Myllylä, Ville / Rämö, Anssi / Alku, Paavo: "Speech quality evaluation of artificial bandwidth extension: comparing subjective judgments and instrumental predictions", 2583-2587.
Turan, M. A. Tuğtekin / Erzin, Engin: "Synchronous overlap and add of spectra for enhancement of excitation in artificial bandwidth extension of speech", 2588-2592.
Wang, Yingxue / Zhao, Shenghui / Liu, Wenbo / Li, Ming / Kuang, Jingming: "Speech bandwidth expansion based on deep neural networks", 2593-2597.
Liu, Bin / Tao, Jianhua / Wen, Zhengqi / Li, Ya / Bukhari, Danish: "A novel method of artificial bandwidth extension using deep architecture", 2598-2602.
Dalen, Rogier C. van / Gales, Mark J. F.: "Annotating large lattices with the exact word error", 2625-2629.
Manohar, Vimal / Povey, Daniel / Khudanpur, Sanjeev: "Semi-supervised maximum mutual information training of deep neural network acoustic models", 2630-2634.
Zhang, Shiliang / Jiang, Hui / Wei, Si / Dai, Li-Rong: "Rectified linear neural networks with tied-scalar regularization for LVCSR", 2635-2639.
He, Yanzhang / Fosler-Lussier, Eric: "Segmental conditional random fields with deep neural networks as acoustic models for first-pass word recognition", 2640-2644.
Chen, Dongpeng / Mak, Brian: "Distinct triphone acoustic modeling using deep neural networks", 2645-2649.
Gelly, Gregory / Gauvain, Jean-Luc: "Minimum word error training of RNN-based voice activity detection", 2650-2654.
Quatieri, Thomas F. / Williamson, James R. / Smalt, Christopher J. / Patel, Tejash / Perricone, Joseph / Mehta, Daryush D. / Helfer, Brian S. / Ciccarelli, Gregory / Ricke, Darrell / Malyska, Nicolas / Palmer, Jeff / Heaton, Kristin / Eddy, Marianna / Moran, Joseph: "Vocal biomarkers to discriminate cognitive load in a working memory task", 2684-2688.
Zhang, Chunlei / Liu, Gang / Yu, Chengzhu / Hansen, John H. L.: "I-vector based physical task stress detection with different fusion strategies", 2689-2693.
Tóth, László / Gosztolya, Gábor / Vincze, Veronika / Hoffmann, Ildikó / Szatlóczki, Gréta / Biró, Edit / Zsura, Fruzsina / Pákáski, Magdolna / Kálmán, János: "Automatic detection of mild cognitive impairment from spontaneous speech using ASR", 2694-2698.
Sidorov, Maxim / Brester, Christina / Schmitt, Alexander: "Contemporary stochastic feature selection algorithms for speech-based emotion recognition", 2699-2703.
Ferrer, Carlos A. / Torres, Diana / González, Eduardo / Calvo, José Ramón / Castillo, Eduardo: "Effect of different jitter-induced glottal pulse shape changes in periodicity perturbation measures", 2704-2708.
Kaushik, Lakshmish / Sangwan, Abhijeet / Hansen, John H. L.: "Automatic audio sentiment extraction using keyword spotting", 2709-2713.
Pasupat, Panupong / Hakkani-Tür, Dilek: "Unsupervised relation detection using automatic alignment of query patterns extracted from knowledge graphs and query click logs", 2714-2718.
Nguyen, The Tung / Neubig, Graham / Shindo, Hiroyuki / Sakti, Sakriani / Toda, Tomoki / Nakamura, Satoshi: "A latent variable model for joint pause prediction and dependency parsing", 2719-2723.
Bokaei, Mohammad Hadi / Sameti, Hossein / Liu, Yang: "Extractive meeting summarization through speaker zone detection", 2724-2728.
Liu, Shih-Hung / Chen, Kuan-Yu / Chen, Berlin / Wang, Hsin-Min / Yen, Hsu-Chun / Hsu, Wen-Lian: "Positional language modeling for extractive broadcast news speech summarization", 2729-2733.
Mokaram, Saeid / Moore, Roger K.: "Speech-based location estimation of first responders in a simulated search and rescue scenario", 2734-2738.
Sousa, Tahir / Flekova, Lucie / Mieskes, Margot / Gurevych, Iryna: "Constructive feedback, thinking process and cooperation: assessing the quality of classroom interaction", 2739-2743.
Huang, Dong-Yan / Dong, Minghui / Li, Haizhou: "A real-time variable-q non-stationary Gabor transform for pitch shifting", 2744-2748.
Aihara, Ryo / Takiguchi, Testuya / Ariki, Yasuo: "Many-to-many voice conversion based on multiple non-negative matrix factorization", 2749-2753.
Kobayashi, Kazuhiro / Toda, Tomoki / Neubig, Graham / Sakti, Sakriani / Nakamura, Satoshi: "Statistical singing voice conversion based on direct waveform modification with global variance", 2754-2758.
Tian, Xiaohai / Wu, Zhizheng / Lee, Siu Wa / Nguyen, Quy Hy / Dong, Minghui / Chng, Eng Siong: "System fusion for high-performance voice conversion", 2759-2763.
Alonso, Agustin / Erro, D. / Navas, Eva / Hernaez, Inma: "Speaker adaptation using only vocalic segments via frequency warping", 2764-2768.
Tajiri, Yusuke / Tanaka, Kou / Toda, Tomoki / Neubig, Graham / Sakti, Sakriani / Nakamura, Satoshi: "Non-audible murmur enhancement based on statistical conversion using air- and body-conductive microphones in noisy environments", 2769-2773.
Polzehl, Tim / Levow, Gina-Anne: "Advanced crowdsourcing for speech and beyond: introduction by the organizers" (abstract).
Jyothi, Preethi / Hasegawa-Johnson, Mark: "Transcribing continuous speech using mismatched crowdsourcing", 2774-2778.
Chowdhury, Shammur Absar / Calvo, Marcos / Ghosh, Arindam / Stepanov, Evgeny A. / Bayer, Ali Orkan / Riccardi, Giuseppe / García, Fernando / Sanchis, Emilio: "Selection and aggregation techniques for crowdsourced semantic annotation task", 2779-2783.
Rothwell, Spencer / Elshenawy, Ahmad / Carter, Steele / Braga, Daniela / Romani, Faraz / Kennewick, Michael / Kennewick, Bob: "Controlling quality and handling fraud in large scale crowdsourcing speech data collections", 2784-2788.
Rothwell, Spencer / Carter, Steele / Elshenawy, Ahmad / Dovgalecs, Vladislavs / Saleem, Safiyyah / Braga, Daniela / Kennewick, Bob: "Data collection and annotation for state-of-the-art NER using unmanaged crowds", 2789-2793.
Polzehl, Tim / Naderi, Babak / Köster, Friedemann / Möller, Sebastian: "Robustness in speech quality assessment and temporal training expiry in mobile crowdsourcing environments", 2794-2798.
Naderi, Babak / Polzehl, Tim / Wechsung, Ina / Köster, Friedemann / Möller, Sebastian: "Effect of trapping questions on the reliability of speech quality judgments in a crowdsourcing paradigm", 2799-2803.
Leemann, Adrian / Kolly, Marie-José / Goldman, Jean-Philippe / Dellwo, Volker / Hove, Ingrid / Almajai, Ibrahim / Grimm, Sarah / Robert, Sylvain / Wanitsch, Daniel: "Voice Äpp: a mobile app for crowdsourcing Swiss German dialect data", 2804-2808.
Loukina, Anastassia / Lopez, Melissa / Evanini, Keelan / Suendermann-Oeft, David / Zechner, Klaus: "Expert and crowdsourced annotation of pronunciation errors for automatic scoring systems", 2809-2813.
Kacorri, Hernisa / Shinkawa, Kaoru / Saito, Shin: "Capcap: an output-agreement game for video captioning", 2814-2818.
Burgos, Pepi / Sanders, Eric / Cucchiarini, Catia / Hout, Roeland van / Strik, Helmer: "Auris populi: crowdsourced native transcriptions of Dutch vowels spoken by adult Spanish learners", 2819-2823.
Wray, Samantha / Ali, Ahmed: "Crowdsource a little to label a lot: labeling a speech corpus of dialectal Arabic", 2824-2828.
Gaur, Yashesh / Metze, Florian / Miao, Yajie / Bigham, Jeffrey P.: "Using keyword spotting to help humans correct captioning faster", 2829-2833.
Byun, Tara McAllister / Hitchcock, Elaine / Harel, Daphna: "Validating and optimizing a crowdsourced method for gradient measures of child speech", 2834-2838.
Wang, Zhong-Qiu / Wang, DeLiang: "Joint training of speech separation, filterbank and acoustic model for robust automatic speech recognition", 2839-2843.
Rath, Shakti / Sivadas, Sunil / Ma, Bin: "Joint environment and speaker normalization using factored front-end CMLLR", 2844-2848.
Abe, Akihiro / Yamamoto, Kazumasa / Nakagawa, Seiichi: "Robust speech recognition using DNN-HMM acoustic model combining noise-aware training with spectral subtraction", 2849-2853.
Yu, Chengzhu / Ogawa, Atsunori / Delcroix, Marc / Yoshioka, Takuya / Nakatani, Tomohiro / Hansen, John H. L.: "Robust i-vector extraction for neural network adaptation in noisy environment", 2854-2857.
Borsky, Michal / Mizera, Petr / Pollak, Petr: "Spectrally selective dithering for distorted speech recognition", 2858-2861.
Lu, Liang / Renals, Steve: "Feature-space speaker adaptation for probabilistic linear discriminant analysis acoustic models", 2862-2866.
Cardinal, Patrick / Dehak, Najim / Zhang, Yu / Glass, James: "Speaker adaptation using the i-vector technique for bottleneck features", 2867-2871.
Karanasou, Penny / Gales, Mark J. F. / Woodland, Philip C.: "I-vector estimation using informative priors for adaptation of deep neural networks", 2872-2876.
Garimella, Sri / Mandal, Arindam / Strom, Nikko / Hoffmeister, Bjorn / Matsoukas, Spyros / Parthasarathi, Sree Hari Krishnan: "Robust i-vector based adaptation of DNN acoustic model for speech recognition", 2877-2881.
Tomashenko, Natalia / Khokhlov, Yuri: "GMM-derived features for effective unsupervised adaptation of deep neural network acoustic models", 2882-2886.
Hsiao, Roger / Ng, Tim / Tsakalidis, Stavros / Nguyen, Long / Schwartz, Richard: "Unsupervised adaptation for deep neural network using linear least square method", 2887-2891.
Li, Sheng / Lu, Xugang / Akita, Yuya / Kawahara, Tatsuya: "Ensemble speaker modeling using speaker adaptive training deep neural network for speaker adaptation", 2892-2896.
Doulaty, Mortaza / Saz, Oscar / Hain, Thomas: "Data-selective transfer learning for multi-domain speech recognition", 2897-2901.
Lustyk, Tomas / Bergl, Petr / Haderlein, Tino / Nöth, Elmar / Cmejla, Roman: "Language-independent method for analysis of German stuttering recordings", 2947-2951.
Al-nasheri, Ahmed / Ali, Zulfiqar / Muhammad, Ghulam / Alsulaiman, Mansour: "An investigation of MDVP parameters for voice pathology detection on three different databases", 2952-2956.
Wu, Jiantao / Yu, Ping / Yan, Nan / Wang, Lan / Yang, Xiaohui / Ng, Manwa L.: "Energy distribution analysis and nonlinear dynamical analysis of adductor spasmodic dysphonia", 2957-2961.
Kasisopa, Benjawan / Klangpornkun, Nittayapa / Burnham, Denis: "Auditory-visual tone perception in hearing impaired Thai listeners", 2962-2966.
Rong, Panying / Yunusova, Yana / Green, Jordan R.: "Speech intelligibility decline in individuals with fast and slow rates of ALS progression", 2967-2971.
A, Rong Na / Mori, Koichi / Sakai, Naomi: "Latency analysis of speech shadowing reveals processing differences in Japanese adults who do and do not stutter", 2972-2976.
Bigi, Brigitte / Klessa, Katarzyna / Georgeton, Laurianne / Meunier, Christine: "A syllable-based analysis of speech temporal organization: a comparison between speaking styles in dysarthric and healthy populations", 2977-2981.
Meyer, Bernd T. / Kollmeier, Birger / Ooster, Jasper: "Autonomous measurement of speech intelligibility utilizing automatic speech recognition", 2982-2986.
Knoll, Monja Angelika / Johnstone, Melissa / Blakely, Charlene: "Can you hear me? acoustic modifications in speech directed to foreigners and hearing-impaired people", 2987-2990.
Yeung, Yu Ting / Wong, Ka Ho / Meng, Helen: "Improving automatic forced alignment for dysarthric speech transcription", 2991-2995.
Włodarczak, Marcin / Heldner, Mattias / Edlund, Jens: "Communicative needs and respiratory constraints", 3051-3055.
Reichel, Uwe D. / Pörner, Nina / Nowack, Dianne / Cole, Jennifer: "Analysis and classification of cooperative and competitive dialogs", 3056-3060.
Cervone, Alessandra / Lai, Catherine / Pareti, Silvia / Bell, Peter: "Towards automatic detection of reported speech in dialogue using prosodic cues", 3061-3065.
Rosenberg, Andrew / Fernandez, Raul / Ramabhadran, Bhuvana: "Modeling phrasing and prominence using deep recurrent learning", 3066-3070.
Looze, Céline De / Yanushevskaya, Irena / Murphy, Andy / O'Connor, Eoghan / Gobl, Christer: "Pitch declination and reset as a function of utterance duration in conversational speech data", 3071-3075.
Freeman, Valerie / Levow, Gina-Anne / Wright, Richard / Ostendorf, Mari: "Investigating the role of `yeah' in stance-dense conversation", 3076-3080.
Choi, Jiyoun / Broersma, Mirjam / Cutler, Anne: "Enhanced processing of a lost language: linguistic knowledge or linguistic skill?", 3110-3114.
Grohe, Ann-Kathrin / Poarch, Gregory J. / Hanulíková, Adriana / Weber, Andrea: "Production inconsistencies delay adaptation to foreign accents", 3115-3119.
Ordin, Mikhail / Polyanskaya, Leona: "Acquisition of English speech rhythm by monolingual children", 3120-3124.
Scharenborg, Odette: "Durational information in word-initial lexical embeddings in spoken Dutch", 3125-3129.
Chen, Fei / Yan, Nan / Wang, Lan / Yang, Tao / Wu, Jiantao / Zhao, Han / Peng, Gang: "The development of categorical perception of lexical tones in Mandarin-speaking preschoolers", 3130-3134.
Ooigawa, Tomohiko: "Perception of Italian liquids by Japanese listeners: comparisons to Spanish liquids", 3135-3139.
Saon, George / Kuo, Hong-Kwang J. / Rennie, Steven / Picheny, Michael: "The IBM 2015 English conversational telephone speech recognition system", 3140-3144.
Liu, Xunying / Flego, Federico / Wang, Linlin / Zhang, C. / Gales, Mark J. F. / Woodland, Philip C.: "The cambridge university 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation", 3145-3149.
Thomas, Samuel / Saon, George / Kuo, Hong-Kwang J. / Mangu, Lidia: "The IBM BOLT speech transcription system", 3150-3153.
Shaik, M. Ali Basha / Tüske, Zoltán / Tahir, M. Ali / Nußbaum-Thom, Markus / Schlüter, Ralf / Ney, Hermann: "Improvements in RWTH LVCSR evaluation systems for Polish, Portuguese, English, urdu, and Arabic", 3154-3158.
Fraga-Silva, Thiago / Gauvain, Jean-Luc / Lamel, Lori / Laurent, Antoine / Le, Viet-Bac / Messaoudi, Abdel: "Active learning based data selection for limited resource STT and KWS", 3159-3163.
Jyothi, Preethi / Hasegawa-Johnson, Mark: "Improved hindi broadcast ASR by adapting the language model and pronunciation model using a priori syntactic and morphophonemic knowledge", 3164-3168.
Versteegh, Maarten / Thiollière, Roland / Schatz, Thomas / Cao, Xuan Nga / Anguera, Xavier / Jansen, Aren / Dupoux, Emmanuel: "The zero resource speech challenge 2015", 3169-3173.
Badino, Leonardo / Mereta, Alessio / Rosasco, Lorenzo: "Discovering discrete subword units with binarized autoencoders and hidden-Markov-model encoders", 3174-3178.
Thiollière, Roland / Dunbar, Ewan / Synnaeve, Gabriel / Versteegh, Maarten / Dupoux, Emmanuel: "A hybrid dynamic time warping-deep neural network architecture for unsupervised acoustic modeling", 3179-3183.
Agenbag, Wiehan / Niesler, Thomas: "Automatic segmentation and clustering of speech using sparse coding and metaheuristic search", 3184-3188.
Chen, Hongjie / Leung, Cheung-Chi / Xie, Lei / Ma, Bin / Li, Haizhou: "Parallel inference of dirichlet process Gaussian mixture models for unsupervised acoustic modeling: a feasibility study", 3189-3193.
Baljekar, Pallavi / Sitaram, Sunayana / Muthukumar, Prasanna Kumar / Black, Alan W.: "Using articulatory features and inferred phonological segments in zero resource speech processing", 3194-3198.
Renshaw, Daniel / Kamper, Herman / Jansen, Aren / Goldwater, Sharon: "A comparison of neural network methods for unsupervised representation learning on the zero resource speech challenge", 3199-3203.
Räsänen, Okko / Doyle, Gabriel / Frank, Michael C.: "Unsupervised word discovery from speech using automatic segmentation into syllable-like units", 3204-3208.
Lyzinski, Vince / Sell, Gregory / Jansen, Aren: "An evaluation of graph clustering methods for unsupervised term discovery", 3209-3213.
Peddinti, Vijayaditya / Povey, Daniel / Khudanpur, Sanjeev: "A time delay neural network architecture for efficient modeling of long temporal contexts", 3214-3218.
Li, Xiangang / Wu, Xihong: "Long short-term memory based convolutional recurrent neural networks for large vocabulary speech recognition", 3219-3223.
Zhang, C. / Woodland, Philip C.: "Parameterised sigmoid and reLU hidden activation functions for DNN acoustic modelling", 3224-3228.
Zhang, Chiyuan / Voinea, Stephen / Evangelopoulos, Georgios / Rosasco, Lorenzo / Poggio, Tomaso: "Discriminative template learning in group-convolutional networks for invariant speech representations", 3229-3233.
Sivadas, Sunil / Wu, Zhenzhou / Bin, Ma: "Investigation of parametric rectified linear units for noise robust speech recognition", 3234-3238.
Su, Hang / Xu, Haihua: "Multi-softmax deep neural network for semi-supervised training", 3239-3243.
Cui, Jia / Saon, George / Ramabhadran, Bhuvana / Kingsbury, Brian: "A multi-region deep neural network model in speech recognition", 3244-3248.
Lu, Liang / Zhang, Xingxing / Cho, Kyunghyun / Renals, Steve: "A study of the recurrent neural network encoder-decoder for large vocabulary speech recognition", 3249-3253.
Zhu, Linchen / Kilgour, Kevin / Stüker, Sebastian / Waibel, Alex: "Gaussian free cluster tree construction using deep neural network", 3254-3258.
Bi, Mengxiao / Qian, Yanmin / Yu, Kai: "Very deep convolutional neural networks for LVCSR", 3259-3263.
Chan, William / Ke, Nan Rosemary / Lane, Ian: "Transferring knowledge from a RNN to a DNN", 3264-3268.
Liu, Changliang / Li, Jinyu / Gong, Yifan: "SVD-based universal DNN modeling for multiple scenarios", 3269-3273.
Chen, Zhuo / Watanabe, Shinji / Erdogan, Hakan / Hershey, John R.: "Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks", 3274-3278.
Liu, Yuzhou / Wang, DeLiang: "Speaker-dependent multipitch tracking using deep neural networks", 3279-3283.
, Sujith P. / Prathosh, A. P. / Ramakrishnan, A. G. / Ghosh, Prasanta Kumar: "An error correction scheme for GCI detection algorithms using pitch smoothness criterion", 3284-3288.
Prasad, RaviShankar / Yegnanarayana, B.: "Robust pitch estimation in noisy speech using ZTW and group delay function", 3289-3292.
Huang, Zhaoqiong / Zhan, Ge / Ying, Dongwen / Yan, Yonghong: "Robust localization of single sound source based on phase difference regression", 3293-3297.
Salvati, Daniele / Drioli, Carlo / Foresti, Gian Luca: "Frequency map selection using a RBFN-based classifier in the MVDR beamformer for speaker localization in reverberant rooms", 3298-3301.
Ma, Ning / Brown, Guy J. / May, Tobias: "Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions", 3302-3306.
Nie, Shuai / Xue, Wei / Liang, Shan / Zhang, Xueliang / Liu, Wenju / Qiao, Liwei / Li, Jianping: "Joint optimization of recurrent networks exploiting source auto-regression for source separation", 3307-3311.
Gong, Rong / Cuvillier, Philippe / Obin, Nicolas / Cont, Arshia: "Real-time audio-to-score alignment of singing voice based on melody and lyric information", 3312-3316.
Lee, Jun-Yong / Cho, Hye-Seung / Kim, Hyoung-Gook: "Vocal separation from monaural music using adaptive auditory filtering based on kernel back-fitting", 3317-3320.
Yen, Frederick Z. / Huang, Mao-Chang / Chi, Tai-Shih: "A two-stage singing voice separation algorithm using spectro-temporal modulation features", 3321-3324.
Lim, Hyungjun / Kim, Myung Jong / Kim, Hoirin: "Robust sound event classification using LBP-HOG based bag-of-audio-words feature representation", 3325-3329.
Tiainen, Mikko / Vainio, Lari / Tiippana, Kaisa / Komeilipoor, Naeem / Vainio, Martti: "Action planning and congruency effect between articulation and grasping", 3390-3393.
Hecht, Ron M. / Bar-Hillel, Aharon / Tiomkin, Stas / Levi, Hadar / Tsimhoni, Omer / Tishby, Naftali: "Cognitive workload and vocabulary sparseness: theory and practice", 3394-3398.
Andrei, Valentin / Cucu, Horia / Buzo, Andi / Burileanu, Corneliu: "Counting competing speakers in a timeframe — human versus computer", 3399-3403.
Chen, Fei / Kwok, Alexander Siu Tai: "Segmental contribution to the intelligibility of ideal binary-masked sentences", 3404-3407.
Ishida, Mako / Arai, Takayuki: "Perception of an existing and non-existing L2 English phoneme behind noise by Japanese native speakers", 3408-3411.
Bhat, Chitralekha / Kopparapu, Sunil: "Viseme comparison based on phonetic cues for varying speech accents", 3412-3416.
O'Reilly, Colm / Marples, Nicola M. / Kelly, David J. / Harte, Naomi: "Quantifying difference in vocalizations of bird populations", 3417-3421.
Choi, Jae / Kim, Jeunghun / Kang, Shin Jae / Kim, Nam Soo: "Reverberation-robust acoustic indoor localization", 3422-3425.
Ming, Huaiping / Huang, Dong-Yan / Xie, Lei / Li, Haizhou / Dong, Minghui: "An alternating optimization approach for phase retrieval", 3426-3430.
Xiao, Xiong / Zhao, Shengkui / Zhong, Xionghu / Jones, Douglas L. / Chng, Eng Siong / Li, Haizhou: "Learning to estimate reverberation time in noisy and reverberant rooms", 3431-3435.
Pang, Cheng / Zhang, Jie / Liu, Hong: "Direction of arrival estimation based on reverberation weighting and noise error estimator", 3436-3440.
Phan, Huy / Hertel, Lars / Maass, Marco / Mazur, Radoslaw / Mertins, Alfred: "Representing nonspeech audio signals through speech classification models", 3441-3445.
Ferrer, Luciana / McLaren, Mitchell / Lawson, Aaron / Graciarena, Martin: "Mitigating the effects of non-stationary unseen noises on language recognition performance", 3446-3450.
Ajili, Moez / Bonastre, Jean-François / Rossato, Solange / Kahn, Juliette / Lapidot, Itshak: "An information theory based data-homogeneity measure for voice comparison", 3451-3455.
Dean, David / Kanagasundaram, Ahilan / Ghaemmaghami, Houman / Rahman, Md. Hafizur / Sridharan, Sridha: "The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition", 3456-3460.
Aronowitz, Hagai: "Score stabilization for speaker recognition trained on a small development set", 3461-3465.
Misra, Abhinav / Ranjan, Shivesh / Zhang, Chunlei / Hansen, John H. L.: "Anti-spoofing system: an investigation of measures to detect synthetic and human speech", 3466-3470.
Carne, Michael J.: "A likelihood ratio-based forensic voice comparison in microphone vs. mobile mismatched conditions using Japanese /ai/", 3471-3475.
Wester, Mirjam / Valentini-Botinhao, Cassia / Henter, Gustav Eje: "Are we using enough listeners? no! — an empirically-supported critique of interspeech 2014 TTS evaluations", 3476-3480.
Chevelu, Jonathan / Lolive, Damien / Maguer, Sébastien Le / Guennec, David: "How to compare TTS systems: a new subjective evaluation methodology focused on differences", 3481-3485.
Latacz, Lukas / Verhelst, Werner: "Double-ended prediction of the naturalness ratings of the blizzard challenge 2008-2013", 3486-3490.
Nose, Takashi / Arao, Yusuke / Kobayashi, Takao / Sugiura, Komei / Shiga, Yoshinori / Ito, Akinori: "Entropy-based sentence selection for speech synthesis using phonetic and prosodic contexts", 3491-3495.
Koriyama, Tomoki / Kobayashi, Takao: "A comparison of speech synthesis systems based on GPR, HMM, and DNN with a small amount of training data", 3496-3500.
Ullmann, Raphael / Rasipuram, Ramya / Magimai-Doss, Mathew / Bourlard, Hervé: "Objective intelligibility assessment of text-to-speech systems through utterance verification", 3501-3505.
Fohr, Dominique / Illina, Irina: "Continuous word representation using neural networks for proper name retrieval from diachronic documents", 3506-3510.
Chen, X. / Tan, T. / Liu, Xunying / Lanchantin, Pierre / Wan, M. / Gales, Mark J. F. / Woodland, Philip C.: "Recurrent neural network language model adaptation for multi-genre broadcast speech recognition", 3511-3515.
Jin, Wengong / He, Tianxing / Qian, Yanmin / Yu, Kai: "Paragraph vector based topic model for language model adaptation", 3516-3520.
Yeh, Ching-Feng / Liou, Yuan-ming / Lee, Hung-yi / Lee, Lin-shan: "Personalized speech recognizer with keyword-based personalized lexicon and language model using word vector representations", 3521-3525.
Li, Sheng / Akita, Yuya / Kawahara, Tatsuya: "Discriminative data selection for lightly supervised training of acoustic model using closed caption texts", 3526-3530.
Das, Amit / Hasegawa-Johnson, Mark: "Cross-lingual transfer learning during supervised training in low resource scenarios", 3531-3535.
Astudillo, Ramón F. / Watanabe, Shinji / Abdelaziz, Ahmed Hussen / Kolossa, Dorothea: "Robust speech processing using observation uncertainty and uncertainty propagation: session and paper overview" (abstract).
Ribas, Dayana / Vincent, Emmanuel / Calvo, José Ramón: "Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE", 3536-3540.
Tachioka, Yuuki / Watanabe, Shinji: "Uncertainty training and decoding methods of deep neural networks based on stochastic representation of enhanced features", 3541-3545.
Saeidi, Rahim / Alku, Paavo: "Accounting for uncertainty of i-vectors in speaker recognition using uncertainty propagation and modified imputation", 3546-3550.
Mallidi, Sri Harish / Ogawa, Tetsuji / Veselý, Karel / Nidadavolu, Phani S. / Hermansky, Hynek: "Autoencoder based multi-stream combination for noise robust speech recognition", 3551-3555.
Huemmer, Christian / Maas, Roland / Schwarz, Andreas / Astudillo, Ramón F. / Kellermann, Walter: "Uncertainty decoding for DNN-HMM hybrid systems based on numerical sampling", 3556-3560.
Abdelaziz, Ahmed Hussen / Watanabe, Shinji / Hershey, John R. / Vincent, Emmanuel / Kolossa, Dorothea: "Uncertainty propagation through deep neural networks", 3561-3565.
Kühne, Marco: "Handling derivative filterbank features in bounded-marginalization-based missing data automatic speech recognition", 3566-3570.
Narayanan, Arun / Misra, Ananya / Chin, Kean K.: "Large-scale, sequence-discriminative, joint adaptive training for masking-based robust ASR", 3571-3575.
Astudillo, Ramón F. / Correia, Joana / Trancoso, Isabel: "Integration of DNN based speech enhancement and ASR", 3576-3580.
Zhang, C. / Woodland, Philip C.: "A general artificial neural network extension for HTK", 3581-3585.
Ko, Tom / Peddinti, Vijayaditya / Povey, Daniel / Khudanpur, Sanjeev: "Audio augmentation for speech recognition", 3586-3589.
Zhang, Xiaohui / Povey, Daniel / Khudanpur, Sanjeev: "A diversity-penalizing ensemble training method for deep learning", 3590-3594.
Kurata, Gakuto / Willett, Daniel: "Deep neural network training emphasizing central frames", 3595-3599.
Chen, Kai / Yan, Zhi-Jie / Huo, Qiang: "Training deep bidirectional LSTM acoustic model for LVCSR by a context-sensitive-chunk BPTT approach", 3600-3604.
Swietojanski, Pawel / Bell, Peter / Renals, Steve: "Structured output layer with auxiliary targets for context-dependent acoustic modelling", 3605-3609.
Bell, Peter / Renals, Steve: "Complementary tasks for context-dependent deep neural network acoustic models", 3610-3614.
Li, Jie / Zhang, Heng / Cai, Xinyuan / Xu, Bo: "Towards end-to-end speech recognition for Chinese Mandarin using long short-term memory recurrent neural networks", 3615-3619.
Chen, Mingming / Yang, Zhanlei / Liang, Jizhong / Li, Yanpeng / Liu, Wenju: "Improving deep neural networks based multi-accent Mandarin speech recognition using i-vectors and accent-specific top layer", 3620-3624.
Huang, Zhen / Li, Jinyu / Siniscalchi, Sabato Marco / Chen, I-Fan / Wu, Ji / Lee, Chin-Hui: "Rapid adaptation for deep neural networks through multi-task learning", 3625-3629.
Parthasarathi, Sree Hari Krishnan / Hoffmeister, Bjorn / Matsoukas, Spyros / Mandal, Arindam / Strom, Nikko / Garimella, Sri: "fMLLR based feature-space speaker adaptation of DNN acoustic models", 3630-3634.
Li, Xiangang / Wu, Xihong: "I-vector dependent feature space transformations for adaptive speech recognition", 3635-3639.
Doulaty, Mortaza / Saz, Oscar / Hain, Thomas: "Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition", 3640-3644.
Asami, Taichi / Masumura, Ryo / Masataki, Hirokazu / Okamoto, Manabu / Sakauchi, Sumitaka: "Training data selection for acoustic modeling via submodular optimization of joint kullback-leibler divergence", 3645-3649.
Cho, Eunah / Kilgour, Kevin / Niehues, Jan / Waibel, Alex: "Combination of NN and CRF models for joint detection of punctuation and disfluencies", 3650-3654.
Lau, Tze Siong / Chen, I-Fan / Lee, Chin-Hui: "Tunable keyword-aware language modeling and context dependent fillers for LVCSR-based spoken keyword search", 3655-3659.
Wang, Haipeng / Ragni, Anton / Gales, Mark J. F. / Knill, Kate M. / Woodland, Philip C. / Zhang, C.: "Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages", 3660-3664.
Do, Quoc Truong / Takamichi, Shinnosuke / Sakti, Sakriani / Neubig, Graham / Toda, Tomoki / Nakamura, Satoshi: "Preserving word-level emphasis in speech-to-speech translation using linear regression HSMMs", 3665-3669.
Ngo, Hoang Gia / Chen, Nancy F. / Nguyen, Binh Minh / Ma, Bin / Li, Haizhou: "Phonology-augmented statistical transliteration for low-resource languages", 3670-3674.
Oouchi, Kazuki / Konno, Ryota / Akyu, Takahiro / Konno, Kazuma / Kojima, Kazunori / Tanaka, Kazuyo / Lee, Shi-wook / Itoh, Yoshiaki: "Evaluation of re-ranking by prioritizing highly ranked documents in spoken term detection", 3675-3679.
Saxena, Abhijeet / Yegnanarayana, B.: "Distinctive feature based representation of speech for query-by-example spoken term detection", 3680-3684.
Lee, Shi-wook / Tanaka, Kazuyo / Itoh, Yoshiaki: "Combination of diverse subword units in spoken term detection", 3685-3689.
Ram, Dhananjay / Asaei, Afsaneh / Dighe, Pranay / Bourlard, Hervé: "Sparse modeling of posterior exemplars for keyword detection", 3690-3694.
Nwe, Tin Lay / Xu, Qianli / Guan, Cuntai / Ma, Bin: "Stress level detection using double-layer subband filter", 3695-3699.
Trouvain, Jürgen / Truong, Khiet P.: "Prosodic characteristics of read speech before and after treadmill running", 3700-3704.
Truong, Khiet P. / Nieuwenhuys, Arne / Beek, Peter / Evers, Vanessa: "A database for analysis of speech under physical stress: detection of exercise intensity while running and talking", 3705-3709.
Paul, Will / Alm, Cecilia Ovesdotter / Bailey, Reynold / Geigel, Joe / Wang, Linwei: "Stressed out: what speech tells us about stress", 3710-3714.
Tsiartas, Andreas / Kathol, Andreas / Shriberg, Elizabeth / Zambotti, Massimiliano de / Willoughby, Adrian: "Prediction of heart rate changes from speech features during interaction with a misbehaving dialog system", 3715-3719.
Pietrowicz, Mary / Hasegawa-Johnson, Mark / Karahalios, Karrie: "Acoustic correlates for perceived effort levels in expressive speech", 3720-3724.
Daoudi, Khalid / Kumar, Ashwini Jaya: "Pitch-based speech perturbation measures using a novel GCI detection algorithm: application to pathological voice classification", 3725-3728.
Vergyri, Dimitra / Knoth, Bruce / Shriberg, Elizabeth / Mitra, Vikramjit / McLaren, Mitchell / Ferrer, Luciana / Garcia, Pablo / Marmar, Charles: "Speech-based assessment of PTSD in a military population using diverse feature classes", 3729-3733.
Yu, Bea / Quatieri, Thomas F. / Williamson, James R. / Mundt, James C.: "Cognitive impairment prediction in the elderly based on vocal biomarkers", 3734-3738.
Gómez-García, J. -A. / Moro-Velázquez, L. / Godino-Llorente, Juan Ignacio / Castellanos-Domínguez, G.: "Automatic age detection in normal and pathological voice", 3739-3743.