Contents

1 . Editorial

 Dear members,

This is the last ISCApad for 2009.

We had this year again excellent conferences and our activity in speech processing was still at a very high level: good papers, many job openings, new books.

I remind you that ISCA is an association at your service. Your advices are always welcome and we work to meet your recommendations whenever it is possible. This year, ISCApad will be redesigned in collaboration with new partners. Of course, all current services will be maintained but we try to made the access easier for all readers.

I wish to all of you an excellent end of year.

Seasons greetings.

 Prof. em. Chris Wellekens 

Institut Eurecom

Sophia Antipolis
France 

public@isca-speech.org

 

 
 
 

 

Back to Top

2 . ISCA News

 

Back to Top

2-1 . Message to students

Dear students,

The International Speech Communication Association ISCA is now opening an online system to build a database of résumés of researchers/students working in the various fields of speech communication.
The goal of  this service is to build a centralized place where many interested employers/corporations can access and search for potential candidates.

Please be advised that the posting service will be updated at 4 month intervals. Next switch will be mid October 2009.

We encourage all of you to upload an updated version of your résumé to: http://www.isca-speech.org/resumes/ and wish you good luck with a fruitful career.


Professor Helen Meng 

Back to Top

3 . Future ISCA Conferences and Workshops (ITRW)

 

Back to Top

3-1 . (2010-09-26) CfP INTERSPEECH 2010 Chiba Japan

FIRST CALL FOR PAPERS

 

INTERSPEECH is the world's largest and most comprehensive conference on issues surrounding the science and technology of spoken language processing both in humans and in machines. It is our great pleasure to host INTERSPEECH 2010 in Japan, the birthplace of ICSLP, which has held two ICSLPs, in Kobe and Yokohama, in the past. The theme of INTERSPEECH 2010 is "Spoken Language Processing for All Ages, Health Conditions, Native Languages and Environments". INTERSPEECH 2010 emphasizes an interdisciplinary approach covering all aspects of speech science and technology spanning the basic theories to applications. Besides regular oral and poster sessions, plenary talks by internationally renowned experts, tutorials, exhibits, and special sessions are planned. We invite you to submit original papers in any related area, including but not limited to:

HUMAN SPEECH PRODUCTION, PERCEPTION AND COMMUNICATION

* Human speech production * Human speech and sound perception * Linguistics, phonology and phonetics * Discourse and dialogue * Prosody (e.g. production, perception, prosodic structure, modeling) * Paralinguistic and nonlinguistic cues (e.g. emotion and expression) * Physiology and pathology of spoken language * Spoken language acquisition, development and learning * Speech and other modalities (e.g. facial expression, gesture)

SPEECH AND LANGUAGE TECHNOLOGY

* Speech analysis and representation * Speech segmentation * Audio segmentation and classification * Speaker turn detection * Speech enhancement * Speech coding and transmission * Voice conversion * Speech synthesis and spoken language generation * Automatic speech recognition * Spoken language understanding * Language and dialect identification * Cross-lingual and multi-lingual speech processing * Multimodal/multimedia signal processing (including sign languages) * Speaker characterization and recognition * Signal processing for music and song * Spoken language technology for prosthesis, rehabilitation, wellness and welfare

SPOKEN LANGUAGE SYSTEMS AND APPLICATIONS

* Spoken dialogue systems * Systems for information extraction/retrieval * Systems for spoken language translation * Applications for aged and handicapped persons * Applications for learning and education * Other applications

RESOURCES, STANDARDIZATION AND EVALUATION

 * Spoken language resources and annotation * Evaluation and standardization of spoken language systems

 PAPER SUBMISSION

Papers for the INTERSPEECH 2010 proceedings should be up to four pages in length and conform to the format given in the paper preparation guidelines and author kits which will be available on the INTERSPEECH 2010 website along with the Final Call for Papers. Optionally, authors may submit additional files, such as multimedia files, to be included on the Proceedings CD-ROM. Authors shall also declare that their contributions are original and not being submitted for publication elsewhere (e.g. another conference, workshop, or journal). Papers must be submitted via the on-line paper submission system, which will open early in 2010. The deadline for submitting a paper is 30 April 2010. This date will not be extended. Inquiries regarding paper submissions should be directed via email to submission@interspeech2010.org.

LANGUAGE

The working language of the conference is English.

IMPORTANT DATES

Paper submission deadline: 30 April 2010

Notification of acceptance or rejection: 2 July 2010

Camera-ready paper due: 9 July 2010

Early registration deadline: 28 July 2010

Conference dates: 26-30 September 2010

WEBSITE & MAIL

http://www.interspeech2010.org/

mail: office@interspeech2010.org

VENUE

Makuhari Messe International Conference Hall Nakase 2-1, Mihama-ku, Chiba-city Chiba 261-0023 Japan http://www.m-messe.co.jp/en/access/index.html

 

 

 

Back to Top

3-2 . (2010-09-26) Call for Tutorials at Interspeech 2010 Chiba Japan

Call for Tutorials

The Interspeech 2010 Organization Committee invites proposals for Tutorials at Interspeech 2010 - the 11th Annual Conference of the International Speech Communication Association, to be held on September 26-30, 2010, in Makuhari, Chiba, Japan. The theme of INTERSPEECH 2009 is "Spoken Language Processing for All Ages, Health Conditions, Native Languages and Environments".

Proposal Submission

The following information should be included in each Tutorial proposal:

  1. Title of the proposed tutorial
  2. Names and affiliation of the presenters, 
    including a brief biography and contact information
  3. Introduction (1 page), 
    including the importance of the topic, the objectives of the proposed tutorial, and brief outline of the main target audience.
  4. Presentation outline (max. 2 pages ), 
    an outline of the presentation and, if multiple presenters are proposed, how the presentation will be shared.
  5. Requirements, 
    including any special equipment necessary for the presentation.

Proposals will be evaluated by the Organization Committee based on the relevance and significance of the topic and potential interest to the conference attendees.

Submission Procedure

Prospective tutorial presenters should submit proposals by email to Tatsuya Kawahara (kawahara@i.kyoto-u.ac.jp) and Masato Akagi (akagi@jaist.ac.jp) before January 4, 2010. Notification of acceptance of proposals is scheduled for January 31, 2010. 

Back to Top

3-3 . (2011-08-27) INTERSPEECH 2011 Florence Italy

Interspeech 2011

Palazzo dei Congressi,  Italy, August 27-31, 2011.

Organizing committee

Piero Cosi (General Chair),

Renato di Mori (General Co-Chair),

Claudia Manfredi (Local Chair),

Roberto Pieraccini (Technical Program Chair),

Maurizio Omologo (Tutorials),

Giuseppe Riccardi (Plenary Sessions).

More information www.interspeech2011.org

Back to Top

3-4 . (2010-06-28) Odyssey 2010 Brno Czech Republic

Odyssey 2010: The Speaker and Language Recognition Workshop will be hosted by Brno University of Technology in Brno, Czech Republic. Odyssey’10 is an ISCA Tutorial and Research Workshop held in cooperation with the ISCA Speaker and Language Characterization SIG. The need for fast, efficient, accurate, and robust means of recognizing people and languages is of growing importance for commercial, forensic, and government applications. The aim of this workshop is to continue to foster interactions among researchers in speaker and language recognition as the successor of previous successful events held in Martigny (1994), Avignon (1998), Crete (2001), Toledo (2004), San Juan (2006) and Stellenbosch (2008). 

http://www.speakerodyssey.com

 

Topics

Topics of interest include speaker and language recognition (verification, identification, segmentation, and clustering): text-dependent and -independent speaker recognition; multispeaker training and detection; speaker characterization and adaptation; features for speaker recognition; robustness in channels; robust classification and fusion; speaker recognition corpora and evaluation; use of extended training data; speaker recognition with speech recognition; forensics, multimodality, and multimedia speaker recognition; speaker and language confidence estimation; language, dialect, and accent recognition; speaker synthesis and transformation; biometrics; human recognition of speaker and language; and commercial applications.

Schedule

Draft papers due:
15 February 2010
Notification of acceptance:
16 April 2010
Final papers due:
30 April 2010
Preliminary program:
17 May 2010
Workshop:
28 June – 1 July 2010 
Back to Top

4 . Workshops and conferences supported (but not organized) by ISCA

 

Back to Top

4-1 . (2009-12-13) ASRU 2009

IEEE ASRU2009
Automatic Speech Recognition and Understanding Workshop
Merano, Italy December 13-17, 2009
http://www.asru2009.org/
 
We are happy to inform you that IEEE ASRU 2009 registrations are now available at: http://www.asru2009.org/index.php/registration  Please note the   1) At least one author/paper needs to register 2) We have a limited number of slots for registrations and priority will be given to authors of accepted papers until October 7 2009.     AFTER that registrations will accepted on a first-come first-serve basis
The eleventh biannual IEEE workshop on Automatic Speech Recognition
and Understanding (ASRU) will be held on December 13-17, 2009.
The ASRU workshops have a tradition of bringing together
researchers from academia and industry in an intimate and
collegial setting to discuss problems of common interest in
automatic speech recognition and understanding.

Workshop topics

• automatic speech recognition and understanding
• human speech recognition and understanding
• speech to text systems
• spoken dialog systems
• multilingual language processing
• robustness in ASR
• spoken document retrieval
• speech-to-speech translation
• spontaneous speech processing
• speech summarization
• new applications of ASR.

The workshop program will consist of invited lectures, oral
and poster presentations,  and panel discussions. Prospective
 authors are invited to submit full-length, 4-6 page papers,
including figures and references, to the ASRU 2009 website
http://www.asru2009.org/.
All papers will be handled and reviewed electronically.
The website will provide you with further details. Please note
that the submission dates for papers are strict deadlines.

IMPORTANT DATES

Paper submission deadline         July 15, 2009
Paper notification of acceptance     September 3, 2009
Demo session proposal deadline        September 24, 2009
Early registration deadline        October 7, 2009
Workshop                 December 13-17, 2009


Please note that the number of attendees will be limited and
priority will be given to paper presenters. Registration will
be handled via the ASRU 2009 website,
http://www.asru2009.org/, where more information on the workshop
will be available.

General Chairs
    Giuseppe Riccardi, U. Trento, Italy
    Renato De Mori, U. Avignon, France

Technical Chairs
    Jeff Bilmes, U. Washington, USA
    Pascale Fung, HKUST, Hong Kong China
    Shri Narayanan, USC, USA
    Tanja Schultz, U. Karlsruhe, Germany

Panel Chairs
    Alex Acero, Microsoft, USA
    Mazin Gilbert, AT&T, USA

Demo Chairs
    Alan Black, CMU, USA
    Piero Cosi, CNR, Italy

Publicity Chairs
    Dilek Hakkani-Tür, ICSI, USA
    Isabel Trancoso, INESC -ID/IST, Portugal

Publication Chair
    Giuseppe di Fabbrizio, AT&T, USA

Local Chair
    Maurizio Omologo, FBK-irst, Italy 
 
 
 
 
 
 
Back to Top

4-2 . (2009-12-14) 6th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications MAVEBA 2009

University degli Studi di Firenze Italy
Department of Electronics and Telecommunications
6th International Workshop
on
Models and Analysis of Vocal Emissions for Biomedical
Applications
MAVEBA 2009
December 14 - 16, 2009
Firenze, Italy
http://maveba.det.unifi.it
Speech is the primary means of communication among humans, and results from
complex interaction among vocal folds vibration at the larynx and voluntary articulators
movements (i.e. mouth tongue, jaw, etc.). However, only recently has research
focussed on biomedical applications. Since 1999, the MAVEBA Workshop is
organised every two years, aiming to stimulate contacts between specialists active in
clinical, research and industrial developments in the area of voice signal and images
analysis for biomedical applications. This sixth Workshop will offer the participants
an interdisciplinary platform for presenting and discussing new knowledge in the field
of models, analysis and classification of voice signals and images, as far as both
adults, singing and children voices are concerned. Modelling the normal and
pathological voice source, analysis of healthy and pathological voices, are among the
main fields of research. The aim is that of extracting the main voice characteristics,
together with their deviation from “healthy conditions”, ranging from fundamental
research to all kinds of biomedical applications and related established and advanced
technologies.
SCIENTIFIC PROGRAM
linear and non-linear models of voice
signals;
physical and mechanical models;
aids for disabled;
measurement devices (signal and image); prostheses;
robust techniques for voice and glottal
analysis in time, frequency, cepstral,
wavelet domain;
neural networks, artificial intelligence and
other advanced methods for pathology
classification;
linguistic and clinical phonetics; new-born infant cry analysis;
neurological dysfunction; multiparametric/multimodal analysis;
imaging techniques (laryngography,
videokymography, fMRI);
voice enhancement;
protocols and database design;
Industrial applications in the biomedical
field;
singing voice;
speech/hearing interactions;
DEADLINES
30 May 2009 Submission of extended abstracts (1-2 pages, 1 column)
/special session proposal
30 July 2009 Notification of paper acceptance
30 September 2009 Final full paper submission (4 pages, 2 columns, pdf
format) and early registration
14-16 December 2009 Conference venue
SPONSORS
ENTE CRF Ente Cassa di Risparmio di Firenze
IEEE EMBS
IEEE Engineering in Medicine and Biology
Society
ELSEVIER Eds.
Biomedical Signal Processing and Control
ISCA
International Speech and Communication
Association
A.I.I.M.B.
Associazione Italiana di Ingegneria Medica e
Biologica
COST Action
2103 Europ. COop. in Science & Tech. Research
FURTHER INFORMATION
Claudia Manfredi – Conference Chair
Department of Electronics and
Telecommunications
Via S. Marta 3, 50139 Firenze, Italy
Phone: +39-055-4796410
Fax: +39-055-494569
E-mail: claudia.manfredi@unifi.it
Piero Bruscaglioni
Department of Physics
Polo Scientifico Sesto Fiorentino, 50019
Firenze,Italy
Phone: +39-055-4572038
Fax: +39-055-4572356
E-mail: piero.bruscaglioni@unifi.it
Back to Top

4-3 . (2010-05-03) Workshop on Spoken Languages Technologies for Under-Resourced Languages (SLTU'10)

Workshop on Spoken Languages Technologies for Under-Resourced Languages 
(SLTU’10)
 
*The second International Workshop on Spoken Languages Technologies for 
Under-resourced languages (SLTU’10) will be held at Universiti Sains Malaysia 
(USM), Penang, Malaysia, May 3 to May 5, 2010.* Workshop 
supported by ISCA, AFCP and CNRS. 
 
The first workshop on Spoken Languages Technologies for Under-Resourced 
Languages was organized in Hanoi, Vietnam, in 2008 by Multimedia, 
Information, Communication and Applications (MICA) research center in 
Vietnam and /Laboratoire d’Informatique de Grenoble/ (LIG) in France. 
This first workshop gathered 40 participants during two days. 
 
For 2010, we intend to attract more participants, especially from the 
local regional zone (Malaysia, Indonesia, Singapore, Thailand, 
Australia, ...). The workshop will take place inside USM in Penang, 
Malaysia. SLTU research workshop will focus on spoken language 
processing for under-resourced languages and aims at gathering 
researchers working on: 
 
   * ASR, synthesis and translation for under-resourced languages 
   * portability issues 
   * multilingual spoken language processing 
   * fast resources acquisition (speech, text, lexicons, parallel corpora) 
   * spoken language processing for languages with rich morphology 
   * spoken language processing for languages without separators 
   * spoken language processing for languages without writing system 
   * NLP for rare or endangered languages 
   * … 
 
*Important dates* 
 
* Paper submission: December 15, 2009 
 
* Notification of Paper Acceptance: February 15, 2010 
 
* Author Registration Deadline: March 1, 2010 
 
*Workshop Web site* 
 
* * 
 
http://www.mica.edu.vn/sltu-2010 
 
 
 
*Workshop Chairs* 
 
Laurent Besacier 
 
Eric Castelli 
 
Dr. Chan Huah Yong 
Back to Top

4-4 . (2010-05-19) CfP LREC 2010 - 7th Conference on Language Resources and Evaluation

LREC 2010 - 7th Conference on Language Resources and Evaluation
 
MEDITERRANEAN CONFERENCE CENTRE, VALLETTA - MALTA
 
***LREC2010 ABSTRACT SUBMISSION IS NOW OPEN!***

IMPORTANT NOTE

 Special Highlight: Contribute to building the LREC2010 Map!

 
 
MAIN CONFERENCE: 19-20-21 MAY 2010
WORKSHOPS and TUTORIALS: 17-18 MAY and 22-23 MAY 2010
 
Conference web site: http://www.lrec-conf.org/lrec2010/
 
 
The seventh international conference on Language Resources and Evaluation (LREC) will be organised in 2010 by ELRA in cooperation with a wide range of international associations and organisations.
 
 
CONFERENCE AIMS
 
In 12 years – the first LREC was held in Granada in 1998 – LREC has become the major event on Language Resources (LRs) and Evaluation for Human Language Technologies (HLT). The aim of LREC is to provide an overview of the state-of-the-art, explore new R&D directions and emerging trends, exchange information regarding LRs and their applications, evaluation methodologies and tools, ongoing and planned activities, industrial uses and needs, requirements coming from the e-society, both with respect to policy issues and to technological and organisational ones. 
 
LREC provides a unique forum for researchers, industrials and funding agencies from across a wide spectrum of areas to discuss problems and opportunities, find new synergies and promote initiatives for international cooperation, in support to investigations in language sciences, progress in language technologies and development of corresponding products, services and applications, and standards.
 
 
Special Highlight: Contribute to building the LREC2010 Map!
 
LREC2010 recognises that time is ripe to launch an important initiative, the LREC2010 Map of Language Resources, Technologies and Evaluation. The Map will be a collective enterprise of the LREC community, as a first step towards the creation of a very broad, community-built, Open Resource Infrastructure. As first in a series, it will become an essential instrument to monitor the field and to identify shifts in the production, use and evaluation of LRs and LTs over the years.
 
When submitting a paper, from the START page you will be asked to fill in a very simple template to provide essential information about resources (in a broad sense that includes technologies, standards, evaluation kits, etc.) that either have been used for the work described in the paper or are a new result of your research. 
 
The Map will be disclosed at LREC, where some event(s) will be organised around this initiative. 
 
 
CONFERENCE TOPICS
 
Issues in the design, construction and use of Language Resources (LRs): text, speech, other associated media and modalities
•    Guidelines, standards, specifications, models and best practices for LRs
•    Methodologies and tools for LRs construction and annotation
•    Methodologies and tools for the extraction and acquisition of knowledge
•    Ontologies and knowledge representation
•    Terminology 
•    Integration between (multilingual) LRs, ontologies and Semantic Web technologies
•    Metadata descriptions of LRs and metadata for semantic/content markup
•    Validation, quality assurance, evaluation of LRs
Exploitation of LRs in different types of systems and applications 
•    For: information extraction, information retrieval, speech dictation, mobile communication, machine translation, summarisation, semantic search, text mining, inferencing, reasoning, etc.
•    In different types of interfaces: (speech-based) dialogue systems, natural language and multimodal/multisensorial interactions, voice activated services, cognitive systems, etc.
•    Communication with neighbouring fields of applications, e.g. e-government, e-culture, e-health, e-participation, mobile applications, etc. 
•    Industrial LRs requirements, user needs
Issues in Human Language Technologies evaluation
•    HLT Evaluation methodologies, protocols and measures
•    Benchmarking of systems and products
•    Usability evaluation of HLT-based user interfaces (speech-based, text-based, multimodal-based, etc.), interactions and dialogue systems
•    Usability and user satisfaction evaluation
General issues regarding LRs & Evaluation
•    National and international activities and projects
•    Priorities, perspectives, strategies in national and international policies for LRs
•    Open architectures 
•    Organisational, economical and legal issues 
 
 
PROGRAMME
 
The Scientific Programme will include invited talks, oral presentations, poster and demo presentations, and panels. 
There is no difference in quality between oral and poster presentations. Only the appropriateness of the type of communication (more or less interactive) to the content of the paper will be considered.
 
 
SUBMISSIONS AND DATES
 
Submitted abstracts of papers for oral and poster or demo presentations should consist of about 1500-2000 words.
•    Submission of proposals for oral and poster/demo papers: 31 October 2009 
 
Proposals for panels, workshops and tutorials will be reviewed by the Programme Committee.
•    Submission of proposals for panels, workshops and tutorials: 31 October 2009
 
 
PROCEEDINGS
 
The Proceedings on CD will include both oral and poster papers, in the same format. They will be added to the ELRA web archives before the conference.
A Book of Abstracts will be printed.
 
 
CONFERENCE PROGRAMME COMMITTEE 
Nicoletta Calzolari, Istituto di Linguistica Computazionale del CNR - Pisa, Italy (Conference chair)
Khalid Choukri - ELRA, Paris, France
Bente Maegaard - CST, University of Copenhagen, Denmark
Joseph Mariani - LIMSI-CNRS and IMMI, Orsay, France
Jan Odijk - UIL-OTS, Utrecht, The Netherlands 
Stelios Piperidis - Institute for Language and Speech Processing (ILSP), Athens, Greece
Mike Rosner – Department of Intelligent Computer Systems, University of Malta, Malta
Daniel Tapias - Sigma Technologies S.L., Madrid, Spain
Back to Top

4-5 . (2010-05-25) CfP JEP 2010

JEP 2010
         XXVIIIèmes Journées d'Étude sur la Parole
 
                    Université de Mons, Belgique
 
                         du 25 au 28 mai 2010
 
                        http://w3.umh.ac.be/jep2010
 
=====================================================================
 
Les Journées d'Études de la Parole (JEP) sont consacrées à l'étude de la communication parlée ainsi qu'à ses applications. Ces journées ont pour but de rassembler l'ensemble des communautés scientifiques francophones travaillant dans le domaine. La conférence se veut aussi un lieu d'échange convivial entre doctorants et chercheurs confirmés.
 
En 2010, les JEP sont organisées par le Laboratoire des Sciences de la Parole de l'Académie Wallonie-Bruxelles, sur le site de l'Université de Mons en Belgique, sous l'égide de l'AFCP 
(Association Francophone de la Communication Parlée) avec le  soutien de l'ISCA (International Speech Communication Association).
Un second appel à communication précisant les thèmes ainsi que les modalités de soumission suivra ce premier appel.
 
 
 
CALENDRIER
===========
Date limite de soumission:          11 janvier 2010
Notification aux auteurs:             15    mars 2010
Conférence:                                 25-28 mai 2010
 
 
 
 
V. Delvaux
Chargée de Recherches FNRS
Laboratoire de Phonétique
Service de Métrologie et Sciences du Langage
Université de Mons-Hainaut
18, Place du Parc
7000 Mons
Belgium
+3265373140
 
http://staff.umh.ac.be/Delvaux.Veronique/index.html
Back to Top

5 . Books,databases and softwares

 

Back to Top

5-1 . Books

 

This section shows recent books whose titles been have communicated by the authors or editors.
 
Also some advertisements for recent books in speech are included.
 
This book presentation is written by the authors and not by this newsletter editor or any  volunteer reviewer.

 

Back to Top

5-1-1 . Sprachverarbeitung -- Grundlagen und Methoden der Sprachsynthese und Spracherkennung

Title: Sprachverarbeitung -- Grundlagen und Methoden 
       der Sprachsynthese und Spracherkennung 
Authors: Beat Pfister, Tobias Kaufmann 
Publisher: Springer 
Year: 2008 
Website: http://www.springer.com/978-3-540-75909-6 
Back to Top

5-1-2 . Digital Speech Transmission

Digital Speech Transmission
Authors: Peter Vary and Rainer Martin
Publisher: Wiley&Sons
Year: 2006
Back to Top

5-1-3 . Distant Speech Recognition,

Distant Speech Recognition, Matthias Wölfel and John McDonough (2009), J. Wiley & Sons.
 
 Please link the title to http://www.distant-speech-recognition.com 
 
In the very recent past, automatic speech recognition (ASR) systems have attained acceptable performance when used with speech captured with a head-mounted or close-talking microphone (CTM). The performance of conventional ASR systems, however, degrades dramatically as soon as the microphone is moved away from the mouth of the speaker. This degradation is due to a broad variety of effects that are not found in CTM speech, including background noise, overlapping speech from other speakers, and reverberation. While conventional ASR systems underperform for speech captured with far-field sensors, there are a number of techniques developed in other areas of signal processing that can mitigate the deleterious effects of noise and reverberation, as well as separating speech from overlapping speakers. Distant Speech Recognition presents a contemporary and comprehensive description of both theoretic abstraction and practical issues inherent in the distant ASR problem.
Back to Top

5-1-4 . Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods

Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods
Joseph Keshet and Samy Bengio, Editors
John Wiley & Sons
March, 2009
Website:  Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods
 
About the book:
This is the first book dedicated to uniting research related to speech and speaker recognition based on the recent advances in large margin and kernel methods. The first part of the book presents theoretical and practical foundations of large margin and kernel methods, from support vector machines to large margin methods for structured learning. The second part of the book is dedicated to acoustic modeling of continuous speech recognizers, where the grounds for practical large margin sequence learning are set. The third part introduces large margin methods for discriminative language modeling. The last part of the book is dedicated to the application of keyword-spotting, speaker
verification and spectral clustering. 
Contributors: Yasemin Altun, Francis Bach, Samy Bengio, Dan Chazan, Koby Crammer, Mark Gales, Yves Grandvalet, David Grangier, Michael I. Jordan, Joseph Keshet, Johnny Mariéthoz, Lawrence Saul, Brian Roark, Fei Sha, Shai Shalev-Shwartz, Yoram Singer, and Nathan Srebo. 
 
 
 
Back to Top

5-1-5 . Some aspects of Speech and the Brain.

Some aspects of Speech and the Brain. 
Susanne Fuchs, Hélène Loevenbruck, Daniel Pape, Pascal Perrier
Editions Peter Lang, janvier 2009
 
What happens in the brain when humans are producing speech or when they are listening to it ? This is the main focus of the book, which includes a collection of 13 articles, written by researchers at some of the foremost European laboratories in the fields of linguistics, phonetics, psychology, cognitive sciences and neurosciences.
 
-- 
Back to Top

5-1-6 . Spoken Language Processing,

Spoken Language Processing, edited by Joseph Mariani (IMMI and
LIMSI-CNRS, France). ISBN: 9781848210318. January 2009. Hardback 504 pp

Publisher ISTE-Wiley

Speech processing addresses various scientific and technological areas. It includes speech analysis and variable rate coding, in order to store or transmit speech. It also covers speech synthesis, especially from text, speech recognition, including speaker and language identification, and spoken language understanding. This book covers the following topics: how to realize speech production and perception systems, how to synthesize and understand speech using state-of-the-art methods in signal processing, pattern recognition, stochastic modeling, computational linguistics and human factor studies. 


More on its content can be found at
http://www.iste.co.uk/index.php?f=a&ACTION=View&id=150

Back to Top

5-1-7 . L'imagerie medicale pour l'etude de la parole

 L'imagerie medical pour l'etude de la parole,

Alain Marchal, Christian Cave

Eds Hermes Lavoisier

99 euros • 304 pages • 16 x 24 • 2009 • ISBN : 978-2-7462-2235-9

Du miroir laryngé à la vidéofibroscopie actuelle, de la prise d'empreintes statiques à la palatographie dynamique, des débuts de la radiographie jusqu'à l'imagerie par résonance magnétique ou la magnétoencéphalographie, cet ouvrage passe en revue les différentes techniques d'imagerie utilisées pour étudier la parole tant du point de vue de la production que de celui de la perception. Les avantages et inconvénients ainsi que les limites de chaque technique sont passés en revue, tout en présentant les principaux résultats acquis avec chacune d'entre elles ainsi que leurs perspectives d'évolution. Écrit par des spécialistes soucieux d'être accessibles à un large public, cet ouvrage s'adresse à tous ceux qui étudient ou abordent la parole dans leurs activités professionnelles comme les phoniatres, ORL, orthophonistes et bien sûr les phonéticiens et les linguistes.

 
 

 

 

 

Back to Top

5-1-8 . Korpusbasierte Sprachverarbeitung

Author: Christoph Draxler
Title: Korpusbasierte Sprachverarbeitung
Publisher: Narr Francke Attempto Verlag Tübingen
Year: 2008
Link: http://www.narr.de/details.php?catp=&p_id=16394

Summary: Spoken language is a major area of linguistic research and speech technology development. This handbook presents an introduction to the technical foundations and shows how speech data is collected, annotated, analysed, and made accessible in the form of speech databases. The book focuses on web-based procedures for the recording and processing of high quality speech data, and it is intended as a desktop reference for practical recording and annotation work. A chapter is devoted to the Ph@ttSessionz database, the first large-scale speech data collection (860+ speakers, 40 locations in Germany) performed via the Internet. The companion web site (http://www.narr-studienbuecher.de/Draxler/index.html) contains audio examples, software tools, solutions to the exercises, important links, and checklists. 

Back to Top

5-2 . Database providers

 

Back to Top

5-2-1 . LDC News

 In this newsletter:

- LDC Incentives: Early Renewal Discounts for Membership Year (MY) 2010 -

 

 

 

 

New Publications:
 
 

 

- NXT Switchboard Annotations -



LDC Incentives:  Early Renewal Discounts for Membership Year (MY) 2010

LDC appreciates the important contribution LDC members make through their continued support of the consortium.  We would like to invite all current and previous members of LDC to renew for Membership Year (MY) 2010.  For MY2010, LDC is pleased to maintain membership fees at last year’s rates – membership fees will not increase.  Additionally, in last month's newsletter, we announced an LDC Incentives Package which will include a host of incentives to help lower the cost of LDC membership and data licensing fees.  As part of this package, LDC will extend discounts to members who keep their membership current and who join early in the year.

The details of our Early Renewal Discounts for MY2010 are as follows:

  • Organizations who joined for MY2009, will receive a 5% discount when renewing. This discount will apply throughout 2010, regardless of time of renewal. MY2009 members renewing before March 1, 2010 will receive an additional 5% discount, for a total 10% discount off the membership fee.
  • New members as well as organizations who did not join for MY2009, but who held membership in any of the previous MY's (1993-2008), will also be eligible for a 5% discount provided that they join/renew before March 1, 2010.

The Membership Fee Table provides exact pricing information.

 

MY2010 Fee

MY2010 Fee
with 5% Discount *

MY2010 Fee
with 10% Discount **

Not-for-Profit

 

 

 

 

Standard

US$2400

US$2280

US$2160

 

Subscription

US$3850

US$3657.50

US$3465

For-Profit

 

 

 

 

Standard

US$24000

US$22800

US$21600

 

Subscription

US$27500

US$26125

US$24750

*   For MY2009 Members renewing for MY2010 and any previous year Member who renews before March 1, 2010

** For MY2009 Members renewing before March 1, 2010

Publications for MY2010 are still being planned but it will be another productive year with a broad selection of publications.  The working titles of data sets we intend to provide include:

Arabic Treebank: Part 2 v 4.0

Fisher Spanish

Chinese Treebank 7.0

LCTL Bengali

Chinese Web N-gram Version 1.0

NPS Chat Corpus

 

In addition to receiving new publications, current year members of the LDC also enjoy the benefit of licensing older data at reduced costs; current year for-profit members may use most data for commercial applications.

This past year, nearly 100 organizations who renewed membership or joined early received a discount on membership fees for MY2009.  Taken together, these members saved over US$50,000!  Be sure to keep an eye out on your mail - all LDC members have been sent an invitation to join letter and renewal invoice for MY2010.  Renew early for MY2010 and save today!


[ top ]

 

LDC at NWAV 38

LDC exhibited at NWAV for the third straight year. We were delighted to interact with so many talented sociolinguistic researchers and to introduce numerous attendees to LDC and our data catalog. LDC distributed free copies of both the SLX Corpus of Classic Sociolinguistic Interviews, as per the terms of the Timebank grant, and the 2008 LDC Spoken Language Sampler, which is available for download here. We also distributed many of our newly minted data sheets, including one featuring the speech annotation tool XTrans. This tool is also freely available from our website in Linux and Windows formats.   

LDC’s Executive Director Chris Cieri and Senior Associate Director Stephanie Strassel presented papers on the following topics:

·         Models of Phonological Variation for Multi-dialectal Communities: the case of L’Aquila

·         Closer Still to a Robust, All Digital, Empirical, Reproducible Sociolinguistic Methodology

Thanks again to everyone who stopped by our display and we look forward to seeing you again next year!

[ top ]

New Publications

(1) 2007 NIST Language Recognition Evaluation Supplemental Training Set consists of 118 hours of conversational telephone speech segments in the following languages and dialects: Arabic (Egyptian colloquial), Bengali, Min Nan Chinese, Wu Chinese, Taiwan Mandarin, Cantonese, Russian, Mexican Spanish, Thai, Urdu and Tamil.

The goal of the NIST (National Institute of Standards and Technology) Language Recognition Evaluation (LRE) is to establish the baseline of current performance capability for language recognition of conversational telephone speech and to lay the groundwork for further research efforts in the field. NIST conducted three previous language recognition evaluations, in 1996, 2003 and 2005. The most significant differences between those evaluations and the 2007 task were the increased number of languages and dialects, the greater emphasis on a basic detection task for evaluation and the variety of evaluation conditions. Thus, in 2007, given a segment of speech and a language of interest to be detected (i.e., a target language), the task was to decide whether that target language was in fact spoken in the given telephone speech segment (yes or no), based on an automated analysis of the data contained in the segment.

The supplemental training material in this release consists of the following:

  • Approximately 53 hours of conversational telephone speech segments in Arabic (Egyptian colloquial), Bengali, Cantonese, Min Nan Chinese,Wu Chinese, Russian, Thai and Urdu. This material is taken from LDC's CALLHOME, CALLFRIEND and Mixer collections.
  • Approximately 65 hours of full telephone conversations in Mandarin Chinese (Taiwan), Spanish (Mexican) and Tamil. This material was collected by Oregon Health and Science University (OHSU), Beaverton, Oregon. The test segments used in the 2005 NIST Language Recognition Evaluation were derived from these full conversations.

In addition to the supplemental material contained in this release, the training data for the 2007 NIST Language Recognition Evaluation consisted of data from previous LRE evaluation test sets, namely, 2003 NIST Language Recognition Evaluation and 2005 NIST Language Recognition Evaluation.

2007 NIST Language Recognition Evaluation Supplemental Training Set is distributed on one DVD-ROM.

2009 Subscription Members will automatically receive two copies of this corpus.  2009 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for US$1500.

[ top ]

*

(2) French Gigaword Second Edition is a comprehensive archive of newswire text data that has been acquired over several years by LDC. This second edition updates French Gigaword First Edition (LDC2006T7) and adds material collected from August 1, 2006 through December 31, 2008.

The two distinct international sources of French newswire in this edition, and the time spans of collection covered for each, are as follows:

  • Agence France-Presse (afp_fre) May 1994 - Dec 2008
  • Associated Press Worldstream, French (apw_fre) Nov 1994 - Dec 2008

The seven-letter codes in parentheses include the three-character source name abbreviations and the three-character language code ("fre") separated by an underscore ("_") character. The three-letter language code conforms to LDC's internal convention based on the ISO 639-3 standard. These codes are used in the directory names where the data files are found and in the prefix that appears at the beginning of every data file name. They are also used (in all UPPER CASE) as the initial portion of the DOC "id" strings that uniquely identify each news story.

The overall totals for each source are summarized below. The "Totl-MB" numbers show the amount of data obtained when the files are uncompressed (i.e., approximately 15 gigabytes, total); the "Gzip-MB" column shows totals for compressed file sizes as stored on the DVD-ROM; and the "K-wrds" numbers are the number of whitespace-separated tokens (of all types) after all SGML tags are eliminated.

Source

#Files

Gzip-MB

Totl-MB

K-wrds

#DOCs

AFP_FRE

172

2408

4079

560000

2060803

APW_FRE

171

2280

1719

241324

0872573

TOTAL

343

4688

5789

801324

2933376

The data has undergone a consistent extent of quality control to eliminate out-of-band content and other obvious forms of corruption. Since the source data is generated manually on a daily basis, there will be a small percentage of human errors common to all sources: missing whitespace, incorrect or variant spellings, badly formed sentences, and so on, as are normally seen in newspapers. No attempt has been made to address this property of the data.

French Gigaword Second Edition is distributed on one DVD-ROM.

2009 Subscription Members will automatically receive two copies of this corpus.  2009 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for US$4000.

[ top ]

*

(3) NXT Switchboard Annotations, brings together in NITE XML, a single XML format, the multiple layers of annotation performed on a transcript subset from Switchboard 1- Release 2, LDC97S62. NXT Switchboard Annotations was developed in a collaboration among researchers from Edinburgh University, Stanford University and the University of Washington.

The original Switchboard corpus is a collection of spontaneous telephone conversations between previously unacquainted speakers of American English on a variety of topics chosen from a pre-determined list. A subset of one million words from those conversations was annotated for syntactic structure and disfluencies as part of the Penn Treebank project. Phonetic transcripts were generated by the International Computer Science Institute, University of California Berkeley and later corrected by the Institute for Signal Information Processing, Mississippi State Univeristy. The Penn Treebank transcripts provided the basis for the NXT Switchboard corpus, and the noun phrases from that subset were annotated for animacy. The Treebank transcript was then aligned with the corresponding subset from the corrected Mississippi State (MS-State) transcript in order to provide word timing information. Focus/contrast and prosodic annotations, as well as phone/syllable alignment were next added to the annotations. The previous annotations of dialog acts and prosody were converted to NITE XML. Lastly, hand annotations for markables were added to provide information about their animacy and information structure, including coreferential links.

NXT is an open source toolkit that enables multiple linguistic annotations to be assembled into a unified database. It uses a stand-off XML data format that consists of several XML files that point to each other. The NXT format provides a data model that describes how the various annotations for a corpus relate to one another. For that reason, it does not impose any particular linguistic theory or any particular markup structure. Instead, users define their annotations in a "metadata" file that expresses their contents and how they relate to each other in terms of the graph structure for the corpus annotations overall. The relationships that can be defined in the data model draw annotations together into a set of intersecting trees, but also allow arbitrary links between annotations over the top of this structure, giving a representation that is highly expressive, easier to process than arbitrary graphs and structured in a way that helps data users. NXT's other core component is a query language designed specifically for working with data conforming to this data model. Together, the data model and query language allow annotations to be treated as one coherent set containing both structural and timing information.

NXT Switchboard Annotations is distributed via web download.

2009 Subscription Members will automatically receive two copies of this corpus.  2009 Standard Members may request a copy as part of their 16 free membership corpora.  Non-members may license this data for US$25.  NXT Switchboard Annotations is made available to LDC not-for-profit members and all non-members under the Creative Commons Attribution-Noncommercial Share Alike 3.0 license. NXT Switchboard Annotations is available to LDC's for-profit members under the terms of their For-Profit Membership Agreements. 

 

 

 
Back to Top

5-2-2 . ELRA - Language Resources Catalogue - Update

*****************************************************************
ELRA - Language Resources Catalogue - Update
*****************************************************************

ELRA is happy to announce that 1 new Terminology Database, 3 Speech Desktop/Microphone resources and 1 Speech Telephone resource are now available in its catalogue:
*
**ELRA-T0373 BioLexicon *
BioLexicon is a large-scale English terminological resource which has been developed to address the needs emerging in text mining efforts in the biomedical domain. It contains over 2.2M lexical entries (over 3.3M semantic relations), and information on over 1.8M variants and on over 2M synonymy relations. BioLexicon is available in a relational database format (MySQL dump format) and it adheres to the EAGLES/ISO standards for lexical resources.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1113

*ELRA-S0301 Norwegian EUROM1 (EUROM1_N)*
EUROM1 is the first really multilingual speech database produced in Europe. Over 60 speakers per language pronounced numbers, sentences, isolated words using close talking microphone.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1114

*ELRA-S0302 TC-STAR female baseline voice: Laura*
Laura contains the recordings of one female English (British) speaker recorded in a noise-reduced room through a headset microphone. It consists of the recordings and annotations of read text material of approximately 10 hours of speech for baseline applications (Text-to-Speech systems).
For more information, see: http://catalog.elra.info/product_info.php?products_id=1115

*ELRA-S0303 TC-STAR male baseline voice: Ian*
Ian contains the recordings of one male English (British) speaker recorded in a noise-reduced room through a headset microphone. It consists of the recordings and annotations of read text material of approximately 10 hours of speech for baseline applications (Text-to-Speech systems).
For more information, see: http://catalog.elra.info/product_info.php?products_id=1116

*ELRA-S0304 SpeechDat(M) Italian Mobile Network Speech Database*
This speech database contains the recordings of 342 Italian speakers recorded over the Italian mobile telephone network. Each speaker uttered around 40 read and spontaneous items.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1117


For more information on the catalogue, please contact Valérie Mapelli mailto:mapelli@elda.org

Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html   

Back to Top

5-2-3 . The ELRA Catalogue of Language Resources at OLAC

 The ELRA Catalogue of Language Resources at OLAC

In the framework of its ongoing collaborative work with the OLAC community and in order to improve its compliance with OLAC standards, ELRA is pleased to announce that the OLAC archive of its Catalogue of Language Resources has been updated. According to OLAC standards, the ELRA Catalogue archive achieves the maximum rate of 5 stars on the OLAC scale.
 
The ELRA Catalogue has been exported into OLAC standards by means of associating the ELRA metadata for describing Language Resources with the OLAC metadata. Daily updates are performed through the automatic harvesting of the ELRA Catalogue which is carried out by the OLAC server.

With this work, ELRA aims at rendering its catalogue more visible to the wider community and it contributes to sharing information on Language Resources through standardized metadata sets.

To visit the ELRA Catalogue: http://catalogue.elra.info

and the corresponding OLAC archive: http://www.language-archives.org/archive/catalogue.elra.info

 

*** About ELRA ***
The European Language Resources Association (ELRA) is a non-profit making organisation founded by the European Commission in 1995, with the mission of providing a clearing house for language resources and promoting Human Language Technologies (HLT).

To find out more about ELRA, please visit our web site:
http://www.elra.info

*** About OLAC ***
OLAC, the Open Language Archives Community, is an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources.

 

To find out more about OLAC, please visit the following website: http://www.language-archives.org. From that website, ELRA resources can be searched, alongside resources from 40 other language archives.


Back to Top

5-2-4 . QAST/ELDA

 QAST (Question Answering on Speech Transcripts) is a CLEF track providing a framework in which QA systems can be evaluated when the
answers have to be extracted from speech transcriptions (manual and automatic transcriptions).

Three QAST  evaluation campaigns have already been carried out in 2007, 2008 and 2009 (http://www.lsi.upc.edu/~qast/2009/).

To help us identify a list of potential participants for QAST 2010, we would be very grateful if you could take a few minutes to fill this
(very short) on-line questionnaire:

http://survey.elda.org/index.php?sid=89265&lang=en

You are welcome to answer the questionnaire even if you are not willing to (or cannot) participate next year. We would be happy to have your
opinion on this task in any case.

Feel free to forward this email to whoever might be interested in QAST. We will collect answers until the end of next week.

Contact: Nicolas Moreau (moreau@elda.org)

Back to Top

5-2-5 . ELDA Distribution campaign 2009

 ELDA Distribution Campaign 2009
*****************************************************************

ELDA is launching a special distribution campaign offering very favorable conditions for the language resources acquisition,
including discounts on public prices, from the ELRA Catalogue of Language Resources (see http://catalog.elra.info).

This offer will be open until the end of December 2009.
 
For more information on this offer, please contact Valérie Mapelli (mapelli@elda.org)

Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html 

 
Back to Top

6 . Jobs openings

We invite all laboratories and industrial companies which have job offers to send them to the ISCApad editor: they will appear in the newsletter and on our website for free. (also have a look at http://www.isca-speech.org/jobs.html as well as http://www.elsnet.org/ Jobs). 

The ads will be automatically removed from ISCApad after  6 months. Informing ISCApad editor when the positions are filled will avoid irrelevant mails between applicants and proposers.


Back to Top

6-1 . (2009-06-19) POSTDOC POSITION in SPEECH RECOGNITION FOR UNDER-RESOURCED LANGUAGES

POSTDOC POSITION in SPEECH RECOGNITION FOR UNDER-RESOURCED LANGUAGES (18 months ; starting January 2010 or later) IN GRENOBLE (France)

=============================================================================

PI (ANR BLANC 2009-2012) is a cooperative project sponsored by the French National Research Agency, between the University of Grenoble (France), the University of Avignon (France), and the International Research Center MICA in Hanoï (Vietnam).


PI addresses spoken language processing (notably speech recognition) for under-resourced languages (or ?-languages). From a scientific point of view, the interest and originality of this project consists in proposing viable innovative methods that go far beyond the simple retraining or adaptation of acoustic and linguistic models. From an operational point of view, this project aims at providing a free open source ASR development kit for ?-languages. We plan to distribute and evaluate such a development kit by deploying ASR systems for new under-resourced languages with very poor resources from Asia (Khmer, Lao) and Africa (Bantu languages).

The POSTDOC position focus on the development of ASR for two low-resourced languages from Asia and Africa. This includes : supervising the resource collection (in relation with the language partners), propose innovative methods to quickly develop ASR systems for these languages, evaluation., etc.

The salary of the POSTDOC position is roughly 2300€ net per month. Applicants should hold a PhD related to spoken language processing. The applicants should be fluent in English. Competence in French is optional, though applicants will be encouraged to acquire this skill during the postdoc.

For further information, please contact Laurent Besacier (Laurent.Besacier at imag.fr).

Back to Top

6-2 . (2009-06-22) PhD studentship in speech and machine learning ESPCI ParisTech

Thesis topic : Vocal Prosthesis Based on Machine Learning
The objective of the thesis is to design and implement a vocal prosthesis to restore the original voice of persons who have lost the ability to speak due to a partial or total laryngectomy or a neurological problem. Using a miniature ultrasound machine and a video camera to drive a speech synthesizer, the device is intended to restore the original voice of these patients with as much fidelity as possible, allowing speech handicapped individuals to interact with those around them in a more natural and familiar way. The thesis work will build upon promising results obtained in the Ouisper project (funded on contract number ANR-06-BLAN-0166, http://www.neurones.espci.fr/ouisper/index.htm, and also supported by the French Defense Department, DGA), which terminates at the end of 2009; however, final success will require addressing the following four key technological issues:
1) New data acquisition protocol: the current acquisition system requires the user’s head to be immobilized during speech. The candidate will need to design and implement an innovative new system to overcome this constraint, which would be unacceptable for a real world application.
2) New dictionaries: results obtained thus far show that a truly open domain vocabulary may not be realistic. The candidate will create new dictionaries of vocabularies which are constrained, yet rich enough to be of genuine utility for verbal communication in the targeted speech handicapped community.
3) New synthesis methods: concatenative synthesis, though conceptually simple, is not sufficiently flexible when the initial recognition step contains errors. The candidate will devise new synthesis methods which better model the spectral qualities of the speaker’s voice, perhaps using Bayesian networks. Innovative techniques of recovering an acceptable prosody for the synthesized speech will also need to be developed.
4) Real time execution: as the amount of calculation necessary to carry out the recognition and synthesis steps is significant, the candidate will need to pay particular attention to optimization of code and real time execution in the algorithms he or she develops.
The thesis will be carried out in partnership with the Laboratoire de Phonétique et de Phonologie of the Université de Paris III, specializing in speech production and pathologies, for which additional funding has been obtained from the Agence Nationale de la Recherche (Emergence-TEC 2009 call, REVOIX project).
Back to Top

6-3 . (2009-06-30)Postdoctoral Fellowships in machine learning/statistics/machine vision at Monash University, Australia


Back to Top

6-4 . (2009-07-01) These: Vocal Prosthesis Based on Machine Learning (France)

  Vocal Prosthesis Based on Machine Learning(2)
These
DeadLine: 01/09/2009
denby@ieee.org
http://
We are looking for an excellent candidate for a PhD studentship in speech and statistical learning at the Laboratoire d'Electronique at ESPCI ParisTech, Paris, France. Interested candidates should contact Prof. B. Denby by mail at denby@ieee.org before 1 Septembre 2009 at the latest (earlier application is strongly encouraged).
Working language: French or English
Thesis topic : Vocal Prosthesis Based on Machine Learning
The objective of the thesis is to design and implement a vocal prosthesis to restore the original voice of persons who have lost the ability to speak due to a partial or total laryngectomy or a neurological problem. Using a miniature ultrasound machine and a video camera to drive a speech synthesizer, the device is intended to restore the original voice of these patients with as much fidelity as possible, allowing speech handicapped individuals to interact with those around them in a more natural and familiar way. The thesis work will build upon promising results obtained in the Ouisper project (funded on contract number ANR-06-BLAN-0166,
http://www.neurones.espci.fr/ouisper/index.htm,
and also supported by the French Defense Department, DGA), which terminates at the end of 2009; however, final success will require addressing the following four key technological issues:
1) New data acquisition protocol: the current acquisition system requires the users head to be immobilized during speech. The candidate will need to design and implement an innovative new system to overcome this constraint, which would be unacceptable for a real world application.
2) New dictionaries: results obtained thus far show that a truly open domain vocabulary may not be realistic. The candidate will create new dictionaries of vocabularies which are constrained, yet rich enough to be of genuine utility for verbal communication in the targeted speech handicapped community.
3) New synthesis methods: concatenative synthesis, though conceptually simple, is not sufficiently flexible when the initial recognition step contains errors. The candidate will devise new synthesis methods which better model the spectral qualities of the speaker's voice, perhaps using Bayesian networks. Innovative techniques of recovering an acceptable prosody for the synthesized speech will also need to be developed.
4) Real time execution: as the amount of calculation necessary to carry out the recognition and synthesis steps is significant, the candidate will need to pay particular attention to optimization of code and real time execution in the algorithms he or she develops.
The thesis will be carried out in partnership with the Laboratoire de Phonetique et de Phonologie of the Universite de Paris III, specializing in speech production and pathologies, for which additional funding has been obtained from the Agence Nationale de la Recherche (Emergence-TEC 2009 call, REVOIX project).
http://gdr-isis.org/rilk/gdr/Kiosque/poste.php?jobid=3369
Back to Top

6-5 . (2009-07-06) PhD in SPEECH RECOGNITION FOR UNDER-RESOURCED LANGUAGES (Grenoble France)


POSTDOC POSITION in SPEECH RECOGNITION FOR UNDER-RESOURCED LANGUAGES (18 months ; starting January 2010 or later) IN GRENOBLE (France)
=============================================================================

PI (ANR BLANC 2009-2012) is a cooperative project sponsored by the French National Research Agency, between the University of Grenoble (France), the University of Avignon (France), and the International Research Center MICA in Hanoï (Vietnam).

PI addresses spoken language processing (notably speech recognition) for under-resourced languages (or ?-languages). From a scientific point of view, the interest and originality of this project consists in proposing viable innovative methods that go far beyond the simple retraining or adaptation of acoustic and linguistic models. From an operational point of view, this project aims at providing a free open source ASR development kit for ?-languages. We plan to distribute and evaluate such a development kit by deploying ASR systems for new under-resourced languages with very poor resources from Asia (Khmer, Lao) and Africa (Bantu languages).


The POSTDOC position focus on the development of ASR for two low-ressourced languages from Asia and Africa. This includes : supervising the ressource collection (in relation with the language partners), propose innovative methods to quickly develop ASR systems for these languages, evaluation., etc.

The salary of the POSTDOC position is roughly 2300? net per month. Applicants should hold a PhD related to spoken language processing. The applicants should be fluent in English. Competence in French is optional, though applicants will be encouraged to acquire this skill during the postdoc.

For further information, please contact Laurent Besacier (Laurent.Besacier at imag.fr).
Back to Top

6-6 . (2009-07-08) Position at Deutsche Telekom R&D

Deutsche Telekom, one of the world’s leading telecommunications and information
technology service provider, is expanding its corporate research and development
activities at Deutsche Telekom Inc., R&D Lab USA, Los Altos, California. Having a close
collaboration with top-notch institutions, the laboratories offer an unprecedented
combination of academic and industrial research with opportunities to have a direct
impact on company’s products and services.
There is a current opening for a highly qualified Senior Research Scientist the
research field New Media for the area of Multimedia Communications and Systems.
We are looking for a self-driven and motivated individual who is passionate about
conducting leading-edge research. Applicants should have recently completed a
doctoral degree in computer science, electrical engineering, or other related disciplines
and have expertise in different facets of multimedia communications such as media
coding, streaming, and compression, with hands on system building experience and
know-how of standards. Experience in industrial R&D will be valued.
Application material should include, in a single pdf file, the following in stated order, (a)
cover letter, (b) one-page statement of research objectives, (c) curriculum vitae, (d) list
of publications, and (e) contact information of at least three individuals who may serve
as references. Short-listed candidates will be invited to give a talk and have interviews
with members of the recruiting committee.
Please submit your application until 22 July 2009.
Deutsche Telekom Inc. is an equal opportunity employer.
Applications should be submitted via email to:
Dr. Jatinder Pal Singh
Deutsche Telekom Inc., R&D Lab USA
Email: laboratories.researchscientist@telekom.de
Back to Top

6-7 . (2009-07-15) PhD at LIMSI Paris

Titre :
Modèles de l'expressivité pour la synthèse de récits courts, lus par un robot humanoïde.
Contenu:
Si les systèmes de synthèses actuels sont généralement suffisants pour lire des phrases de
façon neutre, il sont très vite pénibles à écouter, en particulier pour des textes assez long
(plusieurs paragraphes). Les systèmes de synthèse ne sont guère capables de rendre expressif
une narration. De même, les capacités motrices des robots humanoïdes sont actuellement peu
exploitées et développées pour l’expression par le geste et la posture.
Ce projet de recherche porte sur la synthèse expressive audiovisuelle de récits courts. Le
projet comprend deux aspects principaux. Dans une phase d’analyse, il s’agit de traiter
automatiquement des textes, courts récits de type « contes pour enfant », afin d'en extraire un
contenu pragmatique, sémantique, dialogique, narratif et émotionnel.
Ce contenu servira dans une seconde phase d'une part à la synthèse de prosodie expressive, et
d'autre part à alimenter un modèle comportemental en termes de postures, de gestes et autres
mouvements du robot humanoïde NAO.
Compétences requises:
Ce sujet est situé dans le domaine de l’interaction homme-machine expressive.
Il demande de posséder ou d'acquérir des compétences en informatique linguistique, tant du
point de vue de l'écrit que de l'oral, et si possible également du point de vue audio visuel.
Le projet contient une part significative de programmation pour l’analyse des texte et la
synthèse, mais aussi une part significative d’analyse linguistique (des textes), phonétique (de
la prosodie), et comportementale (posture et gestes).
Des profils de type informatique, science cognitive ou linguistique seront donc considérés.
Contexte et équipe d’accueil:
Cette thèse s’inscrit dans le contrat ANR GV-LEX.
Elle se déroulera au LIMSI-CNRS www.limsi.fr dans les groupes Audio & Acoustique,
Traitement du Langage Parlé, et Architecture et Modèles de l’Interaction
Cette thèse commencera dès septembre, financée par l’ANR pour une durée 3 ans.
Encadrement - contact:
La thèse sera encadrée par Christophe d’Alessandro, directeur de recherche au CNRS. Les
candidatures seront adressées aux quatre chercheurs impliqués dans ce projet :
Christophe d’Alessandro <cda@limsi.fr>
Jean-Claude Martin <martin@limsi.fr>
Sophie Rosset <sophie.rosset@limsi.fr>
Albert Rilliard <rilliard@limsi.fr>
Back to Top

6-8 . (2009-07-17) 2 PhD in Computational linguistics in Radboud University Nijmegen NL

Two PhD students for Second Language Acquisition/Computational Linguistics (1,0 fte)

Faculty of Arts, Radboud University, Nijmegen
Vacancy number: 23.24.09
Closing date: 1 September 2009

Job description
As a PhD student you will take part in the larger research project ‘Corrective feedback and the acquisition of syntax in oral proficiency’. The goal of this research project is to investigate the essential role of corrective feedback in L2 learning of syntax in oral proficiency. It will proceed from a granular level investigating the short-term effects of different types of feedback moves on different types of learners, to a global level by studying whether the granular, short-term effects also generalize to actual learning in the long term. Corrective feedback will be provided through a CALL system that makes use of automatic speech recognition. This will make it possible to assess the learner’s oral production online and to provide corrective feedback immediately under near-optimal conditions.
As a PhD student you will study which feedback moves lead to immediate uptake and acquisition in learners with a high level of education (PhD1) or learners with a low level of education (PhD2).
You are expected to start in November 2009. You will be part of an international and interdisciplinary team and will work in a motivating research environment.
For more information, see: http://lands.let.ru.nl/~strik/research/ASOP.html; http://www.ru.nl/cls/.

Requirements
You must have:
- a Master’s degree in (Applied) Linguistics, Computational Linguistics, Computer Science, Psycholinguistics, Artificial Intelligence, Cognitive Science or Education;
- programming skills (e.g. Matlab, Perl);
- an interest in second language acquisition;
- a working knowledge of Dutch and a good command of the English language.

Organization
The Faculty of Arts consists of eleven departments in the fields of language and culture, history, history of arts, linguistics and business communication, which together cater for about 2,800 students and collaborate closely in teaching and research. The project will be carried out at the Centre for Language Studies as part of the Linguistic Information Processing and Communicative Competences research programmes.
Website: http://www.ru.nl/cls/

Conditions of employment
Employment: 1,0 fte

Additional conditions of employment
The total duration of the contract is 3.5 years. The PhD students will receive an initial contract for 18 months with possible extension by 2 years.
The starting gross salary is €2,042 per month based on full-time employment.

The short-listed applicants will be interviewed in September 2009


Other Information
Please include:
- a copy of your university degree (in English or Dutch)
- a list of all your university marks (in English or Dutch)
- a motivation letter with details of research interests/experience, programming skills and knowledge of Linguistics and Psycholinguistics.

Additional Information
Prof. Roeland van Hout (r.vanhout@let.ru.nl)
Dr. Catia Cucchiarini (c.cucchiarini@let.ru.nl)
Dr. Helmer Strik (h.strik@let.ru.nl)

Application
You can apply for the job (mention the vacancy number 23.24.09) before 1 September 2009 by sending your application -preferably by email- to:

RU Nijmegen, Faculty of Arts, Personnel Department
P.O. Box 9103, 6500 HD, Nijmegen, The Netherlands
E-mail: vacatures@let.ru.nl

 

Back to Top

6-9 . (2009-07-24) Acoustic signal detection engineer Oregon State University

Acoustic Signal Detection Engineer

OSU’s Cooperative Institute for Marine Resources Studies offers one year of support, with the possibility of additional support, for a researcher on a project studying passive acoustic monitoring of large whales under the direction of Dr. David Mellinger. This is a full-time (1.0 FTE), 12-month-per-year, fixed-term Faculty Research Assistant position. Individuals with a Ph.D. may be appointed as a Research Associate (Postdoc). To see full details and to apply, please see http://oregonstate.edu/jobs, then search for posting #0004462 (the leading zeros are required). For questions, please email Jessica.Waddell@oregonstate.edu or David.Mellinger@oregonstate.edu. For full consideration, apply by September 21, 2009. OSU is an Affirmative Action/Office of Equal Opportunity employer. 

Back to Top

6-10 . (2009-08-06) Post graduate Research positions at Marcs, Australia

MARCS Auditory Laboratories currently has 3 Postgraduate Research Awards available, offering a competitive tax free living allowance of $30,427 per annum and a funded place in the doctoral program.

 

Projects Available:

 

Thinking Head - Performance

Supervisor:  Dr Garth Paine (ga.paine@uws.edu.au)

 

Thinking Head and Head-User Evaluation

Supervisor: Assoc Prof Kate Stevens (kj.stevens@uws.edu.au)

 

Sonification of Real-Time Data: Computational and Cognitive Approaches

Supervisor: Professor Roger Dean (roger.dean@uws.edu.au)

 

Thinking Head—Human-Human and Human-Head Interaction

Supervisor: Professor Chris Davis (chris.davis@uws.edu.au)

 

Learning Complex Temporal and Rhythmic Relations

Supervisor: Assoc Prof Kate Stevens (kj.stevens@uws.edu.au)

 

Tuning in to Native Speech and Perceiving Spoken Words

Supervisor: Professor Catherine Best (c.best@uws.edu.au)

 

Applications close 21 August 2009.  For further information visit the scholarship website –  www.uws.edu.au/research/scholarships.

 

MARCS Website - http://marcs.uws.edu.au/            Thinking Head Website - http://thinkinghead.edu.au//

 
 
Back to Top

6-11 . (2009-08-06) PhD position is available at the Queensland University of Technology, Brisbane, Australia.

PHD OPPORTUNITY:

A full-time 3 year PhD position is available at the Queensland University of Technology, Brisbane, Australia.

The position is within the Speech and Audio Research Lab, part of Smart Systems Theme of the Faculty of Built Environment and Engineering, and the Information Securities Institute.  The lab conducts world class research and  postgraduate training in a variety of speech and audio processing areas (speaker recognition, diarisation, speech detection, speech enhancement, multi-microphone speech technology, automatic language identification, keyword spotting)

Project title:
Speaker Diarisation
Starting date:
November/December 2009
Research fields:  Speech and audio processing, pattern recognition, bayesian theory, machine learning, biometrics and security.
Project Description:  
Large volumes of spoken audio are being recorded on a daily basis and audio archives of these recordings around the world are expanding rapidly. It is becoming increasingly important to be able to efficiently and automatically search, index and access information from these audio information sources. Speaker diarisation is an important, fundamental task in this process which aims to annotate the audio stream with speaker identities for each temporal region—determining “
who spoke when.”

Current diarisation systems are susceptible to a number of impediments including wide variability in the acoustic characteristics of recordings from different sources, differences in the number of speakers present in a recording, the dominance of speakers, and the style and structure of the speech.  All can affect the diarisation performance dramatically.

The aim of this research is to develop a framework and methods for better exploiting the sources of
prior information that are generally available in many applications with the view to making portable, robust speaker diarisation systems a reality.  Examples of relevant prior information include identities of participating speakers, models describing the characteristics of speakers in general or of a specific known speaker, models of the effects that recording conditions and domains have on acoustic features and knowledge of the recording domain.

Information for Applicants:
Applicants should hold a strong university degree which would entitle them to embark on a doctorate (Masters/diploma or equivalent) in a relevant discipline (computer science, mathematics, computer systems engineering etc). International students are encouraged to apply. The project is part of a ARC linkage between QUT, a commercial partner, and two  partner speech/audio processing laboratories at European universities.  Opportunities for exchange/internships at these partner institutions exist.  The opportunity also exists for cotutelle PhD with the French partner university.  

Information on Brisbane and Queensland University of Technology can be found at www.qut.edu.au

The salary of the PhD position is provided as a Linkage APAI scholarship ($26,669 in 2009, indexed annually) + top-up scholarship (approx $5,000pa). The salary is tax-exempt.   
Funding is also available for conference/internship travel.

Interested students are encouraged to contact both the project leader Prof. Sridha Sridharan  (s.sridharan@qut.edu.au), and Dr Brendan Baker (bj.baker@qut.edu.au).  

Applicants are asked to provide:
- cover letter describing your interest in the project
- curriculum vitae indicating degrees obtained, disciplines covered (list of courses), publications, and other relevant experience.  
- sample of written work (research papers in English) is also desirable.  
- references along with contact details

As the start date is later this year, potential
applicants are encouraged to contact the project coordinators as soon as possible to register their interest.

Deadlines for applications: 10 September 2009. (international applicants should apply as soon as possible)

Back to Top

6-12 . (2009-08-26) Ph Positions at the University of Bielefeld Germany

PhD Positions

The Applied Informatics Group, Faculty of Technology, Bielefeld University is looking for PhD candidates for grants and project positions in the following areas:

* Dialog modeling for human-robot interaction

* Speech signal modelling and analysis for speech recognition and synthesis

* Modeling and combining bottom-up with top-down attentional-processes

We invite applications from motivated young scientists with a background in computer science, linguistics, psychology, robotics, mathematics, cognitive science or similar areas, that are willing to contribute to the cross-disciplinary research agenda of our research group. Research and development are directed towards understanding the processes and functional constituents of cognitive interaction, and establishing cognitive interfaces and robots that facilitate the use of complex technical systems. Bielefeld University provides a unique environment for research in cognitive and intelligent systems by bringing together researchers from all over the world in a variety of relevant disciplines under the roof of central institutions such as the Excellence Center of Cognitive Interaction Technology (CITEC) or the Research Institute for Cognition and Robotics (CoR-Lab).

Successful candidates should hold an academic degree (MSc/Diploma) in a related discipline and have a strong interest in research and social robotics.

All applications should include: a short cover letter indicating the motivation and research interests of the candidate, a CV including a list of publications, and relevant certificates of academic qualification.

Bielefeld University is an equal opportunity employer. Women are especially encouraged to apply and in the case of comparable competences and qualification, will be given preference. Bielefeld University explicitly encourages disabled people to apply. Bielefeld University offers a family friendly environment and special arrangements for child care and double carrier opportunities.

Please send your application with reference to one of the three offered research areas no later than 15.9.2009 to Ms Susanne Hoeke (shoeke@techfak.uni-bielefeld.de).

Contact:

Susanne Hoeke

AG Applied Informatics

Faculty of Technology

Universitaetsstr. 21-23

33615 Bielefeld

Germany

Email: shoeke@techfak.uni-bielefeld.de

Back to Top

6-13 . (2009-09-03) Post-doc au laboratoire d'informatique de Grenoble France (french)

Le laboratoire LIG propose un sujet de recherche
    pour un post-doctorant
    CDD de 12 mois
    Grenoble, campus
    année 2009-2010

Sujet de recherche
------------------
Apprentissage Parallèle pour l'Indexation Multimédia Sémantique
Mots-clés : Apprentissage, Parallélisme, Indexation Multimédia.

Contexte
--------
Le poste est proposé dans le contexte du projet APIMS (Apprentissage
Parallèle pour l'Indexation Multimédia Sémantique) soutenu par le pôle
MSTIC de l’Université Joseph Fourrier.

La quantité de documents image et vidéo numériques croît de manière
exponentielle depuis de nombreuses années et cette tendance devrait se
poursuivre encore longtemps grâce aux progrès technologiques dans ce
domaine. L’indexation par concepts des documents image et vidéo est une
nécessité pour gérer de manière efficace les masses de données
correspondantes. En effet, les mots-clés nécessaires pour la recherche
par le contenu n’y sont pas explicitement présents comme dans le cas
des documents textuels. La recherche à partir d’exemples ou à partir
de caractéristiques dites « de bas niveau » présente également de
sérieuses limitations : les exemples nécessaires ne sont généralement
pas disponibles et les caractéristiques de bas niveau ne sont pas
aisément manipulables et interprétables par un utilisateur. Par ailleurs,
une similarité au niveau de ces caractéristiques ne correspond pas
forcément à une similarité au niveau sémantique. L’indexation par
concepts est un grand challenge en raison du « fossé sémantique »
séparant le contenu brut de ces documents (pixels, échantillons audio)
et les concepts qui on un sens pour un utilisateur.

Des progrès importants ont été accomplis ces dernières années, notamment
dans le cadre des campagnes d’évaluation TRECVID [1]. Ces campagnes
annuelles organisées par le National Institute of Standards and
Technologies (NIST) américain fournissent des données en quantité
importante, des tâches bien définies, des « vérités terrain », des
métriques et des outils d’évaluation associés. Elles contribuent
largement à fédérer les recherches dans le domaine de l’indexation
et de la recherche par le contenu des documents vidéo.
Les méthodes fonctionnant le mieux actuellement sont des méthodes
statistiques fonctionnant par apprentissage supervisé à partir
d’exemples annotés manuellement. Des caractéristiques dites de bas
niveau sont extraites à partir du signal audio ou image brut (des
histogrammes de couleur ou des transformées de Gabor par exemple) et
sont ensuite envoyées à des classifieurs qui sont entraînés à partir
d’exemples positifs et négatifs des concepts à reconnaître. Pour
obtenir de bons résultats, il est nécessaire de multiplier les
caractéristiques utilisées et de les combiner en utilisant des
techniques de fusion appropriées. Un gain supplémentaire est obtenu
en utilisant les relations entre les concepts comme les relations
statistiques (cooccurrences) ou logiques (générique-spécifique par
exemple).

Les principes généraux étant les mêmes, les différences entre les
approches concernent les choix sur les caractéristiques, sur les
outils de classification et/ou de fusion, et sur la façon de prendre
en compte le contexte. La qualité et la quantité des exemples positifs
et négatifs utilisés fait également une différence importante. L’état
de l’art actuel est l’extraction conjointe de plusieurs centaines de
concepts définis dans l’ontologie LSCOM [2]. Cependant, malgré les
efforts très importants fournis par un grand nombre d’équipes (plus
de 40 équipes ont participé à la tâche d’extraction de concepts dans
les plans vidéo lors de la campagne TRECVID 2008), la précision
moyenne des meilleurs systèmes ne dépasse pas 20%.

L’équipe MRIM du LIG a développé des méthodes et des outils pour
l’extraction automatique de concepts dans les plans vidéo et a
obtenu des résultats un peu supérieurs à la moyenne dans les
campagnes TRECVID 2005 à 2007 [3]. L’objectif de ce projet est
d’améliorer de manière importante ces méthodes et de leur faire
rejoindre voire définir l’état de l’art dans le domaine. Pour
cela, il faut d’une part les optimiser en prenant en compte tous
les facteurs importants et de leur ajouter un certain nombre
d’innovations comme l’utilisation de concepts de niveau
intermédiaire, la combinaison de méthodes génériques et
spécifiques, et l’apprentissage actif pour l’amélioration de
la quantité et qualité de l’annotation servant à l’entraînement
des systèmes.

Un des facteurs limitant est la puissance de calcul nécessaire.
Il faut en effet entraîner et évaluer les systèmes sur plusieurs
centaines de concepts et sur plusieurs dizaines de milliers
d’images ou de plans vidéo. Il faut en outre faire cela en
étudiant de multiples combinaisons de caractéristiques de bas
et moyen niveau, de méthodes de classification et de méthodes
de fusion. Nous envisageons pour cela d’utiliser les ressources
du projet GRID 5000 [4] afin de pouvoir étudier à grande échelle
l’influence combinée de ces différents facteurs. Dans sa version
simple, le problème se parallélise assez facilement (on peut faire
faire l’apprentissage et l’évaluation d’un concept sur un
processeur) mais lorsqu’on veut utiliser le contexte, c'est-à-dire
les relations statistiques ou ontologiques des concepts entre eux,
il y a lieu de faire coopérer les différents processus entre eux
et cela devient un réel problème de programmation parallèle.
L’équipe MESCAL du LIG dispose d’une grande expertise dans ce
domaine et participera à l’étude et à la mise en œuvre des versions
parallèles des méthodes d’extraction de concepts.

L’utilisation de la multi modalité naturellement présente dans
les documents vidéo est également essentielle pour la performance
des systèmes d’indexation par concepts. L’équipe GETALP du LIG
dispose de compétences dans le domaine du traitement du signal
audio et de parole et participera à la définition et à
l’optimisation des caractéristiques de bas et moyen niveau pour
l’indexation des concepts à partir de la piste audio. De même,
l’équipe GPIG de GIPSA-Lab dispose de compétences dans l’analyse
et l’indexation du mouvement dans les documents vidéo et participera
à la définition et à l’optimisation des caractéristiques de bas et
moyen niveau pour l’indexation des concepts à partir du mouvement
dans la piste image.

Références
[1] Smeaton, A. F., Over, P., and Kraaij, W. TRECVID: evaluating
    the effectiveness of information retrieval tasks on digital video.
    In Proceedings of the 12th Annual ACM international Conference on
    Multimedia, New York, NY, USA, October 10-16, 2004.
[2] M. Naphade, J.R. Smith, J. Tesic, S.-F. Chang, W. Hsu,
    L. Kennedy, A. Hauptmann and J. Curtis, Large-Scale Concept
    Ontology for Multimedia, IEEE Multimedia 13(3), pp. 86-91, 2006.
[3] Stéphane Ayache, Georges Quénot and Jérôme Gensel, CLIPS-LSR
    Experiments at TRECVID 2006, TRECVID’2006 Workshop, Gaithersburg,
    MD, USA, November 13-14, 2006.
[4] Bolze, R. et al, Grid'5000: a large scale and highly reconfigurable
    experimental Grid testbed International Journal of High Performance
    Computing Applications, 20(4), pp 481-494, 2006.

Description du poste
--------------------
La première partie du travail consistera à mettre en œuvre des
versions parallèles des méthodes de classification développées dans
l’équipe MRIM et à utiliser ces versions parallèles pour optimiser
conjointement les différents éléments (jeux de caractéristiques,
opérateurs de classification et opérateurs de fusion) intervenant
dans celles-ci. Cette optimisation devra être faire de manière aussi
systématique que possible. Compte tenu de l’aspect hautement
combinatoire et du coût de calcul (même sur une architecture
parallèle) de celle-ci, des méthodes heuristiques appropriées devront
être étudiées et mises en œuvre afin d’obtenir le meilleur résultat
dans un temps donné.

Dans une deuxième partie, il faudra mettre en œuvre des approches
intégrées pour la reconnaissance simultanée de plusieurs centaines
de concepts en prenant en compte dès les premiers niveaux de
l’apprentissage les corrélations existant entre ceux-ci.

Ces travaux seront, dans la mesure du possible, planifiés en fonction
des évaluations TRECVID sur la détection de concepts dans les plans
vidéo. Les expérimentations on lieu en général pendant l’été
(juillet-août) et les campagnes s’étendent de février à novembre de
l’année en cours. L’objectif est de pouvoir évaluer lors des campagnes
2009 et 2010 ce qu’il est prévu de développer dans la première et la
deuxième partie décrites ci-dessus.

Type de poste et localisation
-----------------------------
CDD de 12 mois au laboratoire LIG.

Intégration dans l’équipe MRIM du LIG (recherche en recherche
d’information multimédia et systèmes de recommandation) et
collaboration avec les équipes GETALP et MESCAL du LIG et
l’équipe GPIG du laboratoire GIPSA.

Localisation : Grenoble, campus de Saint Martin d’Hères.
Salaire : 2 000 euros nets / mois environ.

Formation et compétences nécessaires
------------------------------------
Profil demandé
o Expérience importante et compétences reconnues en conception –
  développement de logiciels.
o Connaissances et expérience significative en langage C ou C++.
o Thèse dans l’un des domaines suivants : programmation parallèle,
  systèmes de recherche d’information, apprentissage automatique,
  traitement statistique des données, traitement d’images.

Compétences complémentaires intéressantes pour le poste
o Expérience dans l’optimisation des performances des algorithmes.
o Expérience du travail en équipe.

Date limite de candidature
--------------------------
Les candidatures peuvent être déposée jusqu’au 30 septembre 2009.
Le poste est à pourvoir début novembre ou décembre 2009 au plus tard.
Dès qu'une candidature sera retenue, le poste sera affecté.

Contact
-------
Georges Quénot – Laboratoire LIG – Equipe MRIM – http://mrim.imag.fr/georges.quenot
Adresse : Bâtiment B – 385 avenue de la Bibliothèque – 38400 Saint Martin d’Hères
E-mail : Georges.Quenot@imag.fr – Tél : 04 76 63 58 55

Back to Top

6-14 . (2009-09-23)Position of Professor in Phonology- GIPSA, Grenoble,France

Profil du poste PR0035 "Phonologie - phonétique générale et expérimentale" qui sera mis au concours au printemps 2010.

*Phonologie - phonétique générale et expérimentale*
• *Enseignement :*
• filières de formation concernées :
Parcours LMD de la filière de Sciences du Langage
• objectifs pédagogiques et besoin d’encadrement :
Le professeur recruté devra assurer des enseignements de phonétique et phonologie proposés par l’UFR des Sciences du Langage :
– dans le cursus de Licence : développement de la parole, phonétique articulatoire et acoustique, prosodie, phonologies linéaires et multilinéaires, phonétique expérimentale ;
– dans le Master Sciences du Langage, spécialité « Linguistique, sociolinguistique et acquisition du langage », orientation Recherche, en particulier les cours de phonologie.
Le professeur recruté devra prendre en charge ces enseignements en intégrant les apports des travaux sur les approches phonologiques (géométrie des traits, théorie des éléments, phonologie de laboratoire, phonologie prosodique, tonologie, phonologie cognitive, théorie de l’optimalité, etc.) et sur la diversité des réalisations sonores des langues du monde, en particulier les langues à tradition orale.
Le professeur devra s’impliquer dans la direction et l’encadrement de mémoires de master recherche ainsi que dans la formation des jeunes chercheurs en phonologie, typologie et linguistique de terrain (recueil de données, corpus, etc.), phonétique expérimentale, phonétique générale.
*• Recherche :*
La recherche s’effectuera au sein du département Parole et Cognition (DPC) de l’UMR 5216 GIPSA-lab. Le DPC mène des recherches multidisciplinaires sur la parole et le langage en s’appuyant en particulier sur quatre domaines de compétence: traitement du signal, physique, cognition et sciences du langage. Dans notre approche de l’analyse des langues naturelles, les aspects phonétiques, phonologiques, lexico-sémantiques et prosodiques, incluant les champs de la perspective temporelle (étude diachronique) et spatiale (géolinguistique), sont plus particulièrement l’objet d’étude de l’équipe Systèmes Linguistiques et Dialectologie du DPC.
Les axes de cette équipe concernent l’émergence des phénomènes linguistiques – et particulièrement dans des situations de contacts de langues –, la description et la documentation des systèmes linguistiques à tradition orale, l’étude de la prosodie et des fonctions communicatives. Ces recherches s’inscrivent dans le contrat quadriennal 2011-2014 du laboratoire et font l’objet depuis plusieurs années de collaborations nationales et internationales (notamment avec l’Amérique Latine) soutenues, entre autres, par l’ANR et la Communauté Européenne.
Les thématiques de recherche du Professeur recruté devront s’inscrire dans ces axes de recherche en explorant plus particulièrement la variabilité et les dynamiques temporelles et spatiales des systèmes linguistiques aux niveaux segmental et suprasegmental. Le Professeur s’appuiera sur les acquis et expériences de la linguistique de terrain, de la linguistique de corpus, de la phonétique expérimentale et de la phonologie de laboratoire. Il pourra être amené à proposer des
éléments de modélisation cognitive et computationnelle. Il est attendu que la personne recrutée ait une expérience confirmée dans l’animation d’équipe et s’investisse activement dans la mise en place de projets collectifs nationaux et internationaux. Il devra bien évidemment travailler dans une dynamique de collaboration inter-équipes au sein du GIPSA-lab.
*• Laboratoire d’accueil :*
Laboratoire Grenoble, Image, Parole, Signal, Automatique GIPSA-lab UMR 5216, site : http://www.gipsa-lab.inpg.fr/
*• Contacts :*
– Enseignement :
o Marinette.Matthey@u-grenoble3.fr pour le niveau M
o Francoise.Boch@u-grenoble3.fr pour le niveau L
– Recherche :
o Gerard.Bailly@gipsa-lab.grenoble-inp.fr

Back to Top

6-15 . (2009-09-28) Researcher in expressive speech synthesis DFKI, Kaiserslautern, Germany

Job offer: Researcher in expressive speech synthesis

The German Research Center for Artificial Intelligence (DFKI GmbH), with sites in Kaiserslautern, Saarbrücken, Bremen and Berlin is the leading German research institute in the field of innovative software technology.

DFKI's Language Technology Lab is looking for a Researcher to work in either Saarbrücken or Berlin in the DFG-funded project PAVOQUE ( PArametrisation of prosody and VOice QUality for concatenative speech synthesis in view of Emotion expression). The contract should start on 1 November 2009 or 1 December 2009, and is limited to the project duration of one year.

Main Tasks

  • Develop and extend speech synthesis technologies in the speech synthesis system MARY TTS, in view of the realisation of prosody and voice quality modifications in unit selection;

  • Develop and apply algorithms to annotate prosody and voice quality in expressive speech synthesis corpora;

  • Carry out a listener evaluation study of expressive synthetic speech.

Profile

The ideal candidate holds a PhD, or is close to finishing a PhD, in a relevant topic area such as speech signal processing or computer science. The candidate must have demonstrable experience with programming algorithms of unit selection synthesis, speech signal processing and/or voice conversion, and should have experience with Java programming. Knowledge in the area of planning, carrying out and evaluating perception tests would be a plus. Highly valued personal qualities include creativity, open-mindedness, team spirit, and a willingness to address novel challenges. Fluency in English as a working language is required.

For more information about MARY TTS, see http://mary.dfki.de.

Contact for Questions

Dr. Marc Schröder
Tel. +49-681-302-5303
http://dfki.de/~schroed
marc.schroeder@dfki.de

Closing date: 20 October 2009

Please send your electronic application with all usual documents to: lt-jobs@dfki.de

  

Back to Top

6-16 . (2009-10-01) PhD at the National Center for Biometric Studies- Univ. Canberra Australia

The Faculty of Information Sciences and Engineering of the University of Canberra is offering a top-up stipend of $7,000 per annum for a student undertaking a PhD thesis in the National Centre for Biometric Studies. The project is related to the Thinking Head Project (www.canberra.edu.au/faculties/ise/ncbs/thinkinghead) and will be in one of the following research areas:

*             Automatic Speech Recognition (ASR) or Audio-Video Speech Recognition (AVSR)

*             Speaker Recognition / Verification / Authentication

*             Face Recognition or Facial Feature Tracking

*             Speaker Characterisation or Facial Expression Recognition

*             Affective Computing / Affective Sensing

*             Multimodal Human-Computer Interaction (MM-HCI)

*             Pattern Recognition / Multimodal Fusion Algorithms

The stipend is available for up to 3 years either to an Australian-resident student having gained an APA place or to an international student having gained a scholarship for international students in the Faculty of ISE commencing in 2010 (http://www.canberra.edu.au/research-students/scholarships/). Further information from Prof. Michael Wagner (michael.wagner@canberra.edu.au) or Dr Roland Goecke (roland.goecke@canberra.edu.au).

Back to Top

6-17 . (2009-10-06) ASSISTANT PROFESSOR, Department of Linguistics, University of Washington.

ASSISTANT PROFESSOR, Department of Linguistics, University of Washington. A tenure-track appointment is intended in the area of computational linguistics beginning September 2010 associated with the professional MA program and PhD track in Computational Linguistics.

University of Washington faculty engage in teaching, research and service; the successful applicant will teach graduate and undergraduate courses, supervise student research, and develop a high-impact research program. This position is full-time (100% FTE), with a 9-month service period. Applicants should have a Ph.D. degree in Linguistics, Computer Science, or related field and be highly qualified for undergraduate and graduate teaching and independent research. All qualified candidates are encouraged to apply. 

However, we are particularly interested in scholars active in the areas of machine learning, speech technology, computational semantics and dialogue systems.   The ideal candidate will complement and build on existing strengths within the department, and will be eager to interact with students and faculty from the broader linguistics and language processing community at the University of Washington.

The University of Washington is an affirmative action, equal opportunity employer. The University is building a culturally diverse faculty and staff and strongly encourages applications from women, minorities, individuals with disabilities and covered veterans. The University of Washington, a recipient of the 2006 Alfred P. Sloan award for Faculty Career Flexibility, is committed to supporting the work-life balance of its faculty.

Applications, including a curriculum vitae, statement of research and teaching interests, and three letters of recommendation, should be sent to Prof. Emily M. Bender, Chair, Computational Linguistics Search Committee, Department of Linguistics, University of Washington, Box 354340, Seattle, WA 98195-4340.  Questions regarding the position can be directed to ebender -at- uw.edu. Priority will be given to applications received before November 20, 2009.  Please include your email address.

Back to Top

6-18 . (2009-10-06) Information Technology Support Unit (ITS) Unit at the Directorate General for Translation at the European Parliament is offering a paid 5-month traineeship programme (Schuman Traineeships - general option) in the areas of Language Technology Research and Development and Communication.

The Information Technology Support Unit (ITS) Unit at the Directorate General for Translation at the European Parliament is offering a paid 5-month traineeship programme (Schuman Traineeships - general option) in the areas of Language Technology Research and Development and Communication.
 
Entity and location: Information Technology Support (ITS) Unit of the Directorate General for Translation at the European Parliament in Luxembourg.
 
Requirements:
 
The ITS Unit is looking for candidates for a traineeship in 2 different teams:
 
1) Research and Development Team 
 
a) Computanional Linguist or Research Engineer with an interest in NLP and translation technologies.
b) Translation or Communication Graduate with an interest in translation technologies
 
2) Communication Team
 
a) Graduates with a degree in IT, Web publishing or equivalent
b) Graduates with a degree in Journalism, Communication or equivalent.
 
Detailed profile descriptions (4) are attached to this e-mail.
 
General requirements:
 
1) Be a national of a Member State of the European Union or of an applicant country (derogation possible).
2) Have a thorough knowledge of one of the official languages of the European Union and a good knowledge of a second (English or French)
3) Not have been awarded any other paid traineeship, or have been in paid employment for more than 4 consecutive weeks, with a European Institution or a Member or political group of the European Parliament.
4) Have obtained, before the deadline for applications, a university degree after a course of study of at least three years’ duration;
5) Submit a written reference from a university lecturer or from a professional person who is able to give an objective assessment of the applicant’s aptitudes.
6) have produced a substantial written paper, as part of the requirements for a university degree or for a scientific journal
 
You can find all rules governing traineeships on:   http://www.europarl.europa.eu/pdf/traineeships/general_rules_en.pdf
 
Application procedure:
 
Applications for traineeships starting on 1 March 2010 are accepted until 15 October 2009 (midnight).
 
To find more information, conditions for admission and online application form on this page:
 
 
To indicate in the application form that you would be interested in a traineeship in ITS, please fill in:
 
a) in point 6 "Other" of the application form in subpoint "Aim of traineeship" in "other" that you are interested in a traineeship at the Information Technology Support Unit at Directorate General for Translation
b) in point 6 "Areas of interest" select appropriate areas of interest (e.g. information technology, engineering/technology, multimedia, communications)
c) in point 6 "Department - preference": Directorate General for Translation
 The Information Technology Support Unit (ITS) Unit at the Directorate General for Translation at the European Parliament is offering a paid 5-month traineeship programme (Schuman Traineeships - general option) in the areas of Language Technology Research and Development and Communication.
 
Entity and location: Information Technology Support (ITS) Unit of the Directorate General for Translation at the European Parliament in Luxembourg.
 
Requirements:
 
The ITS Unit is looking for candidates for a traineeship in 2 different teams:
 
1) Research and Development Team 
 
a) Computanional Linguist or Research Engineer with an interest in NLP and translation technologies.
b) Translation or Communication Graduate with an interest in translation technologies
 
2) Communication Team
 
a) Graduates with a degree in IT, Web publishing or equivalent
b) Graduates with a degree in Journalism, Communication or equivalent.
 
Detailed profile descriptions (4) are attached to this e-mail.
 
General requirements:
 
1) Be a national of a Member State of the European Union or of an applicant country (derogation possible).
2) Have a thorough knowledge of one of the official languages of the European Union and a good knowledge of a second (English or French)
3) Not have been awarded any other paid traineeship, or have been in paid employment for more than 4 consecutive weeks, with a European Institution or a Member or political group of the European Parliament.
4) Have obtained, before the deadline for applications, a university degree after a course of study of at least three years’ duration;
5) Submit a written reference from a university lecturer or from a professional person who is able to give an objective assessment of the applicant’s aptitudes.
6) have produced a substantial written paper, as part of the requirements for a university degree or for a scientific journal
 
You can find all rules governing traineeships on:   http://www.europarl.europa.eu/pdf/traineeships/general_rules_en.pdf
 
Application procedure:
 
Applications for traineeships starting on 1 March 2010 are accepted until 15 October 2009 (midnight).
 
To find more information, conditions for admission and online application form on this page:
 
 
To indicate in the application form that you would be interested in a traineeship in ITS, please fill in:
 
a) in point 6 "Other" of the application form in subpoint "Aim of traineeship" in "other" that you are interested in a traineeship at the Information Technology Support Unit at Directorate General for Translation
b) in point 6 "Areas of interest" select appropriate areas of interest (e.g. information technology, engineering/technology, multimedia, communications)
c) in point 6 "Department - preference": Directorate General for Translation
 

Back to Top

6-19 . (2009-10-07) Post-Docs at HLY Center of Excellence, Johns Hopkins University

Johns Hopkins University
Human Language Technology Center of Excellence
Post-Docs, Research Staff, Sabbaticals


The National Human Language Technology Center of Excellence (COE) at Johns Hopkins University is seeking to hire a few outstanding junior and senior researchers in the field of speech and natural language processing. Positions include research staff, sabbaticals and post-docs. The COE, located by Johns Hopkins's main campus in Baltimore, Maryland, conducts long-term research on fundamental challenges that are critical for real-world problems.

Candidates should have a strong background in one of the following areas:

SPEECH PROCESSING:
Robust speech recognition and information extraction (multiple languages, genres, and channels, limited resources)

NATURAL LANGUAGE PROCESSING:
Information extraction, knowledge distillation, machine translation, etc.

MACHINE LEARNING:
Large-scale learning, transfer-learning, semi-supervised, data-mining, etc.

Applicants must have a Ph.D. in CS, ECE, or a related field. Directions for applications can be found at: http://www.hltcoe.org/opportunities.html

Applications for postdoctoral and junior research scientist positions should apply by January 4, 2010 for full consideration.

Note: Although researchers are expected to publish in open peer-reviewed venues, the position requires a security clearance. Security clearances require U.S. citizenship; the COE will seek a clearance for researchers without an existing clearance.

 

Back to Top

6-20 . (2009-10-08) Post-doc position in speech recognition/modeling at TTI-Chicago

### Post-doc position in speech recognition/modeling at TTI-Chicago ###

A post-doc position is available at TTI-Chicago. It includes opportunities for work on articulatory modeling, graphical models, discriminative learning, large-scale data analysis, and multi-modal (e.g. audio-visual) modeling.

The post-doc will be mainly working with Karen Livescu, and will interact with collaborators Jeff Bilmes (U. Washington), Eric Fosler-Lussier (Ohio State U.), and Mark Hasegawa-Johnson (U. Illinois at Urbana-Champaign).

To apply, or for additional information, please contact Karen Livescu at klivescu@uchicago.edu. There is also an opportunity for a shorter-term post-doc project on annotation of speech at the articulatory level. Please contact klivescu@uchicago.edu for more details.

Back to Top

6-21 . (2009-10-19) Opened positions/internships at Microsoft: German Linguists (M/F)

Opened positions/internships at Microsoft: German Linguists (M/F)

MLDC – Microsoft Language Development Center, a branch of the Microsoft Product Group that develops Speech Recognition and Synthesis Technologies, situated in Porto Salvo, Portugal (Opened positions/internships at Microsoft: German Linguists (M/F)), is seeking a part-time or full-time temporary language expert in the German language, for a 2 month contract, renewable, to work in language technology related development projects. The successful candidate should have the following requirements:

·         Be native or near native German speaker

·         Have a university degree in Linguistics (with good computational skills) or Computational Linguistics (Master’s or PhD)

·         Have an advanced level of English (oral and written)

·         Have some experience in working with Speech Technology/Natural Language Processing/Linguistics, either in academia or in industry

·         Have some computational ability – being able to run tools, being comfortable to work with Microsoft Office tools and having some programming fundamentals, though no programming is required

·         Have team work skills

·         Willing to work in Porto Salvo (near Lisbon) for the duration of the contract

·         Willing to work in a multicultural and multinational team across the globe

·         Willing to start immediately

To apply, please submit your resume and a brief statement describing your experience and abilities to Daniela Braga: i-dbraga@microsoft.com

We will only consider electronic submissions.

Deadline for submissions: open until filled.

 

 

Bruno Reis Bechtlufft| MLDC Trainee

Back to Top

6-22 . (2009-10-26) CFP: JHU Summer Workshop on Language Engineering



CFP: JHU Summer Workshop on Language Engineering
 
16th Annual JHU Summer Workshop
CALL FOR TEAM RESEARCH PROPOSALS
Deadline: Wednesday, November 18, 2009.
 
http://www.clsp.jhu.edu/workshops/ws10/CFP
 
The Center for Language and Speech Processing at Johns Hopkins
University invites one-page research proposals for a
Summer Workshop on Language Engineering, to be held
in Baltimore, MD, USA, June 21 to July 30, 2010.
 
Proposals should be suitable for a six-week team exploration, and
should aim to advance the state of the art in any of the various
fields of Human Language Technology (HLT).  This year, proposals in
related areas of Machine Intelligence that share techniques with
HLT, such as Computer Vision (CV), are also strongly solicited.
 
Proposals are welcome on any topic of interest to HLT, CV and
technically related areas.  For example, proposals may address
novel topics or long-standing problems in one of the following
areas.
 
* SPEECH TECHNOLOGY:  Proposals are welcomed that address any
  aspect of information extraction from speech signal (message,
  speaker identity, language,...). Of particular interest are
  proposals for techniques whose performance would be minimally
  degraded by input signal variations, or which require minimal
  amounts of training data.
 
* NATURAL LANGUAGE PROCESSING: Proposals for knowledge discovery
  from text are encouraged, as are proposals in traditional
  fields such as parsing, machine translation, information
  extraction, sentiment analysis, summarization, and question
  answering.  Proposals may aim to improve the accuracy or enrich
  the output of such systems, or extend their reach by improving
  their speed, scalability, and coverage of languages and genres.
 
* VISUAL SCENE INTERPRETATION: New strategies are needed to
  parse visual scenes or generic (novel) objects, analyzing an
  image as a set of spatially related components.  Such strategies
  may integrate global top-down knowledge of scene structure (e.g.,
  generative models) with the kind of rich bottom-up, learned
  image features that have recently become popular for object
  detection.  They will support both learning and efficient search
  for the best analysis.
 
* UNSUPERVISED AND SEMI-SUPERVISED LEARNING: Novel techniques
  that do not require extensive quantities of human annotated data
  to address any of the challenges above could potentially make
  large strides in machine performance as well as lead to greater
  robustness to changes in input conditions.  Semi-supervised and
  unsupervised learning techniques with applications to HLT and CV
  are therefore of considerable interest.
 
Research topics selected for investigation by teams in
past workshops may serve as good examples for your proposal
(http://www.clsp.jhu.edu/workshops).
 
An independent panel of experts will screen all received proposals
for suitability. Results of this screening will be communicated
no later than November 20, 2009. Authors passing this initial
screening will be invited to Baltimore to present their ideas
to a peer-review panel on December 4-6, 2009.  It is expected
that the proposals will be revised at this meeting to address any
outstanding concerns or new ideas. Two or three research topics and
the teams to tackle them will be selected for the 2010 workshop.
 
We attempt to bring the best researchers to the workshop
to collaboratively pursue the selected topics for six weeks.
Authors of successful proposals typically become the team leaders.
Each topic brings together a diverse team of researchers and
students.  The senior participants come from academia, industry
and government.  Graduate student participants familiar with
the field are selected in accordance with their demonstrated
performance. Undergraduate participants, selected through a
national search, are rising seniors: new to the field and showing
outstanding academic promise.
 
If you are interested in participating in the 2010 Summer
Workshop we ask that you submit a one-page research proposal for
consideration, detailing the problem to be addressed.  If your
proposal passes the initial screening, we will invite you to join
us for the December 4-6 meeting in Baltimore (as our guest) for
further discussions aimed at consensus.  If a topic in your area
of interest is chosen as one of the two or three to be pursued
next summer, we expect you to be available for participation
in the six-week workshop. We are not asking for an ironclad
commitment at this juncture, just a good faith understanding
that if a project in your area of interest is chosen, you will
actively pursue it.  We in turn will make a good faith effort to
accommodate any personal/logistical needs to make your six-week
participation possible.
 
Proposals should be submitted via e-mail to clsp@jhu.edu by
4PM EST on Wed, November 18, 2009.
Back to Top

6-23 . (2009-10-08) Computational Linguist or Research engineer with an interest in NLP and translation technologies

Position:

Paid Traineeship - Computational Linguist or Research engineer with an interest in NLP and translation technologies

 

Entity:

ITS (Information Technology Support) - Research and Development Team

Directorate General for Translation,

European Parliament,

Luxembourg

 

Description of working environment

The Information Technology Support Unit (ITS DGTRAD) is the unit that provides technical and logistical support to Parliament’s translation units. ITS provides its users with standard IT support services by manning helpdesks, providing first-level user support, installing and trouble-shooting user configurations, running file, print and web servers, and providing second-level support. It caters specifically for translation needs by providing its users with a palette of tools - commercial (TWB), inter-institutional (IATE, Euramis) and in-house (Fuse, FullDoc etc) - and by integrating these tools into a coherent working environment and providing effective training and support in their use.
ITS promotes the sharing of information and the adoption of best practices amongst its users by providing a Translation Service Portal and publishing a newsletter. It also represents Parliament in a number of inter-institutional bodies concerned with technical questions related to translation.

Mission (tasks):

The Unit for IT support of the Translation DG of the European Parliament invites applications for a 5 months internship in its Research and Development team. Our current projects focus on developing and adapting language technology tools to assist the work of one of the largest translation services in the world.

As a member of our team you will have the possibility to work as a researcher and/or developer on one or more of the following topics:

  • Machine Translation
  • Indexing
  • Text Categorisation
  • Controlled language
  • Automatic Language Recognition
  • Multilingual and Crosslingual Information retrieval.

Depending on her/his field, the selected candidate will have to carry out one or more of the following tasks:

  • Research
  • Needs Analysis
  • Development
  • Documentation

Possibilities of combining your work here with a master/PhD thesis can be discussed.

Education:

The ideal candidate should have a very good Bachelor's degree in Computational Linguistics, Information Science or another related field and a strong interest in Natural Language Processing that can be proven by relevant research papers, university assignments or publications.

 

Technical knowledge and experience:

 

  • A strong background in at least one of the task areas mentioned above
  • Good programming skills in Java, Java for Web-applications and/or Visual Basic
  • SQL/Oracleand XML knowledge would be an asset
  • Statistical NLP

 

Languages:

Knowledge of at least one official EU language. English as a working language is mandatory. Any other EU language would be considered an asset.

 

Skills:

We are looking for highly motivated, communicative candidates who would like to work in a friendly, multinational and multilingual environment in the heart of Europe. Creativity and a strong interest in Language and Translation technologies will definitely be considered as major advantages.

 

 

Back to Top

6-24 . (2009-11-20) Post-doc LIMSI Paris

Le groupe "Traitement du Langage Parlé" du LIMSI/CNRS
(http://www.limsi.fr/tlp) recrute un post-doctorant pour participer au
projet ANR EDyLex.

Le projet EDyLex est un projet financé par l'ANR dans le cadre du
programme CONTINT (Contenus et interactions); il porte sur
l'acquisition dynamique de nouvelles entrées lexicales dans des
chaines d'analyse linguistiques (analyse syntaxique/sémantique ou
systèmes de transcription de la parole) : comment détecter et
qualifier un mot inconnu ou une entité nommée nouvelle dans un texte
ou dans un flux de parole ? Comment lui attribuer une phonétique, une
catégorie, des propriétés syntaxiques, une place dans un réseau
sémantique ?

Le travail concerné par cette annonce porte plus spécifiquement sur la
gestion des mots nouveaux ou inconnus, afin qu'ils puissent être
reconnus par un système de transcription de la parole. Ceci implique
la détection des mots inconnus, l'utilisation d'unités sous-lexicales,
la phonétisation avec variantes de mots nouveaux, l'adaptation des
modèles de langage. Ces méthodes seront validées en interne dans le
projet , et lors des futures campagnes nationales ou internationales
comme STD (Spoken Term Detection) organisée par le NIST
(http://www.itl.nist.gov/iad/mig//tests/std/index.html).

Le consortium de Edylex est composé de l'équipe Alpage
(INRIA-Univ. Paris 7), qui coordonne le projet, du Laboratoire
d'Informatique Fondamentale de Marseille (UMR du CNRS), de Syllabs et
Vecsys Research, deux laboratoires de recherche privés, la première
spécialisée dans le traitement du langage naturel, la deuxième dans le
traitement de la parole, de l'AFP qui fournit les corpus et qui
évaluera l'apport des techniques développées dans son système
d'information, et du LIMSI, dans ses deux composantes, écrit (groupe
ILES) et parole (groupe TLP).

Le projet a commencé le 1er novembre 2009 et durera 3 ans.

Les candidats devront savoir programmer dans un environnement Unix, et
être capable de parler et écrire en anglais et en français.
Ils devront avoir obtenu un doctorat dans l'un des domaines
suivants : traitement de la parole, traitement du langage naturel.

Être impliqué dans un projet de recherche dans le groupe TLP au LIMSI
offre une opportunité exceptionnelle de travailler sur des problèmes
de recherche variés, au sein d'une équipe de recherche de premier
plan, et d'être en contact avec les laboratoires académiques ou
industriels les plus importants dans le domaine.

La durée du contrat est de 1 an renouvelable.

Le travail se déroulera au LIMSI/CNRS qui se situe à Orsay, au sud de Paris.

Les candidatures devront être adressées, accompagnées d'un CV, à
Gilles ADDA (gadda@limsi.fr

Back to Top

6-25 . (2009-11-20) Post doctoral position at Eurecom-Multimedia Sophia Antipolis France

EURECOM
Post-doctoral Position

Title:      Automatic speech recognition and collaborative annotation
            for improving accessibility on video sharing websites by the
            visually and hearing impaired.
Department: Multimedia Communications
URL:        http://www.eurecom.fr/research/

Start Date: January 2010
Duration:   18 months

Description:
Eurecom's Multimedia Communications Department invites applications for a
full-time post-doctoral researcher to work on the ACAV (Annotation
Collaborative pour l'Accessibilité Vidéo) project funded as part of the "Web
Innovant" call.  The project aims to improve the accessibility of video on
popular content sharing websites by the visually and hearing impaired. The
first part of the project involves the development of an automatic speech
recognition system (preferably using freely available, open-source software),
to transcribe web-based video data.  The second part of the work entails the
design and implementation of a system to improve the accessibility of the
content to the visually and hearing impaired, based on semantic web and
collaborative annotation technologies. The position requires significant
experience in speech recognition research. Training on semantic web technologies and
ontology modelling will be provided. The project consortium is
composed of Dailymotion, LIRIS - University of Lyon 1 and EURECOM.

Requirements:
The successful candidate will have been awarded a PhD degree, involving
automatic speech recognition (preferably large vocabulary, continuous speech),
prior to their joining Eurecom. Expertise in enviromental robustness, speaker
adaptation and/or speaker diarization, i.e. speaker atributed speech-to-text,
is a bonus, as is any experience relevant to ontology modelling and semantic
web languages. You will have a strong research track record with significant
publications at leading international conferences and/or in journals.  You will
be highly motivated to undertake challenging, applied research and have good
English language speaking and writing skills. French language skills are a
bonus.

Applications:
Please send to the address below (i) a one page statement of your research
interests and motivation, (ii) your CV and (iii) two letters of reference no
later than 20th of December 2009.

Contact:        Dr. Raphaël Troncy AND Dr. Nicholas Evans
Postal Address: 2229 Route des Crêtes BP 193, F-06904 Sophia Antipolis
                cedex, France
Email address:  raphael.troncy@eurecom.fr AND nick.evans@eurecom.fr
Web address:    http://www.eurecom.fr/main/institute/job.en.htm
Phone:          +33/0 4 93 00 82 42 AND +33/0 4 93 00 81 14
Fax:            +33/0 4 93 00 82 00

Institut Eurécom is located in Sophia Antipolis, a vibrant science park on the
French Riviera. It is in close proximity with a large number of research units
of leading multi-national corporations in the telecommunications, semiconductor
and biotechnology sectors, as well as other outstanding research and teaching
institutions. A freethinking, multinational population and the unique
geographic location provide a quality of life without equal.

Back to Top

6-26 . (2009-11-24) Position at IRCAM Paris

Ircam recruits a Researcher W/M under limited-term contract of 18 months and full-time

From January 4th, 2010

Introduction to IRCAM

IRCAM is a leading non-profit organization associated to Centre Pompidou, dedicated to music production, R&D and education in acoustics and music. It hosts composers, researchers and students from many countries cooperating in contemporary music production, scientific and applied research. The main topics addressed in its R&D department include acoustics, audio signal processing, computer music, interaction technologies, musicology. Ircam is located in the center of Paris near the Centre Pompidou, at 1, Place Igor Stravinsky 75004 Paris.

Introduction to Quaero project

Quaero is a 200 M€ collaborative research and development program focusing on the areas of automatic extraction of information, analysis, classification and usage of digital multimedia content for professionals and consumers. The research work shall concentrate on managing virtually unlimited quantities of multimedia and multilingual information,  including text, speech, music, image and video. Five main application areas have been identified by the partners:

1.       multimedia internet search

2.       enhanced access services to audiovisual content on portals

3.       personalized video selection and distribution

4.       professional audiovisual asset management

5.       digitalization and enrichment of library content, audiovisual cultural heritage and scientific information.

The Quaero consortium was created to meet new multimedia content analysis requirements for consumers and professionals, faced with the explosion of accessible digital information and the proliferation of access means (PC,  TV, handheld devices). More information can be found at www.quaero.org/.

Role of Ircam in Quaero Project

In the Quaero project, Ircam is in charge of the coordination of audio/music indexing research and of development of music-audio indexing technology: music content-description (tempo, rhythm, key, chord, singing-voice, and instrumentation description), automatic indexing (music genre/style, mood), music similarity, music audio summary, chorus detection and audio identification. A specificity of the project is the creation of a large-music-audio corpus in order to train and validate all the algorithms developed during the project.

 

 

Position description

The Researcher would be in charge of the development of the technologies related to audio identification. Researcher will also collaborate with the evaluation team who evaluate algorithms performances and with the developer team.

Required profile

 Very high skills in audio signal processing (spectral analysis, partial tracking, audio coding, audio-feature extraction, parameter estimation)

Very high skills in audio indexing and data mining (statistical modelling, automatic feature selection algorithm, …)

Very high-skills in large-database search algorithms (structure indexing)

Good knowledge of Linux, Windows, MAC-OS environments

High-skill in Matlab programming, skills in C/C++ programming

High productivity, methodical work, excellent programming style.

 

Salary

According to background and experience

Applications

Please send an application letter together with your resume and any suitable information addressing the above issues preferably by email to: peeters_a_t_ircam dot fr with cc to vinet_a_t_ircam dot fr, rod_a_t_ircam dot fr

 _________________________________________________________________________________

 

 L’Ircam recrute un Chargé de recherche et développement H/F – en CDD de 18 mois et à temps plein

Poste disponible à partir du 4 janvier 2010

Présentation de l’Ircam

L'Ircam est une association à but non-lucratif, associée au Centre National d'Art et de Culture Georges Pompidou, dont les missions comprennent des activités de recherche, de création et de pédagogie autour de la musique du XXème siècle et de ses relations avec les sciences et technologies. Au sein de son département R&D, des équipes spécialisées mènent des travaux de recherche et de développement informatique dans les domaines de l'acoustique, du traitement des signaux sonores, des technologies d’interaction, de l’informatique musicale et de la musicologie. L'Ircam est situé au centre de Paris à proximité du Centre Georges Pompidou au 1, Place Stravinsky 75004 Paris.

Introduction au projet Quaero

Quaero est un programme collaborative de recherche et développement de 200 M€. Le programme se concentre sur l’extraction automatique d’information, l’analyse, la classification et l’utilisation des contenus multimédias digitaux pour les besoins professionnels et grand publique. Le travail de recherche se concentre sur la gestion de grandes quantités de documents multimédia et multilingue, incluant le texte, la parole, la musique, l’image et la vidéo.

Les applications principales identifiées et développées dans le projet sont :

Des portails et outils de recherche et de navigation de document multimédias grand public, par exemple la recherche de podcasts, photos ou vidéo sur PC, télévision ou téléphone mobile;

Des outils pour la numérisation, l'enrichissement et la diffusion du patrimoine audiovisuel et des bibliothèques numériques ;

Des solutions professionnelles intégrées de gestion de contenus audiovisuels et de métadonnées (analyse,  fusion, agrégation, indexation, archivage).

 

Le consortium Quaero a été crée afin de répondre au nouveau besoin d’analyse des contenus multimédia tant pour le grand publique que pour les professionnels, face à l’explosion d’information numérique accessible et la prolifération des moyens d’accès à celle-ci (PC, TV, handheld devices). De plus amples informations peuvent être obtenues à l’adresse : www.quaero.org/.

Rôle de l’Ircam dans le projet Quaero

Dans le projet Quaero, l’Ircam est en charge de la coordination des activités de recherche en indexation audio/ musique, ainsi que du développement des technologies d’indexation audio/ musique : extraction de description de contenu musical (tempo, rhythm, key, chord, singing-voice, et instrumentation description), indexation automatique (music genre/style, mood), similarité musicale, résumé audio musicaux, détection du chorus et identification audio. Une spécificité du projet est la création d’un large corpus audio/ musique pour l’entraînement et la validation des algorithmes développés durant le projet.

Missions

Le Chargé de recherche et développement sera en charge du développement des technologies relatives à l’identification audio.

Le Chargé de recherche et développement collaborera également avec l’équipe d’évaluation du projet (en charge de l’évaluation des performances des algorithmes développés), ainsi qu’avec l’équipe de développement.

 

Profil recherché

Grande expérience en recherche en traitement du signal (analyse spectrale, extraction de descripteurs audio, estimation de paramètres) si possible doctorat dans ce domaine

Grande expérience de recherche en indexation audio et analyse des données (modélisation statistique,  sélection de paramètres, …) si possible doctorat dans ce domaine

 Expérience en algorithme de recherche dans de très grandes bases de données

Bonne connaissance des environnements Linux, Windows et Mac OS-X.

Très bonne connaissance du langage Matlab,

 Connaissance des langages C et C++

Haute productivité, travail méthodique, excellent style de programmation, bonne communication rigueur

Salaire

Selon formation et expérience professionnelle

Candidatures

Prière d'envoyer une lettre de motivation et un CV détaillant le niveau d'expérience/expertise dans les domaines mentionnés ci-dessus (ainsi que tout autre information pertinente) à peeters_a_t_ircam dot fr avec copie à

vinet_a_t_ircam dot fr, rod_a_t_ircam dot fr

Back to Top

6-27 . (2009-11-25) Proposition de these CIFRE Univ de Strasbourg France

Proposition de Thèse CIFRE en informatique : Compilation d'unités phono-lexicales du langage parlé.

Cette thèse s'inscrit dans le cadre d'un projet de réalisation d'une plate-forme numérique comportant la conception d'un système logiciel permettant, notamment, l'apprentissage de l'écriture à partir de l'expression orale, la correction automatique d'erreurs orthographiques et grammaticales, et l'indexation sémantique de textes. Ce projet allie des chercheurs en phonétique, en linguistique (syntaxe et sémantique intrinsèque) et en informatique (compilation et optimisation de codes).

L'approche adoptée consiste à considérer que l’écriture implique la primauté de la substance phonique (les sons, les syllabes) et que de ce fait, il faut concevoir un modèle unitaire d’emblée établi sur le couplage entre le système phonologique, caractérisé par la variabilité des sons, et le système grammatical, caractérisé par la stabilité des unités graphiques. On fait donc l’hypothèse méthodologique que ce couplage est représentable dans la partie frontale d’un compilateur via les procédures de gestion de la table des symboles et de gestion des erreurs liées aux niveaux de l’analyse phono-lexicale, de l’analyse syntaxique et de l’analyse sémantique.

Au niveau phonique, la définition des unités lexicales implique de traiter simultanément l'intonation, l'accentuation, les syllabes et les phonèmes, pour segmenter la chaîne en mots phonologiques, représentables par des symboles. Le passage à l'écrit exige ainsi une méthode de construction récursive des unités à différents niveaux d’analyse par la formulation de règles d’inférences qui font correspondre les unités de niveau phonologique et les unités de niveau grammatical : la définition des unités au niveau phonologique est simultanément élaborée et couplée avec celle des unités grammaticales constituant l’énoncé selon une méthode descendante. D’où l’hypothèse de la hiérarchie des unités : une unité grammaticale est une structure relationnelle construite d’attributs atomiques définis par la hiérarchie d’une grammaire formelle.

Chaque unité lexicale peut jouer plusieurs rôles différents, selon le contexte de son apparition, contrairement aux unités lexicales des langages de programmation. Le type qui lui est associé est donc multiple. Par exemple « sort » peu jouer le rôle de verbe ou de nom commun, selon sa place
dans la phrase analysée. Cette place ne peut être identifiée qu’à l’étape d’analyse syntaxique, par confrontation avec les règles de production de la grammaire.

Le processus de compilation doit ensuite servir à extraire l’information de la sémantique du texte analysé, celle-ci devant être exprimée par une syntaxe « élémentaire » d’arbres de syntaxe abstraite étiquetés par des attributs.

Le modèle d'analyse descendante de la phrase, qui permet en effet de contrôler la génération et l'analyse de phrases de complexité croissante, servira de modèle à la constitution d'une base de données relationnelle constituée de tables reliées aux attributs des unités de sens. Les champs des tables représenteront les opérations sur les unités (substitution, déplacement, ajout, réduction).

Ce travail de thèse s'effectuera en collaboration avec l'Université de Strasbourg et l'entreprise Digora à Strasbourg, partenaire Oracle. Côté université, l'étudiant sera encadré par Rudolph Sock (phonéticien), Gérard Reb (linguiste) et Philippe Clauss (informaticien). L'étudiant alternera des périodes de travail en laboratoire (laboratoire LSIIT - Strasbourg) et en entreprise. Le doctorant sera amené à travailler avec un phonéticien post-doctorant. Le travail de thèse pourra débuter dès l'accomplissement de la procédure administrative d'établissement de la convention CIFRE pour une durée de 3 ans. Le salaire minimum est de 23 484 € annuel brut.

Prendre contact avec Philippe Clauss (clauss@unistra.fr)

Liens :
Laboratoire LSIIT : http://www-lsiit.u-strasbg.fr
Digora : http://www.digora.com
Université de Strasbourg : http://www.unistra.fr
Dispositif CIFRE : http://www.anrt.asso.fr/fr/espace_cifre/accueil.jsp 

Back to Top

6-28 . (2009-11-26) Postdocs at LIMSI Paris

The Spoken Language Processing Group at the LIMSI/CNRS (http://www.limsi.fr/tlp) is looking for postdocs, non-permanent research engineers, and doctoral students to participate in a number of research projects funded by national and European programs.

The main research areas are:

* Core technology for speech recognition (acoustic modeling, language modeling, ...)

* Speech-to-speech translation

* Speaker recognition and speaker diarization

 * Language Identification

* Audio indexing in a multilingual context (English, French, German, Arabic, Mandarin, Spanish, Portuguese, Italian, Dutch, Finnish, Danish, Greek, ...)

Preference will be given to candidates with experience in one or more of the following areas: speech processing, computational linguistics, signal processing and computer science.

Applicants should be experienced programmers and be familiar with the Unix environment, and be able to speak and write in English.

Contract duration for postdoc and research engineers: 1 to 3 years Location: LIMSI/CNRS Orsay, France (South of Paris)

Projects at LIMSI offer an exceptional opportunity to address challenging research problems with some of the most prestigious academic and industrial partners. Interested candidates should send a CV to Jean-Luc Gauvain (gauvain@limsi.fr)

Back to Top

6-29 . (2009-11-27) PhD positions for the CMU-Portugal program

PhD.  Program Carnegie Mellon-PORTUGAL in the area of Language and Information Technologies The Language Technologies Institute (LTI) of the School of Computer Science at Carnegie Mellon University offers a dual degree Ph.D. Program in Language and Information Technologies in cooperation with Portuguese Universities.

This Ph.D. program is part of the Portugal-Carnegie Mellon Partnership. The Language Technologies Institute, a world leader in the areas of speech processing, language processing, information retrieval, machine translation, machine learning, and bio-informatics, has been formed 20 years ago. The breadth of language technologies expertise at LTI enables new research in combinations of the core subjects, for example, in speech-to-speech translation, spoken dialog systems, language-based tutoring systems, and question/answering systems. The Portuguese consortium of Universities includes (but is not limited to) the Spoken Language Systems Lab (L2F) of INESC-ID Lisbon/IST, the University of Lisbon (FLUL), the University of Beira Interior (UBI) and the University of Algarve (UALG). These Universities share expertise in the same language technologies as LTI, although with a strong focus on processing the Portuguese language.

 Each Ph.D. student will receive a dual degree from LTI and the selected Portuguese University, being co-supervised by one advisor from each institute, and spending approximately half of the 5-year doctoral program at each institute. The academic part will be done during the first 2 years, including a maximum of 8 courses, with a proper balance of focus areas (Linguistic, Computer Science, Statistical/Learning, Task Orientation). The remaining 3 years of the doctoral program will be dedicated to research.

The thesis topic will be in one of the research areas of the cooperation program, defined by the two advisors. Two multilingual topics have been identified as primary research areas (although other areas of human language technologies may be also contemplated): computer aided language learning (CALL) and speech-to-speech machine translation (S2SMT). The doctoral students will be involved in one of these two projects aimed at building real HLT systems. These projects will involve at least two languages, one of them being Portuguese, the target language for the CALL system to be developed and either the source or target language (or both) for the S2SMT system. These two projects provide a focus for the proposed research; through them the collaboration will explore the main core areas in language technology.

The scholarship will be funded by the Foundation for Science and Technology (FCT), Portugal. How to Apply The application deadline for the LT Ph.D. program in the scope of the CMU-Portugal partnership is December 15, 2009.

Students interested in the dual doctoral program must apply by filling the corresponding form at the LTI webpage. For more information about the joint degree doctoral program in LT, send email to the coordinators of the Portuguese consortium and of the LTI admissions: Isabel.Trancoso at inesc-id dot pt LTI_Portugal_Admissions at cs dot cmu dot edu The applications will be screened by a joint committee formed by representatives of LTI and of the Portuguese Universities. The candidates should indicate their scores in GRE and TOEFL tests. Despite the particular focus on the Portuguese language, applications are not in any way restricted to native or non-native speakers of Portuguese. Post-Doc positions are also available in the scope of the same program. For additional information on these positions, contact Isabel.Trancoso at inesc-id dot pt.

Back to Top

7 . Journals

 

Back to Top

7-1 . IEEE Special Issue on Speech Processing for Natural Interaction with Intelligent Environments

Call for Papers IEEE Signal Processing Society IEEE Journal of Selected Topics in Signal Processing  Special Issue on Speech Processing for Natural Interaction                   with Intelligent Environments  With the advances in microelectronics, communication technologies and smart materials, our environments are transformed to be increasingly intelligent by the presence of robots, bio-implants, mobile devices, advanced in-car systems, smart house appliances and other professional systems. As these environments are integral parts of our daily work and life, there is a great interest in a natural interaction with them. Also, such interaction may further enhance the perception of intelligence. "Interaction between man and machine should be based on the very same concepts as that between humans, i.e. it should be intuitive, multi-modal and based on emotion," as envisioned by Reeves and Nass (1996) in their famous book "The Media Equation". Speech is the most natural means of interaction for human beings and it offers the unique advantage that it does not require carrying a device for using it since we have our "device" with us all the time.  Speech processing techniques are developed for intelligent environments to support either explicit interaction through message communications, or implicit interaction by providing valuable information about the physical ("who speaks when and where") as well as the emotional and social context of an interaction. Challenges presented by intelligent environments include the use of distant microphone(s), resource constraints and large variations in acoustic condition, speaker, content and context. The two central pieces of techniques to cope with them are high-performing "low-level" signal processing algorithms and sophisticated "high-level" pattern recognition methods.  We are soliciting original, previously unpublished manuscripts directly targeting/related to natural interaction with intelligent environments. The scope of this special issue includes, but is not limited to:  * Multi-microphone front-end processing for distant-talking interaction * Speech recognition in adverse acoustic environments and joint          optimization with array processing * Speech recognition for low-resource and/or distributed computing          infrastructure * Speaker recognition and affective computing for interaction with          intelligent environments * Context-awareness of speech systems with regard to their applied          environments * Cross-modal analysis of speech, gesture and facial expressions for          robots and smart spaces * Applications of speech processing in intelligent systems, such as          robots, bio-implants and advanced driver assistance systems.  Submission information is available at http://www.ece.byu.edu/jstsp. Prospective authors are required to follow the Author's Guide for manuscript preparation of the IEEE Transactions on Signal Processing at http://ewh.ieee.org/soc/sps/tsp. Manuscripts will be peer reviewed according to the standard IEEE process.  Manuscript submission due:    		 		  		 		  Jul. 3, 2009 First review completed:       		 		  		 		  Oct. 2, 2009 Revised manuscript due:      		 		  		 		  Nov. 13, 2009 Second review completed:      		 		  		 		  Jan. 29, 2010 Final manuscript due:         		 		  		 		  Mar. 5, 2010  Lead guest editor:         Zheng-Hua Tan, Aalborg University, Denmark             zt@es.aau.dk  Guest editors:         Reinhold Haeb-Umbach, University of Paderborn, Germany             haeb@nt.uni-paderborn.de         Sadaoki Furui, Tokyo Institute of Technology, Japan             furui@cs.titech.ac.jp         James R. Glass, Massachusetts Institute of Technology, USA             glass@mit.edu         Maurizio Omologo, FBK-IRST, Italy             omologo@fbk.eu
Back to Top

7-2 . Special issue "Speech as a Human Biometric: I know who you are from your voice" Int. Jnl Biometrics

International Journal of Biometrics  (IJBM)
 
Call For papers
 
Special Edition on: "Speech as a Human Biometric: I Know Who You Are From Your Voice!"
 
Guest Editors: 
Dr. Waleed H. Abdulla, The University of Auckland, New Zealand
Professor Sadaoki Furui, Tokyo Institute of Technology, Japan
Professor Kuldip K. Paliwal, Griffith University, Australia
 
 
The 2001 MIT Technology Review indicated that biometrics is one of the emerging technologies that will change the world. Human biometrics is the automated recognition of a person using adherent distinctive physiological and/or involuntary behavioural features.
 
Human voice biometrics has gained significant attention in recent years. The ubiquity of cheap microphones, human identity information carried by voice, ease of deployment, natural use, telephony applications diffusion, and non-obtrusiveness have been significant motivations for developing biometrics based on speech signals. The robustness of speech biometrics is sufficiently good. However, there are significant challenges with respect to conditions that cannot be controlled easily. These issues include changes in acoustical environmental conditions, respiratory and vocal pathology, age, channel, etc. The goal of speech biometric research is to solve and/or mitigate these problems.
 
This special issue will bring together leading researchers and investigators in speech research for security applications to present their latest successes in this field. The presented work could be new techniques, review papers, challenges, tutorials or other relevant topics.
 
   Subject Coverage
 
Suggested topics include, but are not limited to:
 
Speech biometrics
Speaker recognition
Speech feature extraction for speech biometrics
Machine learning techniques for speech biometrics
Speech enhancement for speech biometrics
Speech recognition for speech biometrics
Speech changeability over age, health condition, emotional status, fatigue, and related factors
Accent, gender, age and ethnicity information extraction from speech signals
Speech watermarking
Speech database security management
Cancellable speech biometrics
Voice activity detection
Conversational speech biometrics
   Notes for Prospective Authors
 
Submitted papers should not have been previously published nor be currently under consideration for publication elsewhere
 
All papers are refereed through a peer review process. A guide for authors, sample copies and other relevant information for submitting papers are available on the Author Guidelines page
 
   Important Dates
 
Manuscript due: 15 June, 2009
 
Acceptance/rejection notification: 15 September, 2009
 
Final manuscript due: 15 October, 2009
 
For more information please go to Calls for Papers page (http://www.inderscience.com/callPapers.php) OR The IJBM home page (http://www.inderscience.com/ijbm).
 
 
Back to Top

7-3 . Special on Voice transformation IEEE Trans ASLP

CALL FOR PAPERS
IEEE Signal Processing Society
IEEE Transactions on Audio, Speech and Language Processing
Special Issue on Voice Transformation
With the increasing demand for Voice Transformation in areas such as
speech synthesis for creating target or virtual voices, modeling various
effects (e.g., Lombard effect), synthesizing emotions, making more natural
dialog systems which use speech synthesis, as well as in areas like
entertainment, film and music industry, toys, chat rooms and games, dialog
systems, security and speaker individuality for interpreting telephony,
high-end hearing aids, vocal pathology and voice restoration, there is a
growing need for high-quality Voice Transformation algorithms and systems
processing synthetic or natural speech signals.
Voice Transformation aims at the control of non-linguistic information of
speech signals such as voice quality and voice individuality. A great deal
of interest and research in the area has been devoted to the design and
development of mapping functions and modifications for vocal tract
configuration and basic prosodic features.
However, high quality Voice Transformation systems that create effective
mapping functions for vocal tract, excitation signal, and speaking style
and whose modifications take into account the interaction of source and
filter during voice production, are still lacking.
We invite researchers to submit original papers describing new approaches
in all areas related to Voice Transformation including, but not limited to,
the following topics:
* Preprocessing for Voice Transformation
(alignment, speaker selection, etc.)
* Speech models for Voice Transformation
(vocal tract, excitation, speaking style)
* Mapping functions
* Evaluation of Transformed Voices
* Detection of Voice Transformation
* Cross-lingual Voice Transformation
* Real-time issues and embedded Voice Transformation Systems
* Applications
The call for paper is also available at:
http://www.ewh.ieee.org/soc/sps/tap/sp_issue/VoiceTransformationCFP.pdf
Prospective authors are required to follow the Information for Authors for
manuscript preparation of the IEEE Transactions on Audio, Speech, and
Language Processing Signal Processing at
http://www.signalprocessingsociety.org/periodicals/journals/taslp-author-information/
Manuscripts will be peer reviewed according to the standard IEEE process.
Schedule:
Submission deadline: May 10, 2009
Notification of acceptance: September 30, 2009
Final manuscript due: October 30, 2009
Publication date: January 2010
Lead Guest Editor:
Yannis Stylianou, University of Crete, Crete, Greece
yannis@csd.uoc.gr
Guest Editors:
Tomoki Toda, Nara Inst. of Science and Technology, Nara, Japan
tomoki@is.naist.jp
Chung-Hsien Wu, National Cheng Kung University, Tainan, Taiwan
chwu@csie.ncku.edu.tw
Alexander Kain, Oregon Health & Science University, Portland Oregon, USA
kaina@ohsu.edu
Olivier Rosec, Orange-France Telecom R&D, Lannion, France
olivier.rosec@orange-ftgroup.com

Back to Top

7-4 . Special Issue on Statistical Learning Methods for Speech and Language Processing

IEEE Signal Processing Society
IEEE Journal of Selected Topics in Signal Processing
Special Issue on Statistical Learning Methods for Speech and
Language Processing
In the last few years, significant progress has been made in both
research and commercial applications of speech and language
processing. Despite the superior empirical results, however, there
remain important theoretical issues to be addressed. Theoretical
advancement is expected to drive greater system performance
improvement, which in turn generates the new need of in-depth
studies of emerging novel learning and modeling methodologies. The
main goal of this special issue is to fill in the above need, with
the main focus on the fundamental issues of new emerging approaches
and empirical applications in speech and language processing.
Another focus of this special issue is on the unification of
learning approaches to speech and language processing problems. Many
problems in speech processing and in language processing share a
wide range of similarities (despite conspicuous differences), and
techniques in speech and language processing fields can be
successfully cross-fertilized. It is of great interest to study
unifying modeling and learning approaches across these two fields.
The goal of this special issue is to bring together a diverse but
complementary set of contributions on emerging learning methods for
speech processing, language processing, as well as unifying
approaches to problems across the speech and language processing
fields.
We invite original and unpublished research contributions in all
areas relevant to statistical learning, speech processing and
natural language processing. The topics of interest include, but are
not limited to:
• Discriminative learning methods and applications to speech and language processing
• Unsupervised/semi-supervised learning algorithms for Speech and language processing
• Model adaptation to new/diverse conditions
• Multi-engine approaches for speech and language processing
• Unifying approaches to speech processing and/or language processing
• New modeling technologies for sequential pattern recognition
Prospective authors should visit http://www.signalprocessingsociety.org/publications/periodicals/jstsp/
for information on paper submission. Manuscripts should be submitted
using the Manuscript Central system at http://mc.manuscriptcentral.com/jstsp-ieee.
Manuscripts will be peer reviewed according to the standard IEEE process.
Manuscript submission due: Aug. 7, 2009
First review completed: Oct. 30, 2009
Revised manuscript due: Dec. 11, 2009
Second review completed: Feb. 19, 2010
Final manuscript due: Mar. 26, 2010
Lead guest editor:
Xiaodong He, Microsoft Research, Redmond (WA), USA, xiaohe@microsoft.com
Guest editors:
Li Deng, Microsoft Research, Redmond (WA), USA, deng@microsoft.com
Roland Kuhn, National Research Council of Canada, Gatineau (QC), Canada, roland.kuhn@cnrc-nrc.gc.ca
Helen Meng, The Chinese University of Hong Kong, Hong Kong, hmmeng@se.cuhk.edu.hk
Samy Bengio, Google Inc., Mountain View (CA), USA, bengio@google.com 
Back to Top

7-5 . SPECIAL ISSUE OF SPEECH COMMUNICATION: Perceptual and Statistical Audition

Perceptual and Statistical Audition
 
To give authors a bit more time we extended the deadline for the call for papers for the special issue of Speech Communication to the 27th July, 2009
 
See the call for papers below for more details.
 
Aims and Scope
Current trends in audio analysis are strongly founded in statistical principles, or on approaches that are influenced by empirically derived, or perceptually motivated rules of auditory perception. These approaches are perceived as orthogonal, but new ideas that draw upon both perceptual and statistical principles can often result in superior performance. The relationship of these two approaches however, has not been thoroughly explored and is still a developing field of research.
In this special issue we invite researchers to submit papers on original and previously unpublished work on both approaches, and especially on hybrid techniques that combine perceptual and statistical principles, as applied to speech, music and audio analysis.  Recent advances in neurosciences have emphasized the important role of spectro-temporal modulations in human perception. We encourage submission of original and previously unpublished work on techniques that exploit the information in spectro-temporal modulations, particularly within a statistical framework.
Papers describing relevant research and new concepts are solicited on, but not limited to, the following topics:
 
 - Analysis of audio including speech and music
 - Audio classification
 - Speech recognition
 - Signal separation
 - Multi-channel analysis
 - Computational Auditory Scene Analysis  (CASA)
 - Spectro-temporal modulation methods
 - Perceptual aspects of statistical algorithms, such as Independent Component Analysis and Non-negative Matrix Factorization.
 - Hybrid methods that use CASA-like cues in a statistical framework
 
Guest Editors
Martin Heckmann Bhiksha Raj Paris Smaragdis
Honda Research Institute Europe Carnegie Mellon University Adobe Advanced Technology Labs
63073 Offenbach a. M., Germany Pittsburgh, PA 15217 Newton, MA 02446
martin.heckmann@honda-ri.de bhiksha@cs.cmu.edu paris@adobe.com
 
NEW DEADLINE
Papers due 27th July, 2009
 
Submission Guidelines
Authors should consult the "Guide for Authors", available online, at http://www.elsevier.com/locate/specom for information about the preparation of their manuscripts. Authors, please submit your paper via http://ees.elsevier.com/specom, choosing "Perceptual and Statistical Audition" as the Article Type. If you are a first time user of the system, please register yourself as an author. 
 
Back to Top

7-6 . Call for a chapter in Conversational Agents and Natural Language Interaction

Conversational Agents and Natural Language Interaction: Techniques and Effective Practices
A book edited by Dr. Diana Perez-Marin and Dr. Ismael Pascual-Nieto
Universidad Rey Juan Carlos, Universidad Autonoma de Madrid, Spain

We cordially invite you to submit a chapter for the forthcoming Conversational Agents and Natural Language Interaction: Techniques and Effective Practices book to be published by IGI Global (http://www.igi-global.com/requests/details.asp?ID=754)

-------------
Introduction
-------------
Human-Computer Interaction can be understood as two potent information processors (a human and a computer) trying to communicate with each
other using a highly restricted interface. Natural Language (NL) Interaction, that is, to let the users express in natural language could be the solution to
improve the communication between human and computers. Conversational agents exploit NL technologies to engage users in text-based information-seeking and task-oriented dialogs for a broad range of applications such as e-commerce, help desk, Web site navigation,
personalized service, and education.

The benefits of agent expressiveness have been highlighted both for verbal expressiveness and for non-verbal expressiveness. On the other hand, there
are also studies indicating that when using conversational agents mixed results can appear. These studies reveal the need to review the research in a
field with a promising future and a great impact in the area of Human-Computer Interaction.

-----------------------
Objective of the Book
------------------------
The main objective of the book is to identify the most effective practices when using conversational agents for different applications. Some secondary
objectives to fulfill the main goal are:
- To gather a comprehensive number of experiences in which conversational agents have been used for different applications
- To review the current techniques which are being used to design conversational agents
- To encourage authors to publish not only successful results, but also non-successful results and a discussion of the reasons that may have
caused them

------------------
Target Audience
------------------
The proposed book is intended to serve as a reference guide for researchers who want to start their research in the promising field of conversational
agents. It will not be necessary that readers have previous knowledge on the topic.

Recommended topics include, but are not limited to, the following:
1. Fundamental concepts
- Definition and taxonomy of conversational agents
- Motivation, benefits, and issues of their use
- Underlying psychological and social theories
2. Design of conversational agents
- Techniques
- Frameworks
- Methods
3. Practices
- Experiences of use of conversational agents in:
- E-commerce
- Help desk
- Website navigation
- Personalized service
- Training or education
- Results achieved
- Discussion of the reasons of their success of failure
4. Future trends
- Issues that should be solved in the future
- Expectations for the future

-----------------------
Submission Procedure
-----------------------
Researchers and practitioners are invited to submit on or before December 16, 2009, a 2-3 page chapter proposal clearly explaining the mission and
concerns of his or her proposed chapter. Authors of accepted proposals will be notified by January 16, 2010 about the status of their proposals and sent
chapter guidelines. Full chapters (8,000–10,000 words) are expected to be submitted by April 16, 2010. All submitted chapters will be reviewed on a
double-blind review basis. Contributors may also be requested to serve as reviewers for this project.

----------
Publisher
----------
This book is scheduled to be published by IGI Global (formerly Idea Group Inc.), publisher of the “Information Science Reference” (formerly Idea Group Reference), “Medical Information Science Reference,” “Business Science Reference,” and “Engineering Science Reference” imprints. For additional information regarding the publisher, please visit www.igi-global.com. This publication is anticipated to be released in 2011.

-----------------
Important Dates
-----------------
December 16, 2009: Proposal Submission Deadline
January 16, 2010: Notification of Acceptance
April 16, 2010: Full Chapter Submission
June 30, 2010: Review Results Returned
July 30, 2010: Final Chapter Submission
September 30, 2010: Final Deadline

------------------------------------
Editorial Advisory Board Members
--------------------------------------
Dr. Rafael Calvo, University of Sydney, Australia
Dr. Diane Inkpen, University of Ottawa, Canada
Dr. Pamela Jordan, University of Pittsburgh, U.S.A.
Dr. Ramón López Cózar, Universidad de Granada, Spain
Dr. Max Louwerse, University of Memphis, U.S.A.
Dr. José Antonio Macías, Universidad Autónoma de Madrid, Spain
Dr. Mick O’Donnell, Universidad Autónoma de Madrid, Spain
Dr. George Veletsianos, University of Manchester, U.K.
Incompleted list, full list to be announced on November, 16

--------------------------
Inquiries and submissions
---------------------------
Please send all inquiries and submissions (preferably through e-mail) to:

Dr. Diana Perez-Marin, Universidad Rey Juan Carlos, Spain
Email: diana.perez@urjc.es

and

Dr. Ismael Pascual Nieto, Universidad Autonoma de Madrid, Spain
Email: ismael.pascual@uam.es 

Back to Top

8 . Future Speech Science and Technology Events

8-1 . (2009-11-19) GIPSA Seminar Grenoble France

Jeudi 10 décembre 2009, 13h30 – Séminaire externe **
========================================
Jack Loomis
Department of Psychology, University of California, Santa Barbara
Increasing the effectiveness of virtual reality through research on perception, cognition, and action
The effectiveness of virtual reality for a wide range of applications is limited by perceptual error and inadequate locomotion interfaces.
I will discuss research on visual and auditory perception in virtual environments, showing how imperfection in the implementation of virtual reality has resulted in errors in distance perception, errors which reduce the effectiveness of virtual reality as a medium for training new skills.
I will also discuss research showing how the fidelity of the locomotion interface greatly impacts performance on spatial cognition tasks.



Jeudi 17 décembre 2009, 13h30 – Séminaire externe **
========================================
Suzanne Fuchs

ZAS/Phonetik, Berlin, Allemagne

Titre à venir
Résumé à venir 

 

 

 

 

J

Back to Top

8-2 . (2009-12-14)7th International Conference on Natural Language Processing Hyberabad, India

 http://www.icon2009.in   7th International Conference on Natural Language Processing University of Hyderabad, Hyderabad, India December 14-17, 2009 
Back to Top

8-3 . (2009-12-15) All-India Conference of Linguists University of Hyderabad, India

http://www.aicl2009.in All-India Conference of Linguists University of Hyderabad, Hyderabad, India December 15-17, 2009

Back to Top

8-4 . (2010-02-10) International Conference on Socio-Cultural Approaches to Translation, Hyderabad, India

http://www.icscat2010.in/ International Conference on Socio-Cultural Approaches to Translation University of Hyderabad, Hyderabad, India Feb 10-12, 2010 

Back to Top

8-5 . (2010-03-13) CfP SpokenQuery Voice Search Workshop 2010 (SQ 2010)

SpokenQuery Voice Search Workshop 2010 (SQ 2010)

 

13 March 2010, Dallas, TX

http://www.spokenquery.org/

 

Aims

 

Papers are solicited for the SpokenQuery 2010 Workshop on Voice Search (SQ2010), to be held in Dallas, Texas as a satellite to to ICASSP 2010.

 

Small devices with high computing power have become ubiquitous, via cellphone-like devices, car communication systems, etc. This new reality makes it feasible to utilize speech as the preferred input mode. Searching for information from a spoken query brings its own challenges that go beyond the inherent difficulties of speech recognition or information retrieval.

 

The first SQ workshop aims at bringing together researchers working in the area that overlaps speech processing and information retrieval. This workshop is intended to be an open forum that will allow different research communities to become better acquainted with each other and to share ideas.

 

This one-day workshop will include a limited number of oral presentations, chosen for breadth and stimulation, and an informal atmosphere to promote discussion. We hope this workshop will expose participants to a broad perspective of techniques, tools, best practices, and innovation, which will provide the impetus for new research and compelling variants on current approaches.

 

Papers describing relevant research and new concepts are solicited on, but not limited to, the following topics:

 

·        Spoken queries in various languages

·        Retrieval of spoken or text documents via spoken query

·        Handling of new vocabulary words

·        Tools and databases

·        Commercial applications

·        Automotive spoken query applications

·        Spoken query applications via a mobile phone

·        Server-based and client-based approaches to automatic speech recognition

·        Novel adaptation and noise robustness methods

·        Novel demonstrations

 

Manuscripts must be between 4 and 6 pages long, in standard ICASSP double-column format. Accepted papers will be published in the workshop proceedings.

 

Important Dates

 

Paper submission: 30 November 2009

Notification of paper acceptance: 15 January 2010

Workshop: 13 March 2010

 

Organizers

 

Bhiksha Raj, CMU, USA

Evandro Gouvêa, MERL, USA

Tony Ezzat, MERL, USA

 

Technical Committee

 

Michiel Bacchiani, Google, USA

Fabio Crestani, UNISI, Switzerland

Ute Ehrlich, DaimlerAG, Germany

Mazin Gilbert,  ATT, USA

Prasad Venkatesh, Ford, USA

Chao Wang, Vlingo, USA

Geoffrey Zweig, Microsoft, USA

 

Contact

 

To email the organizers, please send email to organizers@spokenquery.org.

Back to Top

8-6 . (2010-03-15) IEEE ICASSP 2010 International Conference on Acoustics, Speech, and Signal Processing March 15 – 19, 2010 Sheraton Dallas Hotel * Dallas, Texas, U.S.A.

IEEE ICASSP 2010   International Conference on Acoustics, Speech, and Signal Processing                            March 15 – 19, 2010                Sheraton Dallas Hotel * Dallas, Texas, U.S.A.                         http://www.icassp2010.com/     The 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) will be held at the Sheraton Dallas Hotel, March 15 – 19, 2010. The ICASSP meeting is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The conference will feature world-class speakers, tutorials, exhibits, and over 120 lecture and poster sessions on the following topics:   * Audio and electroacoustics  * Bio imaging and signal processing  * Design and implementation of signal processing systems  * Image and multidimensional signal processing  * Industry technology tracks  * Information forensics and security  * Machine learning for signal processing  * Multimedia signal processing  * Sensor array and multichannel systems  * Signal processing education  * Signal processing for communications  * Signal processing theory and methods  * Speech processing  * Spoken language processing  Welcome to Texas, Y’All! Dallas is known for living large and thinking big. As the nation’s ninth-largest city, Dallas is exciting, diverse and friendly — factors that contribute to its success as a leading leisure and convention destination. There’s a whole “new” vibrant Dallas to enjoy-new entertainment districts, dining, shopping, hotels, arts and cultural institutions- with more on the way. There’s never been a more exciting time to visit Dallas than now.  Submission of Papers: Prospective authors are invited to submit full-length, four-page papers, including figures and references, to the ICASSP Technical Committee. All ICASSP papers will be handled and reviewed electronically. The ICASSP 2010 website www.icassp2010.com will provide you with further details. Please note that all submission deadlines are strict.  Tutorial and Special Session Proposals: Tutorials will be held on March 14 and 15, 2010. Brief proposals should be submitted by July 31, 2009, through the ICASSP 2010 website and must include title, outline, contact information for the presenter, and a description of the tutorial and material to be distributed to participants. Special sessions proposals should be submitted by July 31, 2009, through the ICASSP 2010 website and must include a topical title, rationale, session outline, contact information, and a list of invited papers. Tutorial and special session authors are referred to the ICASSP website for additional information regarding submissions.  * Important Deadlines *  Submission of Camera-Ready Papers      September 14, 2009  Notification of Paper Acceptance      December 11, 2009  Revised Paper Upload Deadline      January 8, 2010  Author’s Registration Deadline      January 15, 2010  For more detailed information, please visit the ICASSP 2010 official website, http://www.icassp2010.com/.
Back to Top

8-7 . (2010-03-20) CfP CMU Sphinx Users and Developers Workshop 2010 (CMU-SPUD 2010)

CMU Sphinx Users and Developers Workshop 2010 (CMU-SPUD 2010)

 

20 March 2010, Dallas, TX

 

http://www.cs.cmu.edu/~sphinx/Sphinx2010

 

Papers are solicited for the CMU Sphinx Workshop for Users and Developers (CMU-SPUD 2010), to be held in Dallas, Texas as a satellite to to ICASSP 2010.

 

CMU Sphinx is one of the most popular open source speech recognition systems. It is currently used by researchers and developers in many locations world-wide, including universities, research institutions and in industry. CMU Sphinx's liberal license terms has made it a significant member of the open source community and has provided a low-cost way for companies to build businesses around speech recognition.

 

The first SPUD workshop aims at bringing together CMU Sphinx users, to report on applications, developments and experiments conducted using the system. This workshop is intended to be an open forum that will allow different user communities to become better acquainted with each other and to share ideas. It is also an opportunity for the community to help define the future evolution of CMU Sphinx.

 

We are planning a one-day workshop with a limited number of oral presentations, chosen for breadth and stimulation, held in an informal atmosphere that promotes discussion. We hope this workshop will expose participants to different perspectives and that this in turn will help foster new directions in research, suggest interesting variations on current approaches and lead to new applications.

 

Papers describing relevant research and new concepts are solicited on, but not limited to, the following topics. Papers must describe work performed with CMU Sphinx:

 

·        Decoders: PocketSphinx, Sphinx-2, Sphinx-3, Sphinx-4

·        Tools: SphinxTrain, CMU/Cambridge SLM toolkit

·        Innovations / additions / modifications of the system

·        Speech recognition in various languages

·        Innovative uses, not limited to speech recognition

·        Commercial applications

·        Open source projects that incorporate Sphinx

·        Novel demonstrations

 

Manuscripts must be between 4 and 6 pages long, in standard ICASSP double-column format. Accepted papers will be published in the workshop proceedings.

 

IMPORTANT DATES

 

Paper submission: 30 November 2009

Notification of paper acceptance: 15 January 2010

Workshop: 20 March 2010

 

Organizers

 

Bhiksha Raj, Carnegie Mellon University, USA

Evandro Gouvêa, Mitsubishi Electric Research Labs, USA

Richard Stern, Carnegie Mellon University, USA

Alex Rudnicky, Carnegie Mellon University, USA

Rita Singh, Carnegie Mellon University, USA

David Huggins-Daines, Carnegie Mellon University, USA

Nickolay Schmyrev, Nexiwave, Russian Federation

Yannick Estève, Laboratoire d'Informatique de l'Université du Maine, France

 

Contact

 

To email the organizers, please send email to sphinx+workshop@cs.cmu.edu

Back to Top

8-8 . (2010-04-13) CfP Workshop: Positional phenomena in phonology and phonetics Wroclaw-



http://www.ifa.uni.wroc.pl/~glow33/phon.html

 Workshop: Positional phenomena in phonology and phonetics

(Organised by Zentrum für Allgemeine Sprachwissenschaft, Berlin)

*Date:* 13 April 2010
*Organisers:* Marzena Zygis, Stefanie Jannedy, Susanne Fuchs
*Deadline for abstract submission:* 1st November 2009
*Abstracts submitted to:* zygis@zas.gwz-berlin.de
*Invited speakers:*

  * Taehong Cho (Hanyang University, Seoul) confirmed
  * Grzegorz Dogil (University of Stuttgart) confirmed

*Venue:* /Instytut Filologii Angielskiej, ul. Kuz'nicza 22, 50-138 Wroc?aw/

Positional effects found cross-linguistically at the edges of prosodic
constituents (e.g. final lengthening, final lowering, strengthening
effects, or final devoicing) have increasingly received attention in
phonetic-phonological research. Recent empirical investigations of such
positional effects and their variability pose, however, a great number
of questions challenging e.g. the idea of perceptual invariance. It has
been claimed that acoustic variability is a necessary prerequisite for
the perceptual system to parse segmental strings into words, phrases or
larger prosodic units.

This workshop will provide a forum for discussing controversies and
recent developments regarding positional phenomena. We invite abstracts
bearing on positional effects from various perspectives.The following
questions can be addressed, but are not limited to:

 1. What kind of variability is found in the data, and how does such
    variability need to be accounted for? What positional effects are
    common cross-linguistically and how can they be attributed to
    perceptual, articulatory or aerodynamic principles?
 2. How does positional prominence (lexical stress; accent) interact
    with acoustic and articulatory realizations of prosodic
    boundaries? What are the positional (a)symmetries in the
    realizations of boundaries, and what are the mechanisms underlying
    them?
 3. How does left- and right-edge phrasal marking interact with the
    acoustic and articulatory realizations at these prosodic
    boundaries? How are these interpreted in phonetics and in phonology?
 4. What are the necessary prerequisites for the interpretation of
    prosodic constituents? Which auditory cues are essential for the
    perception of boundaries and positional effects? Are such cues
    language-specific?
 5. To what extent do lexical frequency, phonotactic probability, and
    neighbourhood density contribute to the production and recognition
    of prosodic boundaries in (fluent/spontaneous) speech?
 6. How are positional characteristics exploited during the process of
    language acquisition? How are they learned during the process of
    language acquisition? Are positional effects salient enough for L2
    learners?

Abstracts are invited for a 20-min. presentation (excluding discussion).
Abstracts should be sent in two copies: one with a name and one without
as attached files (the name(s) should also be clearly mentioned in the
e-mail) to: zygis@zas.gwz-berlin.de in .pdf format. Only electronic
submissions will be considered. Abstracts may not exceed two pages of
text with at least a one-inch margin on all four sides (measured on A4
paper) and must employ a font not smaller than 12 point. Each page may
include a maximum of 50 lines of text. An additional page with
references may be included.

Deadline for submissions: November 1, 2009.

Contact person: Marzena Zygis

-- 
*************************
Susanne Fuchs, PhD
ZAS/Phonetik
Schützenstrasse 18
10117 Berlin

phone: 030 20192 569
fax:   030 20192 402
webpage: http://susannefuchs.org
*************************


 
Back to Top

8-9 . (2010-05-10) Cfp Workshop on Prosodic Prominence: Perceptual and Automatic Identification

Extended deadline November 2009   Speech Prosody 2010 Satellite Workshop May 10th, 2010, Chicago, Illinois      Description of the workshop: Efficient tools for (semi-)automatic prosodic annotation are becoming more and more important for the speech community, as most systems of prosodic annotation rely on the identification of syllabic prominence in spoken corpora (whether they lead a phonological interpretation or not). The use of automatic and semi-automatic annotation has also facilitated multilingual research; many experiments on prosodic prominence identification have been conducted for European and non-European languages, and protocols have been written in order to build large databases of spoken languages prosodically annotated all around the world. The aim of this workshop is to bring together specialists of automatic prosodic annotation interested in the development of robust algorithms for prominence detection, and linguists who developed methodologies for the identification of prosodic prominence in natural languages on perceptual bases. The conference will include oral and poster sessions, and a final round table.   Scientific topics: 1. Annotation of prominence 2. Perceptual processing of prominences: gestalt theories’ background 3. Acoustic correlates of prominence 4. Prominence and its relations with prosodic structure 5. Prominence and its relations with accent, stress, tone and boundary 6. The use of syntactic/pragmatic information in prominence identification 7. Perception of prominence by naive/expert listeners 8. Statistical methods for prominence’s detection 9. Number of relevant prominence degrees : categorical or continuous scale 10.Prosodic prominence and visual perception   Submission of papers: Anonymous four-page papers (including figures and references) must be written in English, and be uploaded as pdf files here: https://www.easychair.org/login.cgi?conf=prom2010. All papers will be reviewed by at least three members of the scientific committee. Accepted four-page papers will be included in the online proceedings of the workshop published on the workshop website. The publication of extended selected papers after the workshop in a special issue of a journal is being considered.   Organizing Committee: Mathieu Avanzi (Université de Neuchâtel, CH) Anne Lacheret-Dujour (Université de Paris Ouest Nanterre) Anne-Catherine Simon (Université catholique de Louvain-la-Neuve)  Scientific committee: the names of the scientific committee will be announced in the second circular.   Venue: The workshop will take place in The Doubletree Hotel Magnificent Mile, in Chicago. See the Speech prosody 2010 website (http://www.speechprosody2010.illinois.edu/index.html) for further information.   Important deadlines: Submission of four-page papers: November 15, 2009 Notification of acceptation: January 15, 2009 Author's Registration Deadline: March 2, 2010 Workshop: March 10, 2010    Website of the workshop: http://www2.unine.ch/speechprosody-prominence
Back to Top

8-10 . (2010-05-11) Call For Special Session Proposals SPEECH PROSODY 2010

SPEECH PROSODY 2010

===============================================================

Call For Special Session Proposals

===============================================================

 

Speech Prosody 2010, the fifth international conference on speech prosody, invites proposals for special sessions addressing exciting current topics in the science and technology of spoken language prosody.  Special sessions may address any topic among the key topic areas of Speech Prosody 2010 (http://speechprosody2010.org/), or a topic that is too new to be included in the standard topic list.

 

 Proposals for special sessions should include the names and affiliations of the organizers, an abstract describing the topic of the special session, and a list of six to twelve potential authors doing current research in the topic area.  Proposals should be submitted by e-mail to speechprosody2010@yahoogroups.com.  In order to receive full consideration, proposals should be submitted by November 15, 2009.

===================

IMPORTANT DATES

===================

 

November 15, 2009: Manuscript deadline for regular Speech Prosody papers

November 15, 2009: Special Session Proposal deadline for full consideration

November 20, 2009: Acceptance letters mailed to Special Session organizers

December 15, 2009: Manuscript deadline for Special Session papers

January 15, 2010: Acceptance letters mailed to manuscript authors

May 11-14, 2010:  Conference, Speech Prosody 2010

Back to Top

8-11 . (2010-05-11) CfP Speech prosody 2010 Chicago IL USA

SPEECH PROSODY 2010   (New submission deadline)

Deadline Extension: REVISIONS ONLY ==================================

The Speech Prosody 2010 Organizing Committee is happy to announce a REVISIONS ONLY extension of our manuscript deadline. Authors who submit a draft manuscript by November 15 will be allowed to revise their manuscript, as often as necessary, until 8:00 AM Chicago time on November 23. The November 15 draft should include preliminary title, abstract, list of authors, and content adequate for selection of appropriate reviewers. Reviewers will not see the initial draft, however; only the final uploaded draft (8:00 AM Chicago time, November 23) will be sent to reviewers.


===============================================================
Every Language, Every Style: Globalizing the Science of Prosody
===============================================================
Call For Papers
===============================================================

 


Prosody is, as far as we know, a universal characteristic of human speech, founded on the cognitive processes of speech production and perception.  Adequate modeling of prosody has been shown to improve human-computer interface, to aid clinical diagnosis, and to improve the quality of second language instruction, among many other applications.

Speech Prosody 2010, the fifth international conference on speech prosody, invites papers addressing any aspect of the science and technology of prosody.  Speech Prosody is the only recurring international conference focused on prosody as an organizing principle for the social, psychological, linguistic, and technological aspects of spoken language.  Speech Prosody 2010 seeks, in particular, to discuss the universality of prosody.  To what extent can the observed scientific and technological benefits of prosodic modeling be ported to new languages, and to new styles of spoken language?  Toward this end, Speech Prosody 2010 especially welcomes papers that create or adapt models of prosody to languages, dialects, sociolects, and/or communicative situations that are inadequately addressed by the current state of the art.

=======
TOPICS
=======

Speech Prosody 2010 will include keynote presentations, oral sessions, and poster sessions covering topics including:

* Prosody of under-resourced languages and dialects
* Communicative situation and speaking style
* Dynamics of prosody: structures that adapt to new situations
* Phonology and phonetics of prosody
* Rhythm and duration
* Syntax, semantics, and pragmatics
* Meta-linguistic and para-linguistic communication
* Signal processing
* Automatic speech synthesis, recognition and understanding
* Prosody of sign language
* Prosody in face-to-face interaction: audiovisual modeling and analysis
* Prosodic aspects of speech and language pathology
* Prosody in language contact and second language acquisition
* Prosody and psycholinguistics
* Prosody in computational linguistics
* Voice quality, phonation, and vocal dynamics

====================
SUBMISSION OF PAPERS
====================

Prospective authors are invited to submit full-length, four-page papers, including figures and references, at http://speechprosody2010.org. All Speech Prosody papers will be handled and reviewed electronically.

===================
VENUE
===================

The Doubletree Hotel Magnificent Mile is located two blocks from North Michigan Avenue, and three blocks from Navy Pier, at the cultural center of Chicago.  The Windy City has been the center of American innovation since the mid nineteenth century, when a railway link connected Chicago to the west coast, civil engineers reversed the direction of the Chicago river, Chicago financiers invented commodity corn (maize), and the Great Chicago Fire destroyed almost every building in the city. The Magnificent Mile hosts scores of galleries and museums, and hundreds of world-class restaurants and boutiques.

===================
IMPORTANT DATES
===================

Submission of Papers (http://speechprosody2010.org): November 15, 2009
Notification of Acceptance:                                           December 15, 2009
Conference:                                                                    May 11-14, 2010

 

Back to Top

8-12 . (2010-05-24) CfP 4th INTERNATIONAL CONFERENCE ON LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS (LATA 2010)

 4th INTERNATIONAL CONFERENCE ON LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS (LATA 2010)

  

 Trier, Germany, May 24-28, 2010

 http://grammars.grlmc.com/LATA2010/

 *********************************************************************

 AIMS:

 LATA is a yearly conference in theoretical computer science and its applications. As linked to the International PhD School in Formal Languages and Applications that was developed at Rovira i Virgili University (the host of the previous three editions and co-organizer of this one) in the period 2002-2006, LATA 2010 will reserve significant room for young scholars at the beginning of their career. It will aim at attracting contributions from both classical theory fields and application areas (bioinformatics, systems biology, language technology, artificial intelligence, etc.).

 SCOPE:

 Topics of either theoretical or applied interest include, but are not limited to:

 - algebraic language theory

- algorithms on automata and words

- automata and logic

- automata for system analysis and programme verification

- automata, concurrency and Petri nets

- cellular automata

- combinatorics on words

- computability

- computational complexity

- computer linguistics

- data and image compression

- decidability questions on words and languages

- descriptional complexity

- DNA and other models of bio-inspired computing

- document engineering

- foundations of finite state technology

- fuzzy and rough languages

- grammars (Chomsky hierarchy, contextual, multidimensional, unification, categorial, etc.)

- grammars and automata architectures

- grammatical inference and algorithmic learning

- graphs and graph transformation

- language varieties and semigroups

- language-based cryptography

- language-theoretic foundations of artificial intelligence and artificial life

- neural networks

- parallel and regulated rewriting

- parsing

- pattern matching and pattern recognition

- patterns and codes

- power series

- quantum, chemical and optical computing

- semantics

- string and combinatorial issues in computational biology and bioinformatics

- symbolic dynamics

- term rewriting

- text algorithms

- text retrieval

- transducers

- trees, tree languages and tree machines

- weighted machines

 

 

 

STRUCTURE:

 LATA 2010 will consist of:

 - 3 invited talks

- 2 invited tutorials

- refereed contributions

- open sessions for discussion in specific subfields, on open problems, or on professional issues (if requested by the participants)

 INVITED SPEAKERS:

 John Brzozowski (Waterloo), Complexity in Convex Languages

Alexander Clark (London), Three Learnable Models for the Description of Language

Lauri Karttunen (Palo Alto), to be announced (tutorial)

Borivoj Melichar (Prague), Arbology: Trees and Pushdown Automata

Anca Muscholl (Bordeaux), Communicating Automata (tutorial)

 

ORGANIZING COMMITTEE:

 Adrian Horia Dediu (Tarragona)

Henning Fernau (Trier, co-chair)

Maria Gindorf (Trier)

Stefan Gulan (Trier)

Anna Kasprzik (Trier)

Carlos Martín-Vide (Brussels, co-chair)

Norbert Müller (Trier)

Bianca Truthe (Magdeburg)

 SUBMISSIONS:

 Authors are invited to submit papers presenting original and unpublished research. Papers should not exceed 12 single-spaced pages and should be formatted according to the standard format for Springer Verlag's LNCS series (see http://www.springer.com/computer/lncs/lncs+authors?SGWID=0-40209-0-0-0). Submissions have to be uploaded at:

 

http://www.easychair.org/conferences/?conf=lata2010

 PUBLICATIONS:

 A volume of proceedings published by Springer in the LNCS series will be available by the time of the conference.

 A special issue of the Journal of Computer and System Sciences (Elsevier) will be later published containing refereed extended versions of some of the papers contributed to the conference. Submissions to it will be only by invitation.

 A special issue of another major journal containing papers oriented to applications is under consideration.

 REGISTRATION:

 The period for registration will be open since September 1, 2009 until May 24, 2010. The registration form can be found at the website of the conference: http://grammars.grlmc.com/LATA2010/

 Early registration fees: 500 Euro

Early registration fees (PhD students): 400 Euro

Late registration fees: 530 Euro

Late registration fees (PhD students): 430 Euro

On-site registration fees: 550 Euro

On-site registration fees (PhD students): 450 Euro

 At least one author per paper should register. Papers that do not have a registered author by February 15, 2010 will be excluded from the proceedings.

 Fees comprise access to all sessions, one copy of the proceedings volume, coffee breaks, lunches, excursion, and conference dinner.

 PAYMENT:

 Early (resp. late) registration fees must be paid by bank transfer before February 15, 2010 (resp. May 14, 2010) to the conference series account at Open Bank (Plaza Manuel Gomez Moreno 2, 28020 Madrid, Spain): IBAN: ES1300730100510403506598 - Swift code: OPENESMMXXX (account holder: Carlos Martin-Vide & URV – LATA 2010).

 Please write the participant’s name in the subject of the bank form. Transfers should not involve any expense for the conference.

 On-site registration fees can be paid only in cash. A receipt for the payment will be provided on site.

Besides paying the registration fees, it is required to fill in the registration form at the website of the conference.

 

BEST PAPER AWARDS:

 An award will be offered to the authors of the two best papers accepted to the conference. Only papers fully authored by PhD students are eligible. The award intends to cover their travel expenses.

 IMPORTANT DATES:

 Paper submission: December 3, 2009

Notification of paper acceptance or rejection: January 21, 2010

Final version of the paper for the LNCS proceedings: February 3, 2010

Early registration: February 15, 2010

Late registration: May 14, 2010

Starting of the conference: May 24, 2010

Submission to the post-conference special issue(s): August 27, 2010

 FURTHER INFORMATION:

 gindorf-ti@informatik.uni-trier.de

 CONTACT:

 LATA 2010

Universität Trier

Fachbereich IV – Informatik

Campus II, Behringstraße

D-54286 Trier

 Phone: +49-(0)651-201-2836

Fax: +49-(0)651-201-3954

Back to Top

8-13 . (2010-06-01) NAACL-HLT-10: Call for Tutorial Proposals

NAACL-HLT-10: Call for Tutorial Proposals

 

Proposals are invited for the Tutorial Program of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2010 Conference. The conference is to be held from June 1 to 6, 2009 in Los Angeles, California. The tutorials will be held on Tuesday,  June 1.

 

We seek proposals for half-day (or exceptionally full-day) tutorials on all topics in computational linguistics, speech processing, information extraction and retrieval, and natural language processing, including their theoretical foundations, algorithms, intersections, and applications. Tutorials will normally move quickly, but they are expected to be accessible, understandable, and of interest to a broad community of researchers.

 

Information on the tutorial instructor payment policy can be found at http://aclweb.org/adminwiki/index.php?title=Policy_on_tutorial_teacher_payment 

 

PLEASE NOTE: Remuneration for Tutorial presenters is fixed according to the above policy and does not cover registration fees for the main conference.

 

SUBMISSION DETAILS

 

Proposals for tutorials should contain:

  1. A title and brief description of the tutorial content and its relevance to the NAACL-HLT community (not more than 2 pages).
  2. A brief outline of the tutorial structure showing that the tutorial's core content can be covered in a three-hour slot (including a coffee break). In exceptional cases six-hour tutorial slots are available as well.
  3. The names, postal addresses, phone numbers, and email addresses of the tutorial instructors, including a one-paragraph statement of their research interests and areas of expertise.
  4. A list of previous venues and approximate audience sizes, if the same or a similar tutorial has been given elsewhere; otherwise an estimate of the audience size.
  5. A description of special requirements for technical equipment (e.g., internet access).

Proposals should be submitted by electronic mail, in plain ASCII text, no later than January 15, 2010 to tutorials.hlt10@gmail.com. The subject line should be: "NAACL HLT 2010: TUTORIAL PROPOSAL".

 

PLEASE NOTE: 

  1. Proposals will not be accepted by regular mail or fax, only by email to: tutorials.hlt10@gmail.com.
  2. You will receive an email confirmation from us within 24 hours that your proposal has been received.

TUTORIAL SPEAKER RESPONSIBILITIES

 

Accepted tutorial speakers will be notified by February 1, 2010, and must then provide abstracts of their tutorials for inclusion in the conference registration material by March 1, 2010. The description should be in two formats: an ASCII version that can be included in email announcements and published on the conference web site, and a PDF version for inclusion in the electronic proceedings (detailed instructions will be given). Tutorial speakers must provide tutorial materials, at least containing copies of the course slides as well as a bibliography for the material covered in the tutorial, by April 15, 2010.

 

IMPORTANT DATES

  • Submission deadline for tutorial proposals: January 15, 2010
  • Notification of acceptance: February 1, 2010
  • Tutorial descriptions due: March 1, 2010
  • Tutorial course material due: April 15, 2010
  • Tutorial date: June 1, 2010

TUTORIALS CO-CHAIRS

  • Jason Baldridge, The University of Texas at Austin
  • Peter Clark, The Boeing Company
  • Gokhan Tur, SRI International

Please send inquiries concerning NAACL-HLT-10 tutorials to tutorials.hlt10@gmail.com

 

Back to Top

8-14 . (2010-06-05) CfP NAACL HLT 2010, ACL 2010 and COLING 2010

 
NAACL HLT 2010, ACL 2010 and COLING 2010
                             JOINT CALL FOR WORKSHOP PROPOSALS
                      
http://people.ict.usc.edu/~traum/naacl10-workshops/cfwp
 
                           * * * Proposal deadline: Oct 30, 2009 * * *
 
The Association for Computational Linguistics (ACL) and the
International Committee on Computational Linguistics (ICCL) invite
proposals for workshops to be held in conjunction with one of the three
2010 flagship conferences in computational linguistics: NAACL HLT 2010,
ACL 2010 and COLING 2010. We solicit proposals on any topic of interest
to the ACL/ICCL community. Workshops will be held at one of the
following conference venues:
 
    * NAACL HLT 2010 is the 11th annual meeting of the North American
chapter of the Association for Computational Linguistics. It will be
held in Los Angeles, June 1-6, 2010. The dates for the NAACL HLT
workshops will be June 5-6. The webpage for NAACL HLT 2010 is:
http://naaclhlt2010.isi.edu/.
    * The 48th annual meeting of the ACL (ACL 2010) will be held in
Uppsala, July 11-16, 2010. The ACL workshops will be held July 15-16.
The webpage for ACL 2010 is http://acl2010.org/.
    * The 23rd International Conference on Computational Linguistics
(COLING 2010) will be held in Beijing, August 23-27, 2010. There will be
pre-conference workshops on August 21-22, and post-conference workshops
on August 28. The webpage for the conference is:
http://www.coling-2010.org/.
 
SUBMISSION INFORMATION
 
As in 2009, we will coordinate the submission and reviewing of workshop
proposals for all three ACL/ICCL 2010 conferences.
 
Proposals for workshops should contain:
 
    * A title and brief (2-page max) description of the workshop topic
and content.
    * The desired workshop length (one or two days), and an estimate of
the audience size.
    * The names, postal addresses, phone numbers, and email addresses of
the organizers, with one-paragraph statements of their research
interests and areas of expertise.
    * A list of potential members of the program committee, with an
indication of which members have already agreed.
    * A description of any shared tasks associated with the workshop.
    * A description of special requirements for technical needs.
    * A venue preference specification.
 
The venue preference specification should list the venues at which the
organizers would be willing to present the workshop (NAACL HLT, ACL, or
COLING). A proposal may specify one, two, or three acceptable workshop
venues; if more than one venue is acceptable, the venues should be
preference-ordered. There will be a single workshop committee,
coordinated by the three sets of workshop chairs. This single committee
will review the quality of the workshop proposals. Once the reviews are
complete, the workshop chairs will work together to assign workshops to
each of the three conferences, taking into account the location
preferences given by the proposers.
 
The ACL has a set of policies on workshops. You can find the ACL's
general policies on workshops at
http://www.cis.udel.edu/~carberry/ACL/Workshops/workshop-support-general-policy.html,
the financial policy for workshops at
http://www.cis.udel.edu/~carberry/ACL/Workshops/workshop-conf-financial-policy.html,
and the financial policy for SIG workshops at
http://www.cis.udel.edu/~carberry/ACL/Workshops/workshops-Sig-financial-policy.html.
 
This year we will be using the START system for submission and reviewing
of workshop proposals. Please submit proposals to
https://www.softconf.com/a/ACLWorkshops2010 no later than 12 Midnight,
Pacific Standard Time, October 30, 2009.
 
Notification of acceptance of workshop proposals will occur no later
than November 20, 2009. Since the three ACL/ICCL conferences will occur
at different times, the timescales for the submission and reviewing of
workshop papers, and the preparation of camera-ready copies, will be
different for the three conferences. Suggested timescales for each of
the conferences are given below.
 
TIMELINES FOR 2010 WORKSHOPS
 
Oct 30, 2009     Workshop proposal deadline
Nov 20, 2009     Notification of acceptance
   
NAACL 2010    
Dec 18, 2009: Proposed workshop CFP
Mar 1, 2010: Proposed paper due date
Mar 30, 2010: Proposed notification of acceptance
Jun 5-6, 2010: Workshops
   
ACL 2010    
Jan 18, 2010: Proposed workshop CFP
Apr 5, 2010: Proposed paper due date
May 6, 2010: Proposed notification of acceptance
Jul 15-16, 2010: Workshops
   
COLING 2010    
Feb 25, 2010: Proposed workshop CFP
May 30, 2010: Proposed paper due date
Jun 30, 2010: Proposed notification of acceptance     
Aug 21-22, 2010: Pre-conference workshops
Aug 28, 2010: Post-conference workshops
 
Workshop Co-Chairs
 
    * Richard Sproat, NAACL, Oregon Health & Science University
    * David Traum, NAACL, University of Southern California
    * Pushpak Bhattacharyya, ACL, Indian Institute of Technology, Bombay
    * David Weir, ACL, University of Sussex
    * Noah Smith, COLING, Carnegie Mellon University
    * Takenobu Tokunaga, COLING, Tokyo Institute of Technology
    * Haifeng Wang, COLING, Toshiba (China) Research and Development Center
 

For inquiries, send email to: cl.workshops.2010@gmail.com

Back to Top

8-15 . (2010-07-11) 48th Annual Meeting of the Association for Computational Linguistics

48th Annual Meeting of the Association for Computational Linguistics

                       July 11-16, 2010

                       Uppsala, Sweden

 

* * * Paper Submission Deadline: February 15, 2010 * * *

 

The Annual Meeting of the Association for Computational Linguistics

(ACL) is the flagship conference for research on language and

computation. The 48th Annual Meeting of the ACL (ACL 2010) seeks

submission of papers on original and unpublished research in all areas

of computational linguistics, broadly conceived to include areas such

as psycholinguistics, speech, information retrieval, multimodal

language processing, and language issues in emerging domains such as

bioinformatics. In addition, we want to stress that both theoretical,

as well as practical and empirical papers, are sought for the

conference.

 

ACL 2010 has the goal of a broad technical program. Thus, ACL 2010

invites papers in the following categories:

   Research papers

     - theoretical computational linguistics

     - empirical/data-driven approaches

     - paradigms/techniques/strategies

     - resources and evaluation

     - applications/systems

     - negative result (report of a sensible experiment or approach

                        that failed to achieve the desired results)

   Survey papers (new emerging area, field relevant to computational

                  linguistics, etc.)

   Position papers (we are particularly soliciting papers co-authored

                    by two individuals with opposing positions, though

                    single-authored papers are welcome)

   Challenge papers (a challenge to the field in terms of setting out

                     a goal for the next 5/10/20 years)

 

The above categories include types of papers that have not typically

been part of the ACL conference program. Since the appropriate

criteria for evaluating papers is not identical for the above

categories (and subcategories), ACL 2010 will use a different review

form for each category of paper, with the review form tailored to the

type of submission. For example, the review criteria for an

applications/systems research paper will include whether a substantive

evaluation or user experiments are reported and whether a demo will be

available at the conference, whereas the review form for a theoretical

computational linguistics research paper will include a different set

of review criteria. The review forms will be available on the

conference web site at least 3 months prior to the submission

deadline. At the time of submission, authors will be asked to

designate the category under which they believe that their paper

should be evaluated. However, the program committee chairs reserve the

right to change the selected category if they feel that the submission

falls into a different category of paper.

 

If you are unsure about whether your submission is appropriate for ACL

2010 please email the program chairs at program@acl2010.org.

 

LONG VERSUS SHORT PAPERS:

The submission deadlines for long and for short papers are identical.

Long papers are appropriate for:

1) reporting substantial, completed, and previously unpublished research;

2) presenting a survey of a subfield that would be of interest to

   computational linguists;

3) a two-author position paper in which the co-authors take

   opposing positions.

Short papers typically constitute more focused contributions. Thus

they are appropriate for:

1) reporting smaller experiments;

2) describing work-in-progress;

3) single-author position papers;

4) challenge papers;

5) descriptions of new language resources or evaluation methodologies

   (although these could be long papers);

6) presenting negative results.

 

Long papers will be allocated 8 pages of content in the

conference proceedings, and short papers will be allocated 4 pages

of content. Both long and short papers may have any number of

pages consisting solely of references. Long papers will generally be

presented as 20-minute talks plus questions (although authors will be

given the option of instead selecting a poster presentation or a

10-minute oral presentation followed by a poster); short papers will

be presented either as a poster or as a 10-minute talk followed by a

poster session. There will be no distinction in the conference

proceedings between papers that are assigned different presentation

modes (such as oral versus poster).

 

IMPORTANT DATES:

Feb 15, 2010 - Paper submissions due (both long and short papers);

July 11-16, 2010 - ACL 2010

 

TOPICS OF INTEREST:

Topics include, but are not limited to:

* Discourse, dialogue, and pragmatics

* Grammar engineering

* Information extraction

* Information retrieval

* Knowledge acquisition

* Large scale language processing

* Language generation

* Language processing in domains such as bioinformatics, legal, medical, etc.

* Language resources, evaluation methods and metrics, science of annotation

* Lexical/ontological/formal semantics

* Machine translation

* Mathematical linguistics, grammatical formalisms

* Mining from textual and spoken language data

* Multilingual language processing

* Multimodal language processing (including speech, gestures, and other

                                  communication media)

* NLP applications and systems

* NLP on noisy unstructured text, such as emails, blogs, sms

* Phonology/morphology, tagging and chunking, word segmentation

* Psycholinguistics

* Question answering

* Semantic role labeling

* Sentiment analysis and opinion mining

* Spoken language processing

* Statistical and machine learning methods

* Summarization

* Syntax, parsing, grammar induction

* Text mining

* Textual entailment and paraphrasing

* Topic and text classification

* Word sense disambiguation

 

SUBMISSIONS:

The deadline for both long and short papers is February 15, 2010.

Submission will be electronic in PDF format through the conference

website.

 

Long papers may consist of up to 8 pages of content (excluding

references), and short papers may consist of up to 4 pages of content

(excluding references). Both long and short papers may include any

additional number of pages consisting solely of references. Both long

and short paper submissions should follow the two-column format of ACL

2010 proceedings. We strongly recommend the use of ACL LaTeX style

files or Microsoft Word Style files tailored for this year's

conference, which will be available on the conference website. All

submissions must conform to the official ACL 2010 style guidelines to

be announced on the conference website.

 

Reviewing of papers will be double-blind. Therefore, the paper must

not include the authors' names and affiliations. Furthermore,

self-references that reveal the author's identity, e.g., "We

previously showed (Smith, 1991) ...", must be avoided. Instead, use

citations such as "Smith (1991) previously showed ...". Papers that do

not conform to these requirements will be rejected without review.

 

MULTIPLE SUBMISSION POLICY:

ACL 2010 will not accept for publication or presentation work that

will be (or has been) published at other meetings or in other

publications. However, papers that have been or will be submitted

elsewhere may be submitted to ACL 2010 provided that this fact is

stated at the time of submission. If the paper is accepted by both

ACL 2010 and another meeting or publication, it must be withdrawn

from one of them; furthermore, its authors must notify the program

chairs, within two days of receiving the ACL acceptance notification,

indicating which meeting or publication they choose for presentation

of their work.

 

MENTORING SERVICE: ACL is providing a mentoring (coaching) service for

authors from regions of the world where English is less emphasized as

a language of scientific exchange. Many authors from these regions,

although able to read the scientific literature in English, have

little or no experience in writing papers in English for conferences

such as the ACL meetings. If you would like to take advantage of the

service, please upload your paper in PDF format by January 1, 2010

using the paper submission software for the mentoring service which

will be available at the conference website.

 

PROGRAM COMMITTEE CO-CHAIRS:

   Sandra Carberry, University of Delaware, USA

   Stephen Clark, University of Cambridge, UK

Back to Top

8-16 . (2010-07-11) CfP ACL 2010 Student Research Workshop

ACL 2010 Student Research Workshop
                  July 11-16, 2010
                  Uppsala, Sweden
             http://acl2010.org/srw.html
 
* * * Paper submission deadline: February 15, 2010 * * *
 
General Invitation for Submissions
 
The Student Research Workshop is an established tradition at ACL
conferences. The workshop provides a venue for student researchers
investigating topics in Computational Linguistics and Natural Language
Processing to present their work and receive feedback. Participants will
have the opportunity to receive feedback from a general audience as well
as from panelists; the panelists are experienced researchers who will
prepare in-depth comments and questions in advance of the presentation.
 
We would like to invite student researchers to submit their work to the
workshop. Since this workshop is an excellent opportunity to ask for
suggestions, to receive useful feedback and to run your ideas by an
international audience of researchers, the emphasis of the workshop will
be on work in progress. The research being presented can come from any
topic area within computational linguistics including, but not limited
to, the following topic areas:
 
    * pragmatics, discourse, semantics, syntax and the lexicon
    * phonetics, phonology and morphology
    * linguistic, mathematical and psychological models of language
    * information retrieval, information extraction, question answering
    * summarization and paraphrasing
    * language generation
    * speech recognition, speech synthesis
    * corpus-based language modeling
    * multi-lingual processing, machine translation, translation aids
    * spoken and written natural language interfaces, dialogue systems
    * multi-modal language processing, multimedia systems
    * message and narrative understanding system
 
Submission Requirements
 
The emphasis of the workshop is on original and unpublished research.
The papers should describe original work in progress. Students who have
settled on their thesis direction but still have significant research
left to do are particularly encouraged to submit their papers.
 
Since the main purpose of presenting at the workshop is to exchange
ideas with other researchers and to receive helpful feedback for further
development of the work, papers should clearly indicate directions for
future research wherever appropriate. All authors of multi-author papers
MUST be students. Papers submitted for this workshop are eligible only
if they have not been presented at any other meeting with publicly
available published proceedings. Students who have already presented at
an ACL/EACL/NAACL Student Research Workshop may not submit to this
workshop. They should submit their papers to the main conference
instead. It must be indicated if a paper has been submitted to another
conference or workshop.
 
Submission Format
 
Paper submissions should follow the two-column format of ACL 2010
proceedings without exceeding six (6) pages, including references. We
strongly recommend the use of ACL LaTeX style files or Microsoft Word
style files tailored for this year's conference. These files will be
available soon from this page. Submissions must conform to the official
ACL 2010 style guidelines and they must be electronic in PDF. Please use
the submission system to submit your paper.
 
Reviewing Procedure
 
Reviewing of papers submitted to the Student Research Workshop will be
managed by the Student Workshop Co-Chairs, with the assistance of a team
of reviewers. Each submission will be matched with a mixed panel of
student and senior researchers for review. The final acceptance decision
will be based on the results of the review. As the reviewing will be
blind, the paper must not include the authors' names and affiliations.
Furthermore, self-references that reveal the author's identity, e.g.,
"We previously showed (Smith, 1991) ...", must be avoided. Instead, use
citations such as "Smith previously showed (Smith, 1991) ...". Papers
that do not conform to these requirements will be rejected without review.
 
Important Dates:
 
  Paper submission deadline: February 15, 2010
  Notification of acceptance: April 12, 2010
  Camera-ready submission deadline: May 10, 2010
  Conference dates: July 11-16, 2010 (The workshop will be held during
the main conference)
 
Funding for Travel
 
There will be funding to assist student participants with travel and
conference expenses. This information will be available soon from the
workshop webpage.
 
Contact Information
 
If you need to contact the Co-Chairs of the Student Workshop, please
use: srw@acl2010.org. An e-mail sent to this address will be forwarded
to all Co-Chairs.
 
Tomek Strzalkowski (Faculty Advisor) University at Albany, SUNY, USA
Seniz Demir (Co-Chair) University of Delaware, USA
Jan Raab (Co-Chair) Charles University in Prague, Czech Republic

Nils Reiter (Co-Chair) Heidelberg University, Germany

Back to Top

8-17 . (2010-09-06) Third International Workshop on Perceptual Quality of Systems (PQS) Berlin GE

 

On behalf of the organizing committee of the Third International Workshop on Perceptual Quality of Systems (PQS), we would like to invite you to attend this exciting event that will happen in Dresden, September 6-8, 2010.

 

Third International Workshop on Perceptual Quality of Systems is a direct successor to the successful workshop series inaugurated in Herne in 2003, and in Berlin in 2006.

 

The quality of systems which address human perception is difficult to describe. An engineering approach to quality includes the consideration of how a system is perceived by its users, and how the needs and expectations of the users develop. Thus, quality assessment and prediction have to take the relevant human perception and judgement factors into account.

 

The workshop is intended to provide an interdisciplinary exchange of ideas between both academic and industrial researchers working on different aspects of perceptual quality of systems. Papers are invited which refer to methodological aspects of quality and usability assessment and evaluation, the underlying perception and judgment processes, as well as to particular technologies, systems or services.

 

Contributions are welcomed in (but not limited to) the following areas (please note that in all of these areas both theoretical and empirical approaches are encouraged):

 

Methodologies and Methods of Quality Assessment and Evaluation

Metrology: Test Design and Scaling

Quality of Speech and Music

Quality of Multimodal Perception

Perceptual Quality vs. Usability

Semio-Acoustics and -Perception

Quality and Usability of

  • Speech Technology Devices
  • Telecommunication Systems and Services
  • Multi-Modal User Interfaces
  • Virtual Reality

Product-Sound Quality

 

The workshop will take place in the “Bischof Bennohaus”, a villa-like conference center located in Bautzen, near Dresden, Germany. All participants will be accommodated in this center. Accomodation and meals are included in the workshop fees. A shuttle bus service will collect participants from downtown Dresden and transport them to the workshop venue.

 

We invite your papers, posters and demonstrations on these topics, and look forward to seeing you in Dresden in September 2010.

 

Ute Jekosch & Ercan Altinsoy

Chair of Communication Acoustics, Dresden University of Technology

 

Sebastian Möller & Alexander Raake

Deutsche Telekom Laboratories, Berlin University of Technology

 

Important Dates

April 16, 2010          : Abstract submission (approx. 400 words)

May 16, 2010          : Notification of acceptance

June 16, 2010                   : Submission of the camera-ready-paper (max. 6 pages)

September 6-8, 2010         : Workshop

 

Contact and more information:

 

Ute Jekosch

 

Chair of Communication Acoustics

Dresden University of Technology

Helmholtzstr. 18

01062 Dresden, Germany

Email: Ute.Jekosch@tu-dresden.de

Phone: +49 351 463 344 63

Back to Top

8-18 . (2010-09-27) Summer School Cognitive and Physical Models of Speech Production, Speech Perception and Production-Perception Interaction” Berlin

Announcement

 

Summer School CPMSP2 - 2010

“Cognitive and Physical Models of Speech Production, Speech Perception and Production-Perception Interaction”

Part III: Planning and Dynamics

 

Berlin – September 27-October 1, 2010

 

After two successful editions in Lubmin (2004) and Autrans (2007), we are pleased to announce the 3rd International CPMSP2 Summer School on “Cognitive and Physical Models of Speech Production, Speech Perception and Production-Perception Interaction”. The summer school will be held in Berlin from the 27th of September to the 1st of October 2010.

 

The focus of this summer school will be the planning of speech sequences and its interactions with dynamical properties of speech production and speech perception. It will be organized around 9 tutorials addressing related issues from the linguistic, neurophysiologic, motor control, and perception perspectives. The following invited speakers have accepted to present these tutorials:

§ Rachine Ridouane – LPP – Paris: Units in speech planning

§ Pierre Hallé – LPP – Paris: Units in speech acquisition

§ Noël Nguyen – LPL – Aix-en-Provence: The dynamical approach to speech perception

§ Linda Wheeldom – University of Birmingham: Phonological monitoring in the production of spoken sequences

§ Jelena Krivokapic – Yale University: Prosodic planning in spoken sequences

§ Paul Cisek – Département de Physiologie – Université de Montréal: Human movement planning and control

§ Marianne Pouplier – IPS – München: Dynamical coupling of intergestural planning

§ Pascal Perrier – Gipsa-lab – Grenoble: Gesture planning integrating dynamical constraints and related issues in speech motor control

§ Peter Dominey – SCBRI - Lyon : Sensorimotor interactions and the construction of speech sequences

 

The summer school is open to all students, postdocs and researchers. Its aim is to provide a platform for interchanges between students, junior and senior researchers by means of poster presentations, discussion forums and working groups, related to the topics addressed in the tutorials. Further information and conditions for participation can be found at http://summerschool2010.danielpape.info/

 

Dates

Applications with one page abstract :     April 6, 2010

Notification of Acceptance:                    May 31, 2010

Conference:                                          September 27 – October 1, 2010

 

We are looking forward to seeing you there!

 

Organisers:

Susanne Fuchs (ZAS Berlin)

Melanie Weirich (ZAS Berlin)

Daniel Pape (IEETA, University of Aveiro, Aveiro)

Pascal Perrier (GIPSA-lab, Grenoble INP, Grenoble)

 

Contact: berlin.dynamics@gmail.com

Back to Top

8-19 . (2010-11-29) International Symposium on Chinese Spoken Language Processing ISCSLP

CALL FOR PAPERS

2010 International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)
November 29 – December 3, 2010  -  Tainan and Sun Moon Lake, Taiwan
 

===========================================================
 
ISCSLP is the flagship conference of ISCA SIG-CSLP (International Speech Communication Association, Special Interest Group on Chinese Spoken Language Processing).
ISCSLP2010 will be held during November 29 – December 3, 2010 in Tainan and Sun Moon Lake, Taiwan, and hosted by National Cheng Kung University.
Tainan, located in south of Taiwan, is the cultural place of origin. There are a lot of historical sites and antique places. Therefore, Tainan has another name called the ancient capital. In addition to these historical sites, Tainan is a modern city with various shopping centers, department stores, and night markets. You can taste all of Taiwanese traditional food, e.g. shrimp roll, tofu pudding, coffin, oyster omelet, rice tube pudding, Taiwanese meatballs in Tainan . Don't miss this chance to experience these toothsome snacks when you visit Tainan .
 
Sun Moon Lake, situated in Nantou County's Yuchih Township, in the center of Taiwan, is the island's largest lake. It is a beautiful alpine lake, divided by the tiny Lalu Island; the eastern part of the lake is round like the sun and the western side is shaped like a crescent moon, hence the name "Sun Moon Lake". Its crystalline, emerald green waters reflect the hills and mountains which rise on all sides. Natural beauty is enhanced by numerous cultural and historical sites. Well-known both at home and abroad, the Sun Moon Lake Scenic Area has exceptional potential for further growth and recognition as a prime tourism destination. When you come to Sun Moon Lake, slow down your hurried travels, stay, and relax for a few days. Here, the beauty of nature will make you want to keep coming back!
We invite your participation in this premier conference, where the language from ancient civilizations embraces modern computing technology. ISCSLP2010 will feature world-renowned plenary speakers, tutorials, exhibits, and a number of lecture and poster sessions on the following topics:
 
Speech Production and Perception
Phonetics and Phonology
Speech Analysis
Speech Coding
Speech Enhancement
Speech Recognition
Speech Synthesis
Language Modeling and Spoken Language Understanding
Spoken Dialog Systems
Spoken Language Translation
Speaker and Language Recognition
Computer-Assisted Language Learning
Indexing, Retrieval and Authoring of Speech Signals
Multi-Modal Interface including Spoken Language Processing
Spoken Language Resources and Technology Evaluation
Applications of Spoken Language Processing Technology 
 
Official Language & Publication
The official language of ISCSLP is English.
All papers accepted will be included in IEEE Xplore and indexed by EI Compendex.
 
Paper Submission
Authors are invited to submit original, unpublished work in English.
Papers should be submitted via
http://conf.ncku.edu.tw/iscslp2010/paper.htm
Each submission will be reviewed by two or more reviewers.
At least one author of each paper is required to register. 
 
Important Dates
Full paper submission by July 15, 2010
Notification of acceptance by Aug. 30, 2010
Camera ready papers by Sep. 13, 2010
Registration to cover an accepted paper by Oct.13, 2
Back to Top