|
Editorial
Dear Members,
It is my pleasure to draw the attention of my young colleagues to the large number of open positions in speech processing offered by
academic labs as well as industry. Speech processing is becoming mature and industry needs skilled persons in speech technology
and
linguistics as well as excellent experts in programming on-board real time applications.
However research remains crucial since to widen the application field we need increased robustness to noise and to intrinsic
variabilities.
ISCA organizes a large number of conferences, ITR Workshops and supports most of the essential conferences on speech processing.
Intense activity is devoted now to the organization of Interspeech 2007 in Antwerp. We hope to meet most of you at this
annual major meeting of speech scientists.
If you authored a recent book, do not forget to inform me: ISCA is pleased to advertise it in ISCApad.
Christian Wellekens
TABLE OF CONTENTS
- ISCA News
- SIG's activities
- Courses, internships
- Books, databases,
softwares
- Job openings
- Journals
- Future Interspeech Conferences
- Future ISCA Tutorial and Research Workshops (ITRW)
- Forthcoming Events supported (but not organized) by ISCA
- Future Speech Science and technology events
ISCA NEWS
top
Prix de these AFCP
Association Francophone de la Communication Parlee
Cloture definitive le 21 juin 2007
L'Association Francophone de la Communication Parlee (AFCP)
decerne chaque annee un prix scientifique pour une excellente these du
domaine. L'AFCP souhaite ainsi promouvoir toutes les facettes de la
recherche en communication parlee : des travaux fondamentaux aux travaux
appliques, du domaine des STIC, SHS ou SDV. L'objectif de ce prix est de
dynamiser les jeunes chercheurs, tout en faisant connaitre leurs travaux.
Le jury est compose des chercheurs elus du CA de l'AFCP.
- R. Ridouane est laureat du prix 2004 pour:
"Suite de consonnes en berbere: phonetique et phonologie",
- M. Dohen en 2005 pour: "Deixis prosodique multisensorielle: production
et perception audiovisuelle de la focalisation contrastive en francais"
La remise officielle du prix se fera lors de la rencontre francophone
"Les Journees d'Etudes sur la Parole" (JEP) 2008 (Avignon).
Chaque recipiendaire se verra remettre la somme de 500 euros, et sera
invite a resumer ses travaux lors d'un expose.
CALENDRIER
Peut candidater tout docteur ayant soutenu sa these entre le 1er octobre
2005 et le 31 decembre 2006. On ne peut candidater qu'a une seule edition.
Candidature avant le 21 juin 2007 (depot de la these & envoi postal).
Resultats : mi juillet 2007.
DOSSIER DE CANDIDATURE
serveur AFCP des theses qui regroupe la plupart des theses
francophones du domaine
2/ Postez un CD a:
Herve' Glotin Prix AFCP UMR CNRS LSIS
Univ. Sud Toulon Var, BP20132
83957 La Garde Cedex 20 - France
contenant un seul fichier (votre nom.pdf) avec dans l'ordre:
* resume de votre these (2 pages),
* liste de vos publications,
* tous les rapports (jury et rapporteus) scannes de votre soutenance de
these,
* une lettre de recommandation scannee de votre dir. de these pour ce prix,
* votre CV (avec coord. completes dont Email).
SIG's activities A list of Speech Interest
Groups can be found on our
web.
top
COURSES, INTERNSHIPS
Ecole Recherche Multimodale d'Information
Spéciale Campagnes Technolangue, Technovision, NIST et TREC.
Website
du 4 sept au 6 sept 2007
Presqu'île de Giens - Var
Clôture des inscriptions: 15 juin 2007
Avec les soutiens du LSIS, Conseil Régional PACA, Univ. Sud Toulon Var
et l'Association Francophone de la Commmunication Parlée (AFCP).
L'Ecole Recherche Multimodale d'Information 2007 porte sur les méthodes
et performances des systèmes de recherche d'information inter-modalités,
en s'appuyant sur les campagnes d'évaluation Technolangue (parole),
Technovision (image), NIST et TREC. Des chercheurs y ayant participé
présentent les bases communes entre ces systèmes, et jetent des ponts
entre les différentes disciplines. Ces spécialistes d'analyses conjointes
textes, images, sons ou vidéos interviennent sur 3 jours avec discussions
et démonstrations ouvertes. L'originalité d'ERMITES est d'insister sur les
analyses conjointes de modalités, montrant l'intérêt de sortir d'un
pré-carré spécifique. ERMITES permet à faible coût de se familiariser
avec plusieurs disciplines sur cette problématique commune. Le site
est le superbe VVF de La Badine - Presqu'île de Giens-Var, (accès TGV
Toulon) en face du parc naturel de Port-Cros. Cette fin de saison
y est très agréable, détente: plages ou catamarans au pied du VVF.
Programme
- Bengio Samy - Research scientist, GOOGLE USA.
"Apprentissage automatique pour le traitement et recherche d'information
dans les séquences multimodales".
- Haton Jean-Paul- Pr. LORIA, Membre Institut Univ. de France.
"Reconnaissance de forme et analyse de scènes".
- Gallinari Patrick - Pr. LIP6, UMR CNRS.
"Méthodes statistiques pour l'apprentissage de structure".
- Bonastre Jean-François - LIA, Membre Institut Univ. de France.
"Reconnaissance du locuteur / indexation de documents audio".
- Moellic Pierre-Alain - CEA LIST Paris.
"Analyses d'images: dernières avancées à l'échelle au travers
d'ImagEval/Technolangues".
- Merialdo Bernard - Pr. EURECOM.
"Indexation Multimédia".
- Gravier Guillaume - CNRS IRISA.
"Reconnaissance Automatique de la Parole et applications".
- Paugam-Moisy Hélène - Pr. LIRIS UMR CNRS.
"Mémoires associatives multimodales et temporelles".
- Glotin Hervé - LSIS UMR CNRS.
"Recherche robuste d'information dans des scènes acoustiques ou visuelles".
- Kermorvant Christopher - PhD R&D,A2IA Paris.
"Analyse d'image et Intelligence Artificielle".
- Mulhem Philippe - CNRS IMAG.
"Indexation et recherche sémantique d'images".
- Farinas Jerôme - IRIT UMR CNRS.
"Identification & classification automatique de langues".
Inscription et site
le forfait inclut les actes, l'hébergement (chambre individuelle
avec SdB) et la restauration vue sur mer au VVF.
Inscriptions sur le site web
Clôture des 16 inscriptions: 15 juin 2007.
** 2 BOURSES de 150 euros sont offertes par l'AFCP **
(faire la demande lors de votre inscription).
E-mail
Organisateurs : H. Glotin avec J. Le Maitre, LSIS UMR CNRS 6168.
ELSNET Summer School Belfast 2007
Advanced Dialogue Systems: Affectivity, Adaptability and Multimodality
16 - 27th July 2007
This year's ELSNET Summer School will be held in Belfast, Northern Ireland. It is being hosted by Queen's University at its pleasant
tree-lined campus near Belfast's buzzing city centre.
ELSNET Summer School Belfast is being organised by Computer Science at Queen's University in association with Computing
and Mathematics at the University of Ulster.
The 2007 Summer School focuses on dialogue systems - covering everything from basic prompt and response systems
to systems that adapt to the user's level of experience and even the user's emotional state. And the school includes guidance
for assessing how well implemented systems are actually working.
Bringing together a teaching team of world experts, the school will cover industry-standard technologies, hosting
environments and markup languages for building robust speech-based and multimodal dialogue systems. Alongside
practical strands that teach attendees how to set about building both simple and more complex dialogue systems the school
includes extensive coverage of the latest trends in dialogue development as viewed by academic and industrial dialogue specialists.
At the leading edge of dialogue system development the school considers approaches to emotion-enablement - from analysis
of real-world emotionally coloured interactions to ways of conveying affect through the use of computer-generated embodied
conversational agents.
In addition to the 2-week schedule of lectures and practicals the summer school will be complemented by a social
programme of events and recommended excursions.
Summer School Web Page.
Master in Human Language Technologies and Interfaces at the University of Trento
Website
organized by: University of Trento and Fondazione Bruno Kessler Irst
Call for applications, Academic Year 2007/08
Goal
Human language technology gives people the possibility of using speech
and/or natural language to access a variety of automated services, such as
airline reservation systems or voicemail, to access and communicate
information across different languages, and to keep under control the
increasing amount of information available by automatically extracting
useful content and summarizing it. This master aims at providing skills in
the basic theories, techniques, and applications of this technology through
courses taught by internationally recognized researchers from the
university, research centers and supporting industry partners. Students
enrolled in the master will gain in depth knowledge from graduate courses
and from substantial practical projects carried out in research and industry
labs.
Courses: Speech Processing, Machine Learning for NLP, Human Language, Text
Processing, Spoken Dialog Systems,Human Computer Interaction, Language
Resources, Multilingual Technology
Requisites
Master degree level ( min 4 years) in the area of computer science,
electrical engineering, computational linguistics and cognitive science and
other related disciplines.
English language (official language)
Student Grants
A limited number of fellowships will be available.
Application Deadline
Non EU Students: June, 15
EU Students: end of July
Info
E-mail
University of Trento-Department of Information and Communication
Technologies Via Sommarive, 14-38100 Povo (Trento), Italy
Summer school: Cognitive and physical models of speech production, perception, and perception-production interaction.
Part II : Brain and Speech
Autrans, France
September 16-21, 2007
After the success of the previous summer school held in Lubmin (Germany) 2004, we are happy to announce the second international summer school on Cognitive
and physical models of speech production, perception, and perception-production interaction. This year we will pay special attention to the brain.
The aim of this summer school is to relate fundamental knowledge on speech production and perception to insights about the organization and function of the brain.
Tutorials will be presented by specialists in these domains.
This summer school is intended mainly for graduate students, postdoctoral fellows, and researchers who work in the fields of speech production, perception,
perception-production interaction, and the brain (neurolinguistics). Potential topics are:
Speech and language acquisition
Speech and language disorders
Neural basis of speech production
Speech production control
Neural basis of speech perception
Audio-visual speech perception
Plasticity of speech perception
It is intended to provide a platform for interchanges between students, junior and senior researchers, and hence, we would like each participant to feel free to contribute
to any of these topics.
Submission
For abstract submission, please include the name(s) of the author(s), affiliations, and a contact e-mail address in the first lines of the body of the message.
Texts should be written in English. Since the number of participants is limited to 40, registration will be restricted
and based on the scientific quality of the submitted abstract.
Authors are invited to present their work in discussion groups or poster sessions at the summer school.
All details can be viewed at the summer school website
Important dates
Deadline for the application is the 2nd of May, 2007!
Notification of acceptance May 21st, 2007
Summer school September 16th-21st,2007
Registration
The number of participants is limited to 40.
There will be no registration fee. Participants will have to pay for lodging and board.
We are currently trying to get further funding for participants.
Invited speakers are:
Monica Baciu (LPNC, UPMF, Grenoble)
Grzegorz Dogil (Stuttgart university)
Hélène Loevenbruck (ICP/Gipsa-lab, CNRS, Grenoble)
Marc Sato (CRLMB, McGill university, Montréal)
Jean-Luc Schwartz (ICP/Gipsa-lab, CNRS, Grenoble)
Christophe Pallier (INSERM U562, Gif sur Yvette)
Georg Meyer (School of psychology, university of Liverpool)
Bernd Kröger (UK Aachen)
Organizers
Susanne Fuchs (ZAS, Berlin)
Hélène Loevenbruck (ICP, GIPSA-lab, Grenoble)
Daniel Pape (ZAS, Berlin)
Pascal Perrier (ICP, GIPSA-lab, Grenoble)
Studentships available for 2006/7 at the Department of Computer
Science The University of Sheffield - UK One-Year MSc in HUMAN
LANGUAGE TECHNOLOGY The Sheffield MSc in Human Language Technology has
been carefully tailored to meet the demand for graduates with the
highly-specialised multi-disciplinary skills that are required in HLT,
both as practitioners in the development of HLT applications and as
researchers into the advanced capabilities required for next-generation
HLT systems. The course provides a balanced programme of instruction
across a range of relevant disciplines including speech technology,
natural language processing and dialogue systems. The programme is
taught in a research-led environment. This means that you will study the
most advanced theories and techniques in the field, and also have the
opportunity to use state- of-the-art software tools. You will also have
opportunities to engage in research-level activity through in-depth
exploration of chosen topics and through your dissertation. Graduates
from this course are highly valued in industry, commerce and academia. The
programme is also an excellent introduction to the substantial research
opportunities for doctoral-level study in HLT. A number of studentships
are available, on a competitive basis, to suitably qualified applicants.
These awards pay a stipend in addition to the course fees. See further details
of the course Information on
how to apply
top
BOOKS, DATABASES, SOFTWARES
Databases
HIWIRE database
We would like to draw your attention to the Interspeech 2007 special
session
"Novel techniques for the NATO non-native Air Traffic Control
and HIWIRE cockpit databases"
http://www.interspeech2007.org/Technical/nato_atc.php
that we are co-organizing. For this special session we make available
(free of charge) the
cockpit database, along with training
and testing HTK scripts. Our goal is to investigate feature extraction,
acoustic modelling and adaptation algorithms for the problem of
(hands-free) speech recognition in the cockpit. A description of the task,
database and ordering information can be found at the
website of the project
We hope that you will be able to participate to this special session.
Alex Potamianos, TUC
Thibaut Ehrette, Thales Research
Dominique Fohr, LORIA
Petros Maragos, NTUA
Marco Matassoni, ITC-IRST
Jose Segura, UGR
- Language Resources Catalogue - Update
ELRA is happy to announce that new Speech Related Resources are now
available in its catalogue.
Moreover, we are pleased to announce that years 2005 and 2006 from the
Text Corpus of "Le Monde" (ELRA-W0015) are now available.
*ELRA-S0235 LC-STAR Hebrew (Israel) phonetic lexicon
*The LC-STAR Hebrew (Israel) phonetic lexicon comprises 109,580 words,
including a set of 62,431 common words, a set of 47,149 proper names
(including person names, family names, cities, streets, companies and
brand names) and a list of 8,677 special application words. The lexicon
is provided in XML format and includes phonetic transcriptions in SAMPA.
More information
*ELRA-S0236 LC-STAR English-Hebrew (Israel) Bilingual Aligned Phrasal
lexicon
*The LC-STAR English-Hebrew (Israel) Bilingual Aligned Phrasal lexicon
comprises 10,520 phrases from the tourist domain. It is based on a list
of short sentences obtained by translation from US-English 10,449
phrasal corpus. The lexicon is provided in XML format.
More information
*ELRA-S0237 LC-STAR US English phonetic lexicon
*The LC-STAR US English phonetic lexicon comprises 102,310 words,
including a set of 51,119 common words, a set of 51,111 proper names
(including person names, family names, cities, streets, companies and
brand names) and a list of 6,807 special application words. The lexicon
is provided in XML format and includes phonetic transcriptions in SAMPA.
More information
*ELRA-W0015 Text corpus of "Le Monde"
*Corpus from "Le Monde" newspaper. Years 1987 to 2002 are available in
an ASCII text format. Years 2003 to 2006 are available in .XML format.
Each month consists of some 10 MB of data (circa 120 MB per year).
More information
*ELRA-S0238 MIST Multi-lingual Interoperability in Speech Technology
database
*The MIST Multi-lingual Interoperability in Speech Technology database
comprises the recordings of 74 native Dutch speakers (52 males, 22
females) who uttered 10 sentences in Dutch, English, French and German,
including 5 sentences per language identical for all speakers and 5
sentences per language per speaker unique. Dutch sentences are
orthographically annotated.
More information and also
More information
*ELRA-S0239 N4 (NATO Native and Non Native) database
*The (NATO Native and Non Native) database comprises speech data
recorded in the naval transmission training centers of four countries
(Germany, The Netherlands, United Kingdom, and Canada) during naval
communication training sessions in 2000-2002. The material consists of
native and non-native speakers using NATO Naval English procedure
between ships, and reading from a text, "The North Wind and the Sun," in
both English and the speaker's native language. The audio material was
recorded on DAT and downsampled to 16kHz-16bit, and all the audio files
have been manually transcribed and annotated with speakers identities
using the tool, Transcriber.
More information and also
More information
*ELRA-W0047 Catalan Corpus of News Articles
*The Catalan Corpus of News Articles comprises articles in Catalan from
1 January 1999 to 31 March 2007. These articles are grouped per
trimester without chronological order inside.
More information and also
More information
*ELRA-L0075 Bulgarian Linguistic Database
*This database contains 81,647 entries in Bulgarian with a linguistic
environment tool (for WINDOWS XP). The data may be used for
morphological analysis and synthesis, syntactic agreement checking,
phonetic stress determining.
More information and also
More information
For more information on the catalogue, please contact
Valérie Mapelli
Our on-line catalogue has moved to
the following address. Please update your bookmarks.
Books
Speech and Language Engineering
Editor: Martin Rajman
Publisher: EPFL Press, distributed by CRC Press
Year: 2007
Human Communication Disorders/ Speech therapy
This interesting series can be listed on Wiley website
Incurses em torno do ritmo da fala
Author: Plinio A. Barbosa
Publisher: Pontes Editores (city: Campinas)
Year: 2006 (released 11/24/2006)
(In Portuguese, abstract attached.)
Website
Speech Quality of VoIP: Assessment and Prediction
Author: Alexander Raake
Publisher: John Wiley & Sons, UK-Chichester, September 2006
Website
Self-Organization in the Evolution of Speech, Studies in the Evolution
of Language Author: Pierre-Yves Oudeyer Publisher:Oxford University Press
Website
Speech Recognition Over Digital Channels Authors: Antonio M.
Peinado and Jose C. Segura Publisher: Wiley, July 2006 Website
Multilingual Speech Processing Editors: Tanja Schultz and
Katrin Kirchhoff , Elsevier Academic Press, April 2006 Website
Reconnaissance automatique de la parole: Du signal a
l'interpretation Authors: Jean-Paul Haton Christophe
Cerisara Dominique Fohr Yves Laprie Kamel Smaili 392 Pages
Publisher: Dunod
top
JOB OPENINGS
We invite all laboratories and industrial companies which have job
offers to send them to the ISCApad
editor: they will appear in the newsletter and on our website for
free. (also have a look at http://www.isca-speech.org/jobs.html
as well as http://www.elsnet.org/
Jobs)
Several openings at Nuance
Nuance is the leading provider of speech and imaging solutions for businesses and consumers around the world. Every day, millions of users and thousands of businesses
experience Nuance by calling directory assistance, requesting account information, dictating patient records, telling a navigation system their destination, or digitally reproducing
documents that can be shared and searched. With more than 2000 employees worldwide, we are committed to make the user experience more enjoyable by transforming
the way people interact with information and how they create, share and use documents. Making each of those experiences
productive and compelling is what Nuance is about.
RESEARCH ENGINEER LVCSR
As part of the team, the candidate will be creating speech technologies for embedded applications varying from
simple command and control up to natural speech dialogue on mobile and automotive platforms.
The candidate will join an experienced international team with expertise in creating, optimizing and testing
portable ASR software for embedded devices. The position can be located at our Aachen office (Germany),
the Nuance International Headquarters in Merelbeke, near Ghent (Belgium), or our Tel-Aviv office (Israel).
Responsibilities
· Design, implementation, evaluation, optimization and testing of new algorithms and tools, with a focus on search,
grammar processing, dictation and/or natural language understanding.
· Occasional involvement in the creation of demonstrators and evaluators of the developed technology.
· Occasionally provide support directly to customers and to our Embedded Professional Services Team
Required Skills
· Background in ASR research is required: preferably with ASR search techniques,
CFG parsing, FSM processing, dictation or natural language understanding technology.
· Knowledge of C, C++, Pearl or Python, Matlab
· Excellent English communication skills, written and spoken.
· Positive "can-do" attitude, well organized
· Customer Service attitude, excellent presentation and communication skills
· Ability to deal with critical situations.
Ideal Skills
· 2-3 years work experience in relevant area or Ph.D. in speech processing
Education
· MS or BS in Computer Science, Computer Engineering or related Technical Degree
Please send your CV in English to : Deanna Roe
SOFTWARE ENGINEER
Key Responsibilities
In this role, you will be responsible for porting Nuance state-of-the-art speech recognition and text to
speech products to different embedded platforms typically found in the automotive, mobile or game station markets.
As development engineer in the Professional Services team, you will be in close contact with the research lab
developing the technologies, and will be contributing to speech enabling mobile phones (dialing by voice, SMS reading),
automotive hands-free car-kits (dialing by voice, acoustical echo canceller), automotive and mobile navigation systems
(route guidance or traffic information through TTS), game stations, or similar kind of devices where speech is bringing
clear added value. As experienced engineer, you will rapidly be working together with our customers’ integration teams
to help them including our technologies into successful products in an efficient way, and managing from A to Z
complete projects and customer interaction. While most of the job will take place in our offices in Merelbeke,
a few short trips inside or outside Europe might be possible.
Your Profile
- Requirements:
o Bachelors or Graduate University degree in Electrical Engineering, Computer Engineering,
Computer Science or equivalent.
o Professional experience in the Embedded market
o Fluent in English, both written and spoken
o Strong customer communication skills
o Strong Ansi C programming skills
o Ability to travel for short trips
o Strong team player, capable to work independently or/and to manage other engineers when needed
o Self learner, with sense of initiative, and perseverance to deliver high quality work
- Nice to have
o Experience with embedded hardware platforms, operating systems, and software development
o Knowledgeable about audio streaming technologies, real-time protocols, codecs, signal processing
o Knowledgeable about WinCe, Linux
o Multi-lingual
Please send your CV in English to : Deanna Roe
JUNIOR MOBILE APPLICATION DEVELOPER
Located in Aachen (Germany)
As a Mobile application developer you will be part of a team creating ground-breaking software for mobile devices
(smartphones or personal navigation devices,). You will design, implement and maintain applications that make uses
of Nuance’s state-of-the-art speech recognition and speech synthesis technologies to deliver new experience to Nuance
customers: hands-free dialing and control of the phone, selection of songs on a media player by voice, voice entry
of destination in a navigation device or dictation of SMS. You will follow up with latest industry developments
for mobile platform and work with new mobile devices before they hit the market. You will work in a multi-cultural
international team located in Belgium, Germany, Israel and the USA.
Qualifications:
· Bachelor or Master Degree in Computer Science, Computer Engineering, Electrical Engineering of equivalent.
· Strong C/C++ programming skills
· Hands-on experience with object-oriented design & object-oriented programming
· Good software/system troubleshooting skills
· Familiarity with one of the following development platforms: Symbian, Windows Mobile, BREW, J2ME.
· Fluent in English, both written in spoken.
· Ability to work in a multinational team
· Self- & quick learner
· Ability to work under time pressure and take initiative
Strong candidates will also have the following;
· Understanding of speech recognition software.
· Knowledge of Java.
· Practical experience with Nokia’s S60 development platform.
· Familiarity with cross-platform software design principles
· Exposure to Agile software development methodologies
· Knowledge of other European or Asian languages
Please send your CV in English to Deanna Roe
If you have any questions, please contact her on +44 7717506622
TEMPORARY FULL-TIME LANGUAGE SPECIALIST
Nuance Communications, Inc., a world-wide leader in speech technology, is seeking full-time temporary language specialist
to develop our newest TTS technology into various languages.
Positions are available for the following languages:
· Finnish
· Dutch ( The Netherlands)
· Australian English
· Mandarin (People’s Republic of China)
· Mandarin (Taiwan)
· Cantonese
· Japanese
· Korean
· Danish
· Norwegian
· Egyptian Arabic
· Greek
· Hindi
· Portuguese (Iberian)
· Indian English
· Russian
· Swedish
· Turkish
· Spanish (Catalan)
Responsibilities:
· Helping in development of the linguistic processing components
· Design of text corpora for recording and testing
· Processing speech data
· Creating and tuning the TTS voice
· Testing and productization of the TTS voice
.
Requirements:
· Native or near native speaker of one of them above-mentioned languagesbr>
· Speak English fluently (working language)
· Have some experience working in speech/NLP/linguistics either in academia or in industry
· Have some computational ability – no programming is required, but you should be comfortable working with MS Windows
· If you know some AWK or Perl, it would be an advantage
· Willing to work in Merelbeke (near Ghent in Belgium) for the duration of the contract
Offer:
-A 3-5 months contract
-Training in all aspects of the job, and you will be working with a talented and committed team
-An exciting job with an innovative and performing international company
-The opportunity to work in an enthusiastic, supportive team
-Competitive remuneration package
-Relocation, travel and accommodation assistance
Please send your CV in English to Deanna Roe
ALGORITHMS DEVELOPER
We are seeking a brilliant algorithms developer for research and implementation of core technology for
speech recognition and synthesis on cellular handsets.
Responsibilities:
-Research and implementation of technology that is required for different applications of mobile devices, such as voice
enabled dialing, application launching etc.
-Develop modeling methods and algorithms for enhanced quality and robustness considering limited computational resources
-Specific models and algorithms for different languages
-Define and implement schemes and tools that are required for turning technologies into products – such as
creating specific models for project and tuning parameters
-Deliver models and algorithms to the product group – guide and support the absorption of new technologies.
-Development of methodology and tools for research and model creation
-M.Sc. in computer-science, software engineering, physics, or mathematics, B.Sc. with distinction will also be considered.
Your profile:
- Completed the following courses with grade above 85: At least one C/C++ course with final project,
Probability theory and statistics, Algorithms
- Excellent research and implementation skills are required with at least 1 year of proven experience in
the following areas: Developing C/C++ code, Development, implementation and analysis of algorithms and mathematical models
- Knowledge of Perl, Python, Windows/Linux scripts, SQL
- Experience in the following areas is highly desirable: Speech– processing, Machine learning, Statistical modeling;
Neural networks, Bioinformatics.
- Knowledge in one or more of the following an advantage:Digital signal processing, Image processing,
Computational linguistics
- Advanced written and spoken English
Locations: Israel, Tel aviv / Germany, Aachen / Belgium, Merelbeke
Please send your CV in English to Deanna Roe
The experience speaks for itself
POSTDOCTORAL POSITIONS WITHIN THE SOUND TECHNOLOGY GROUP, LINKOPING
, SWEDEN
Two positions for postdoctoral associates are available within the Sound
Technology Group Sound
Technology Group , Digital Media
Division at the Department of Science and
Technology (ITN),
Linkoping Universityat Campus Norrkoping, Sweden.
Our research is focused on physical and perceptual models of sound
sources, sound source separation and adapted signal representations.
Candidates must have a strong background in research and a completed
Ph.D.
Programming skills (e.g. Matlab, C/C++ or Java) are very desirable, as
well as expertise in conducting acoustic/auditory experiments.
We are especially interested in candidates with research background in
the following areas:
. Auditory Scene Analysis
Classification of Sound Sources
. Sound Processing
. Spatial Audio and Hearing
. Time-Frequency and Wavelet Representations
. Acoustics (theoretical and experimental)
but those with related research interests are also welcome to apply.
Preferred starting date is September 2007.
Inquiries and CVs must be addressed to
Prof. G. Evangelista
Digital Media Division
Department of Science and Technology (ITN)
Linkoping Institute of Technology (LiTH) at Campus Norrkoping
Room: K5724 (K=E5kenhus, Bredgatan 33)
SE-60174 Norrkoping, Sweden
Phone: +46 11 36 31 01
Fax: +46 11 36 32 70
Cambridge University Engineering Department-
Machine Intelligence Laboratory - Speech Group
Research Associate in Spoken Dialogue Systems
Applications are invited for a Research Associate position in the
Machine
Intelligence Laboratory to join a group lead by Professor Steve Young
working in the area of Spoken Dialogue Systems. The technical focus is
on
the use of reinforcement learning within man-machine interfaces to
enable
automatic learning of dialogue behaviour and on-line adaptation. The
work
will involve statistical modelling, algorithm design, system development
and
user evaluation. The successful candidate will have a good first degree
and
preferably a higher degree in a relevant area. Good programming skills
in
C/C++ are essential.
The appointment will be for two years initially, starting as soon as
possible. Salary is in the range =A324,402 to =A331,840 p.a. Further
details
and an application form can be found at
our website. Informal enquiries should be
addressed by email to Professor Young. Applicants
should email their completed application form together with their CV and
a
covering letter describing their research experiences, interests and
goals
to Rachel Fogg Rachel Fogg no later than 15th July 2007.
The University is committed to equality of opportunity
Opening on Speech recognition at Telefonica, Barcelona (Spain)
The speech Technology Group at Telefonica Investigacion y Desarrollo
(TID) is looking for a highly qualified candidate for an
engineering position on speech recognition and related technologies.
The selected person will become a part of a multidisciplinary team of
young highly motivated people in an objective driven, friendly
atmosphere located in a central area of Barcelona (Spain).
Minimum requirements ar:
Degree in Computer Science /Electrical Engineering/Computational
Linguistics or similar with 2+ years of experience (Ph.D. preferred)
on speech technology.
Good knowledge of speech recognition and speech synthesis.
Proven programming expertise in C++ and Java
Good level of English (required) and some knowledge of Spanish
(preferred)
High motivation and teamwork spirit
Salary depending on the experience and value of the applicant
Starting date as soon as possible
The speech technology group is a well established group within TID
with more than 15 years of experience in research and development of
technology for internal use of Telefonica group as well as outside
organizations. It is also a very active partner in many National and
European projects. TID is the research and development company inside
the Telefonica group, currently one of the biggest Telecom companies.
It is the biggest private research center in Spain in number of
employees and available resources.
Please send your resume and contact information to
Sonia Tejero
Tlf: +34 93 365 3024
Sound to Sense:
18 Fellowships in speech research
Sound to Sense (S2S) is a Marie Curie Research Training Network involving collaborative speech research amongst 13 universities in 10 countries.
18 Training Fellowships are available, of which 12 are predoctoral and 6 postdoctoral (or equivalent experience). Most but not all are planned to start in September or October 2007.
A research training network’s primary aim is to support and train young researchers in professional and inter-disciplinary scientific skills that will equip them for
careers in research. S2S’s scientific focus is on cross-disciplinary methods for modelling speech recognition by humans and machines. Distinctive aspects of our
approach include emphasis on richly-informed phonetic models that emphasize communicative function of utterances, multilingual databases, multiple time domain analyses,
hybrid episodic-abstract computational models, and applications and testing in adverse listening conditions and foreign language learning.
Eleven projects are planned. Each can be flexibly tailored to match the Fellows’ backgrounds, research interests, and professional development needs, and will fall into one of four broad themes.
1: Multilinguistic and comparative research on Fine Phonetic Detail (4 projects)
2: Imperfect knowledge/imperfect signal (2 projects)
3: Beyond short units of speech (2 projects)
4: Exemplars and abstraction (3 projects)
The institutions and senior scientists involved with S2S are as follows:
* University of Cambridge, UK (S. Hawkins (Coordinator), M. Ford, M. Miozzo, D. Norris. B. Post)
* Katholieke Universiteit, Leuven, Belgium (D. Van Compernolle, H. Van Hamme, K. Demuynck)
* Charles University, Prague, Czech Republic (Z. Palková, T. Dub?da, J. Volín)
* University of Provence, Aix-en-Provence, France (N. Nguyen, M. d’Imperio, C. Meunier)
* University Federico II, Naples, Italy (F. Cutugno, A. Corazza)
* Radboud University, Nijmegen, The Netherlands (L. ten Bosch, H. Baayen, M. Ernestus, C. Gussenhoven, H. Strik)
* Norwegian University of Science and Technology (NTNU), Trondheim, Norway (W. van Dommelen, M. Johnsen, J. Koreman, T. Svendsen)
* Technical University of Cluj-Napoca, Romania (M. Giurgiu)
* University of the Basque Country, Vitoria, Spain (M-L. Garcia Lecumberri, J. Cenoz)
* University of Geneva, Switzerland (U. Frauenfelder)
* University of Bristol, UK (S. Mattys, J. Bowers)
* University of Sheffield, UK (M. Cooke, J. Barker, G. Brown, S. Howard, R. Moore, B. Wells)
* University of York, UK. (R. Ogden, G. Gaskell, J. Local)
Successful applicants will normally have a degree in psychology, computer science, engineering, linguistics, phonetics, or related disciplines, and want to acquire expertise in one or more of the others.
Positions are open until filled, although applications before 1 May 2007 are recommended for starting in October 2007.
Further details are available from the web about:
+ the research network (92kB) and how to apply,
+ the research projects(328 kB).
Research scientist- Speech Technology- Princeton, NJ, USA
Company
Profile: Headquartered in Princeton, NJ, ETS (Educational Testing Service)is the world's premier educational measurement institution and a
leader in educational research. As an innovator in developing achievement and occupational tests for clients in business,
education, and government, we are determined to advance educational excellence for the communities we serve.
Job Description: ETS Research & Development has a Research Scientist opening in the Automated Scoring and
Natural Language Processing Group. This group conducts research focusing on the development of new capabilities
in automated scoring and NLP-based analysis and evaluation systems, which are used to improve assessments, learning tools
and test development practices for diverse groups of users that include K-12 students, college students,
English Language Learners and lifelong learners. The Research Scientist position involves applying scientific,
technical and software engineering skills to designing and conducting research studies and developing capabilities
in support of educational products and services. The job is a full-time job.
Required qualifications
· A Ph.D. in Natural Language Processing, Computational Linguistics, Computer Science, or Electrical Engineering with a
focus on speech technology, particularly speech recognition. Knowledge of linguistics is a plus.
· Evidence of at least three years of independent substantive research experience and/or experience in developing
and deploying speech technology capabilities, preferably in educational environments.
· Demonstrable contributions to new and/or modified theories of speech processing and their implementation in
automated systems.
· Practical expertise with speech recognition systems and fluency in at least one major programming language (e.g.,
Java, Perl, C/C++, Python).
· Three years of independent substantive research experience and/or experience in developing and deploying speech
technology capabilities, preferably in educational environments.
How to apply Please send copy of your resume, along with cover letter stating salary requirements and job #2965,
to e-mail
ETS offers competitive salaries, outstanding benefits, a stimulating work environment, and attractive growth potential.
ETS is an Equal Opportunity, Affirmative Action Employer.
Web site
Research Fellow in Speech Synthesis-
Centre for Speech Technology Research/
University of Edinburgh
The Centre for Speech Technology Research at the University of Edinburgh
is seeking a research fellow to work on the speech synthesis project
"Automatically-determined inventories for speech synthesis". This
project uses machine learning techniques to automatically discover, from
speech data, a set of units for speech synthesis - that is, an
alternative to manually-specified phoneme-based units such as diphones.
This research is currently being conducted within a concatenative (i.e.
unit selection) framework, but we now seek to extend this to the other
major synthesis technique: statistical parametric synthesis, based on
Hidden Markov Models (i.e., trajectory HMMs). The successful candidate
will be expected to contribute, plan and execute new research, as well
as extend our existing techniques.
You ideally will have a PhD in speech synthesis and experience of
trajectory Hidden Markov Models. You will have very good programming
skills, preferably in C++, and experience with one or more of:
concatenative speech synthesis techniques; statistical models of speech;
perceptual evaluations; Festival. An automatic speech recognition
background is also appropriate for this position. This post is fixed
term for 15 months.
For more information and application instructions, consult our website and enter vacancy number 3006866.
Software Engineer Position at Be Vocal, Mountain View, CA,USA
We are currently looking for a Software Engineer with previous exposure to Speech, to work in our Speech and Natural Language Technology group.
This group’s mission is to be the center of excellence for speech and natural language technologies within BeVocal. Responsibilities include assisting in the
development of internal tools and processes for building Natural Language based speech applications as well as on ongoing infrastructure/product improvements.
The successful candidate must be able to take direction from senior members of the team and will also be given the opportunity to make original contributions to
new and existing technologies during the application development process. As such, you must be highly motivated and have the ability to work well independently
in addition to working as a team.
Responsibilities
* Develop and maintain speech recognition/NLP tools and supporting infrastructure
* Develop and enhance component speech grammars
* Work on innovative solutions to improve overall Speech/NL performance across BeVocal’s deployments.
Requirements
* BS in Computer Science, Electrical Engineering or Linguistics, an MS is a preferred.
* 2-5 years of software development experience in Perl, Java, C/C++. A willingness and ability to pick up additional software languages as needed is essential.
* Exposure or experience with speech recognition/pattern recognition either from an academic environment or directly related work experience.
* Experience working as part of a world-class speech and language group is highly desirable.
* Experience building natural language applications is preferred.
* Experience building LVCSR speech recognition systems is a plus.
For immediate consideration, please send your resume by email and include
"Software Engineer, Speech" in the subject
line of your email. Principals only please (no 3rd parties or agencies). Contact
for details
BeVocal's policy is to comply with all applicable laws and to provide equal employment opportunity for all applicants and employees without regard to
non-job-related factors such as race, color, religion, sex, national origin, ancestry, age, disability, veteran status, marital status or sexual orientation.
This policy applies to all areas of employment, including recruitment, hiring, training, promotion, compensation, benefits, transfer, and social and recreational
programs.
Position at Saybot in China
Job title: Speech Scientist
Location: China (Beijing or Shanghai)
Saybot develops software technology and curricula for learning spoken english. Since 2005, we have been
building software which features state-of-the-art speech technologies and innovative interactive lessons to
help users practice speaking English. We are currently looking for talented speech scientists to help strengthen
our R&D team and to develop our next-generation products. Successful candidates would have proven excellence
and good work ethics in academic or industry context and demonstrated creativity in building speech systems
with revolutionary designs.
* MS/PhD degree in speech technology (or related).
* Expertise in at least one of the following areas and basic knowledge of the others:
o acoustic model training,
o speaker adaptation,
o natural language understanding,
o prosody analysis,
o embedded recognizers.
* Excellent programming skills in both object-oriented languages (C++, C# or Java) and scripting (Perl or Python).
* Good knowledge and experience in at least one commonly used recognizer (HTK, Sphinx, Nuance...).
* Excellent communication skills in written and oral English.
* Experience in machine translation is a plus.
* Experience in VoIP integration is a plus.
* Experience in language teaching is a plus.
Contact: Sylvain Chevalier
2 Positions in Research and Development in "Audio description and indexing" at IRCAM-Paris
PRESENTATION OF THE SAMPLE ORCHESTRATOR PROJECT:
The goal of the Sample Orchestrator project is to develop and test new applications
for managing and manipulating sound samples based on audio content.
On the one hand the commercial availability of large databases of sound samples
available on various supports (CD, DVD, online), are currently limited in their
applications (synthesizers by sampling). On the other hand, recent scientific and
technological development in audio indexing and database management allow the development
of new musical functions: database management based on audio content, audio processing
driven by audio content, development of orchestration tools.
TASKS:
Two positions are available from April 15th 2007 within the "Equipe Analyse/Synthese" of Ircam
for (each) a 12 months total duration (possibility of extending the contracts).
The main tasks to be done for the research and development positions are:
- Research and development of new audio features and algorithms
for the description of instrumental, percussive and FX sounds.
- Research and development of new audio features and algorithms
for the morphological description of sounds
- Research and development of new audio features and algorithms
for sounds containing "loops"
- Research and development of algorithms for automatic audio indexing
- Research and development of algorithms for fast search by similarity in large databases
- Participation in the definition of the specification
- Participation in user evaluation and feedback
- Integration into the final application
RESEARCH POSITION:
REQUIRED EXPERIENCE AND COMPETENCE:
- High skills in Audio indexing and signal processing
- High skills in Matlab programming
- High productivity, methodical work, excellent programming style.
- Good knowledge of UNIX, Mac and Windows environments
SALARY:
According to background and experience.
DEVELOPMENT POSITION:
REQUIRED EXPERIENCE AND COMPETENCE:
- Skills in Audio indexing and signal processing
- High skills in C/C++ programming
- High productivity, methodical work, excellent programming style.
- Good knowledge of UNIX, Mac and Windows environments
SALARY:
According to background and experience.
EEC WORKING PAPERS:
In order to start immediately, the candidate should preferably have EEC citizenship
or already own valid EEC working papers.
AVAILIBILITY:
The positions are available in the "Analysis/Synthesis" team in the R&D department
from April 15th 2007 for (each) a duration of 12 months (possibility of extending the contracts).
TO APPLY:
Please send your resume with qualifications and informations addressing the above issues,
preferably by email
Xavier Rodet, Analyse/Synthese team manager).
or by fax at:
(33 1) 44 78 15 40, care of Xavier.Rodet
or by surface mail to:
Xavier Rodet, IRCAM, 1 Place Stravinsky, 75004 Paris.
IRCAM:
IRCAM is a leading non-profit organization dedicated to musical production,
R&D and education in acoustics and music, located in the center of Paris (France),
next to the Pompidou Center. It hosts composers, researchers and students from many countries
cooperating in contemporary music production, scientific and applied research.
The main topics addressed in its R&D department are acoustics, psychoacoustics, audio synthesis
and processing, computer aided composition, user interfaces, real time systems.
Detailed activities of IRCAM and its groups are presented on our
WWW server.
RESEARCH AND DEVELOPMENT POSITION IN "AUDIO CONTENT ACCESS" at IRCAM (Paris)
PRESENTATION OF THE MUSICDISCOVER PROJECT :
The goal of the MusicDiscover project is to give access to the contents of
musical audios recordings (as it is the case, for example, for texts),
i.e. to a structured description, as complete as possible, of the recordings:
melody, genre/style, rate/rhythm, instrumentation, musical structure,
harmony, etc. The principal objective is thus to develop and evaluate means
directed towards the contents, which include techniques and tools for
analysis, indexing, representation and search for information. These means
will make it possible to build and use such a structured description.
This project of the ACI "Masses of Data" is carried out in collaboration
between Ircam (Paris), Get-Telecom (Paris) and the LIRIS (Lyon) since
October 2004. The principal lines of research are :
- Rhythmic analysis and detection of ruptures
- Recognition of musical instruments and indexing
- Source Separation
- Structured Description
- Research of music by similarity
- Recognition of musical titles
- Classification of musical titles in genre and emotion.
The available position relates to the construction and the use of the
Structured Description in collaboration with the other lines of research.
DEVELOPMENTS TASKS:
A position is available from December 1st 2006 within the "Equipe
Analyse/Synthese" of Ircam for a 9 months total duration.
The contents of work are as follows:
- Participation in the design of a Structured Description
- Software development for construction and use of Structured Descriptions
- Participation in the definition and development of the graphic interface
- Participation in the evaluations
REQUIRED EXPERIENCE AND COMPETENCE:
- Experience of research in Audio Indexing and signal processing
- Experience in Flash, C and C++ and Matlab programming.
- High productivity, methodical work, excellent programming style.
- Good knowledge of UNIX and Windows environments.
AVAILABILITY :
- The position is available in the "Analysis/Synthesis" team in the R&D
department from November 1st 2006 for a duration of 9 months.
EEC WORKING PAPERS :
- In order to start immediately, the candidate should preferably have EEC
citizenship or already own valid EEC working papers.
SALARY:
- According to background and experience.
TO APPLY:
- Please send your resume with qualifications and informations adressing the
above issues, preferably by email to
Xavier Rodet, Analyse/Synthese team manager.
or by fax at:
(33 1) 44 78 15 40, care of Xavier.Rodet
or by surface mail to:
Xavier Rodet, IRCAM, 1 Place Stravinsky, 75004 Paris.
Introducing IRCAM
IRCAM is a leading non-profit organization dedicated to musical
production, R&D and education in acoustics and music, located in the center
of Paris (France), next to the Pompidou Center. It hosts composers,
researchers and students from many countries cooperating in contemporary
music production, scientific and applied research. The main topics
addressed in its R&D departement are acoustics, psychoacoustics, audio
synthesis and processing, computer aided composition, user interfaces, real
time systems.
Detailed activities of IRCAM and its groups are presented on our
WWW
server
top
JOURNALS
CfP: Speech Communication Journal - Special Issue on “Evaluating new methods and models for advanced speech-based interactive systems”
The aim of this special issue is to explore new evaluation techniques and strategies as applied to advanced dialogue systems, including new models and methods.
Original, previously unpublished submissions addressing some (or all) of the following questions are encouraged:
1. What characteristics of spoken language interaction can and should be incorporated into advanced spoken dialogue systems?
2. What are the best methods for designing such systems? To what extent are automatic design methods appropriate or possible?
3. What criteria can be defined for the evaluation of the performance of advanced spoken dialogue systems?
4. Under what circumstances should these criteria be used?
5. How effective are these criteria in isolating problems with a dialogue strategy and in measuring how correction of the problems improves the dialogue?
6. How can these criteria be used to compare and evaluate alternative dialogue strategies and methods for the design and implementation of dialogue systems?
7. How can these criteria be used to compare the pre-modification version and the post-modification version of a dialogue strategy as developers attempt to
improve the dialogue strategy?
8. How can the evaluation process be streamlined so that it can be frequently and effectively applied to the improvement and comparison of dialogue strategies?
Guest Editors
Michael McTear, University of Ulster, UK
Kristiina Jokinen, University of Helsinki, Finland
James Larson, Oregon Graduate Institute, Oregon, USA
Important Dates
Submission Deadline: 30th June 2007
Notification of Acceptance: 30th November 2007
Final manuscript due: 31 January 2008
Tentative Publication Date: June 2008
Submission Procedure
Prospective authors should follow the regular guidelines of the Speech Communication journal for electronic submission (http://ees.elsevier.com/specom).
During submission authors must select the Article Type: "Special Issue: Spoken Dialogue Technology, not "Regular Paper", and also select
Professor Marc Swerts as the handling Editor-in-Chief.
Full text of CFP
Call for Papers- Special Issue of the
IEEE Transactions on Audio, Speech and Language Processing
on New Approaches to Statistical Speech and Text Processing
Dramatic advances in automatic speech recognition (ASR) technology in
recent years has enabled serious growth in spoken language processing
research, both for human-computer interaction and spoken document
processing. The challenges of working with spoken language, including
ASR errors and disfluencies, were major factors in the adoption of
statistical techniques in the language processing community. Statistical
methods now dominate many areas of text processing as well, enabled by
growing collections of linguistic data resources and developments in
machine learning. While transfer of methods from spoken- to written-
language processing continues, advances in written-language processing
also now have a significant impact on spoken-language processing.
This issue seeks to highlight the cross-fertilization in speech and text
processing by publishing novel statistical modeling and learning methods
that span a variety of language processing applications.
We invite papers describing new approaches to statistical language
processing of both spoken and written language. Submissions must not
have been previously published, with the exception that substantial
extensions of conference papers will be considered. Of particular
interest are methods that transfer recent developments from text
processing to speech processing and vice versa, but new methods in one
domain are also welcome. Papers describing new strategies for
integrating acoustic and linguistic cues in spoken language processing
are also encouraged.
> include:
- Unsupervised and semi-supervised learning
- Discriminative learning
- Transfer or adaptation to new domains
- Active learning
- Reinforcement learning
- Memory-based learning and neighborhood methods
- Novel statistical models
- Statistical methods for feature selection or transformation
Specific applications of interest include information extraction,
question answering, text segmentation and classification, summarization,
translation, language generation and spoken language dialogs. Papers
that address component problems of these larger applications are also
encouraged, including parsing, discourse analysis, and talker
interaction analysis. The issue aims to cover a variety of applications
as well as different statistical methods.
Submission procedure:
Prospective authors should prepare manuscripts according to the
Information for Authors as published in any recent issue of the
Transactions.
Note that all rules will apply with regard to submission lengths,
mandatory overlength page charges, and color charges. Manuscripts should
be submitted electronically through the online
IEEE manuscript
submission system.
When selecting a manuscript type, authors must click on "Special Issue
of TASLP on New Approaches to Statistical Speech and Text Processing".
Authors should follow the instructions for the IEEE Transactions Audio,
Speech and Language Processing and indicate in the Comments to the
Editor-in-Chief that the manuscript is submitted for publication in the
Special Issue on New Approaches to Statistical Speech and Text
Processing. We require a completed copyright form to be signed and faxed
to +1-732-562-8905 at the time of submission. Please indicate the
manuscript number on the top of the page.
Schedule:
Submission deadline: 15 June 2007
Notification of final acceptance: 15 December 2008
Final manuscript due: 1 February 2008
Publication date: May 2008
Guest Editors:
Dr. Bill Byrne Cambridge University, UK
Dr. Mark Johnson Brown University, USA
Dr. Lillian Lee Cornell University, USA
Dr. Steve Renals University of Edinburgh, UK
Papers accepted for FUTURE PUBLICATION in Speech
Communication Full text available on http://www.sciencedirect.com/ for
Speech Communication subscribers and subscribing institutions. Free
access for all to the titles and abstracts of all volumes and even by
clicking on Articles in
press and then Selected papers.
top
FUTURE CONFERENCES
Publication policy: Hereunder, you will find very short announcements
of future events. The full call for participation can be accessed on the
conference websites See also our Web pages (http://www.isca-speech.org/) on
conferences and workshops.
FUTURE INTERSPEECH CONFERENCES
INTERSPEECH 2007-EUROSPEECHAugust 27-31,2007,Antwerp,
Belgium Chair: Dirk van Compernolle, K.U.Leuven and Lou Boves,
K.U.Nijmegen Website
INTERSPEECH 2007 is the eighth conference in the annual series of
INTERSPEECH events and also the tenth biennial EUROSPEECH conference. The
conference is jointly organized by scientists from the Netherlands and
Belgium, and will be held in Antwerp, Belgium, August 27-31, 2007, under the
sponsorship of the International Speech Communication Association (ISCA).
The INTERSPEECH meetings are considered to be the top international
conferences in spoken language processing, with more than 1000 attendees
from universities, industry, and government agencies. The conference offers
the prospect of meeting the future leaders of our field, exchanging ideas,
and exploring opportunities for collaboration, employment, and sales through
keynote talks, tutorials, technical sessions, exhibits, and poster sessions.
In recent years the INTERSPEECH meetings have taken place in a number of
exciting venues including most recently Pittsburgh, Lisbon, Jeju Island
(Korea), Geneva, Denver, Aalborg (Denmark), and Beijing.
AREAS AND TOPICS OF INTEREST:
Interspeech is the world's largest and most comprehensive conference
on Speech Science and Speech Technology and it solicits papers in the
following areas and topics:
A.Human speech production, perception and communication
Phonology and phonetics
Discourse and dialogue
Prosody (production, perception, prosodic structure)
Paralinguistic and nonlinguistic cues (e.g. emotion and expression)
Speech production
Speech perception
Physiology and pathology
Spoken language acquisition, development and learning
B.Speech and Language technology
Speech and audio processing
Speech enhancement
Speech coding and transmission
Spoken language generation and synthesis
Speech recognition
Spoken language understanding
Accent and language identification
Cross-lingual and multi-lingual processing
Multimodal/multimedia signal processing
Speaker characterization and recognition
C.Spoken language systems and applications
Dialogue systems
Systems for information retrieval
Systems for translation
Applications for aged and handicapped persons
Applications for learning and education
Other applications
D.Resources, standardization and evaluation
Spoken language resources and annotation
Evaluation and standardization
PAPER SUBMISSION
Authors will have to declare that their contribution is original
and not being submitted for publication elsewhere (e.g., another
conference, workshop, or journal).
Each corresponding author will be notified by e-mail of the acceptance
or rejection of his paper by May 25, 2007. Minor updates of accepted
papers will be allowed during May 25 - June 3, 2007.
More information is available on
the conference website
INVITED KEYNOTE SPEAKERS
Keynote speaker: ISCA Medalist Prof. Victor Zue (MIT, Cambridge, MA)
Title: On Organic Interfaces
Keynote speaker: Prof. Sophie Scott (UCL, London, UK)
Title: How the Brain Decodes Speech – Some Perspectives from Functional Imaging
Keynote speaker: Prof. Alex Waibel (CMU, Pittsburgh, PA; University of Karlsruhe, Germany)
Title: Computer-Supported Human-Human Multilingual Communication
Keynote speaker: Prof. Luc Steels (Free University Brussels, Belgium; Sony Computer Science Laboratory Paris, France)
Title: Can Robots Invent their Own Language?
IMPORTANT DATES
Full paper submission deadline: March 23, 2007
Notification of paper acceptance/rejection May 25, 2007
Early registration deadline: June 22, 2007
Further information via website or
email.
ORGANIZERS
Professor Dirk Van Compernolle (General Chair)
Professor Lou Boves (General Co-Chair)
c/o
Annitta De Messemaeker
Katholieke Universiteit Leuven
Department of Electrical Engineering
Kasteelpark Arenberg 10
B3001 Heverlee
Belgium
Fax: +32 16 321723
Email
Website
INTERSPEECH 2008-ICSLP September 22-26, 2008, Brisbane,
Queensland, Australia Chairman: Denis Burnham, MARCS, University of West Sydney.
INTERSPEECH 2009-EUROSPEECH Brighton, UK, Chairman:
Prof. Roger Moore,
University of Sheffield.
INTERSPEECH 2010-ICSLP Chiba, Japan
ISCA is pleased to announce that INTERSPEECH 2010 will take place in
Makuhari-Messe, Chiba, Japan, September 26-30, 2010. The event will be chaired by
Keikichi Hirose (Univ. Tokyo), and will have as a theme "Towards Spoken
Language Processing for All - Regardless of Age, Health Conditions,
Native Languages, Environment, etc."
top
FUTURE ISCA TUTORIAL AND RESEARCH WORKSHOP (ITRW)
6th ISCA Speech Synthesis Research Workshop (SSW-6)
University of Bonn (Germany), August 22-24, 2007
A satellite of
INTERSPEECH 2007 (Antwerp)in collaboration with SynSIG and IfK (University of Bonn)
Organized shortly after the 16th International Congress on Phonetic Sciences (Saarbrücken, Germany, August 6-10, 2007).
Like its predecessors in Autrans (France) 1990, New Paltz (NY, USA) 1994, Jenolan (Australia) 1998, Pitlochry (UK) 2001,
and Pittsburgh (PA, USA) 2004, SSW-6 will cover all aspects of speech synthesis
and adjacent fields, such as:
TOPICS (updated list)
* Text processing for speech synthesis
* Prosody Generation for speech synthesis
* Speech modeling for speech synthesis applications
* Signal processing for speech synthesis
* Concatenative speech synthesis (diphones, polyphones, unit selection)
* Articulatory synthesis
* Statistical parametric speech synthesis
* Voice transformation/conversion/adaptation for speech synthesis
* Expressive speech synthesis
* Multilingual and/or multimodal speech synthesis
* Text-to-speech and content-to-speech
* Singing speech synthesis
* Systems and applications involving speech synthesis
* Techniques for assessing synthetic speech quality
* Language resources for speech synthesis
* Aids for the handicapped involving speech synthesis.
Deadlines (updated)
* Full-paper submission (up to 6 pages) - May 14, 2007 (EXTENDED DEADLINE!)
* Notification of acceptance - June 25, 2007
* Deadline for paper modification - July 15, 2007
Please send your papers, preferably as PDF files, as an e-mail
attachment.
Further information can soon be obtained from the
website of the workshop,
Contact Prof. Wolfgang Hess
8th Workshop on Discourse and Dialogue (SIGdial), Antwerp, Belgium
Antwerp, September 2-3, 2007
Held immediately following Interspeech 2007
Continuing with a series of successful workshops in Sydney, Lisbon, Boston, Sapporo, Philadelphia, Aalborg,
and Hong Kong, this workshop spans the ACL
and ISCA SIGdial interest area of discourse and dialogue.
This series provides a regular forum for the presentation of research in this area to both the larger
SIGdial community as well as researchers outside this community. The workshop is organized by SIGdial,
which is sponsored jointly by ACL and ISCA.
Topics of Interest
We welcome formal, corpus-based, implementation or
analytical work on discourse and dialogue including but not restricted
to the following three themes:
1. Discourse Processing and Dialogue Systems
Discourse semantic and pragmatic issues in NLP applications such as text summarization, question answering,
information retrieval including topics like:
· Discourse structure, temporal structure, information structure
· Discourse markers, cues and particles and their use
. (Co-)Reference and anaphora resolution, metonymy and bridging resolution
· Subjectivity, opinions and semantic orientation
Spoken, multi-modal, and text/web based dialogue systems including topics such as:
· Dialogue management models;
· Speech and gesture, text and graphics integration;
· Strategies for preventing, detecting or handling miscommunication (repair and correction types,
clarification and under-specificity, grounding and feedback strategies);
· Utilizing prosodic information for understanding and for disambiguation;
2. Corpora, Tools and Methodology
Corpus-based work on discourse and spoken, text-based and multi-modal dialogue including its support, in particular:
· Annotation tools and coding schemes;
· Data resources for discourse and dialogue studies;
· Corpus-based techniques and analysis (including machine learning);
· Evaluation of systems and components, including methodology, metrics and case studies;
The pragmatics and/or semantics of discourse and dialogue (i.e. beyond a single sentence) including the following issues:
· The semantics/pragmatics of dialogue acts (including those which are less studied in the semantics/pragmatics framework);
· Models of discourse/dialogue structure and their relation to referential and relational structure;
· Prosody in discourse and dialogue;
· Models of presupposition and accommodation; operational models of conversational implicature.
Submissions
The program committee welcomes the submission of long papers for full plenary presentation as well as short papers
and demonstrations. Short papers and demo descriptions will be featured in short plenary presentations, followed by
posters and demonstrations.
· Long papers must be no longer than 8 pages, including title, examples, references, etc. In addition to this, two additional pages are allowed as an appendix which
may include extended example discourses or dialogues, algorithms, graphical representations, etc.
· Short papers and demo descriptions should aim to be 4 pages or less (including title, examples, references, etc.).
Please use the official ACL style files.
Submission/Reviewing will be managed by the START system. Link to follow.
Papers that have been or will be submitted to other meetings or publications must provide this information
(see submission format). SIGdial 07 cannot accept for publication or presentation work that will be (or has been)
published elsewhere.
Authors are encouraged to make illustrative materials available, on the web or otherwise.
For example, excerpts of recorded conversations, recordings of human-computer dialogues, interfaces to working systems,
etc.
Important Dates (subject to change)
Submission May 2, 2007
Notification June 13, 2007
Final submissions July 6, 2007
Workshop September 2-3, 2007
Websites
Workshop website:To be announced
Submission website:To be announced
Sigdial website
Interspeech 2007 website
Email
Program Committee (confirmed)
Harry Bunt, Tilburg University, Netherlands (co-chair) Tim Paek, Microsoft Research, USA (co-chair)
Simon Keizer, Tilburg University, Netherlands (local chair) Wolfgang Minker, University of Ulm, Germany
David Traum, USC/ICT, USA
CfP-SLaTE Workshop on Speech and Language Technology in Education
ISCA Tutorial and Research Workshop
The Summit Inn, Farmington, Pennsylvania USA October 1-3, 2007.
Website
Speech and natural language processing technologies have evolved from being emerging new technologies to being reliable techniques
that can be used in real applications. One worthwhile application is Computer-Assisted Language Learning. This is not only helpful to
the end user, the language learner, but also to the researcher who can learn more about the technology from observing its use in a real setting.
This workshop will include presentations of both research projects and real applications in the domain of speech and language technology
in education.
IMPORTANT DATES
Full paper deadline: May 1, 2007.
Notification of acceptance: July 1, 2007.
Early registration deadline: August 1, 2007.
Preliminary programme available: September 1, 2007.
Workshop will take place: October 1-3, 2007.
LOCATION
The workshop will be held in the beautiful Laurel Highlands. In early October the vegetation in the Highlands puts on a beautiful show of colors
and the weather is still not too chilly. The event will take place at the Summit Inn, situated on one of the Laurel Ridges. It is close to the Laurel Caverns
where amateur spelunkers can visit the underground caverns. The first night event will be a hayride and dinner at a local winery and the banquet will take
place at Frank Lloyd Wright’s wonderful Fallingwater.
TOPICS
The workshop will cover all topics which come under the purlieu of speech and language technology for education.
In accordance with the spirit of the ITRWs, the upcoming workshop will focus on research and results,
give information on tools and welcome prototype demons
trations of potential future applications.
The workshop will focus on research issues, applications, development tools and collaboration. It will be
concerned with all topics which fit under the purview of speech and language technology for education.
Papers will discuss theories, applications, evaluation, limitations, persistent difficulties, general research
tools and techniques. Papers that critically evaluate approaches or processing strategies will be especially
welcome, as will prototype demonstrations of real-world applications.
The scope of acceptable topic interests includes but is not limited to:
- Use of speech recognition for CALL
- Use of natural language processing for CALL
- Use of spoken language dialogue for CALL
- Applications using speech and/or natural language processing for CALL
- CALL tutoring systems
- Assessment of CALL tutors
ORGANIZATION-CONTACT
The workshop is being organized by the new ISCA Special Interest Group, SLaTE.
The general chair is Dr. Maxine Eskenazi from Carnegie Mellon University .
PROGRAMME
As per the spirit of ITRWs, the format of the workshop will consist of a non-overlapping mixture of oral, poster
and demo sessions. Internationally recognized experts from pertinent areas will deliver several keynote lectures
on topics of particular interest.
All poster sessions will be opened by an oral summary by the session chair.
A number of poster sessions will be succeeded by a discussion session focussing on the subject of the session.
The aim of this structure is to ensure a lively and valuable workshop for all involved.
Furthermore, the organizers would like to encourage researchers and industrialists to bring along
their applications, as well as prototype demonstrations and design tools where appropriate.
The official language of the workshop is English. This is to help guarantee the highest degree of
international accessibility to the workshop. At the opening of the workshop hardcopies and CD-ROM of the
abstracts and proceedings will be available.
CALL FOR PAPERS
We seek outstanding technical articles in the vein discussed above. For those who intend to submit papers,
the deadline is May 1, 2007. Following preliminary review by the committee,
notification will be sent regarding acceptance/rejection. Interested authors should send full 4 page camera-ready
papers.
REGISTRATION FEE
The fee for the workshop, including a booklet of Abstracts, the Proceedings on CD-ROM is:
- $325 for ISCA members and
- $225 for ISCA student members with valid identification
Registrations after August 1, 2007 cannot be guaranteed.
ADDITIONAL REGISTRATION INFORMATION
All meals except breakfast for the two and a half days as well as the two special events are included in this price.
Hotel accommodations are $119 per night , and breakfast is about $10. Upon request we will furnish bus transport
from the Greater Pittsburgh Airport and from Pittsburgh to Farmington at a cost of about $30. ISCA membership
is 55 Euros. You must be a member of ISCA to attend this workshop.
ITRW Odyssey 2008
The Speaker and Language Recognition Workshop
21-25 January 2008, Stellenbosch, South Africa
Topics
* Speaker recognition(identification, verification, segmentation, clustering)
* Text dependent and independent speaker recognition
* Multispeaker training and detection
* Speaker characterization and adaptation
* Features for speaker recognition
* Robustness in channels
* Robust classification and fusion
* Speaker recognition corporaand evaluation
* Use of extended training data
* Speaker recognition with speaker recognition
* Forensics, multimodality and multimedia speaker recogntion
* Speaker and language confidence estimation
* Language, dialect and accent recognition
* Speaker synthesis and transformation
* Biometrics
* Human recognition
* Commercial applications
Paper submission
Proaspective authors are invited to submit papers written in English via the Odyssey website. The style guide, templates,and submission
form can be downloaded from the Odyssey website. Two members of the scientific committee will review each paper.
Each accepted paper must have at least one registered author.
The Proceedings will be published on CD
Schedule
Draft paper due July 15, 2007
Notification of acceptance September 15,2007
Final paper due October 30, 2007
Preliminary program November 30, 2007
Workshop January 21-25, 2008
Futher informations: venue, registation...
On the workshop website
Chairs
Niko Brummer, Spescom Data Voice, South Africa
Johan du Preez.Stellenbosch University,South Africa
ISCA TR Workshop on Experimental Linguistics 2008
August 2008, Athens, Greece
Website
Prof. Antonis Botinis
ITRW on Evidence-based Voice and Speech Rehabilitation in Head & Neck Oncology
May 2008, Amsterdam, The Netherlands,
Cancer in the head and neck area and its treatment can have debilitating effects on communication. Currently available treatment options such as radiotherapy,
surgery, chemo-radiation, or a combination of these can often be curative. However, each of these options affects parts of the vocal tract and/or voice to a more
or lesser degree. When the vocal tract or voice no longer functions optimally, this affects communication. For example, radiotherapy can result in poor voice quality,
limiting the speaker’s vocal performance (fatigue from speaking, avoidance of certain communicative situations, etc.). Surgical removal of the larynx necessitates
an alternative voicing source, which generally results in a poor voice quality, but further affects intelligibility and the prosodic structure of speech. Similarly, a
commando procedure (resection involving portions of the mandible / floor of the mouth / mobile tongue) can have a negative effect on speech intelligibility.
This 2 day tutorial and research workshop will focus on evidence-based rehabilitation of voice and speech in head and neck oncology. There will be 4 half day
sessions, 3 of which will deal with issues concerning total laryngectomy. One session will be devoted to research on rehabilitation of other head and neck cancer
sites. The chairpersons of each session will prepare a work document on the specific topic at hand (together with the two keynote lecturers assigned), which will
be discussed in a subsequent round table session. After this there will be a 30’ poster session, allowing 9-10 short presentations. Each presentation consists of
maximally 4 slides, and is meant to highlight the poster’s key points. Posters will be visited in the subsequent poster visit session. The final work document will
refer to all research presently available, discuss its (clinical) relevance, and will attempt to provide directions for future research. The combined work document,
keynote lectures and poster abstracts/papers will be published under the auspices of ISCA.
Organizers prof. dr. Frans JM Hilgers
prof. dr. Louis CW Pols,
dr. Maya van Rossum.
Sponsoring institutions:
Institute of Phonetic Sciences - Amsterdam Center for Language and Communication,
The Netherlands Cancer Institute – Antoni van Leeuwenhoek Hospital
Dates and submission details as well as a website address will be announced in a later issue.
Audio Visual Speech Processing Workshop (AVSP 2008)
Tentative location:Queensland coast near Brisbane (most likely South Stradbroke Island)
Tentative date: 27-29 September 2008 (immediately after Interspeech 2008)
Following in the footsteps of previous AVSP workshops / conferences, AVSP workshop (ISCA Research and Tutorial Workshop) will be hold
concomitantly to Interspeech2008, Brisbane, Australia, 22-26 September 2008. The aim of AVSP2008 is to bring together researchers and practitioners
in areas related to auditory-visual speech processing. These include human and machine AVSP, linguistics, psychology, and computer science.
One of the aims of the AVSP workshops is to foster collaborations across disciplines, as AVSP research is inherently multi-disciplinary.
The workshop will include a number of tutorials / keynote addresses by internationally renowned researchers in the area
of AVSP.
Organizers Roland Goecke,
Simon Lucey,
Patrick Lucey
Australian National University,RSISE, Bldg. 115, Australian National University, Canberra, ACT 0200, Australia
Robust ASR Workshop
Santiago, Chile
October-November 2008
Dr. Nestor Yoma
top
FORTHCOMING EVENTS SUPPORTED (but not organized) by ISCA
AVSP 2007
International Conference on Auditory-Visual Speech
Processing 2007,
August 31 - September 3, 2007
Kasteel Groenendael, Hilvarenbeek, The Netherlands
The next International Conference on Auditory-Visual
Speech Processing (AVSP 2007) will be organised by
different members of Tilburg University (The Netherlands).
It will take place in Kasteel Groenendael in Hilvarenbeek
(The Netherlands) from August 31, 2007 till September 3,
2007, immediately following Interspeech 2007 in Antwerp
(Belgium). Hilvarenbeek is located at close distance from
Antwerp, so that attendance at AVSP 2007 can easily be
combined with participation in Interspeech 2007.
Auditory-visual speech production and perception by human
and machine is an interdisciplinary and cross-linguistic
field which has attracted speech scientists, cognitive
psychologists, phoneticians, computational engineers, and
researchers in language learning studies. Since the
inaugural workshop in Bonas in 1995, Auditory-Visual
Speech Processing workshops have been organised on a
regular basis (see an overview at the avisa website). In
line with previous meetings, this conference will consist
of a mixture of regular presentations (both posters and
oral), and lectures by invited speakers. All presentations
will be plenary.
We are happy to announce that the following experts have
agreed to give a keynote lecture at our conference:
Sotaro Kita (Birmingham)
Asif Ghazanfar (Princeton)
More details about the conference can be found on the
website
Further information
CfP SPECOM 2007
The 12th International Conference on Speech and Computer
October 15-18, 2007
Organized by Moscow State Linguistic University
General Chair:
Prof. Irina Khaleeva
(Moscow State Linguistic University)
Chair:
Prof. Rodmonga Potapova
(Moscow State Linguistic University)
SPECOM'07 is the twelfth conference in the annual series of SPECOM events. It is organized by Moscow State
Linguistic University and will be held in Moscow, Russia, under the sponsorship of Russian Foundation for Basic
Research (RFBR), Ministry of Education and Science of Russian Federation, the International Speech Communication
Association (ISCA) and others.
SPECOM'07 will cover various aspects of speech science and technology. The program of the conference will
include keynote lectures by internationally renowned scientists, parallel oral and poster sessions and an
exhibition. The sci-tech exhibition that will be held during the conference will be open to companies and
research institutions.
The official language of the Conference will be English.
Important Dates (Extended)
Paper submission opening
February 1, 2007
Full paper deadline (EXTENDED)
June 20, 2007
Conference
October 15-18, 2007
Topics
o Speech signal coding and decoding; multi-channel transmitted speech intelligibility; speech information security
o Speech production and perception modeling
o Automatic processing of multilingual, multimodal and multimedia information
o Linguistic, para- and extralinguistic communicative strategies
o Development and testing of automatic voice and speech systems for speaker verification; speaker psychoemotional
state and native language identification
o Automatic speech recognition and understanding systems
o Language and speech information processing systems for robotechnics
o Automated translation systems
o New information technologies for spoken language acquisition, development and learning
o Text-to-speech conversion systems
o Spoken and written natural language corpora linguistics
o Multifunctional expert and information retrieval systems
o Future of multi-purpose and anti-terrorist speech technologies
PAPER SUBMISSION
The deadline for full paper submission (4-6 pages) is April 25, 2007. Papers are to be sent by e-mail to
specom2007@mail.ru. All manuscripts must be in English. Please note that the size of a single letter must not
exceed 10 Megabytes (that is, the total size of all the attached files should not be greater than 7 Megabytes to
leave room for recoding operations performed by the e-mail software). In case the paper files are larger than 7
Megabytes, it is recommended to pack them into a split WinRar or WinZip archive and send part by part in a
series of letter.
All the papers will be reviewed by an international scientific committee. Each author will be notified by
e-mail of the acceptance or rejection of her/his paper by May 30, 2007. Minor updates of accepted papers will be
allowed during May 30 - June 15, 2007.
EVALUATION CRITERIA
Submission of a paper or poster is more likely to be accepted if it is original, innovative, and contributes to
the practice of worldwide scientific communication. Quality of work, clarity and completeness of the submitted
materials will be considered.
REGISTRATION
Registration will be available at the Conference on arrival.
Early registration deadline: July 10, 2007.
The registration fees are planned to be approximately as follows:
Regular
500 EUR
Students/PG Students
200 EUR
NIS (New Independent States), Regular
300 EUR
NIS, Students/PG Students
100 EUR
Russia, Regular
150 EUR
Russia, Students/PG Students (no Proceedings)
Free
Extra Copy of Proceedings (hard copy)
20 EUR
Extra Proceedings CD/DVD
10 EUR
Information regarding accommodation costs will be available later.
All the registration and accommodation payments will be accepted in cash during the registration procedure on
arrival.
PAPER PREPARATION GUIDELINES
In the following you will find guidelines for preparing your full paper to SPECOM'07 electronically.
· To achieve the best viewing experience both for the Proceedings and the CD (or DVD), we strongly
encourage you to use Times Roman font. This is needed in order to give the Proceedings a uniform look. Please
use the attached printable version of this newsletter as a model.
· Authors are requested to submit PDF files of their manuscripts, generated from the original Microsoft
Word sources. PDF files can be generated with commercially available tools or with free software such as
PDFCreator.
· Paper Title - The paper title must be in boldface. All non-function words must be capitalized, and all
other words in the title must be lower case. The paper title is centered.
· Authors' Names - The authors' names (italicized) and affiliations (not italicized) appear centered
below the paper title.
· Abstract - Each paper must contain an abstract that appears at the beginning of the paper.
· Major Headings - Major headings are in boldface.
· Sub Headings - Sub headings appear like major headings, except that they are in italics and not bold
face.
· References - Number and list all references at the end of the paper. The references are numbered in
order of appearance in the document. When referring to them in the text, type the corresponding reference number
in square brackets as shown at the end of this sentence [1].
· Illustrations - Illustrations must appear within the designated margins, and must be positioned within
the paper margins. Caption and number every illustration. All half-tone or color illustrations must be clear
when printed in black and white. Line drawings must be made in black ink on white paper.
· Do NOT include headers and footers. The page numbers, session numbers and conference identification
will be inserted automatically in a post processing step, at the time of printing the Proceedings.
· Apart from the paper in PDF format, authors can upload multimedia files to illustrate their submission.
Multimedia files can be used to include materials such as sound files or movies. The proceedings CD (DVD) will
NOT contain readers or players, so only widely accepted file formats should be used, such as MPEG, Windows WAVE
PCM (.wav) or Windows Media Video (.wmv), using only standard codecs to maximize compatibility. Authors must
ensure that they have sufficient author rights to the material that they submit for publication. Archives (RAR,
ZIP or ARJ format) are allowed. The archives will be unpacked on the CD (DVD), so that authors can refer to the
file name of the multimedia illustration from within their paper. The submitted files will be accessible from
the abstract card on the CD (DVD) and via a bookmark in the manuscript. We advise to use SHORT but meaningful
file names. The total unzipped size of the multimedia files should be reasonable. It is recommended that they do
not exceed 32 Megabytes.
· Although no copyright forms are required, the authors must agree that their contribution, when
accepted, will be archived by the Organizing Committee.
· Authors must proofread their manuscripts before submission and they must proofread the exact files
which they submit.
POSTERS AND PRESENTATIONS
Only electronic presentations are accepted. PowerPoint presentations can be supplied on CD, DVD, FD or USB
Flash drives.
Designated poster space will be wooden or felt boards. The space allotted to one speaker will measure 100 cm
(width) x 122 cm (height). Posters will be attached to the boards using pushpins. Pins will be provided.
Thanks for following all of these instructions carefully!
If you have any questions or comments concerning the submission, please don't hesitate to contact the
conference organizers.
Please address all technical issues or questions regarding paper submission or presentation to our technical
assistant Nikolay Bobrov.
CFP IEEE ASRU 2007
Automatic Speech Recognition and Understanding Workshop
The Westin Miyako Kyoto, Japan
December 9 -13, 2007
Conference website
The tenth biannual IEEE workshop on Automatic Speech Recognition and
Understanding (ASRU) cooperated by ISCA will be held during December
9-13, 2007. The ASRU workshops have a tradition of bringing together
researchers from academia and industry in an intimate and collegial
setting to discuss problems of common interest in automatic speech
recognition and understanding.
WORKSHOP TOPICS
Papers in all areas of human language technology are encouraged to be
submitted, with emphasis placed on:
- automatic speech recognition and understanding technology
- speech to text systems
- spoken dialog systems
- multilingual language processing
- robustness in ASR
- spoken document retrieval
- speech-to-speech translation
- spontaneous speech processing
- speech summarization,
- new applications of ASR.
SUBMISSIONS FOR THE TECHNICAL PROGRAM
The workshop program will consist of invited lectures, oral and poster
presentations, and panel discussions. Prospective authors are invited
to submit full-length, 4-6 page papers, including figures and
references, to the ASRU 2007 website. All papers will be handled and
reviewed electronically. The website will provide you with further
details.
There is also a demonstration session, which has become another
highlight of the ASRU workshop. Demonstration proposals will be
handled separately.
Please note that the submission dates for papers are strict deadlines.
IMPORTANT DATES
Paper submission deadline July 16, 2007
Paper acceptance/rejection notification September 3, 2007
Demonstration proposal deadline September 24, 2007
Workshop advance registration deadline October 15, 2007
Workshop December 9-13, 2007
REGISTRATION AND INFORMATION
Registration will be handled via the ASRU 2007 website .
ORGANIZING COMMITTEE
General Chairs:
Sadaoki Furui (Tokyo Inst. Tech.)
Tatsuya Kawahara (Kyoto Univ.)
Technical Chairs:
Jean-Claude Junqua (Panasonic)
Helen Meng (Chinese Univ. Hong Kong)
Satoshi Nakamura (ATR)
Publication Chair:
Timothy Hazen, MIT, USA
Publicity Chair:
Tomoko Matsui, ISM, Japan
Demonstration Chair:
Kazuya Takeda, Nagoya U, Japan
CfP-3rd International Conference on Large-scale Knowledge Resources (LKR 2008)
3-5 March, 2008, Tokyo Institute of Technology, Tokyo Japan
Website
Sponsored by: 21st Century Center of Excellence (COE) Program
"Framework for Systematization and Application of Large-scale
Knowledge Resources",Tokyo Institute of Technology
In the 21st century, we are now on the way to the knowledge-intensive
society in which knowledge plays ever more important roles. Research
interest should inevitably shift from information to knowledge, namely
how to build, organize, maintain and utilize knowledge are the central
issues in a wide variety of fields. The 21st Century COE program,
"Framework for Systematization and Application of Large-scale
Knowledge Resources (COE-LKR)" conducted by Tokyo Institute of
Technology is one of the attempts to challenge these important
issues. Inspired by this project, LKR2008 aims at bringing together
diverse contribution in cognitive science, computer science, education
and linguistics to explore design, construction, extension,
maintenance, validation, and application of knowledge.
Topics of Interest to the conference includes:
Infrastructure for Large-scale Knowledge
Grid computing
Network computing
Software tools and development environments
Database and archiving systems
Mobile and ubiquitous computing
Systematization for Large-scale Knowledge
Language resources
Multi-modal resources
Classification, Clustering
Formal systems
Knowledge representation and ontology
Semantic Web
Cognitive systems
Collaborative knowledge
Applications and Evaluation of Large-scale Knowledge
Archives for science and art
Educational media
Information access
Document analysis
Multi-modal human interface
Web applications
Organizing committee
General conference chair: Furui, Sadaoki (Tokyo Institute of Technology)
Program co-chairs: Ortega, Antonio (University of Southern California)
Tokunaga, Takenobu (Tokyo Institute of Technology)
Publication chair: Yonezaki, Naoki (Tokyo Institute of Technology)
Publicity chair: Yokota, Haruo (Tokyo Institute of Technology)
Local organizing chair: Shinoda, Koichi (Tokyo Institute of Technology)
Submission
Since we are aiming at an interdisciplinary conference covering wide
range of topics concerning large-scale knowledge resources, authors
are requested to add general introductory description in the
beginning of the paper so that readers of other research area can
understand the importance of the work. Note that one of the
reviewers of each paper is assigned from other topic area to see if
this requirement is fulfilled.
There are two categories of paper presentation: oral and poster. The
category of the paper should be stated at submission. Authors are
invited to submit original unpublished research papers, in English,
up to 12 pages for oral presentation and 4 pages for poster
presentation, strictly following the LNCS/LNAI format guidelines
available at the Springer LNCS Web page.
.
Details of the submission procedure will be announced later on.
Reviewing
The reviewing of the papers will be blind and managed by an
international Conference Program Committee consisting of Area Chairs
and associated Program Committee Members. Final decisions on the
technical program will be made by a meeting of the Program Co-Chairs
and Area Chairs. Each submission will be reviewed by at least three
program committee members, and one of the reviewers is assigned from
a different topic area.
Publication
The conference proceedings will be published by Springer-Verlag in
their Lecture Notes in Artificial Intelligence (LNAI), which will be
available at the conference.
Important dates
Paper submission deadline: 30 August, 2007
Notification of acceptance: 10 October, 2007
Camera ready papers due: 10 November, 2007
e-mail correspondence
Call for Papers (Preliminary version)
Speech Prosody 2008 Campinas, Brazil,
May 6-9, 2008
Speech Prosody 2008 will be the fourth conference of a series of international events of the Special Interest Groups on Speech Prosody (ISCA), starting by the one held in Aix-en Provence, France, in 2002. The conferences in Nara, Japan (2004), and in Dresden, Germany (2006) followed the proposal of biennial meetings, and now is the time of changing place and hemisphere by trying the challenge of offering a non-stereotypical view of Brazil. It is a great pleasure for our labs to host the fourth International Conference on Speech Prosody in Campinas, Brazil, the second major city of the State of São Paulo.
It is worth highlighting that prosody covers a multidisciplinary area of research involving scientists from very different backgrounds and traditions, including linguistics and phonetics, conversation analysis, semantics and pragmatics, sociolinguistics, acoustics, speech synthesis and recognition, cognitive psychology, neuroscience, speech therapy, language teaching, and related fields. Information: sp2008_info@iel.unicamp.br. Web site: http://sp2008.org.
We invite all participants to contribute with papers presenting original research from all areas of speech prosody,
especially, but nor limited to the following.
Scientific Topics
Prosody and the Brain
Long-Term Voice Quality
Intonation and Rhythm Analysis and Modelling
Syntax, Semantics, Pragmatics and Prosody
Cross-linguistic Studies of Prosody
Prosodic variability
Prosody in Discourse
Dialogues and Spontaneous Speech
Prosody of Expressive Speech
Perception of Prosody
Prosody in Speech Synthesis
Prosody in Speech Recognition and Understanding
Prosody in Language Learning and Acquisition
Pathology of Prosody and Aids for the Impaired
Prosody Annotation in Speech Corpora
Others (please, specify)
Organising institutions
Speech Prosody Studies Group, IEL/Unicamp | Lab. de Fonética, FALE/UFMG | LIACC, LAEL, PUC-SP
Important Dates
Call for Papers: May 15, 2007
Full Paper Submission: Sept. 30, 2007
Notif. of Acceptance: Nov. 30, 2007
Early Registration: Dec. 20, 2007
Conference: May 6-9, 2008
top
FUTURE SPEECH SCIENCE AND TECHNOLOGY EVENTS
Séminaire AFCP : Traitement automatique du langage parlé et langues peu dotées
jeudi 21 juin 2007 ; 10h-16h
IMAG / Maison Jean Kuntzmann
Domaine Universitaire de Saint-Martin d'Hères (Grenoble)
accès
Programme préliminaire (liste des interventions) :
-V. Berment (C&S / LIG) "Langues peu dotées : définition et problématiques pour le TALN et le TALP"
-Nimaan Abdillahi, P. Nocera (LIA) "Récents développements du LIA en reconnaissance automatique du Somali"
-Nathalie Vallée (GIPSA / Département Langage et Cognition) : "Organisation syllabique des unités lexicales des langues"
-Thomas Pellegrini, L. Lamel ""Determination d'unites lexicales dans les langues peu-dotees pour la reconnaissance automatique de la parole"
-L. Besacier, V-B Le "Méthodologie du CLIPS pour la reconnaissance automatique de langues peu dotées : application aux
langues khmères, vietnamiennes et à l'arabe dialectal"
-Pierette Bouillon (Univ. Geneve, à confirmer) "MedSLT : a multilingual spoken language translation system tailored for medical
domains and its deployement for less-resourced languages"
-Dr. Pushpak Bhattacharyya (Department of Computer Science and Engineering Indian Institute of Technology Mumbai, India) :
"Spoken Language Technologies for Indian Languages"
Inscriptions : séminaire gratuit mais inscription nécessaire
Ce séminaire est organisé par l'AFCP
Adhérer à l'AFCP
Last CFP-Fifth International Workshop on
Content-Based Multimedia Indexing, CBMI-2007
June 25-27, 2007, Bordeaux, France
The Workshop is supported by IEEE, EURASIP, European research networks
COST292 and Muscle, INRIA, CNRS, Region d'Aquitaine, University Bordeaux
1, IBM
Topics
Multimedia indexing and retrieval (image, audio, video, text)
Multimedia content extraction
Matching and similarity search
Construction of high level indices
Multi-modal and cross-modal indexing
Content-based search techniques
Multimedia data mining
Presentation tools
Meta-data compression and transformation
Handling of very large scale multimedia database
Organisation, summarisation and browsing of multimedia documents
Applications
Evaluation and metrics
Paper submission
Perspective contributors are invited to submit papers via conference
web-site
Submission of full paper (to be received by):
January 25, 2007
Notification of acceptance:
March 10, 2007
Submission of camera-ready papers:
April 10, 2007
Submission of extended versions in Special issue of JSPIC March 1, 2007
Oerganizers
Chair of Organising committee : Jenny Benois-Pineau, LABRI, University
Bordeaux 1, France
Technical Program Chair : Eric Pauwels, CWI, The Netherlands
CfP-14th International Conference on Systems, Signals and Image Processing,
IWSSIP 2007
and
6th EURASIP Conference Focused on Speech and Image Processing, Multimedia Communications and Services
EC-SIPMCS 2007
June 27 – 30, 2007, Maribor, Slovenia
UPDATE: WHAT IS NEW?
* Tutorials are added. Please see the conference official web
site for updated list of all tutorials.
* Take a look at the conference official web site for paper
submission procedure and author registration information. Note that
paper submission deadline is March 18, 2007. Online paper submission
system and automatic notification system are prepared and working.
Please, register and login to the system, to upload the paper(s).
* Take care to make hotel reservations in advance. There are
other events in Maribor at the same time, so hotels may have limited
number of rooms to offer. Look at the conference official web site for
accommodation possibilities.
* Four keynote speakers will have their speech at the conference,
all of them are internationally recognized scientists in their research
areas. See conference official web site for their bio and abstracts.
CALL FOR PAPERS
Download Call for Papers
IWSSIP is an International Conference on Systems, Signals and Image Processing which brings together researchers
and developers from both academia and industry to report on the latest scientific and theoretical advances,
to discuss and debate major issues and to demonstrate state of-the-art systems.
The EURASIP conference is initiated by the European Association for Speech, Signal and Image Processing (EURASIP)
that is focused on Speech and Image Processing, Multimedia Communications and Services (EC-SIPMCS).
The goal of EC-SIPMCS is to promote the interface researchers involved in the development and applications of
methods and techniques within the framework of speech/image processing, multimedia communications and services.
Topics of Interest
The program includes keynote and invited lectures by eminent international experts, peer reviewed contributed
papers, posters, invited sessions on the same or related topics, industrial presentations and exhibitions around
but not limited to the following topics for IWSSIP and EC-SIPMCS conferences:
• Signal Processing and Systems
• Artificial Intelligence Technologies
• ICT in E-learning/Consulting
• Standards and Related Issues
• Image Scanning, Display and Printing
• Video Streaming and Videoconferencing
• Digital Video Broadcasting (DVB)
• Watermarking and Encryption
• Implementation Technologies
• Applications Areas
• Speech and Audio Processing
• Image and Video Processing and Coding
• Audio, Image and Video Indexing and Retrieval
• Multimedia Signal Processing
• Multimedia Databases
• Multimedia and DTV Technologies
• Multimedia Communications, Networking, Services and Applications
• Multimedia Human-Machine Interface and Interaction
• Multimedia Content Processing and Content Description
• Multimedia Data Compression
• Multimedia Systems
Keynote speakers:
Prof. Dr. Kamisety R. Rao, IEEE Fellow, University of Texas Arlington, USA
Prof. Dr. Markus Rupp, Vienna University of Technology, Austria
Prof. Dr. Levent Onural, Bilkent University, Ankara, Turkey
Submission of Regular Papers
Papers must be submitted electronically by March 18, 2007. Each paper will be evaluated by at least two independent
reviewers, and will be accepted based on its originality, significance and clarity.
Publications
All accepted papers will be published in CD Proceedings that will be available at the Conference.
Abstracts of accepted papers will be printed and included in the INSPEC database. Selected papers
will be considered for possible publication in scholarly journals.
Tutorial and Special Sessions
Those willing to prepare a tutorial course and those willing to organize special session during EC-SIPMCS 2007 and IWSSIP 2007 Conference should contact dr. Peter Planinši? at ec2007uni-mb.si.
Important Dates
Paper and Poster Submissions: March 18, 2007
Notification of acceptance: April 20, 2007
Early registration deadline: April 26, 2007
Camera ready copy due: May 6, 2007
Author Registration: May 6, 2007
Contact Information:
Fax: +386 2 220 7272
E-mail
Website
Žarko ?u?ej, General Chair
University of Maribor, Slovenia
Peter Planinši?, Program Chair
University of Maribor, Slovenia
International Workshop
Is a neural theory of language possible?
Development of unified representations in natural and artificial systems
organized by
CRIL
Centro di Ricerca Interdisciplinare sul Linguaggio
and
The CONTACT Project “Learning and development of Contextual Action”
University of Salento
Hotel President,
Lecce, June 28, 29, 30, 2007
Workshop website
Scientific issues and aims of discussion
The CONTACT project and CRIL are collaborating closely to build an integrated system for characterizing the motor processes of speech production by the simultaneous acquisition of data during articulation in many modalities: ultrasound, articulography, laryngography, and audiovisual recording. We seek to identify motoric and neural invariants that share a common structure during the development of perception and production for both speech and manipulation. The effectiveness of hypothesized invariants will be tested on an artificial learning system fed with the collected motoric data, which should autonomously develop its perceptual capabilities in speech recognition.
This collaboration prompted us to organize an interdisciplinary workshop, creating an opportunity for dialogue between first class scientists from the
disciplines of cognitive neuroscience, robotics and linguistics. The hope is that in the future, research in the field of cognitive neuroscience will mature
and converge to an integrated epistemological perspective, leading to the elaboration of a unified neural theory of language and motor control.
4th Joint Workshop on Machine Learning
and Multimodal Interaction (MLMI'07)
28-30 June 2007
Brno, Czech Republic
Hotel Continental ,
a modern hotel located in a quiet part of
the city within walking |