Two Ph.D. fellowships available at  

Department of Electronics and Telecommunications

The Norwegian University of Science and Technology ( NTNU)

Trondheim, Norway


Closing date: January 27, 2008


The Ph.D. scholarships are part of the projects SIRKUS (Spoken Information Retrieval by Knowledge Utilization in Statistical Speech Processing) and SMUDI (Voice Control in Multi-Modal Dialogue).

PhD Fellowship in the SIRKUS project:
The vision of SIRKUS is to develop new architectures for speech recognition that can achieve human-like performance. There is evidence indicating that system performace based on the current paradigm is reaching an upper bound. We believe that in order to achieve our vision, knowledge about speech production and perception as well as a better understanding of speech per se needs to be incorporated into a statistical framework. This implies that new approaches to speech analysis need to be investigated and developed, and that a new statistical framework for modelling speech based on a set of analysis results needs to be defined.

The PhD project
An on-going PhD project will investigate and develop speech analysis methods that will supplement and augment the current methods that are based on spectral analysis. In particular, development of detectors of phonologically significant events, i.e. speech attributes, will be central. A set of speech analyses, including speech attribute detectors, will produce speech event sequences, which can constitute temporally asynchronous observation streams and may be correlated both in time and across observation streams.
This  PhD project will in cooperation with the on-going project develop novel statistical modelling approaches suitable for modelling speech based on event sequences. Integration of the observation streams from the speech analyses in a statistical description of various linguistic units such as phonemes, syllables, words and sentences will be central.
The project will incorporate collaboration with several foreign partners. The scholarships will also include an international visiting research scholarship.

Qualifications:
We seek highly motivated individuals holding a masters degree in electronics engineering, signal processing, statistics, or other relevant disciplines. Experience in speech technology is desirable, but not an absolute requirement.

PhD Fellowship in the SMUDI project:
Large groups of disabled persons have great difficulties accessing information that is available on the internet. Many government and municipal agencies are in the process of changing their preferred interaction with the public, moving to internet based systems for submission of applications and information requests. In many cases it will be necessary for the user to provide information by filling in forms. Some examples are online shopping for goods and services, e.g. air travel and use of public services such as filling in tax return forms.

The PhD project
The goal of the PhD project is to develop a speech based system for filling in internet forms. The work will include methods for interpretation of the internet forms, speech recognition to transform the user speech to text, and integration of speech tehnology with other modalities for information presentation and user input.

In order to achieve a best possible performance for this task, the speech recognition ought to be at the user’s computer. This will have the advantage that the speech recognizer can be adapted to the user’s voice and pronunciation, and in addition provide a situation where the speech signal is not band limited or noise corrupted by the transmission channel. The challenge is that the recognizer cannot be tailored for filling in one particular form, and thus it will be necessary to develop a general large vocabulary speech recognizer (a “dictation engine”) for Norwegian. The system should be able to define vocabulary and syntax dynamically by interpreting the content of the internet form.

Qualifications:
We seek highly motivated individuals holding a masters degree in electronics engineering, signal processing, statistics, natural language processing or other relevant disciplines. Experience in speech technology is desirable, but not an absolute requirement.


Information for both fellowships:

The PhD fellows will be associated with the Signal Processing Group at  NTNU and will work in a strong and active scientific environment.

Award holders at NTNU are normally appointed for up to 4 years with 25% of the time spent on specified work. This work is primarily linked to teaching and is usually divided so that a relatively large part is done in the first half of the period of the appointment.
The appointments is at code 1017, salary level 43-58 in the national salary scheme, gross NOK. 325.600 – 423.800 per annum (1 NOK ~ 0.125 EUR), and are normally remunerated at wage level 43 of which 2% is deducted for the State Pension scheme. The salary might be adjusted after negotiation with the employer to reflect the applicant’s experience.

Further information:
For more information on the SIRKUS position and the application requirements, please see http://www.iet.ntnu.no/projects/SIRKUS/Positions.html
For more information on the SMUDI position and the application requirements, please see http://www.iet.ntnu.no/projects/SMUDI/Positions.html

Interested candidates are encouraged to contact Professor Torbjørn Svendsen (phone: +47-735-92674, email: torbjorn@iet.ntnu.no) for further information.