Thu-Ses1-O2:
Physiology and Pathology of Spoken Language

This is the final program for this session. For oral sessions, the timing on the left is the current presentation order, but this may still change, so please check at the conference itself.
Time:Thursday 10:00 Place:201A Type:Oral
Chair:Francis Grenez
10:00Reliable tracking based on speech sample salience of vocal cycle length perturbations
Christophe Mertens (Université Libre de Bruxelles)
Francis Grenez (Université Libre de Bruxelles)
Lise Crevier-Buchman (Hôpital Européen Georges Pompidou, Paris, France)
Jean Schoentgen (National Fund for Scientific Research, Belgium)
The presentation concerns a method for tracking cycle lengths in voiced speech. The speech cycles are detected via the saliences of the speech signal samples, defined by the length of the temporal interval over which a sample is a maximum. The tracking of the cycle lengths is based on a dynamic programming algorithm which does not request that the signal is locally periodic and the average period length known a priori. The method has been validated on a corpus of normophonic speakers. The results report the tremor frequency and modulation depth of the vocal frequency of 72 ALS and 8 normophonic speakers.
10:20Longitudinal Changes of Selected Voice Source Parameters
Hideki Kasuya (Utsunomiya University)
Hajime Yoshida (Yoshida Clinic)
Satoshi Ebihara (Kyoundo Hospital)
Hiroki Mori (Utsunomiya University)
Longitudinal changes were investigated for selected voice source parameters: fundamental frequency (F0), jitter (period perturbation quotient, PPQ), shimmer (amplitude perturbation quotient, APQ) and glottal noise (normalized noise energy, NNE). Acoustic analyses were made on the sustained phonation of the Japanese vowel /a/ of 20 males and 38 females with no laryngeal disease, which were recorded over periods ranging from 10 to 18 years. The longitudinal change of the parameters was evaluated by the statistical t-test, revealing that: 1) strong individuality existed for significant longitudinal changes in the acoustic parameters, 2) F0 falling is a more typical tendency of vocal aging of females than males, while F0 rising, which has been pointed out in the previous reports for males, was not found, 3) shimmer is a more observable indication of vocal aging than jitter, and 4) glottal noise in a high-frequency region tends to increase with aging.
10:40Automatic perceptual categorization of disordered connected speech
Ali Alpan (Laboratory of Images, Signals & Telecommunication Devices, Université Libre de Bruxelles)
Jean Schoentgen (Laboratory of Images, Signals & Telecommunication Devices, Université Libre de Bruxelles)
Youri Maryn (Department of Otorhinolaryngology and Head & Neck Surgery, Department of Speech-Language Pathology and Audiology, Sint-Jan General Hospital)
Francis Grenez (Laboratory of Images, Signals & Telecommunication Devices, Université Libre de Bruxelles)
The objective of the presentation is to report experiments involving the automatic classification of disordered connected speech into binary (normal, pathological) or multiple (modal, moderately hoarse, severely hoarse) categories. The multi-category classification according to the perceived degree of hoarseness is considered to be clinically meaningful and desirable given that the reliable perceptual classification by humans of disordered voice stimuli is known to be difficult and time-consuming. The acoustic cues are temporal signal-to-dysperiodicity ratios as well as mel-frequency cepstral coefficients. The classifiers are support vector machines which have been trained and tested on two connected speech corpora. The binary classification accuracy has been high (98%) for both sets of acoustic cues. The multi-category classification accuracy has been 70% when based on signal-to-dysperiodicity ratios and 59% when based on mel-frequency cepstral coefficients.
11:00Kinematic Analysis of Tongue Movement Control in Spastic Dysarthria
Heejin Kim (Beckman Institute, University of Illinois, Urbana, USA)
Panying Rong (Department of Speech and Hearing, University of Illinois, Urbana, USA)
Torrey Loucks (Department of Speech and Hearing, University of Illinois, Urbana, USA)
Mark Hasegawa-Johnson (Department of Electrical and Computer Engineering, University of Illinois, Urbana, USA)
This study provided a quantitative analysis of the kinematic deviances in dysarthria associated with spastic cerebral palsy. Of particular interest were tongue tip movements during alveolar consonant release. Our analysis based on kinematic measures indicated that speakers with spastic dysarthria had a restricted range of articulation and disturbances in articulatory-voicing coordination. The degree of kinematic deviances was greater for lower intelligibility speakers, supporting an association between articulatory dysfunctions and intelligibility in spastic dysarthria.
11:20Pre- and short-term posttreatment vocal functioning in patients with advanced head and neck cancer treated with concomitant chemoradiotherapy
Irene Jacobi (Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute, Amsterdam, The Netherlands)
Lisette van der Molen (Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute, Amsterdam, The Netherlands)
Maya van Rossum (Department of Otorhinolaryngology, Leiden University Medical Centre, Leiden, The Netherlands)
Frans Hilgers (Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute, Amsterdam, The Netherlands)
Forty-seven patients with advanced larynx/hypopharynx, nasopharynx or oropharynx/oral cavity cancer were recorded before, and 10 weeks after concomitant chemoradiotherapy (CCRT), to investigate the effect of the tumor versus the effects of treatment. To evaluate voice functioning before and after treatment, voice quality and glottal behavior of sustained /a/ vowels were analyzed acoustically and compared with patient-based data on cigarette and alcohol usage. Acoustic measures of effort, nasality and regularity, such as periodicity or harmonics-to-noise ratio, differed significantly and progressed differently in dependence of the 3 distinct cancer/radiation sites. Baseline measures of voice stability correlated significantly with alcohol/smoking behavior.
11:40Acoustic Analysis of Intonation in Parkinson’s Disease
Joan Ma (Queen Margaret University)
Ruediger Hoffmann (Dresden University of Technology)
The aim of this study was to explore the prosodic characteristics of speakers with Parkinson’s disease (PD) in the marking of intonations. Twenty-four German PD speakers with either mild or moderate degree of dysarthria were compared with twelve non-dysarthric control speakers on the production of imperatives, questions and statements. Acoustic analyses of fundamental frequency (average F0, F0 range and F0 envelop), intensity (average intensity, intensity range and intensity envelop) and speech rate (number of syllable per second) were conducted to investigate the effect of PD on intonation marking. The results showed that the dysarthric and non-dysarthric speakers differed significantly in all F0 measures, with higher average F0 and reduced F0 variability noted for the PD speakers. Although the PD speakers were more monotonous in prosody, they showed similar intonation contrasts between intonations as in non-dysarthric speakers.

top