| Time: | Thursday 10:00 | Place: | 201A | Type: | Oral |
| Chair: | Francis Grenez | ||||
| 10:00 | Reliable tracking based on speech sample salience of vocal cycle length perturbations |
| (Université Libre de Bruxelles) (Université Libre de Bruxelles) (Hôpital Européen Georges Pompidou, Paris, France) (National Fund for Scientific Research, Belgium) | |
| The presentation concerns a method for tracking cycle lengths in voiced speech. The speech cycles are detected via the saliences of the speech signal samples, defined by the length of the temporal interval over which a sample is a maximum. The tracking of the cycle lengths is based on a dynamic programming algorithm which does not request that the signal is locally periodic and the average period length known a priori. The method has been validated on a corpus of normophonic speakers. The results report the tremor frequency and modulation depth of the vocal frequency of 72 ALS and 8 normophonic speakers. | |
| 10:20 | Longitudinal Changes of Selected Voice Source Parameters |
| (Utsunomiya University) (Yoshida Clinic) (Kyoundo Hospital) (Utsunomiya University) | |
| Longitudinal changes were investigated for selected voice source parameters: fundamental frequency (F0), jitter (period perturbation quotient, PPQ), shimmer (amplitude perturbation quotient, APQ) and glottal noise (normalized noise energy, NNE). Acoustic analyses were made on the sustained phonation of the Japanese vowel /a/ of 20 males and 38 females with no laryngeal disease, which were recorded over periods ranging from 10 to 18 years. The longitudinal change of the parameters was evaluated by the statistical t-test, revealing that: 1) strong individuality existed for significant longitudinal changes in the acoustic parameters, 2) F0 falling is a more typical tendency of vocal aging of females than males, while F0 rising, which has been pointed out in the previous reports for males, was not found, 3) shimmer is a more observable indication of vocal aging than jitter, and 4) glottal noise in a high-frequency region tends to increase with aging. | |
| 10:40 | Automatic perceptual categorization of disordered connected speech |
| (Laboratory of Images, Signals & Telecommunication Devices, Université Libre de Bruxelles) (Laboratory of Images, Signals & Telecommunication Devices, Université Libre de Bruxelles) (Department of Otorhinolaryngology and Head & Neck Surgery, Department of Speech-Language Pathology and Audiology, Sint-Jan General Hospital) (Laboratory of Images, Signals & Telecommunication Devices, Université Libre de Bruxelles) | |
| The objective of the presentation is to report experiments involving the automatic classification of disordered connected speech into binary (normal, pathological) or multiple (modal, moderately hoarse, severely hoarse) categories. The multi-category classification according to the perceived degree of hoarseness is considered to be clinically meaningful and desirable given that the reliable perceptual classification by humans of disordered voice stimuli is known to be difficult and time-consuming. The acoustic cues are temporal signal-to-dysperiodicity ratios as well as mel-frequency cepstral coefficients. The classifiers are support vector machines which have been trained and tested on two connected speech corpora. The binary classification accuracy has been high (98%) for both sets of acoustic cues. The multi-category classification accuracy has been 70% when based on signal-to-dysperiodicity ratios and 59% when based on mel-frequency cepstral coefficients. | |
| 11:00 | Kinematic Analysis of Tongue Movement Control in Spastic Dysarthria |
| (Beckman Institute, University of Illinois, Urbana, USA) (Department of Speech and Hearing, University of Illinois, Urbana, USA) (Department of Speech and Hearing, University of Illinois, Urbana, USA) (Department of Electrical and Computer Engineering, University of Illinois, Urbana, USA) | |
| This study provided a quantitative analysis of the kinematic deviances in dysarthria associated with spastic cerebral palsy. Of particular interest were tongue tip movements during alveolar consonant release. Our analysis based on kinematic measures indicated that speakers with spastic dysarthria had a restricted range of articulation and disturbances in articulatory-voicing coordination. The degree of kinematic deviances was greater for lower intelligibility speakers, supporting an association between articulatory dysfunctions and intelligibility in spastic dysarthria. | |
| 11:20 | Pre- and short-term posttreatment vocal functioning in patients with advanced head and neck cancer treated with concomitant chemoradiotherapy |
| (Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute, Amsterdam, The Netherlands) (Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute, Amsterdam, The Netherlands) (Department of Otorhinolaryngology, Leiden University Medical Centre, Leiden, The Netherlands) (Department of Head and Neck Oncology and Surgery, The Netherlands Cancer Institute, Amsterdam, The Netherlands) | |
| Forty-seven patients with advanced larynx/hypopharynx, nasopharynx or oropharynx/oral cavity cancer were recorded before, and 10 weeks after concomitant chemoradiotherapy (CCRT), to investigate the effect of the tumor versus the effects of treatment. To evaluate voice functioning before and after treatment, voice quality and glottal behavior of sustained /a/ vowels were analyzed acoustically and compared with patient-based data on cigarette and alcohol usage. Acoustic measures of effort, nasality and regularity, such as periodicity or harmonics-to-noise ratio, differed significantly and progressed differently in dependence of the 3 distinct cancer/radiation sites. Baseline measures of voice stability correlated significantly with alcohol/smoking behavior. | |
| 11:40 | Acoustic Analysis of Intonation in Parkinson’s Disease |
| (Queen Margaret University) (Dresden University of Technology) | |
| The aim of this study was to explore the prosodic characteristics of speakers with Parkinson’s disease (PD) in the marking of intonations. Twenty-four German PD speakers with either mild or moderate degree of dysarthria were compared with twelve non-dysarthric control speakers on the production of imperatives, questions and statements. Acoustic analyses of fundamental frequency (average F0, F0 range and F0 envelop), intensity (average intensity, intensity range and intensity envelop) and speech rate (number of syllable per second) were conducted to investigate the effect of PD on intonation marking. The results showed that the dysarthric and non-dysarthric speakers differed significantly in all F0 measures, with higher average F0 and reduced F0 variability noted for the PD speakers. Although the PD speakers were more monotonous in prosody, they showed similar intonation contrasts between intonations as in non-dysarthric speakers. |