Language Generalization Using Active Learning in the Context of Parkinson’s Disease Classification

Authors: Santiago Andrés Moreno Acevedo C. D. Ríos Juan Camilo Vasquez Correa J Rusz E Nöth J. R. Orozco

Date: 04.09.2023


Abstract

Speech traits have enabled the evaluation and monitoring of the neurological state of different disorders, including Parkinson’s Disease (PD) using classical and deep approaches. Considering that speech contains paralinguistic information, the native language of the speaker influences the performance of the trained models when classifying the presence of the disease. Although researchers have performed several studies using corpora from different acoustic and language conditions, there is no baseline for the accuracy of a system to classify PD in cross-language scenarios. This study evaluates the generalization capability of different classical and deep methods to discriminate between PD patients and healthy speakers. The experiments are performed in cross-language scenarios. In particular, an Active Learning (AL) strategy is considered to evaluate the influence of the training data selection to improve the model’s performance under cross-language settings. The results indicate that models based on Wav2Vec 2.0 yielded the best results in detecting the presence of the disease in such non-controlled cross-language scenarios. In addition, the AL selection outperformed the results compared to a random selection of training samples. The considered AL based-approach allows to achieve high accuracies using a careful selection of training data in an adaptively manner. This is particularly important when dealing with non-annotated and limited data, such as the case of pathological speech modeling.

BIB_text

@Article {
title = {Language Generalization Using Active Learning in the Context of Parkinson’s Disease Classification},
pages = {349-359},
keywds = {
Active Learning; Cross Language; Deep Learning; Machine Learning; Parkinson’s Disease; Speech Processing
}
abstract = {

Speech traits have enabled the evaluation and monitoring of the neurological state of different disorders, including Parkinson’s Disease (PD) using classical and deep approaches. Considering that speech contains paralinguistic information, the native language of the speaker influences the performance of the trained models when classifying the presence of the disease. Although researchers have performed several studies using corpora from different acoustic and language conditions, there is no baseline for the accuracy of a system to classify PD in cross-language scenarios. This study evaluates the generalization capability of different classical and deep methods to discriminate between PD patients and healthy speakers. The experiments are performed in cross-language scenarios. In particular, an Active Learning (AL) strategy is considered to evaluate the influence of the training data selection to improve the model’s performance under cross-language settings. The results indicate that models based on Wav2Vec 2.0 yielded the best results in detecting the presence of the disease in such non-controlled cross-language scenarios. In addition, the AL selection outperformed the results compared to a random selection of training samples. The considered AL based-approach allows to achieve high accuracies using a careful selection of training data in an adaptively manner. This is particularly important when dealing with non-annotated and limited data, such as the case of pathological speech modeling.


}
isbn = {978-303140497-9},
date = {2023-09-04},
}
Vicomtech

Parque Científico y Tecnológico de Gipuzkoa,
Paseo Mikeletegi 57,
20009 Donostia / San Sebastián (Spain)

+(34) 943 309 230

Zorrotzaurreko Erribera 2, Deusto,
48014 Bilbao (Spain)

close overlay

Behavioral advertising cookies are necessary to load this content

Accept behavioral advertising cookies