Cross-Lingual Transferability of Voice Analysis Models: a Parkinson’s Disease Case Study

Authors

  • Claudio Ferrante Dipartimento di Elettronica Informazione e Bioingegneria, Politecnico di Milano, Italy
  • Vincenzo Scotti Dipartimento di Elettronica Informazione e Bioingegneria, Politecnico di Milano, Italy https://orcid.org/0000-0002-8765-604X

DOI:

https://doi.org/10.17469/O2111AISV000007

Keywords:

Natural Language Processing, Deep Learning, Speech Analysis, Parkinson's Disease, Domain Adaptation, Multilingual

Abstract

Traditionally, speech analysis has always relied on a set of very informative features like (Mel) spectrogram, Mel Frequency Cepstral Coefficients (MFCC), pitch or intensity to build speech powered applications. Recently, deep learning-based models for the extraction of acoustic features have allowed significantly improving the state of the art in many speech-related applications. With this work, we focus the analysis on the cross-lingual transferability of speech analysis features. The idea is to understand whether and how well a classification model trained on speech features in a source language works on an unseen target language. We evaluate these properties analysing models for Parkinson's disease detection from speech, adapting the models from English to Telugu. Results show that multi-lingual pre-trained deep learning-based features do not require explicit adaptation and work well out-of-the-box. Differently, models not adapting out-of-the-box respond well even to unsupervised adaptation on a small data set.

Downloads

Published

29-12-2023

Similar Articles

<< < 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 > >> 

You may also start an advanced similarity search for this article.