Gender bias in voice recognition
An i- and x-vector-based gender-specific automatic speaker recognition study
DOI:
https://doi.org/10.17469/O2108AISV000006Parole chiave:
speaker recognition, i-vectors, x-vectors, gender-difference, speaker-embeddingsAbstract
One of the critical implications of the physiological differences between adult males and females is acoustic differences in speech production. Such acoustic signal variability between the genders affects automatic speech processing applications, especially automatic speaker recognition systems. In this paper, the performance of the genders in state-of-the-art automatic speaker recognition algorithms, such as i- and x-vector, is studied by training the algorithms using a gender-balanced multilingual dataset and tested with gender-separated data from two different languages (English and Mandarin). Furthermore, generated i- and x-vector speaker embedding distributions in higher-dimensions are analysed using the t-SNE technique. The area distribution of speaker embeddings aids interpretation of the speaker recognition performances for both algorithms.
Downloads
Pubblicato
Fascicolo
Sezione
Licenza
Copyright (c) 2021 AISV - Associazione Italiana di Scienze della Voce

Questo lavoro è fornito con la licenza Creative Commons Attribuzione - Non commerciale 4.0 Internazionale.