Gender Neutralisation for Unbiased Speech Synthesising

Rizhinashvili, Davit; Sham, Abdallah; Anbarjafari, Gholamreza

doi:10.3390/electronics11101594

Gender Neutralisation for Unbiased Speech Synthesising

Atıf İçin Kopyala

Rizhinashvili D., Sham A. H., Anbarjafari G.

ELECTRONICS, cilt.11, sa.10, 2022 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 11 Sayı: 10
Basım Tarihi: 2022
Doi Numarası: 10.3390/electronics11101594
Dergi Adı: ELECTRONICS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Aerospace Database, Communication Abstracts, INSPEC, Metadex, Directory of Open Access Journals, Civil Engineering Abstracts
Anahtar Kelimeler: responsible AI, speech analysis, emotion recognition, gender bias
Yıldız Teknik Üniversitesi Adresli: Hayır

Özet

Machine learning can encode and amplify negative biases or stereotypes already present in humans, resulting in high-profile cases. There can be multiple sources encoding the negative bias in these algorithms, like errors from human labelling, inaccurate representation of different population groups in training datasets, and chosen model structures and optimization methods. Our paper proposes a novel approach to speech processing that can resolve the gender bias problem by eliminating the gender parameter. Therefore, we devised a system that transforms the input sound (speech of a person) into a neutralized voice to the point where the gender of the speaker becomes indistinguishable by both humans and AI. Wav2Vec based network has been utilised to conduct speech gender recognition to validate the main claim of this research work, which is the neutralisation of gender from the speech. Such a system can be used as a batch pre-processing layer for training models, thus making associated gender bias irrelevant. Further, such a system can also find its application where speaker gender bias by humans is also prominent, as the listener will not be able to judge the gender from speech.