Identifying and typifying demographic unfairness in phoneme-level embeddings of self-supervised speech recognition models

Source

arxiv.orgfull article ↗

Read on arxiv

Publisher summary· verbatim

arXiv:2604.22631v1 Announce Type: new Abstract: Modern automatic speech recognition (ASR) systems have been observed to function better for certain speaker groups (SGs) than others, despite recent gains in overall performance. One potential impediment to progress towards fairer ASR is a more nuanced

Discussion

No replies yet. Be first.

Identifying and typifying demographic unfairness in phoneme-level embeddings of self-supervised speech recognition models

Related coverage

Identifying and typifying demographic unfairness in phoneme-level embeddings of self-supervised speech recognition models

Related coverage