论文标题

种族分布在培训数据中对面部识别偏见的影响:仔细观察

The Impact of Racial Distribution in Training Data on Face Recognition Bias: A Closer Look

论文作者

Kolla, Manideep, Savadamuthu, Aravinth

论文摘要

面部识别算法在现实世界中使用时可能非常有用,但是当偏向某些人口统计学时,它们也可能是危险的。因此,必须了解如何训练这些算法以及哪些因素会影响其准确性和公平性以建立更好的因素。在这项研究中,我们阐明了种族分布在训练数据中对面部识别模型性能的影响。我们进行了16种不同的实验,培训数据中面部的种族分布不同。我们使用准确度指标,聚类指标,UMAP预测,面部质量和决策阈值分析了这些训练有素的模型。我们表明,仅训练数据集中种族的统一分布并不能保证无偏见的面部识别算法以及诸如面部图像质量之类的因素如何发挥至关重要的作用。我们还研究了聚类指标与偏见之间的相关性,以了解聚类是否是偏见的良好指标。最后,我们介绍了一个名为“种族等级”的指标,以研究面部特征的间和种族内相关性,以及它们如何影响面部识别模型的学习能力。通过这项研究,我们试图将更多了解面部识别训练的基本要素(数据)带入数据。更好地理解培训数据对面部识别算法偏见的影响将有助于创建更好的数据集,进而有助于更好的面部识别系统。

Face recognition algorithms, when used in the real world, can be very useful, but they can also be dangerous when biased toward certain demographics. So, it is essential to understand how these algorithms are trained and what factors affect their accuracy and fairness to build better ones. In this study, we shed some light on the effect of racial distribution in the training data on the performance of face recognition models. We conduct 16 different experiments with varying racial distributions of faces in the training data. We analyze these trained models using accuracy metrics, clustering metrics, UMAP projections, face quality, and decision thresholds. We show that a uniform distribution of races in the training datasets alone does not guarantee bias-free face recognition algorithms and how factors like face image quality play a crucial role. We also study the correlation between the clustering metrics and bias to understand whether clustering is a good indicator of bias. Finally, we introduce a metric called racial gradation to study the inter and intra race correlation in facial features and how they affect the learning ability of the face recognition models. With this study, we try to bring more understanding to an essential element of face recognition training, the data. A better understanding of the impact of training data on the bias of face recognition algorithms will aid in creating better datasets and, in turn, better face recognition systems.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源