论文标题

量化种族和性别特征确定商业面部识别算法的身份的程度

Quantifying the Extent to Which Race and Gender Features Determine Identity in Commercial Face Recognition Algorithms

论文作者

Howard, John J., Sirotin, Yevgeniy B., Tipton, Jerry L., Vemury, Arun R.

论文摘要

人脸特征可用于确定个人身份以及性别和种族等人口统计信息。但是,尽管政府和行业的部署越来越多,但黑盒商业面部识别算法(CFRAS)使用性别和种族特征在多大程度上使用性别和种族特征来确定身份。在这项研究中,我们量化了性别和种族特征影响不同人(即未分数得分)的面部识别相似性得分的程度。我们使用五个不同的CFRA和333个不同的测试受试者进行了这项研究。作为对照,我们比较了这些非拟合分布的行为与商业虹膜识别算法(CIRA)。确认了先前的工作,所有CFRA都为同一性别和种族的人提供了更高的相似性得分,这种效果称为“广泛同质性”。 CIRA未观察到这种影响。接下来,我们将主组件分析(PCA)应用于相似性评分矩阵。我们表明,通过性别和种族,CFRAS集群人的一些主要组成部分(PC),但大多数人没有。 PC中的人口聚类仅占CFRA总分差异的10%。 CIRA未观察到聚类。这表明,尽管CFRAS使用某些性别和种族特征来建立身份,但当前CFRA使用的大多数功能与性别和种族无关,类似于CIRA使用的虹膜纹理模式。最后,仅使用没有显示人口聚类的PC对相似性评分矩阵进行重建,从而降低了广泛的同质性效应,但也降低了配合分数和非序列分数之间的分离。这表明,CFRAS有可能在与性别和种族无关的特征上运作,尽管其识别精度较低,但这不是当前的商业实践。

Human face features can be used to determine individual identity as well as demographic information like gender and race. However, the extent to which black-box commercial face recognition algorithms (CFRAs) use gender and race features to determine identity is poorly understood despite increasing deployments by government and industry. In this study, we quantified the degree to which gender and race features influenced face recognition similarity scores between different people, i.e. non-mated scores. We ran this study using five different CFRAs and a sample of 333 diverse test subjects. As a control, we compared the behavior of these non-mated distributions to a commercial iris recognition algorithm (CIRA). Confirming prior work, all CFRAs produced higher similarity scores for people of the same gender and race, an effect known as "broad homogeneity". No such effect was observed for the CIRA. Next, we applied principal components analysis (PCA) to similarity score matrices. We show that some principal components (PCs) of CFRAs cluster people by gender and race, but the majority do not. Demographic clustering in the PCs accounted for only 10 % of the total CFRA score variance. No clustering was observed for the CIRA. This demonstrates that, although CFRAs use some gender and race features to establish identity, most features utilized by current CFRAs are unrelated to gender and race, similar to the iris texture patterns utilized by the CIRA. Finally, reconstruction of similarity score matrices using only PCs that showed no demographic clustering reduced broad homogeneity effects, but also decreased the separation between mated and non-mated scores. This suggests it's possible for CFRAs to operate on features unrelated to gender and race, albeit with somewhat lower recognition accuracy, but that this is not the current commercial practice.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源