具有身份信息数据增强方法的文本分类任务的公平性

论文标题

具有身份信息数据增强方法的文本分类任务的公平性

Fairness for Text Classification Tasks with Identity Information Data Augmentation Methods

论文作者

Wadhwa, Mohit, Bhambhani, Mohan, Jindal, Ashvini, Sawant, Uma, Madhavan, Ramanujam

论文摘要

反事实公平方法解决了以下问题：如果文本实例中引用的敏感身份属性不同，预测将如何变化？这些方法完全基于为给定培训和测试集实例生成反事实。反事实实例通常是通过替换敏感的身份项（即，在实例中存在的身份术语替换为属于相同敏感类别的其他身份项）来制备的。因此，这些方法的功效在很大程度上取决于身份对的质量和全面性。在本文中，我们提供了一个两步的数据增强过程，（1）以前的阶段由一种新的方法组成，用于准备与单词嵌入的身份对列表的全面列表，（2）后者包括利用准备好的身份对列表，以增强培训实例，以通过应用三个简单的身份替代品，身份置于身份，身份验证，身份置于身份，身份配对，以及配对既有既有身份，又配对既有既有既有。我们从经验上表明，两阶段的增强过程会导致不同的身份对和增强的训练集，并在两个众所周知的文本分类任务上改善了基于代币的公平度量评分。

Counterfactual fairness methods address the question: How would the prediction change if the sensitive identity attributes referenced in the text instance were different? These methods are entirely based on generating counterfactuals for the given training and test set instances. Counterfactual instances are commonly prepared by replacing sensitive identity terms, i.e., the identity terms present in the instance are replaced with other identity terms that fall under the same sensitive category. Therefore, the efficacy of these methods depends heavily on the quality and comprehensiveness of identity pairs. In this paper, we offer a two-step data augmentation process where (1) the former stage consists of a novel method for preparing a comprehensive list of identity pairs with word embeddings, and (2) the latter consists of leveraging prepared identity pairs list to enhance the training instances by applying three simple operations (namely identity pair replacement, identity term blindness, and identity pair swap). We empirically show that the two-stage augmentation process leads to diverse identity pairs and an enhanced training set, with an improved counterfactual token-based fairness metric score on two well-known text classification tasks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题