首页> 外文会议>Conference on empirical methods in natural language processing >Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints
【24h】

Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints

机译:男性也喜欢购物:使用语料库级约束来减少性别偏见的扩大

获取原文

摘要

Language is increasingly being used to define nch visual recognition problems with supporting image collections sourced from the web. Structured prediction models are used in these tasks to take advantage of correlations between co-occurring labels and visual input but risk inadvertently encoding social biases found in web corpora. In this work, we study data and models associated with multilabel object classification and visual semantic role labeling We find that (a) datasets for these tasks contain significant gender bias and (b) models trained on these datasets further amplify existing bias. For example, the activity cooking is over 33% more likely to involve females than males in a training set, and a trained model further amplifies the disparity to 68% at test time. We propose to inject corpus-level constraints for calibrating existing structured prediction models and design an algorithm based on Lagrangian relaxation for collective inference. Our method results in almost no performance loss for the underlying recognition task but decreases the magnitude of bias amplification by 47.5% and 40.5% for multilabel classification and visual semantic role labeling, respectively.
机译:语言越来越多地用于定义ch视觉识别问题,并支持来自网络的图像集。在这些任务中使用结构化的预测模型,以利用共同出现的标签和视觉输入之间的相关性,但可能会无意中编码在Web语料库中发现的社会偏见。在这项工作中,我们研究了与多标签对象分类和视觉语义角色标签相关的数据和模型。我们发现(a)这些任务的数据集包含明显的性别偏见,并且(b)在这些数据集上训练的模型进一步放大了现有偏见。例如,在训练集中进行活动烹饪的女性比男性参与运动的可能性高出男性33%以上,并且经过训练的模型在测试时将差异进一步扩大到68%。我们建议注入语料库级约束以校准现有的结构化预测模型,并设计基于拉格朗日松弛的算法进行集体推理。我们的方法几乎不会导致基础识别任务的性能损失,但是对于多标签分类和视觉语义角色标签,偏差放大的幅度分别降低了47.5%和40.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号