Auditing black-box models for indirect influence

Adler Philip; Falk Casey; Friedler Sorelle A.; Nix Tionney; Rybeck Gabriel; Scheidegger Carlos; Smith Brandon; Venkatasubramanian Suresh

首页> 外文期刊>Knowledge and information systems >Auditing black-box models for indirect influence

【24h】

Auditing black-box models for indirect influence

机译：审计黑匣子模型，用于间接影响

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data-trained predictive models see widespread use, but for the most part they are used as black boxes which output a prediction or score. It is therefore hard to acquire a deeper understanding of model behavior and in particular how different features influence the model prediction. This is important when interpreting the behavior of complex models or asserting that certain problematic attributes (such as race or gender) are not unduly influencing decisions. In this paper, we present a technique for auditing black-box models, which lets us study the extent to which existing models take advantage of particular features in the data set, without knowing how the models work. Our work focuses on the problem of indirect influence: how some features might indirectly influence outcomes via other, related features. As a result, we can find attribute influences even in cases where, upon further direct examination of the model, the attribute is not referred to by the model at all. Our approach does not require the black-box model to be retrained. This is important if, for example, the model is only accessible via an API, and contrasts our work with other methods that investigate feature influence such as feature selection. We present experimental evidence for the effectiveness of our procedure using a variety of publicly available data sets and models. We also validate our procedure using techniques from interpretable learning and feature selection, as well as against other black-box auditing procedures. To further demonstrate the effectiveness of this technique, we use it to audit a black-box recidivism prediction algorithm.

机译：数据训练的预测模型参见广泛使用，但对于大多数情况下，它们用作输出预测或分数的黑匣子。因此，难以获得对模型行为的更深理解，特别是不同的特征如何影响模型预测。这在解释复杂模型的行为或断言某些有问题的属性（如种族或性别）并未过度影响决策时非常重要。在本文中，我们提出了一种用于审计黑盒式模型的技术，这使我们能够研究现有模型在数据集中利用特定功能的程度，而不知道模型如何工作。我们的工作侧重于间接影响问题：某些功能可能会通过其他相关特征间接地影响结果。结果，我们可以在案件中找到属性影响，即使在进一步直接检查模型时，该属性根本不会被模型引用。我们的方法不需要再培训黑匣子模型。例如，如果模型仅通过API访问，则这是重要的，并且对与调查特征影响（例如特征选择）的其他方法对比进行对比。我们使用各种公开的数据集和模型提出了我们程序的有效性的实验证据。我们还使用来自可解释的学习和特征选择的技术以及其他黑匣子审计程序验证我们的程序。为了进一步证明这种技术的有效性，我们将其用来审核黑盒常规预测算法。

著录项

来源
《Knowledge and information systems》 |2018年第1期|共28页
作者
Adler Philip; Falk Casey; Friedler Sorelle A.; Nix Tionney; Rybeck Gabriel; Scheidegger Carlos; Smith Brandon; Venkatasubramanian Suresh;
展开▼
作者单位

Haverford Coll Dept Comp Sci Haverford PA 19041 USA;

Haverford Coll Dept Comp Sci Haverford PA 19041 USA;

Haverford Coll Dept Comp Sci Haverford PA 19041 USA;

Haverford Coll Dept Comp Sci Haverford PA 19041 USA;

Haverford Coll Dept Comp Sci Haverford PA 19041 USA;

Univ Arizona Dept Comp Sci Tucson AZ 85721 USA;

Haverford Coll Dept Comp Sci Haverford PA 19041 USA;

Univ Utah Dept Comp Sci Salt Lake City UT 84112 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词
Black-box auditing; ANOVA; Algorithmic accountability; Deep learning; Discrimination-aware data mining; Feature influence; Interpretable machine learning;

机译：黑匣子审计;ANOVA;算法问责制;深入学习;歧视感知数据挖掘;特征影响;可解释的机器学习;

相似文献

外文文献
中文文献
专利

1. Auditing black-box models for indirect influence [J] . Adler Philip, Falk Casey, Friedler Sorelle A., Knowledge and information systems . 2018,第1期

机译：审计黑匣子模型，用于间接影响
2. The influence of unmeasured occupancy disturbances on the performance of black-box thermal building models [J] . Louise R?vdal Lund Christensen, Thea Hauge Broholt, Michael Dahl Knudsen, E3S Web of Conferences . 2020,第10期

机译：未测量的占用障碍对黑箱热建筑模型性能的影响
3. Influence of ensemble surrogate models and sampling strategy on the solution quality of algorithms for computationally expensive black-box global optimization problems [J] . Juliane Mueller, Christine A. Shoemaker Journal of Global Optimization . 2014,第2期

机译：集成替代模型和采样策略对计算量大的黑箱全局优化问题算法求解质量的影响
4. Auditing Black-Box Models for Indirect Influence [C] . Philip Adler, Casey Falk, Sorelle A. Friedler, IEEE International Conference on Data Mining . 2016

机译：审核黑盒模型以产生间接影响
5. THE INDIRECT AND SUPEREROGATORY INDIRECT INFLUENCE OF A PERSUASIVE MESSAGE. [D] . STEELE, CLAUDE MASON. 1971

机译：说服性消息的间接和过度矫正间接影响。
6. Session 6: Do Black-Box Models of Thermoregulation Still Have Research Value?: Discussion Arising from Session on Black-Box Models [O] . 1986

机译：第六场：温度调节的黑匣子模型仍然具有研究价值吗？：关于黑匣子模型的讨论引起的讨论
7. Auditing Black-box Models for Indirect Influence [O] . Adler, Philip, Falk, Casey, Friedler, Sorelle A., 2016

机译：审计间接影响的黑盒模型
8. Defense Contract Audit Agency Audits of Indirect Costs at Major Contractors [R] . 1998

机译：国防合同审计机构对主要承包商间接费用的审计

Auditing black-box models for indirect influence

摘要

著录项

相似文献

相关主题

期刊订阅