首页> 外文会议>Machine learning >Exploiting the Omission of Irrelevant Data
【24h】

Exploiting the Omission of Irrelevant Data

机译:利用无关数据的遗漏

获取原文
获取原文并翻译 | 示例

摘要

Most learning algorithms work most effectively when their training data contain completely specified labeled samples. In many diagnostic tasks, however, the data will include the values of only some of the attributes; we model this as a blocking process that hides the values of those attributes from the learner. While blockers that remove the values of critical attributes can handicap a learner, this paper instead focuses on blockers that remove only irrelevant attribut ,al-ues, i.e., values that are not needed to classify an instance, given the values of the other unblocked attributes. We first motivate and formalize this model of 'superfluous-value blocking", and then demonstrate that these omissions can be usefub by proving that certain classes that seem hard to learn in the general PAC model - viz., decision trees and DNF formulae - are trivial to learn in this setting. We also show that this model can be extended to deal with (1) theory revision (i.e., modifying an existing formula); (2) blockers that occasionally include superfluous values or exclude required values; and (3) other corruptions of the training data.
机译:当大多数学习算法的训练数据包含完全指定的带标签样本时,它们将最有效地工作。但是,在许多诊断任务中,数据将仅包含某些属性的值;我们将此建模为一个阻塞过程,该过程对学习者隐藏了这些属性的值。尽管删除关键属性值的阻止程序可能会妨碍学习者,但本文着重于仅删除无关属性的阻止程序,即给定其他未阻塞属性的值时,无需对实例进行分类的值。我们首先激励并正式化“多余值阻止”模型,然后通过证明在通用PAC模型中似乎难以学习的某些类(即决策树和DNF公式)可以证明这些遗漏是有用的我们还显示该模型可以扩展为处理(1)理论修订(即,修改现有公式);(2)偶尔包含多余值或排除所需值的阻止器;以及(3) )训练数据的其他损坏。

著录项

  • 来源
    《Machine learning》|1996年|216-224|共9页
  • 会议地点 Bari(IT);Bari(IT)
  • 作者单位

    Siemens Corporate Research 755 College Road East Princeton, NJ 08540-6632;

    NEC Research Institute 4 Independence Way Princeton, NJ 08540;

    Rutgers University Faculty of Management Newark, NJ 07102/RUTCOR New Brunswick, NJ 08903;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算机的应用;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号