Meta-Learner for Unknown Attribute Values Processing: Dealing with Inconsistency of Meta-Databases

IVAN BRUHA

首页> 外文期刊>Journal of Intelligent Information Systems >Meta-Learner for Unknown Attribute Values Processing: Dealing with Inconsistency of Meta-Databases

【24h】

Meta-Learner for Unknown Attribute Values Processing: Dealing with Inconsistency of Meta-Databases

机译：用于未知属性值处理的元学习器：处理元数据库的不一致

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Efficient robust data mining algorithms should comprise some routines for processing unknown (missing) attribute values when acquiring knowledge from real-world databases because these data usually contain a certain percentage of missing values. The paper Bruha and Franek (1996) figures out that each dataset has more or less its own 'favourite' routine for processing unknown attribute values. It evidently depends on the magnitude of noise and source of unknownness in each dataset. One possibility how to choose an efficient routine for processing unknown attribute values for a given database is exhibited in this paper. The covering machine learning algorithm CN4, a large extension of the well-known CN2 algorithm, is used here as an inductive vehicle. Each of the six routines for unknown attribute value processing (which are available in CN4) is used independently in order to process a given database. Afterwards, a meta-learner is used to derive a meta-classifier that makes up the overall (final) decision about the class of input unseen objects. The entire system is called a meta-combiner. The meta-database that is formed for the meta-learner could be inconsistent which could decrease the performance of the entire meta-classifier. Therefore, the existing meta-system (Meta-CN4) has been enhanced by a 'purification' procedure that appropriately solves up the conflict of inconsistent meta-data. The paper first surveys the CN4 algorithms including its six routines for unknown attribute value processing. Afterwards, it introduces the methodology of the meta-learner including its enhancement that solves inconsistent meta-databases. Finally, the results of experiments with various percentages of unknown attribute values on real-world data are presented and performances of the meta-classifier and the six base classifiers are then compared. The paper also explains the difference between the meta-combiner (meta-learner) described here and the cross-validation procedure used for obtaining the classification accuracy.

机译：有效的鲁棒数据挖掘算法应包括一些例程，用于在从实际数据库中获取知识时处理未知（缺失）属性值，因为这些数据通常包含一定百分比的缺失值。论文Bruha和Franek（1996）指出，每个数据集或多或少都有自己的“最喜欢的”例程来处理未知的属性值。显然，这取决于每个数据集中的噪声大小和未知源。本文展示了一种如何为给定数据库选择一种有效的例程来处理未知属性值的可能性。覆盖式机器学习算法CN4是众所周知的CN2算法的较大扩展，在这里用作感应车辆。用于未知属性值处理的六个例程（可在CN4中使用）分别独立使用，以便处理给定的数据库。之后，使用元学习器来得出元分类器，该元分类器构成了有关输入未见对象类别的整体（最终）决策。整个系统称为元合并器。为元学习者形成的元数据库可能不一致，这可能会降低整个元分类器的性能。因此，现有的元系统（Meta-CN4）已通过“纯化”过程得到了增强，该过程可以适当解决不一致的元数据的冲突。本文首先考察了CN4算法，其中包括用于未知属性值处理的六个例程。随后，它介绍了元学习器的方法，包括解决不一致的元数据库的增强功能。最后，给出了在真实数据上使用各种百分比的未知属性值的实验结果，然后比较了元分类器和六个基本分类器的性能。本文还解释了此处描述的元组合器（元学习器）与用于获得分类准确性的交叉验证过程之间的区别。

著录项

来源
《Journal of Intelligent Information Systems》 |2004年第1期|p.71-87|共17页
作者
IVAN BRUHA;
展开▼
作者单位

Department of Computing & Software, McMaster University, Hamilton, ON, Canada, L8S4K1;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
unknown attribute value processing; meta-learning; meta-combiner; meta-classifier; base classifiers;

机译：未知属性值处理;元学习;元合并器;元分类器;基础分类器;

相似文献

外文文献
中文文献
专利

1. Multiple attribute decision-making method for dealing with heterogeneous relationship among attributes and unknown attribute weight information under q-rung orthopair fuzzy environment [J] . Liu Zhengmin, Liu Peide, Liang Xia International journal of entelligent systems . 2018,第9期

机译：q-阶邻态对模糊环境下处理属性与未知属性权重信息异类关系的多属性决策方法
2. Consistency and an algorithm recognising inconsistency of realisations of a system of random discrete equations with two-valued unknowns [J] . A. V. Shapovalov Discrete mathematics and applications . 2008,第4期

机译：具有二值未知数的随机离散方程组的实现的一致性和识别不一致的算法
3. A data preprocessing mechanism based on processing attribute values and selecting attributes [J] . Mao Komori, Hidenao Abe, Yoshiaki Tachibana, 電子情報通信学会技術研究報告. 人工知能と知識処理. Artificial Intelligence and Knowledge Based Processing . 2000,第709期

机译：基于处理属性值和选择属性的数据预处理机制
4. Unknown Attribute Values Processing by Meta-learner [C] . Ivan Bruha International Symposium on Methodologies for Intelligent Systems . 2002

机译：Meta-Learner的未知属性值处理
5. Efficient storage and query processing of set-valued attributes. [D] . Ramasamy, Karthikeyan. 2001

机译：集值属性的有效存储和查询处理。
6. SOME INCONSISTENCIES IN DEALING WITH TUBERCULOSIS [O] . Francis George Curtis 1915

机译：处理肺结核的某些不确定性
7. Warming and salting in the western Mediterranean during the second half of the 20th century: inconsistencies, unknowns and the effect of data processing [O] . Manuel Vargas-Yáñez, Francina Moya, Elena Tel, 2009

机译：20世纪下半叶西地中海的变暖和腌制：不一致，未知和数据处理的影响

Meta-Learner for Unknown Attribute Values Processing: Dealing with Inconsistency of Meta-Databases

摘要

著录项

相似文献

相关主题

期刊订阅