Data scale reduction via instances summarization using the Rough Set Theory

机译：使用粗糙集理论通过实例汇总减少数据规模

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Actually, the major obstacle encountered when applying Data Mining algorithms to real life data is the incapacity of these algorithms to handle very large data such as those stored in industrial databases. Developing new algorithm which require less memory and processing time will certainly help to solve this problem. But we followed here another way to solution, the reduction of the size of input data. We present in this article our new system CFSumm, which is dedicated to data summarization considered as a pre-process step before the use of a Data Mining Tool. The basic idea of this method is to summarize several instances sufficiently similar by a weighted pseudo-instance which can replace them for further processes. We explain in this article how the α-Rough Set Theory framework allows a great flexibility in the summarization process. We also expose some experimental results obtained on data with real life size, which demonstrate the quality of the summary obtained and the high scalability of our method.

机译：实际上，将数据挖掘算法应用于现实生活数据时遇到的主要障碍是这些算法无法处理非常大的数据，例如存储在工业数据库中的数据。开发需要更少内存和处理时间的新算法无疑将有助于解决此问题。但是我们在这里采用了另一种解决方案，即减小输入数据的大小。我们在本文中介绍了我们的新系统CFSumm，该系统专用于数据汇总，被视为使用数据挖掘工具之前的预处理步骤。该方法的基本思想是通过加权伪实例总结几个足够相似的实例，这些实例可以替换它们以进行进一步的处理。我们将在本文中解释α粗糙集理论框架如何在汇总过程中提供极大的灵活性。我们还公开了从具有真实大小的数据中获得的一些实验结果，这些结果证明了所获得摘要的质量以及我们方法的高度可扩展性。

著录项

来源
《Second International Conference on Data Mining, 2nd》|2000年|p.279-288|共10页
会议地点 Cambridge(GB)
作者
G. Gaumer; M. Quafafou;
展开▼
作者单位

Institut de Recherche en Informatique de Nantes, University of Nantes, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类社会科学研究方法;
关键词

相似文献

外文文献
中文文献
专利

1. Application of Rough Set Theory in Data Mining Market Analysis Using Rough Sets Data Explorer [J] . Jayasuruthi L, Shalini A, Kumar V. Vinoth Journal of computational and theoretical nanoscience . 2018,第6a7期

机译：粗糙集理论在数据挖掘市场分析中的应用使用粗糙集数据探险者
2. Attribute reduction on real-valued data in rough set theory using hybrid artificial bee colony: extended FTSBPSD algorithm [J] . Chebrolu Srilatha, Sanjeevi Sriram G. Soft computing: A fusion of foundations, methodologies and applications . 2017,第24期

机译：使用混合人工蜂殖民地的粗糙集理论中真实数据的属性降低：扩展FTSBPSD算法
3. A novel attribute reduction approach for multi-label data based on rough set theory [J] . Li Hua, Li Deyu, Zhai Yanhui, Information Sciences: An International Journal . 2016,第Null期

机译：基于粗糙集理论的多标签数据属性约简新方法
4. Data scale reduction via instances summarization using the Rough Set Theory [C] . G. Gaumer, M. Quafafou International Conference on Data Mining . 2001

机译：使用粗糙集理论通过实例概述减少数据量表
5. Rough set approach to feature reduction in KDD: Evolutionary computing and data sampling. [D] . Rahman, Mohammad Mahibour. 2006

机译：减少KDD中的特征的粗糙集方法：进化计算和数据采样。
6. Data mining of the GAW14 simulated data using rough set theory and tree-based methods [O] . Liang-Ying Wei, Cheng-Lung Huang, Chien-Hsiun Chen 2005

机译：使用粗糙集理论和基于树的方法对GAW14模拟数据进行数据挖掘
7. A Parallel Rough Set Theory for Nonlinear Dimension-Reduction in Big Data Analysis [O] . Amsaveni Muthusamy, Duraisamy Subramani 2019

机译：大数据分析中非线性维度降低的平行粗糙集理论

Data scale reduction via instances summarization using the Rough Set Theory

摘要

著录项

相似文献

相关主题

期刊订阅