...
首页> 外文期刊>The Knowledge Engineering Review >Using automated planning for improving data mining processes
【24h】

Using automated planning for improving data mining processes

机译:使用自动化计划来改善数据挖掘流程

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a distributed architecture for automating data mining (DM) processes using standard languages. DM is a difficult task that relies on an exploratory and analytic process of processing large quantities of data in order to discover meaningful patterns. The increasing heterogeneity and complexity of available data requires some expert knowledge on how to combine the multiple and alternative DM tasks to process the data. Here, we describe DM tasks in terms of Automated Planning, which allows us to automate the DM knowledge flow construction. The work is based on the use of standards that have been defined in both DM and automated-planning communities. Thus, we use PMML (Predictive Model Markup Language) to describe DM tasks. From the PMML, a problem description in PDDL (Planning Domain Definition Language) can be generated, so any current planning system can be used to generate a plan. This plan is, again, translated to a DM workflow description, Knowledge Flow for Machine Learning format (Knowledge Flow file for the WEKA (Waikato Environment for Knowledge Analysis) tool), so the plan or DM workflow can be executed in WEKA.
机译:本文提出了一种使用标准语言来自动化数据挖掘(DM)流程的分布式体系结构。 DM是一项艰巨的任务,它依赖于探索和分析过程来处理大量数据以发现有意义的模式。可用数据的异质性和复杂性日益增加,需要一些专家知识,以了解如何结合多个和替代的DM任务来处理数据。在这里,我们根据自动化计划来描述DM任务,这使我们能够自动化DM知识流的构建。这项工作基于在DM和自动规划社区中定义的标准的使用。因此,我们使用PMML(预测模型标记语言)来描述DM任务。根据PMML,可以生成PDDL(规划域定义语言)中的问题描述,因此可以使用任何当前的规划系统来生成计划。再次将该计划翻译为DM工作流描述,即机器学习的知识流格式(WEKA(知识分析的Waikato环境)工具的知识流文件),因此可以在WEKA中执行该计划或DM工作流。

著录项

  • 来源
    《The Knowledge Engineering Review》 |2013年第2期|157-173|共17页
  • 作者单位

    Departamento de InformAtica, Universidad Carlos Ⅲ de Madrid, Avd. de la Universidad 30, 28911 Leganes (Madrid), Spain;

    Departamento de InformAtica, Universidad Carlos Ⅲ de Madrid, Avd. De la Universidad 30, 28911 Leganes (Madrid), Spain;

    Departamento de InformAtica, Universidad Carlos Ⅲ de Madrid, Avd. De la Universidad 30, 28911 Leganes (Madrid), Spain;

    Departamento de InformAtica, Universidad Carlos Ⅲ de Madrid, Avd. De la Universidad 30, 28911 Leganes (Madrid), Spain;

    Departamento de InformAtica, Universidad Carlos Ⅲ de Madrid, Avd. De la Universidad 30, 28911 Leganes (Madrid), Spain;

    Departamento de InformAtica, Universidad Carlos Ⅲ de Madrid, Avd. De la Universidad 30, 28911 Leganes (Madrid), Spain;

    Ericsson Espana, S.A.U, Madrid R&D Center, Technology & Innovation, C/ Via de los Poblados 13, 28013 Madrid, Spain;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号