首页> 外文会议>International symposium on intelligent data analysis >Automatically Wrangling Spreadsheets into Machine Learning Data Formats
【24h】

Automatically Wrangling Spreadsheets into Machine Learning Data Formats

机译:自动将电子表格转换为机器学习数据格式

获取原文

摘要

To help automate the important pre-processing step in machine learning and data mining, we introduce SYNTH-A-SIZER, a tool for semi-automatically wrangling spreadsheets into attribute-value format, so that they can be used by popular machine learning tools, only requiring the user to mark cells belonging to one single example, synth-a-sizer is based on inductive programming principles. We introduce synth-a-sizer's transformations, search algorithm as well as a heuristic and distance measure for identifying types. We also report on a first experimental evaluation.
机译:为了帮助实现机器学习和数据挖掘中重要的预处理步骤的自动化,我们引入了SYNTH-A-SIZER,该工具可将电子表格半自动地整理为属性值格式,以便流行的机器学习工具可以使用它们, synth-a-sizer只需要用户标记属于一个示例的单元,它基于归纳编程原理。我们介绍了synth-a-sizer的转换,搜索算法以及用于识别类型的启发式和距离度量。我们还报告了首次实验评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号