...
首页> 外文期刊>International Journal of Computer Trends and Technology >Pattern Discovery in Mixed Datausing Large Database
【24h】

Pattern Discovery in Mixed Datausing Large Database

机译:使用大型数据库的混合数据中的模式发现

获取原文
           

摘要

With the great progress of microelectronics and other relating information technologies, together with the still broadening applications of computers in a vast range of businesses and industries, large databases containing mixed-mode data are becoming quite commonplace. Today, large databases contain various modes of collected data related to different components of a complex real world system. Their use is not necessarily confined to classifications. Many of them may not have clearly-defined class labelsor even any explicit class information at all. Indeed, there are many different reasons todetermine or discover all patterns, to achieve any comprehensive analysis and understanding of the information within the data spaces. In the past, data mining or pattern discovery has by and large been developed fundamentally for categorical databases. All of the classification rules have been found from pre-labeled data samples. When mixed-mode data are processed, engineers naturally work on the class-dependence relationship to discretize the real data. Where class information is lacking, there is no suitable way to discover patterns within these mixed-type databases. Consequently, most important pattern analysis jobs -such as pattern clustering, or even pattern summarization -being developed for categorical data will not be easily applied to a mixed-mode database. To break this impasse is theobjective of this thesis. We have attempted to develop some pattern discovery methods for mixed-mode databases where classes or features are unavailable. Analyzing these mixed-modes of databases and providing researchers with helpful knowledge is a challenging task. Developing new ways to turn the raw data into useful knowledge is now a long-term challenge in the data mining community
机译:随着微电子学和其他相关信息技术的飞速发展,以及计算机在广泛的商业和行业中仍在不断扩展的应用,包含混合模式数据的大型数据库已变得相当普遍。如今,大型数据库包含与复杂的现实世界系统的不同组件相关的各种模式的收集数据。它们的使用不一定仅限于分类。他们中的许多人可能根本没有明确定义的类别标签,甚至根本没有任何明确的类别信息。确实,有许多不同的原因来确定或发现所有模式,以实现对数据空间内信息的全面分析和理解。过去,从根本上为分类数据库开发了数据挖掘或模式发现。所有分类规则均已从预先标记的数据样本中找到。处理混合模式数据时,工程师自然会处理类依赖关系以离散化实际数据。在缺少类信息的地方,没有合适的方法来发现这些混合类型数据库中的模式。因此,为分类数据开发的最重要的模式分析工作(例如模式聚类,甚至模式摘要)将不容易应用于混合模式数据库。打破这一僵局是本论文的目的。我们已经尝试为类别或功能不可用的混合模式数据库开发一些模式发现方法。分析数据库的这些混合模式并为研究人员提供有用的知识是一项艰巨的任务。开发将原始数据转化为有用知识的新方法现在是数据挖掘社区中的长期挑战

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号