基于中医方剂数据库的Top-Rank-k频繁模式挖掘算法

秦琦冰; 谭龙

首页> 中文期刊> 《计算机应用》 >基于中医方剂数据库的Top-Rank-k频繁模式挖掘算法

基于中医方剂数据库的Top-Rank-k频繁模式挖掘算法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The dependency of the empirical parameters in frequent patterns mining of Traditional Chinese Medicine (TCM) prescriptions should be reduced to improve the accuracy of mining results.Aiming at the characteristics of TCM prescription data,an efficient Top-Rank-k frequent patterns mining algorithm based on Weighted Undirected Graph (WUG) was proposed.The new algorithm can directly mining frequent k-itemset (k≥3) without mining 1-times and 2-times,and then quikly backtrack to the corresponding prescription of the frequent itemsets of core drugs combination.Besides,the compression mechanism of Dynamic Bit Vector (DBV) was used to store the edge weights in undirected graph to improve the spatial storage efficiency of the algorithm.Experiments were conducted on TCM prescription datasets,real datasets (Chess,Pumsb and Retail) and synthetic datasets (T10I4D100K and Test2K50KD1).The experimental results show that compared with iNTK (improved Node-list Top-Rank-K) and BTK (B-list Top-Rank-K),the proposed algorithm has better performance in terms of time and space,and it can be applied to other types of data sets.%为降低中医(TCM)方剂频繁模式挖掘过程中对经验参数的依赖,提高挖掘结果的准确性,针对中医方剂的数据特点,提出一种基于带权无向图的Top-Rank-k频繁模式挖掘算法.该算法可以直接挖掘出频繁k-itemset(k≥3)而无需产生1-itemset和2-itemset,并随之快速回溯到核心药物组合的频繁项集所对应的方剂信息;此外,采用一种动态位向量(DBV)的压缩机制对无向图中边的权重进行压缩存储,以有效地提高算法的空间存储效率.分别对中医方剂数据集、真实数据集(Chess、Pumsb和Retail)和合成数据集(T10I4D100K和Test2K50KD1)进行测试和比较,结果表明该算法与iNTK和BTK相比具有更高的时间和空间效率,而且也可以应用于其他类型的数据集.

著录项

来源
《计算机应用》 |2017年第2期|329-334|共6页
作者
秦琦冰; 谭龙;
展开▼
作者单位

黑龙江大学计算机科学技术学院;

哈尔滨150080;

黑龙江大学计算机科学技术学院;

哈尔滨150080;

黑龙江省数据库与并行计算重点实验室(黑龙江大学);

哈尔滨150080;

展开▼
原文格式 PDF
正文语种 chi
中图分类 TP311.13;
关键词
中医方剂; Top-Rank-k; 频繁模式; 带权无向图; 动态位向量;

相似文献

中文文献
外文文献
专利

1. 基于有序FP-tree结构和投影数据库的最大频繁模式挖掘算法 [J] . 王利军 ,唐立 . 淮阴师范学院学报（自然科学版） . 2020,第001期
2. 一种基于频繁模式有向无环图的数据流频繁模式挖掘算法 [J] . 任家东 ,王倩 ,王蒙 . 燕山大学学报 . 2011,第002期
3. 基于频繁模式树的最大频繁模式挖掘算法 [J] . 缪裕青 . 桂林电子科技大学学报 . 2004,第003期
4. 分布式数据库的精简频繁模式集及其挖掘算法 [J] . 贾泂 ,刘群 ,姜晗 . 浙江师范大学学报（自然科学版） . 2010,第002期
5. 一种基于带权无向图的中医方剂频繁项集挖掘算法 [J] . 谭龙 ,秦琦冰 . 计算机应用与软件 . 2017,第005期
6. 一种基于中医方剂数据库的Top-Rank-k频繁模式挖掘算法 [C] . . 第33届中国数据库学术会议（NDBC2016 ） . 2016
7. Top-rank-k频繁模式挖掘算法优化及其并行化研究 [A] . 龙玉航 . 2020

基于中医方剂数据库的Top-Rank-k频繁模式挖掘算法

摘要

著录项

相似文献

相关主题

期刊订阅