首页> 中文期刊> 《情报学报》 >基于基序及其时序关系的耦合流数据分类算法

基于基序及其时序关系的耦合流数据分类算法

         

摘要

Currently, coupled stream data classification is a very popular topic in data mining and information science, which has been attracted more and more domestic and abroad scholars. However, most of the existing research results are based on the feature extraction and classification from the single stream of data, and the dependency relations among the features within and across the streams are not taken into account. Due to this situation, referring to searching motif methods of bioinformatics, a classifying method applying long - run frequency and inverse document frequency is presented in this research. This method converts every input stream of the coupled stream data into a signal variation to extract the motif effectively. By calculating the frequency of the motif, the long - run frequency and the weight of inverse document frequency, the temporal relationships among the motifs of the input stream data can be approached, then the results can be used to classify the coupled stream data. The simulation results prove the effectiveness of the method.%耦合流数据分类问题是当前数据挖掘与信息领域的热点和难点,引起国内外越来越多学者的关注,但现有研究成果大多依赖于从单个流数据中提取特征并进行分类,没有考虑到流数据内以及流数据间特征的相互依赖关系.基于此,借鉴生物信息学中基序查找的方法,本文提出了长期频率和逆文档频率的分类方法,该方法主要是将耦合流数据中每个输入流都转化为信号变化特征,以便有效地提取基序,通过计算基序的频率、长期频率与逆文档频率的权重,用以衡量不同输入耦合流数据的基序之间的时序关系,并利用基序与时序的关系实现对耦合流数据的分类,仿真实验的结果也证明了该方法的有效性.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号