首页> 外文会议>Database and Expert Systems Applications; Lecture Notes in Computer Science; 4080 >Multivariate Stream Data Classification Using Simple Text Classifiers
【24h】

Multivariate Stream Data Classification Using Simple Text Classifiers

机译:使用简单文本分类器的多变量流数据分类

获取原文
获取原文并翻译 | 示例

摘要

We introduce a classification framework for continuous multivariate stream data. The proposed approach works in two steps. In the preprocessing step, it takes as input a sliding window of multivariate stream data and discre-tizes the data in the window into a string of symbols that characterize the signal changes. In the classification step, it uses a simple text classification algorithm to classify the discretized data in the window. We evaluated both supervised and unsupervised classification algorithms. For supervised, we tested Naieve Bayes Model and SVM, and for unsupervised, we tested Jaccard, TFIDF, Jaro and JaroWinkler. In our experiments, SVM and TFIDF outperformed the other classification methods. In particular, we observed that classification accuracy is improved when the correlation of attributes is also considered along with the n-gram tokens of symbols.
机译:我们介绍了用于连续多元流数据的分类框架。拟议的方法分两个步骤进行。在预处理步骤中,它将多元流数据的滑动窗口作为输入,并将窗口中的数据离散化为表征信号变化的一串符号。在分类步骤中,它使用简单的文本分类算法对窗口中的离散化数据进行分类。我们评估了监督分类算法和非监督分类算法。在有监督的情况下,我们测试了Naieve Bayes模型和SVM;在无监督的情况下,我们测试了Jaccard,TFIDF,Jaro和JaroWinkler。在我们的实验中,SVM和TFIDF优于其他分类方法。尤其是,我们观察到,当属性的相关性与符号的n-gram标记一起考虑时,分类精度也会提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号