首页> 外文期刊>Knowledge-Based Systems >Outlier detection on uncertain data based on local information
【24h】

Outlier detection on uncertain data based on local information

机译:基于本地信息的不确定数据的异常值检测

获取原文
获取原文并翻译 | 示例
           

摘要

Based on local information: local density and local uncertainty level, a new outlier detection algorithm is designed in this paper to calculate uncertain local outlier factor (ULOF) for each point in an uncertain dataset. In this algorithm, all concepts, definitions and formulations for conventional local outlier detection approach (LOF) are generalized to include uncertainty information. The least squares algorithm on multi-times curve fitting is used to generate an approximate probability density function of distance between two points. An iteration algorithm is proposed to evaluate K-μ-distance and a pruning strategy is adopted to reduce the size of candidate set of nearest-neighbors. The comparison between ULOF algorithm and the state-of-the-art approaches has been made. Results of several experiments on synthetic and real data sets demonstrate the effectiveness of the proposed approach.
机译:基于局部信息:局部密度和局部不确定性水平,本文设计了一种新的离群值检测算法,用于计算不确定数据集中每个点的不确定局部离群值因子(ULOF)。在该算法中,对常规局部离群值检测方法(LOF)的所有概念,定义和公式进行了概括,以包括不确定性信息。使用多次曲线拟合的最小二乘算法生成两点之间距离的近似概率密度函数。提出了一种迭代算法来评估K-μ距离,并采用了一种修剪策略来减小最近邻候选集的大小。已对ULOF算法和最新方法进行了比较。在综合和真实数据集上进行的几次实验结果证明了该方法的有效性。

著录项

  • 来源
    《Knowledge-Based Systems》 |2013年第10期|60-71|共12页
  • 作者

    Jing Liu; HuiFang Deng;

  • 作者单位

    Department of Computer Science and Engineering, South China University of Technology, Guangzhou 510006, PR China;

    Department of Computer Science and Engineering, South China University of Technology, Guangzhou 510006, PR China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Outlier detection; Uncertain data; Local information;

    机译:离群值检测;不确定的数据;当地信息;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号