DATA FUSION IN SEVERAL ALGORITHMS

STAN LIPOVETSKY

首页> 外文期刊>Advances in Adaptive Data Analysis >DATA FUSION IN SEVERAL ALGORITHMS

【24h】

DATA FUSION IN SEVERAL ALGORITHMS

机译：几种算法中的数据融合

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data fusion consists of the process of integrating several datasets with some common variables, and other variables available only in partial datasets. The main problem of data fusion can be described as follows. From one source, having X~0 and Y~0 datasets (with N~0 observations by multiple x and y variables, n and m of those, respectively), and from another source, having X~1 data (with N~1 observations by the same n x-variables), we need to estimate the missing portion of the Y~1 data (of size N~1 by m variables) in order to combine all the data into one set. Several algorithms are considered in this work, including estimation of weights proportional to the distances from each ith observation in the X~1 "recipients" dataset to all observations in the X~0 "donors" dataset. Or we can use a sample balancing technique with the maximum effective base performed by applying ridge-regression for the Gifi system of binaries obtained from the x-variables for the best fit of the "donors" X~0 data to the margins defined by each respondent in the "recipients" X~1 dataset. Then the weighted regressions of each y in the Y~0 dataset by all variables in the X~0 are constructed. For each ith observation in the dataset X~0, these regressions are used for predicting the y-variables in the Y~1 "recipients" dataset. If X and Y are the same n variables from different sources, the dual partial least squares technique and a special regression model with dummies defining each of the three available sets are used for prediction of the Y~1 data.

机译：数据融合包括将多个数据集与一些公共变量以及其他仅在部分数据集中可用的变量进行集成的过程。数据融合的主要问题可以描述如下。从一个来源获得X〜0和Y〜0数据集（分别由多个x和y变量（分别为n和m）进行N〜0个观测），从另一个来源获得X〜1数据（其中N〜1个）观察相同的n个x变量），我们需要估计Y〜1数据（大小为N〜1，由m个变量组成）的缺失部分，以便将所有数据组合为一组。在这项工作中考虑了几种算法，包括权重的估计与从X〜1“收件人”数据集中的每个第i个观察值到X〜0“捐助者”数据集中的所有观察值的距离成比例。或者，我们可以使用样本平衡技术，通过对从x变量获取的二进制文件的Gifi系统应用岭回归来实现最大有效基数，以使“供体” X〜0数据最适合每个定义的边距“收件人” X〜1数据集中的受访者。然后构造X〜0中所有变量在Y〜0数据集中每个y的加权回归。对于数据集X〜0中的第ith个观察，这些回归用于预测Y〜1“收件人”数据集中的y变量。如果X和Y是来自不同来源的相同n个变量，则使用对偶偏最小二乘技术和特殊的回归模型（具有定义三个可用集合中的每一个的虚拟变量）来预测Y〜1数据。

著录项

来源
《Advances in Adaptive Data Analysis》 |2013年第3期|共12页
作者
STAN LIPOVETSKY;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Data fusion; Distances; Weighting; Dual canonical correlations and PLS; Ridge-regression; Regression with dummies;

机译：数据融合距离加权双重正则相关和PLS里奇回归假人回归;

相似文献

外文文献
中文文献
专利

1. Tensors for Data Mining and Data Fusion: Models, Applications, and Scalable Algorithms [J] . Papalexakis Evangelos E., Faloutsos Christos, Sidiropoulos Nicholas D. ACM transactions on intelligent systems . 2017,第2期

机译：用于数据挖掘和数据融合的张量：模型，应用程序和可扩展算法
2. Role of Algorithm Engineering in Data Fusion Algorithms [J] . S. A. Quadri, Othman Sidek Journal of Computational Intelligence and Electronic Systems . 2013,第1期

机译：算法工程在数据融合算法中的作用
3. Role of Algorithm Engineering in Data Fusion Algorithms [J] . S. A. Quadri, Othman Sidek Journal of Computational Intelligence and Electronic Systems . 2013,第1期

机译：算法工程在数据融合算法中的作用
4. Comparison of data mining algorithms in remote sensing using Lidar data fusion and feature selection [C] . Papia Rozario, Rahul Gomes IEEE International Conference on Electro Information Technology . 2021

机译：利用LIDAR数据融合和特征选择对数据挖掘算法的比较
5. Development and assessment of advanced data fusion algorithms for remotely sensed data [D] . Cakir, Halil I. 2004

机译：开发和评估用于遥感数据的高级数据融合算法
6. A 3D Scan Model and Thermal Image Data Fusion Algorithms for 3D Thermography in Medicine [O] . Adam Chromy, Ondrej Klima 2017

机译：用于医学3D热成像的3D扫描模型和热图像数据融合算法
7. Performance of kalman and gain fusion algorithms for sensor data fusion with measurement loss [O] . Shantha Kumar N, Girija G, Raol JR 2001

机译：卡尔曼和增益融合算法在具有测量损耗的传感器数据融合中的性能

DATA FUSION IN SEVERAL ALGORITHMS

摘要

著录项

相似文献

相关主题

期刊订阅