首页> 外文期刊>Data mining and knowledge discovery >Unsupervised group matching with application to cross-lingual topic matching without alignment information
【24h】

Unsupervised group matching with application to cross-lingual topic matching without alignment information

机译:无监督的组与应用程序匹配,在没有对齐信息的情况下匹配的跨语言主题

获取原文
获取原文并翻译 | 示例
       

摘要

We propose a method for unsupervised group matching, which is the task of finding correspondence between groups across different domains without cross-domain similarity measurements or paired data. For example, the proposed method can find matching of topic categories in different languages without alignment information. The proposed method interprets a group as a probability distribution, which enables us to handle uncertainty in a limited amount of data, and to incorporate the high order information on groups. Groups are matched by maximizing the dependence between distributions, in which we use the Hilbert Schmidt independence criterion for measuring the dependence. By using kernel embedding which maps distributions into a reproducing kernel Hilbert space, we can calculate the dependence between distributions without density estimation. In the experiments, we demonstrate the effectiveness of the proposed method using synthetic and real data sets including an application to cross-lingual topic matching.
机译:我们提出了一种对无监督组匹配的方法,这是在没有跨域相似度测量或配对数据的情况下在不同域中发现组之间的对应关系的任务。例如,所提出的方法可以在不对齐信息的情况下找到不同语言中的主题类别的匹配。该提出的方法将一个组解释为概率分布,这使我们能够在有限量的数据中处理不确定性,并纳入组的高阶信息。通过最大化分布之间的依赖性来匹配组,其中我们使用Hilbert Schmidt独立标准来测量依赖性。通过使用映射分布到再现内核希尔伯特空间的内核嵌入,我们可以计算没有密度估计的分布之间的依赖性。在实验中,我们展示了使用合成和实际数据集的所提出方法的有效性,包括应用于跨语言主题匹配的应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号