首页> 外文期刊>International journal of data mining, modelling and management >Parallel, massive processing in SuperMatrix: a general tool for distributional semantic analysis of corpora
【24h】

Parallel, massive processing in SuperMatrix: a general tool for distributional semantic analysis of corpora

机译:SuperMatrix中的并行,大规模处理:用于语料库的分布式语义分析的通用工具

获取原文
获取原文并翻译 | 示例
           

摘要

This article presents an extended version of the SuperMatrix system - a general tool supporting automatic acquisition of lexical semantic relations from corpora. Extensions focus mainly on parallel processing of massive amounts of data. The construction of the system is discussed. Three distributed parts of the system are presented, i.e., distributed construction of co-incidence matrices from corpora, computation of similarity matrix and parallel solving of synonymy tests. An evaluation of a proposed approach to parallel processing is shown. Parallelisation of similarity matrix computation demonstrates almost linear speedup. The smallest improvements were achieved for construction of matrices, as this process is mostly bound by reading huge amounts of data. Areas of application of the system are described.
机译:本文介绍了SuperMatrix系统的扩展版本-一种支持从语料库自动获取词汇语义关系的通用工具。扩展主要集中于并行处理大量数据。讨论了系统的构建。提出了系统的三个分布式部分,即,从语料库分布的共生矩阵的分布式构造,相似性矩阵的计算和同义词测试的并行求解。显示了对并行处理的建议方法的评估。相似度矩阵计算的并行化显示了几乎线性的加速。矩阵构造的改进最小,因为此过程主要受读取大量数据的约束。描述了系统的应用领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号