Parallel, massive processing in SuperMatrix: a general tool for distributional semantic analysis of corpora

Bartosz Broda; Maciej Piasecki

首页> 外文期刊>International journal of data mining, modelling and management >Parallel, massive processing in SuperMatrix: a general tool for distributional semantic analysis of corpora

【24h】

Parallel, massive processing in SuperMatrix: a general tool for distributional semantic analysis of corpora

机译：SuperMatrix中的并行，大规模处理：用于语料库的分布式语义分析的通用工具

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article presents an extended version of the SuperMatrix system - a general tool supporting automatic acquisition of lexical semantic relations from corpora. Extensions focus mainly on parallel processing of massive amounts of data. The construction of the system is discussed. Three distributed parts of the system are presented, i.e., distributed construction of co-incidence matrices from corpora, computation of similarity matrix and parallel solving of synonymy tests. An evaluation of a proposed approach to parallel processing is shown. Parallelisation of similarity matrix computation demonstrates almost linear speedup. The smallest improvements were achieved for construction of matrices, as this process is mostly bound by reading huge amounts of data. Areas of application of the system are described.

机译：本文介绍了SuperMatrix系统的扩展版本-一种支持从语料库自动获取词汇语义关系的通用工具。扩展主要集中于并行处理大量数据。讨论了系统的构建。提出了系统的三个分布式部分，即，从语料库分布的共生矩阵的分布式构造，相似性矩阵的计算和同义词测试的并行求解。显示了对并行处理的建议方法的评估。相似度矩阵计算的并行化显示了几乎线性的加速。矩阵构造的改进最小，因为此过程主要受读取大量数据的约束。描述了系统的应用领域。

著录项

来源
《International journal of data mining, modelling and management》 |2013年第1期|1-19|共19页
作者
Bartosz Broda; Maciej Piasecki;
展开▼
作者单位

Institute of Informatics, Wroclaw University of Technology, 50-370 Wroclaw, Poland;

Institute of Informatics, Wroclaw University of Technology, 50-370 Wroclaw, Poland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
supermatrix; distributional semantics; parallel processing; semantic analysis;

机译：超级矩阵分布语义;并行处理;语义分析;

相似文献

外文文献
中文文献
专利

1. Word sense induction in bengali using parallel corpora and distributional semantics [J] . Sengupta Saptarshi, Pandit Rajat, Mitra Parag, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2019,第5期

机译：孟加拉语使用并行语料库和分布语义的词感测诱导
2. DEDUCTIVE QUERY PROCESSING WITH AN OBJECT-ORIENTED SEMANTIC NETWORK IN A MASSIVELY PARALLEL ENVIRONMENT [J] . S.H. Oh, W.S. Lee International Journal of Computers & Applications . 2004,第2期

机译：大规模并行环境中以对象为导向的语义网络的演绎查询处理
3. An implementation of the Cohen's class time-frequency distributions on a massively parallel processor [J] . Krzysztof KONOPKO Przeglad Elektrotechniczny . 2012,第9B期

机译：大规模并行处理器上Cohen类时频分布的实现
4. Parallel, massive processing in SuperMatrix—A general tool for distributional semantic analysis of corpus [C] . Proceedings of the International Multiconference on Computer Science and Information Technology . 2010

机译：SuperMatrix中的并行，大规模处理-语料库分布语义分析的通用工具
5. Leveraging Semantic Similarity in Parallel Corpora for Natural Language Processing [D] . Wu, Shumin 2015

机译：利用并行语料库中的语义相似性进行自然语言处理
6. MethPat: a tool for the analysis and visualisation of complex methylation patterns obtained by massively parallel sequencing [O] . Nicholas C. Wong, Bernard J. Pope, Ida L. Candiloro, 2016

机译：MethPat：用于通过大规模并行测序获得的复杂甲基化模式的分析和可视化工具
7. A Generic Approach to Processing Parallel Corpora of the Europarl for Distributional Discourse Patterns [O] . Jolanta Mizera-Pietraszko 2011

机译：处理欧运柱的平行语料库的通用方法进行分布话语模式

Parallel, massive processing in SuperMatrix: a general tool for distributional semantic analysis of corpora

摘要

著录项

相似文献

相关主题

期刊订阅