The Effectiveness of a Graph-Based Algorithm for Stemming

机译：基于图的词干算法的有效性

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In Information Retrieval (IR), stemming enables a matching of query and document terms which are related to a same meaning but which can appear in different morphological variants. In this paper we will propose and evaluate a statistical graph-based algorithm for stemming. Considering that a word is formed by a stem (prefix) and a derivation (suffix), the key idea is that strongly interlinked prefixes and suffixes form a community of sub-strings. Discovering these communities means searching for the best word splits which give the best word stems. We conducted some experiments on CLEF 2001 test sub-collections for Italian language. The results show that stemming improve the IR effectiveness. They also show that effectiveness level of our algorithm is comparable to that of an algorithm based on a-priori linguistic knowledge. This is an encouraging result, particularly in a multi-lingual context.

机译：在信息检索（IR）中，词干使查询和文档术语匹配，它们具有相同的含义，但可以以不同的形态表示。在本文中，我们将提出并评估基于统计图的词干算法。考虑到单词是由词干（前缀）和派生词（后缀）组成的，关键思想是强互连的前缀和后缀形成了子字符串社区。发现这些社区意味着寻找提供最佳词干的最佳单词分割。我们对CLEF 2001意大利语测试子集进行了一些实验。结果表明，茎梗改善了IR的有效性。他们还表明，我们算法的有效性水平可与基于先验语言知识的算法相媲美。这是一个令人鼓舞的结果，尤其是在使用多种语言的情况下。

著录项

来源
《5th International Conference on Asian Digital Libraries, ICADL 2002, Dec 11-14, 2002, Singapore》|2002年|p.117-128|共12页
会议地点 Singapore(SG);Singapore(SG)
作者
Michela Bacchin; Nicola Ferro; Massimo Melucci;
展开▼
作者单位

Department of Information Engineering University of Padua, Via Gradenigo, 6/a ― 35031 Padova ― Italy;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息与知识传播;
关键词

相似文献

外文文献
中文文献
专利

1. Graph-based algorithms and data-driven documents for formulation and visualization of large MDO systems [J] . Benedikt Aigner, Imco van Gent, Gianfranco La Rocca, CEAS Aeronautical Journal . 2018,第4期

机译：基于图形的算法和数据驱动的文档，用于大型MDO系统的公式化和可视化
2. A Graph-Based Taxonomy of Recommendation Algorithms and Systems in LBSNs [J] . Kefalas Pavlos, Symeonidis Panagiotis, Manolopoulos Yannis Knowledge and Data Engineering, IEEE Transactions on . 2016,第3期

机译：LBSN中基于图的推荐算法和系统分类法
3. Graph-based low complexity detection algorithms in multiple-input-multiple-out systems: an edge selection approach [J] . Lv, T., Long, Communications, IET . 2013,第12期

机译：多输入多输出系统中基于图的低复杂度检测算法：一种边缘选择方法
4. The Effectiveness of a Graph-Based Algorithm for Stemming [C] . Michela Bacchin, Nicola Ferro, Massimo Melucci, International conference on Asian digital libraries . 2002

机译：基于图形的茎秆算法的有效性
5. Fast Graph-Based Algorithms for Analyzing Protein-Protein Interaction Networks [D] . Shen, Yue. 2019

机译：基于快速的图形算法分析蛋白质 - 蛋白质相互作用网络
6. Algorithms for effective querying of compound graph-based pathway databases [O] . Ugur Dogrusoz, Ahmet Cetintas, Emek Demir, 2009

机译：基于复合图的路径数据库的有效查询算法
7. Graph-based Sequence Clustering through Multiobjective Evolutionary Algorithms for Web Recommender Systems [O] . Gul Nildem Demir, A. Sima Uyar, Sule Oguducu 2009

机译：基于图的序列聚类 - 基于多目标进化算法的Web推荐系统

The Effectiveness of a Graph-Based Algorithm for Stemming

摘要

著录项

相似文献

相关主题

期刊订阅