On the Complexity of Rocchio's Similarity-Based Relevance Feedback Algorithm

Zhixiang Chen; Bin Fu

首页> 外文期刊>Journal of the American Society for Information Science and Technology >On the Complexity of Rocchio's Similarity-Based Relevance Feedback Algorithm

【24h】

On the Complexity of Rocchio's Similarity-Based Relevance Feedback Algorithm

机译：基于Rocchio相似度的相关反馈算法的复杂度

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Rocchio's similarity-based relevance feedback algorithm, one of the most important query reformation methods in information retrieval, is essentially an adaptive learning algorithm from examples in searching for documents represented by a linear classifier. Despite its popularity in various applications, there is little rigorous analysis of its learning complexity in literature. In this article, the authors prove for the first time that the learning complexity of Rocchio's algorithm is O(d + d~2(log d + log n)) over the discretized vector space {0, ..., n-1}~d, when the inner product similarity measure is used. The upper bound on the learning complexity for searching for documents represented by a monotone linear classifier (q, 0) over {0,..., n - 1 }~d can be improved to, at most, 1 + 2k (n - 1) (log d+ log(n - 1)), where k is the number of nonzero components in q. Several lower bounds on the learning complexity are also obtained for Rocchio's algorithm. For example, the authors prove that Rocchio's algorithm has a lower bound Ω((_2~d)log n) on its learning complexity over the Boolean vector space {0,1}~d.

机译：Rocchio的基于相似度的相关性反馈算法是信息检索中最重要的查询重构方法之一，本质上是从搜索线性分类器表示的文档中的示例中得出的一种自适应学习算法。尽管它在各种应用中很受欢迎，但对其文学中的学习复杂性却缺乏严格的分析。在本文中，作者首次证明了Rocchio算法的学习复杂度在离散向量空间{0，...，n-1}上为O（d + d〜2（log d + log n））。〜d，当使用内积相似性度量时。在{0，...，n-1}〜d上搜索由单调线性分类器（q，0）表示的文档的学习复杂度上限可以提高到最多1 + 2k（n- 1）（log d + log（n-1）），其中k是q中非零分量的数量。 Rocchio算法还获得了学习复杂度的几个下限。例如，作者证明Rocchio算法在布尔向量空间{0,1}〜d上具有较低的学习复杂度Ω（（_ 2〜d）log n）。

著录项

来源
《Journal of the American Society for Information Science and Technology》 |2007年第10期|1392-1400|共9页
作者
Zhixiang Chen; Bin Fu;
展开▼
作者单位

Department of Computer Science, University of Texas-Pan American, 1201 W. University Drive, Edinburg, TX 78541-2999;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类科学、科学研究;
关键词

相似文献

外文文献
中文文献
专利

1. A quadratic lower bound for Rocchio's similarity-based relevance feedback algorithm with a fixed query updating factor [J] . Chen ZX, Fu B, Abraham J Journal of combinatorial optimization . 2010,第2期

机译：具有固定查询更新因子的Rocchio基于相似度的相关性反馈算法的二次下界
2. Rocchio's Model Based on Vector Space Basis Change for Pseudo Relevance Feedback [J] . Rabeb Mbarek, Mohamed Tmar, Hawete Hattab OASIcs : OpenAccess Series in Informatics . 2014,第4期

机译：基于向量空间基变化的Rocchio模型的伪相关反馈
3. Application of Relevance Feedback Based on Rocchio Theory for Medical Image Retrieval [J] . Weihua Song, XianWei Wu Advanced Science Letters . 2012,第Null期

机译：基于Rocchio理论对医学图像检索的相关反馈的应用
4. On the Complexity of Rocchio's Similarity-Based Relevance Feedback Algorithm [C] . Zhixiang Chen, Bin Fu International Symposium on Algorithms and Computation(ISAAC 2005); 20051219-21; Sanya(CN) . 2005

机译：基于Rocchio相似度的相关反馈算法的复杂度
5. Active relevance feedback algorithms. [D] . Xu, Zuobing. 2008

机译：主动相关性反馈算法。
6. Evaluation of Term Ranking Algorithms for Pseudo-Relevance Feedback in MEDLINE Retrieval [O] . Sooyoung Yoo, Jinwook Choi 2011

机译：MEDLINE检索中伪相关反馈的术语排序算法的评估
7. On the complexity of Rocchio’s similarity-based relevance feedback algorithm [O] . Zhixiang Chen, Bin Fu 2005

机译：关于Rocchio基于相似度的相关性反馈算法的复杂性

On the Complexity of Rocchio's Similarity-Based Relevance Feedback Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅