Construction of query concepts based on feature clustering of documents

Youjin Chang; Minkoo Kim; Vijay V. Raghavan

首页> 外文期刊>Information retrieval >Construction of query concepts based on feature clustering of documents

【24h】

Construction of query concepts based on feature clustering of documents

机译：基于文档特征聚类的查询概念构建

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In Information Retrieval, since it is hard to identify users' information needs, many approaches have been tried to solve this problem by expanding initial queries and reweighting the terms in the expanded queries using users' relevance judgments. Although relevance feedback is most effective when relevance information about retrieved documents is provided by users, it is not always available. Another solution is to use correlated terms for query expansion. The main problem with this approach is how to construct the term-term correlations that can be used effectively to improve retrieval performance. In this study, we try to construct query concepts that denote users' information needs from a document space, rather than to reformulate initial queries using the term correlations and/or users' relevance feedback. To form query concepts, we extract features from each document, and then cluster the features into primitive concepts that are then used to form query concepts. Experiments are performed on the Associated Press (AP) dataset taken from the TREC collection. The experimental evaluation shows that our proposed framework called QCM (Query Concept Method) outperforms baseline probabilistic retrieval model on TREC retrieval.

机译：在信息检索中，由于难以识别用户的信息需求，因此尝试了许多方法来解决此问题，方法是扩展初始查询并使用用户的相关性判断对扩展查询中的术语进行加权。尽管当用户提供有关检索到的文档的相关性信息时，相关性反馈是最有效的，但并非总是可用。另一种解决方案是使用关联词进行查询扩展。这种方法的主要问题是如何构建可有效用于提高检索性能的项-项相关性。在这项研究中，我们尝试构建表示文档空间中用户信息需求的查询概念，而不是使用术语相关性和/或用户相关性反馈来重新构造初始查询。为了形成查询概念，我们从每个文档中提取特征，然后将这些特征聚集到原始概念中，然后将这些原始概念用于形成查询概念。实验是从TREC馆藏的美联社（AP）数据集中进行的。实验评估表明，我们提出的称为QCM（查询概念方法）的框架在TREC检索方面优于基线概率检索模型。

著录项

来源
《Information retrieval》 |2006年第3期|p.231-248|共18页
作者
Youjin Chang; Minkoo Kim; Vijay V. Raghavan;
展开▼
作者单位

Graduate School of Information and Communication, Ajou University, Suwon, Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类图书馆学、图书馆事业;
关键词
concept-based information retrieval; query reformulation; query concepts;

机译：基于概念的信息检索;查询重构;查询概念;

相似文献

外文文献
中文文献
专利

1. Query reformulation using automatically generated query concepts from a document space [J] . Youjin Chang, Iadh Ounis, Minkoo Kim Information Processing & Management . 2006,第2期

机译：使用来自文档空间的自动生成的查询概念进行查询重构
2. Ontology-Based Mapping for Automated Document Management: A Concept-Based Technique for Word Mismatch and Ambiguity Problems in Document Clustering [J] . YEN-HSIEN LEE, PAUL JEN-HWA HU, CHING-YI TU ACM Transactions on Management Information Systems . 2015,第1期

机译：基于本体的自动文档管理映射：一种基于概念的文档聚类中词不匹配和歧义问题的技术
3. Incremental models for query clustering and query-context aware document clustering [J] . Poonam Goyal, N. Mehala, Navneet Goyal International journal of knowledge and web intelligence . 2015,第2期

机译：用于查询聚类和查询上下文感知的文档聚类的增量模型
4. Construction of Query Concepts in a Document Space Based on Data Mining Techniques [C] . Youjin Chang, Minkoo Kim, Iadh Ounis Flexible Query Answering Systems . 2004

机译：基于数据挖掘技术的文档空间查询概念的构建
5. Visualization of search engine query result using region-based document model on XML documents. [D] . Parikh, Sunish Umesh. 2000

机译：在XML文档上使用基于区域的文档模型来可视化搜索引擎查询结果。
6. Synonym Topic Model and Predicate-Based Query Expansion for Retrieving Clinical Documents [O] . Qing T. Zeng, Doug Redd, Thomas Rindflesch, 2012

机译：用于检索临床文档的同义词主题模型和基于谓词的查询扩展
7. Analysis of Query Dependent Multi-Document Summarization using Feature based and Cluster based Methods [O] . 2016

机译：基于特征和基于群集的方法的查询依赖多文件概述分析

Construction of query concepts based on feature clustering of documents

摘要

著录项

相似文献

相关主题

期刊订阅