Preferences in Wikipedia abstracts: Empirical findings and implications for automatic entity summarization

Danyun Xu; Gong Cheng; Yuzhong Qu

首页> 外文期刊>Information Processing & Management >Preferences in Wikipedia abstracts: Empirical findings and implications for automatic entity summarization

【24h】

Preferences in Wikipedia abstracts: Empirical findings and implications for automatic entity summarization

机译：Wikipedia摘要中的首选项：自动实体摘要的经验发现和启示

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The volume of entity-centric structured data grows rapidly on the Web. The description of an entity, composed of property-value pairs (a.k.a. features), has become very large in many applications. To avoid information overload, efforts have been made to automatically select a limited number of features to be shown to the user based on certain criteria, which is called automatic entity summarization. However, to the best of our knowledge, there is a lack of extensive studies on how humans rank and select features in practice, which can provide empirical support and inspire future research. In this article, we present a large-scale statistical analysis of the descriptions of entities provided by DBpedia and the abstracts of their corresponding Wikipedia articles, to empirically study, along several different dimensions, which kinds of features are preferable when humans summarize. Implications for automatic entity summarization are drawn from the findings.

机译：以实体为中心的结构化数据在Web上的增长迅速。由属性-值对（也称为特征）组成的实体描述在许多应用中变得非常庞大。为了避免信息过载，已经做出努力来基于某些标准自动选择要显示给用户的有限数量的功能，这称为自动实体摘要。然而，据我们所知，目前尚缺乏有关人类如何在实践中对等级进行排序和选择特征的广泛研究，这些研究可以提供实证支持并激发未来的研究。在本文中，我们对DBpedia提供的实体描述及其相应的Wikipedia文章的摘要进行了大规模的统计分析，以便从几个不同维度进行实证研究，当人类进行总结时，哪种功能更可取。从结果中得出自动实体摘要的含义。

著录项

来源
《Information Processing & Management》 |2014年第2期|284-296|共13页
作者
Danyun Xu; Gong Cheng; Yuzhong Qu;
展开▼
作者单位

State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, PR China;

State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, PR China;

State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, PR China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
DBpedia; Entity summarization; Feature selection; Property ranking; Wikipedia;

机译：DBpedia;实体汇总;功能选择;物业排名;维基百科;

相似文献

外文文献
中文文献
专利

1. Automatic compilation of language resources for named entity recognition in Turkish by utilizing Wikipedia article titles [J] . Dilek Kuecuek Computer standards & interfaces . 2015,第sepa期

机译：利用维基百科文章标题自动编译语言资源，以土耳其语命名实体识别
2. Automatically building large-scale named entity recognition corpora from Chinese Wikipedia [J] . Jie?Zhou, Bi-cheng?Li, Gang?Chen Frontiers of Information Technology & Electronic Engineering . 2015,第11期

机译：从中文维基百科自动建立大规模的命名实体识别语料库
3. Automatically building large-scale named entity recognition corpora from Chinese Wikipedia [J] . Jie ZHOU, Bi-cheng LI, Gang CHEN 浙江大学学报（英文版）（C辑：计算机与电子） . 2015,第011期

机译：从中文维基百科自动建立大规模的命名实体识别语料库
4. Related Entity Finding Using Semantic Clustering Based on Wikipedia Categories [C] . Georgios Stratogiannis, Georgios Siolas, Andreas Stafylopatis Language processing and intelligent information systems . 2013

机译：基于维基百科类别的使用语义聚类的相关实体查找
5. Heterogeneity in motorists' preferences for travel time and time reliability: Empirical finding from multiple survey data sets and its policy implications. [D] . Yan, Jia. 2002

机译：驾车者偏好出行时间和时间可靠性的异质性：来自多个调查数据集的经验发现及其政策含义。
6. Automatic Summarization of Mouse Gene Information by Clustering and Sentence Extraction from MEDLINE Abstracts [O] . Jianji Yang, Aaron M. Cohen, William Hersh 2007

机译：通过MEDLINE摘要的聚类和句子提取自动总结小鼠基因信息
7. A new graph based text segmentation using Wikipedia for automatic text summarization [O] . Pourvali Mohsen 2012

机译：一种新的基于图的文本分割，使用维基百科进行自动文本汇总
8. Delft University at the TREC 2009 Entity Track: Ranking Wikipedia Entities [R] . Serdyukov, P., De Vries, A. 2009

机译：代尔夫特大学在TREC 2009实体轨道：排名维基百科实体

Preferences in Wikipedia abstracts: Empirical findings and implications for automatic entity summarization

摘要

著录项

相似文献

相关主题

期刊订阅