Representing topics labels for exploring digital libraries

机译：表示用于探索数字图书馆的主题标签

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Topic models have been shown to be a useful way of representing the content of large document collections, for example via visualisation interfaces (topic browsers). These systems enable users to explore collections by way of latent topics. A standard way to represent a topic is using a set of keywords, i.e. the top-n words with highest marginal probability within the topic. However, alternative topic representations have been proposed, including textual and image labels. In this paper, we compare different topic representations, i.e. sets of topic words, textual phrases and images, in a document retrieval task. We asked participants to retrieve relevant documents based on pre-defined queries within a fixed time limit, presenting topics in one of the following modalities: (1) sets of keywords, (2) textual labels, and (3) image labels. Our results show that textual labels are easier for users to interpret than keywords and image labels. Moreover, the precision of retrieved documents for textual and image labels is comparable to the precision achieved by representing topics using sets of keywords, demonstrating that labelling methods are an effective alternative topic representation.

机译：主题模型已被证明是表示大型文档集合内容的一种有用方法，例如通过可视化界面（主题浏览器）。这些系统使用户可以通过潜在主题来探索馆藏。代表主题的标准方法是使用一组关键字，即主题内边际概率最高的前n个单词。但是，已经提出了替代主题表示，包括文本和图像标签。在本文中，我们在文档检索任务中比较了不同的主题表示形式，即主题词，文本短语和图像的集合。我们要求参与者在固定的时限内根据预定义的查询来检索相关文档，以以下方式之一呈现主题：（1）关键字集，（2）文本标签和（3）图像标签。我们的结果表明，文本标签比关键字和图像标签更易于用户解释。此外，检索到的用于文本和图像标签的文档的精度可与使用关键字集表示主题所实现的精度相媲美，这表明标记方法是一种有效的替代主题表示。

著录项

来源
《2014 IEEE/ACM Joint Conference on Digital Libraries》|2014年|239-248|共10页
会议地点 London(GB)
作者
Aletras N.; Baldwin T.; Jey Han Lau; Stevenson M.;
展开▼
作者单位

Comput. Sci., Univ. of Sheffield, Sheffield, UK;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
digital libraries; image retrieval; text analysis; content representation; digital libraries; document collections; document retrieval task; image labels; keyword sets; latent topics; marginal probability; query processing; textual labels; textual phrases; top-n words; topic label representation; topic models; topic words; Electronic publishing; Encyclopedias; Feature extraction; Internet; Labeling; Visualization; evaluation; information retrieval; topic model;

机译：数字图书馆;图像检索;文本分析;内容表示;数字图书馆;文档集合;文档检索任务;图像标签;关键字集;潜在主题;边际概率;查询处理;文本标签;文本短语;前n个单词;主题标签表示;主题模型;主题词;电子出版;百科全书;特征提取;互联网;标签;可视化;评估;信息检索;主题模型;;

相似文献

外文文献
中文文献
专利

1. Using MPEG-21 DIDL to Represent Complex Digital Objects in the Los Alamos National Laboratory Digital Library [J] . D-lib magazine . 2003,第9期

机译：在洛斯阿拉莫斯国家实验室数字图书馆中使用MPEG-21 DIDL表示复杂的数字对象
2. LOD for Library Science: Benefits of Applying Linked Open Data in the Digital Library Setting: Retrospects and Research Topics [J] . Atif Latif, Ansgar Scherp, Klaus Tochtermann Kunstliche Intelligenz . 2016,第2期

机译：图书馆学的LOD：在数字图书馆中应用链接的开放数据的好处：回顾与研究主题
3. Perceived value of digital components in library programmes: The case of Auckland Libraries' Dare to Explore summer reading programme [J] . Misilei Jolene, Liew Chern Li Library & Information Science Research . 2018,第3a4期

机译：图书馆计划中数字组件的价值：奥克兰图书馆敢于探索夏季阅读计划的情况
4. Representing topics labels for exploring digital libraries [C] . Aletras N., Baldwin T., Jey Han Lau, IEEE/ACM Joint Conference on Digital Libraries . 2014

机译：代表探索数字图书馆的主题标签
5. Creating collaboration: Exploring the development of a Baptist digital library and archive, a case study. [D] . Hall, Taffey. 2013

机译：建立合作关系：探索浸信会数字图书馆和档案馆的发展，一个案例研究。
6. Topics in Library Technology: Labeling Techniques [O] . Stanley D. Truelson Jr. 1966

机译：图书馆技术主题：标记技术
7. Representing Topics Labels for Exploring Digital Libraries [O] . Nikolaos Aletras, Timothy Baldwin, Jey Han Lau, 2015

机译：代表探索数字图书馆的主题标签

Representing topics labels for exploring digital libraries

摘要

著录项

相似文献

相关主题

期刊订阅