首页> 外文会议>2014 IEEE/ACM Joint Conference on Digital Libraries >Representing topics labels for exploring digital libraries
【24h】

Representing topics labels for exploring digital libraries

机译:表示用于探索数字图书馆的主题标签

获取原文
获取原文并翻译 | 示例

摘要

Topic models have been shown to be a useful way of representing the content of large document collections, for example via visualisation interfaces (topic browsers). These systems enable users to explore collections by way of latent topics. A standard way to represent a topic is using a set of keywords, i.e. the top-n words with highest marginal probability within the topic. However, alternative topic representations have been proposed, including textual and image labels. In this paper, we compare different topic representations, i.e. sets of topic words, textual phrases and images, in a document retrieval task. We asked participants to retrieve relevant documents based on pre-defined queries within a fixed time limit, presenting topics in one of the following modalities: (1) sets of keywords, (2) textual labels, and (3) image labels. Our results show that textual labels are easier for users to interpret than keywords and image labels. Moreover, the precision of retrieved documents for textual and image labels is comparable to the precision achieved by representing topics using sets of keywords, demonstrating that labelling methods are an effective alternative topic representation.
机译:主题模型已被证明是表示大型文档集合内容的一种有用方法,例如通过可视化界面(主题浏览器)。这些系统使用户可以通过潜在主题来探索馆藏。代表主题的标准方法是使用一组关键字,即主题内边际概率最高的前n个单词。但是,已经提出了替代主题表示,包括文本和图像标签。在本文中,我们在文档检索任务中比较了不同的主题表示形式,即主题词,文本短语和图像的集合。我们要求参与者在固定的时限内根据预定义的查询来检索相关文档,以以下方式之一呈现主题:(1)关键字集,(2)文本标签和(3)图像标签。我们的结果表明,文本标签比关键字和图像标签更易于用户解释。此外,检索到的用于文本和图像标签的文档的精度可与使用关键字集表示主题所实现的精度相媲美,这表明标记方法是一种有效的替代主题表示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号