【24h】

Entity-balanced Gaussian pLSA for Automated Comparison

机译:实体平衡高斯PLSA用于自动比较

获取原文

摘要

Community created content (e.g., product descriptions, reviews) typically discusses one entity at a time and it can be hard as well as time consuming for a user to compare two or more entities. In response, we define a novel task of automatically generating entity comparisons from text. Our output is a table that semantically clusters descriptive phrases about entities. Our clustering algorithm is a Gaussian extension of probabilistic latent semantic analysis (pLSA), in which each phrase is represented in word vector embedding space. In addition, our algorithm attempts to balance information about entities in each cluster to generate meaningful comparison tables, where possible. We test our system's effectiveness on two domains, travel articles and movie reviews, and find that entity-balanced clusters are strongly preferred by users.
机译:社区创建的内容(例如,产品描述,评论)通常一次讨论一个实体,并且对于用户比较两个或更多个实体,它可能很难和耗时。作为响应,我们定义了从文本自动生成实体比较的新颖任务。我们的输出是一个表,语义群集关于实体的描述性短语。我们的聚类算法是概率潜在语义分析(PLSA)的高斯扩展,其中每个短语在Word矢量嵌入空间中表示。此外,我们的算法尝试在可能的情况下尝试使用每个群集中的实体的信息来生成有意义的比较表。我们在两个域名,旅游文章和电影评论中测试系统的效果,并发现用户强烈首选实体平衡群集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号