【24h】

Entity-balanced Gaussian pLSA for Automated Comparison

机译:实体平衡高斯pLSA用于自动比较

获取原文

摘要

Community created content (e.g., product descriptions, reviews) typically discusses one entity at a time and it can be hard as well as time consuming for a user to compare two or more entities. In response, we define a novel task of automatically generating entity comparisons from text. Our output is a table that semantically clusters descriptive phrases about entities. Our clustering algorithm is a Gaussian extension of probabilistic latent semantic analysis (pLSA), in which each phrase is represented in word vector embedding space. In addition, our algorithm attempts to balance information about entities in each cluster to generate meaningful comparison tables, where possible. We test our system's effectiveness on two domains, travel articles and movie reviews, and find that entity-balanced clusters are strongly preferred by users.
机译:社区创建的内容(例如,产品说明,评论)通常一次讨论一个实体,并且用户比较两个或多个实体可能既困难又耗时。作为回应,我们定义了一个新颖的任务,即根据文本自动生成实体比较。我们的输出是一个表,该表在语义上聚集了有关实体的描述性短语。我们的聚类算法是概率潜在语义分析(pLSA)的高斯扩展,其中每个词组都在词向量嵌入空间中表示。此外,我们的算法会尝试在每个群集中平衡有关实体的信息,以在可能的情况下生成有意义的比较表。我们在两个领域(旅行文章和电影评论)上测试了系统的有效性,发现用户强烈偏爱实体平衡的集群。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号