首页> 外国专利> INTERACTIVELY BUILDING A TOPIC MODEL EMPLOYING SEMANTIC SIMILARITY IN A SPOKEN DIALOG SYSTEM

INTERACTIVELY BUILDING A TOPIC MODEL EMPLOYING SEMANTIC SIMILARITY IN A SPOKEN DIALOG SYSTEM

机译:在口语对话系统中交互构建具有语义相似性的主题模型

摘要

A computer-implemented method is presented for building a topic model to discover topics in a collection of documents generated by a plurality of users. The method includes extracting conversations from the collection of documents, dividing the extracted conversations into a plurality of segments, generating a topic distribution for each of the plurality of segments based on the extracted conversations and a first pre-defined prior probability distribution, and generating continuous value constructs for each of the topic distributions based on an external corpus and a second pre-defined prior probability distribution, wherein similarity is defined between the continuous value constructs.
机译:提出了一种计算机实现的方法,该方法用于构建主题模型以发现由多个用户生成的文档集合中的主题。该方法包括:从文档集合中提取会话;将提取的会话划分为多个段;基于提取的会话和第一预定义的先验概率分布,为多个段中的每个段生成主题分布;以及生成连续的基于外部语料库和第二预定义的先验概率分布的每个主题分布的值构造,其中在连续值构造之间定义相似性。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号