首页> 外文会议>International conference on applications of natural language to information systems >Detecting Non-covered Questions in Frequently Asked Questions Collections
【24h】

Detecting Non-covered Questions in Frequently Asked Questions Collections

机译:检测常见问题集中的未发现问题

获取原文

摘要

Frequently asked questions (FAQ) collections are a popular and effective way of representing information, and FAQ retrieval systems provide a natural-language interface to such collections. An important aspect of efficient and trustworthy FAQ retrieval is to maintain a low fall-out rate by detecting non-covered questions. In this paper we address the task of detecting non-covered questions. We experiment with threshold-based methods as well as unsupervised one-class and supervised binary classifiers, considering tf-idf and word embeddings text representations. Experiments, carried out on a domain-specific FAQ collection, indicate that a cluster-based model with query paraphrases outperforms threshold-based, one-class, and binary classifiers.
机译:常见问题解答(FAQ)集合是表示信息的一种流行且有效的方式,而FAQ检索系统为此类集合提供了自然语言的界面。高效且值得信赖的FAQ检索的重要方面是通过检测未覆盖的问题来保持较低的失败率。在本文中,我们解决了检测未发现问题的任务。考虑到tf-idf和词嵌入文本表示形式,我们尝试了基于阈值的方法以及无监督的一类和有监督的二进制分类器。对特定于域的FAQ集合进行的实验表明,具有查询短语的基于群集的模型优于基于阈值的,一类和二进制分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号