...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Understand Short Texts by Harvesting and Analyzing Semantic Knowledge
【24h】

Understand Short Texts by Harvesting and Analyzing Semantic Knowledge

机译:通过收集和分析语义知识来理解短文本

获取原文
获取原文并翻译 | 示例
           

摘要

Understanding short texts is crucial to many applications, but challenges abound. First, short texts do not always observe the syntax of a written language. As a result, traditional natural language processing tools, ranging from part-of-speech tagging to dependency parsing, cannot be easily applied. Second, short texts usually do not contain sufficient statistical signals to support many state-of-the-art approaches for text mining such as topic modeling. Third, short texts are more ambiguous and noisy, and are generated in an enormous volume, which further increases the difficulty to handle them. We argue that semantic knowledge is required in order to better understand short texts. In this work, we build a prototype system for short text understanding which exploits semantic knowledge provided by a well-known knowledgebase and automatically harvested from a web corpus. Our knowledge-intensive approaches disrupt traditional methods for tasks such as text segmentation, part-of-speech tagging, and concept labeling, in the sense that we focus on semantics in all these tasks. We conduct a comprehensive performance evaluation on real-life data. The results show that semantic knowledge is indispensable for short text understanding, and our knowledge-intensive approaches are both effective and efficient in discovering semantics of short texts.
机译:理解短文本对于许多应用程序至关重要,但是挑战很多。首先,短文本并不总是遵循书面语言的语法。结果,传统的自然语言处理工具(从词性标记到依赖项解析)不容易应用。其次,短文本通常没有足够的统计信号来支持许多用于文本挖掘的最新方法,例如主题建模。第三,短文本更加含糊和嘈杂,并且产生了大量的文本,这进一步增加了处理它们的难度。我们认为,为了更好地理解短文本,需要语义知识。在这项工作中,我们建立了一个用于短文本理解的原型系统,该系统利用了著名知识库提供的语义知识,并从网络语料库中自动获取了语义知识。在我们关注所有这些任务中的语义的意义上,我们的知识密集型方法破坏了诸如文本分段,词性标记和概念标签之类的任务的传统方法。我们对真实数据进行全面的性能评估。结果表明,语义知识对于理解短文本是必不可少的,而我们的知识密集型方法对于发现短文本的语义既有效又有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号