首页> 中文期刊> 《计算机应用》 >面向阅读理解的句子组合模型

面向阅读理解的句子组合模型

         

摘要

The reading comprehension of document in Natural Language Processing (NLP) requires the technologies such as representation,understanding and reasoning on the document.Aiming at the choice questions of literature reading comprehension in college entrance examination,a sentence composition model based on the hierarchical composition model was proposed,which could achieve the semantic consistency measure at the sentence level.Firstly,a neural network model was trained by the triple consisted of single word and phrase vector.Then,the sentence vectors were combined by the trained neural network model (two composition methods:the recursion method and the recurrent method) to obtain the distributed vector of sentence.The similarity between sentences was presented by the cosine similarity between the two sentence vectors.In order to verify the proposed method,the 769 simulation materials and 13 Beijing college entrance examination materials (including the source text and the choice question) were collected as the test set.The experimental results show that,compared with the traditional optimal method based on HowNet semantics,the precision of the proposed recurrent method is improved by 7.8 percentage points in college entrance examination materials and 2.7 percentage points in simulation materials respectively.%阅读理解任务需要综合运用文本的表示、理解、推理等自然语言处理技术.针对高考语文中文学作品阅读理解的选项题问题,提出了基于分层组合模式的句子组合模型,用来实现句子级的语义一致性计算.首先,通过单个词和短语向量组成的三元组来训练一个神经网络模型;然后,通过训练好的神经网络模型来组合句子向量(两种组合方法:一种为递归方法;另一种为循环方法),得到句子的分布式向量表示.句子间的一致性利用两个句子向量之间的余弦相似度来表示.为了验证所提方法,收集了769篇模拟材料+13篇北京高考语文试卷材料(包括原文与选择题)作为测试集.实验结果表明,与传统最优的基于知网语义方法相比,循环方法准确率在高考材料中提高了7.8个百分点在模拟材料中提高了2.7个百分点.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号