首页> 中文期刊> 《情报学报》 >基于多层术语度的一体化术语抽取研究

基于多层术语度的一体化术语抽取研究

         

摘要

以往的术语抽取研究大多将语言学方法和统计方法分别进行单独的处理,并且只考虑候选术语本身的术语度,而没有考虑候选术语所在句子的术语度对术语抽取性能的影响.本文将语言学方法与统计方法进行并行融合,综合考虑候选术语及其所在语句的术语度,进行基于多层术语度的一体化术语抽取.该研究有两个特色:首先,采用条件随机场模型,能有效融合语言学方法和统计方法,实验结果表明了基于一体化策略的术语抽取方法的有效性;其次,通过语料库比较方法,提出基于多层术语度的术语抽取方法,该方法能抽取多字术语,实验结果表明了利用多层术语度进行术语抽取的有效性.%In most previous studies on terminology extraction, linguistics methods and statistical methods were used through independent process respectively.At the same time, without considering termhood of sentence which includes the terminology candidate, only the termhood of the terminology candidate was considered.In this paper, a method based on multi-level termhood is proposed.The method uses integration strategy which ensembles the linguistics methods and statistical methods.The proposed method uses the termhood of the terminology candidate and the sentence.In this paper, the validity of the terminology extraction method based on the integrated strategy is verified by conditional random fields.Multi-level termhood is computed by the comparison of the corpus, and experiment results show that the multi-level termhood can get better performnance than normal method.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号