首页> 外文会议>22nd International Conference on Computational Linguistics >Chinese Term Extraction Using Minimal Resources
【24h】

Chinese Term Extraction Using Minimal Resources

机译:使用最少的资源提取中文术语

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a new approach for term extraction using minimal resources. A term candidate extraction algorithm is proposed to identify features of the relatively stable and domain independent term delimiters rather than that of the terms. For term verification, a link analysis based method is proposed to calculate the relevance between term candidates and the sentences in the domain specific corpus from which the candidates are extracted. The proposed approach requires no prior domain knowledge, no general corpora, no full segmentation and minimal adaptation for new domains. Consequently, the method can be used in any domain corpus and it is especially useful for resource-limited domains. Evaluations conducted on two different domains for Chinese term extraction show quite significant improvements over existing techniques and also verify the efficiency and relative domain independent nature of the approach. Experiments on new term extraction also indicate that the approach is quite effective for identifying new terms in a domain making it useful for domain knowledge update.
机译:本文提出了一种使用最少资源的术语提取新方法。提出了术语候选者提取算法,以识别相对稳定和领域独立的术语定界符的特征,而不是术语的特征。为了进行术语验证,提出了一种基于链接分析的方法来计算术语候选词与特定领域语料库中的句子之间的相关性,从中提取候选词。所提出的方法不需要先验领域知识,不需要一般语料,不需要完整的分割和对新领域的最小适应。因此,该方法可用于任何领域语料库,并且对于资源受限的领域特别有用。在两个不同领域进行的中文术语提取评估显示,与现有技术相比,已有相当大的改进,并且验证了该方法的效率和相对领域无关的性质。关于新术语提取的实验还表明,该方法对于识别域中的新术语非常有效,使其对域知识更新很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号