首页> 中文期刊> 《计算机工程》 >基于含边界词性特征的中文命名实体识别

基于含边界词性特征的中文命名实体识别

         

摘要

According to the possible expressions as the features in the task, the application of Part of Speech(PoS) used in the task of Chinese personal name recognition is discussed based on the Conditional Random Fields(CRFs) on the character level. And the method of combined PoS and word-edge as a feature item is put forward. By sequence labeling on common corpus, multiple experiments of Chinese personal name recognition are token which are done in similar experiment environment with multiple applications of PoS features. Through the results of the experiments, the combination of second level PoS and word-edges is obtained the best effect in the system performance and the recognition of Chinese named entities.%根据词性在任务中可能出现的特征表达,在字粒度一级,基于条件随机场模型,对词性特征在中文命名实体识别任务中的应用进行研究,提出一种将词性和词边界合成为一个特征项的方法.在相同实验环境下针对多种词性特征的应用情况,采用序列标注的方式在公共语料上进行多次中文命名实体识别实验.通过对多次实验结果的比较分析得出,二级词性与词边界合成的特征在系统执行性能和识别效果等方面均为最优.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号