首页> 外文会议> >Automatic recognition of Chinese place names: a statistical and rule-based combined approach

【24h】

Automatic recognition of Chinese place names: a statistical and rule-based combined approach

机译：自动识别中文地名：一种基于统计和基于规则的组合方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The automatic recognition of Chinese place names, a special case of the recognition of Chinese special nouns, is an important task in Chinese information processing. In this paper, we propose an approach combining statistical and rule-based techniques. The proposed approach discovers candidates from Chinese texts based upon the probability of a character being part of a Chinese place name; and confirms or eliminates the candidates by applying rules obtained by human summarization and transformation-based machine learning. In this approach, we employ a statistical measure: weight of likelihood (WOL), to estimate the likelihood of a character being part of a Chinese place name in real corpora. To the authors' knowledge, it is the first time WOL has been used to capture the capability of a character forming Chinese places names in real corpora. We evaluate the performance of our approach on a real data set and the recall and precision are 97% and 90.92% respectively.

机译：中文地名的自动识别是中文特殊名词识别的一种特殊情况，是中文信息处理中的重要任务。在本文中，我们提出了一种结合统计和基于规则的技术的方法。所提出的方法基于字符是中文地名一部分的概率从中文文本中发现候选者;并通过应用基于人类摘要和基于变换的机器学习获得的规则来确认或消除候选人。在这种方法中，我们采用统计量度：似然权重（WOL），以估计某个字符成为真实语料库中中文地名一部分的可能性。据作者所知，这是第一次使用WOL捕获在真实语料库中形成中文地名的字符的功能。我们在真实数据集上评估我们的方法的性能，召回率和准确性分别为97％和90.92％。

著录项

来源
《》|2001年|P.2204-2209|共6页
会议地点
作者
Jia-heng Zheng; Hong-ye Tan; Kai-ying Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Combining rule-based and statistical mechanisms for low-resource named entity recognition [J] . Ryan Gabbard, Jay DeYoung, Constantine Lignos, Machine translation . 2018,第1a2期

机译：结合基于规则和统计的机制，以实现低资源命名实体的识别
2. Chunk Segmentation of Chinese Sentences Using a Combined Statistical and Rule-based Approach (CSRA) [J] . Rongbo Wang, Xiaohua Wang, Zhiqun Chen, International Journal of Computer Processing of Oriental Languages . 2007,第2a3期

机译：统计和规则相结合的方法（CSRA）对汉语句子进行大块分割
3. Automatic normalization of short texts by combining statistical and rule-based techniques [J] . Marta R. Costa-jussa, Rafael E. Banchs Language Resources and Evaluation . 2013,第1期

机译：通过结合统计和基于规则的技术来自动规范短文本
4. Automatic recognition of Chinese place names: a statistical and rule-based combined approach [C] . Jia-heng Zheng, Hong-ye Tan, Kai-ying Liu, IEEE Interantional Conference on Systems, Man, and Cybernetics . 2001

机译：自动识别中国地名：基于统计和规则的组合方法
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Improving the Named Entity Recognition of Chinese Electronic Medical Records by Combining Domain Dictionary and Rules [O] . Xianglong Chen, Chunping Ouyang, Yongbin Liu, 2020

机译：结合领域词典和规则改进中文电子病历的命名实体识别
7. Chinese named entity recognition combining a statistical model with human knowledge [O] . Youzheng Wu, Jun Zhao, Bo Xu 2003

机译：中文命名实体识别将统计模型与人类知识相结合

Automatic recognition of Chinese place names: a statistical and rule-based combined approach

摘要

著录项

相似文献

相关主题

期刊订阅