Studying the impact of language-independent and language-specific features on hybrid Arabic Person name recognition

Oudah Mai; Shaalan Khaled

首页> 外文期刊>Language Resources and Evaluation >Studying the impact of language-independent and language-specific features on hybrid Arabic Person name recognition

【24h】

Studying the impact of language-independent and language-specific features on hybrid Arabic Person name recognition

机译：研究独立于语言和特定于语言的功能对混合阿拉伯语人名识别的影响

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, extensive experiments are conducted to study the impact of features of different categories, in isolation and gradually in an incremental manner, on Arabic Person name recognition. We present an integrated system that employs the rule-based approach with the machine learning (ML)-based approach in order to develop a consolidated hybrid system. Our feature space is comprised of language-independent and language-specific features. The explored features are naturally grouped under six categories: Person named entity tags predicted by the rule-based component, word-level features, POS features, morphological features, gazetteer features, and other contextual features. As decision tree algorithm has proved comparatively higher efficiency as a classifier in current state-of-the-art hybrid Named Entity Recognition for Arabic, it is adopted in this study as the ML technique utilized by the hybrid system. Therefore, the experiments are focused on two dimensions: the standard dataset used and the set of selected features. A number of standard datasets are used for the training and testing of the hybrid system, including ACE (2003-2004) and ANERcorp. The experimental analysis indicates that both language-independent and language-specific features play an important role in overcoming the challenges posed by Arabic language and have demonstrated critical impact on optimizing the performance of the hybrid system.

机译：在本文中，进行了广泛的实验，以孤立的方式逐步地研究了不同类别的特征对阿拉伯语人名识别的影响。我们提出了一种集成系统，该系统采用基于规则的方法和基于机器学习（ML）的方法，以开发整合的混合系统。我们的特征空间由与语言无关和特定于语言的特征组成。探索的功能自然分为以下六类：由基于规则的组件预测的人员命名实体标签，单词级功能，POS功能，形态功能，地名词典功能和其他上下文功能。由于决策树算法已被证明在当前最先进的阿拉伯混合命名实体识别中作为分类器具有较高的效率，因此在本研究中将其用作混合系统使用的ML技术。因此，实验着重于两个方面：使用的标准数据集和所选要素的集合。许多标准数据集都用于混合系统的训练和测试，包括ACE（2003-2004）和ANERcorp。实验分析表明，独立于语言的特征和特定于语言的特征在克服阿拉伯语言所带来的挑战中都发挥着重要作用，并已显示出对优化混合系统性能的关键影响。

著录项

来源
《Language Resources and Evaluation》 |2017年第2期|351-378|共28页
作者
Oudah Mai; Shaalan Khaled;
展开▼
作者单位

Masdar Inst Sci & Technol, Abu Dhabi, U Arab Emirates;

British Univ Dubai, Dubai Int Acad City, U Arab Emirates;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Named entity recognition; Information extraction; Rule-based approach; Machine learning; Hybrid approach; Natural language processing;

机译：命名实体识别;信息提取;基于规则的方法;机器学习;混合方法;自然语言处理;

相似文献

外文文献
中文文献
专利

1. Hybrid Feature Model for Emotion Recognition in Arabic Text [J] . Alswaidan Nourah, Menai Mohamed El Bachir Quality Control, Transactions . 2020,第期

机译：阿拉伯文中情感识别的混合特征模型
2. Arabic isolated word recognition system using hybrid feature extraction techniques and neural network [J] . Lotfi Boussaid, Mohamed Hassine International journal of speech technology . 2018,第1期

机译：混合特征提取技术与神经网络的阿拉伯语孤立词识别系统
3. Hybrid Feature Vector for the Recognition of Arabic Handwritten Characters Using Feed-Forward Neural Network [J] . Lamghari N., Charaf M. E. H., Raghay S. Arabian Journal for Science and Engineering . 2018,第12期

机译：混合特征向量用于前馈神经网络识别阿拉伯手写字符
4. Studying the impact of various features on the performance of Conditional Random Field-based Arabic Named Entity Recognition [C] . Morsi Alia, Rafea Ahmed 2013 ACS International Conference on Computer Systems and Applications . 2013

机译：研究各种功能对基于条件随机场的阿拉伯命名实体识别性能的影响
5. The Role of Diacritics in Word Recognition and their Impact on Arabic L2 Learners' Reading Speed, Accuracy, and Comprehension at Different Stages of Arabic L2 Acquisition [D] . Midhwah, Ali Ahmed Al 2018

机译：作用于字体识别的作用及其对阿拉伯语L2采集不同阶段的阿拉伯语L2学习者阅读速度，准确性和理解的影响
6. The Development of Language-Specific and Language-Independent Talker Processing [O] . Susannah V. Levi, Richard G. Schwartz -1

机译：特定于语言和独立于语言的谈话者处理的发展
7. Impact of features and classifiers combinations on the performances of Arabic recognition systems [O] . Afef Kacem Echi, Abdel Belaid 2017

机译：特征和分类机组的影响与阿拉伯识别系统的性能

Studying the impact of language-independent and language-specific features on hybrid Arabic Person name recognition

摘要

著录项

相似文献

相关主题

期刊订阅