Deep recurrent neural networks with word embeddings for Urdu named entity recognition

Wahab Khan; Ali Daud; Fahd Alotaibi; Naif Aljohani; Sachi Arafat

首页> 外文期刊>ETRI journal >Deep recurrent neural networks with word embeddings for Urdu named entity recognition

【24h】

Deep recurrent neural networks with word embeddings for Urdu named entity recognition

机译：具有Word Embeddings的深度经常性神经网络，用于URDU命名实体识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Named entity recognition (NER) continues to be an important task in natural language processing because it is featured as a subtask and/or subproblem in information extraction and machine translation. In Urdu language processing, it is a very difficult task. This paper proposes various deep recurrent neural network (DRNN) learning models with word embedding. Experimental results demonstrate that they improve upon current state‐of‐the‐art NER approaches for Urdu. The DRRN models evaluated include forward and bidirectional extensions of the long short‐term memory and back propagation through time approaches. The proposed models consider both language‐dependent features, such as part‐of‐speech tags, and language‐independent features, such as the “context windows” of words. The effectiveness of the DRNN models with word embedding for NER in Urdu is demonstrated using three datasets. The results reveal that the proposed approach significantly outperforms previous conditional random field and artificial neural network approaches. The best f‐measure values achieved on the three benchmark datasets using the proposed deep learning approaches are 81.1%, 79.94%, and 63.21%, respectively.

机译：命名实体识别（NER）继续成为自然语言处理中的重要任务，因为它在信息提取和机器翻译中的子任务和/或子问题。在乌尔都语语言处理中，这是一项非常艰巨的任务。本文提出了各种深度经常性神经网络（DRNN）学习模型，单词嵌入。实验结果表明，他们改善了当前最先进的URDU方法。评估的DRRN模型包括长短期内存的前向和双向扩展，并通过时间方法进行后传播。所提出的模型考虑依赖语言依赖性功能，例如语音部分标签，以及语言无关的功能，例如单词的“上下文窗口”。使用三个数据集演示了在URDU中嵌入NER的DRNN模型的有效性。结果表明，所提出的方法显着优于先前有条件的随机场和人工神经网络方法。使用所提出的深度学习方法在三个基准数据集上实现的最佳F测量值分别为81.1％，79.94％和63.21％。

著录项

来源
《ETRI journal》 |2020年第1期|共11页
作者
Wahab Khan; Ali Daud; Fahd Alotaibi; Naif Aljohani; Sachi Arafat;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
conditional random fieldsdeep recurrent neural networkmachine learningnamed entity recognitionUrdu;

机译：条件随机字段Deep经常性神经网络机组学习实体识别龙;

相似文献

外文文献
中文文献
专利

1. Urdu Named Entity Recognition and Classification System Using Artificial Neural Network [J] . MUHAMMAD KAMRAN MALIK ACM transactions on Asian language information processing . 2018,第1期

机译：基于人工神经网络的乌尔都语命名实体识别与分类系统
2. A Holistic Approach to Urdu Language Word Recognition using Deep Neural Networks [J] . H.R.Khan, M.A.Hasan, M.Kazmi, Engineering Technology and Applied Science Research . 2021,第3期

机译：利用深神经网络的乌尔都语语言词识别的整体方法
3. Deep learning with word embeddings improves biomedical named entity recognition [J] . Bioinformatics . 2017,第14期

机译：与Word Embeddings的深度学习改善了生物医学的命名实体识别
4. Biomedical Named-Entity Recognition by Hierarchically Fusing BioBERT Representations and Deep Contextual-Level Word-Embedding [C] . Usman Naseem, Katarzyna Musial, Peter Eklund, International Joint Conference on Neural Networks . 2020

机译：通过分层融合BioBERT表示和深度上下文级别词嵌入的生物医学命名实体识别
5. Improving Search via Named Entity Recognition in Morphologically Rich Languages: A Case Study in Urdu [D] . Riaz, Kashif H. 2018

机译：通过形态丰富的语言中的命名实体识别来改善搜索：以乌尔都语为例
6. Deep learning with word embeddings improves biomedical named entity recognition [O] . Maryam Habibi, Leon Weber, Mariana Neves, -1

机译：带有词嵌入的深度学习可改善生物医学命名实体的识别
7. Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition [O] . Unanue, Inigo Jauregi, Borzeshi, Ehsan Zare, Piccardi, Massimo 2017

机译：具有专用字嵌入的递归神经网络健康域命名实体识别

Deep recurrent neural networks with word embeddings for Urdu named entity recognition

摘要

著录项

相似文献

相关主题

期刊订阅