Word-Level vs Sentence-Level Language Identification: Application to Algerian and Arabic Dialects

Mohamed Lichouri; Mourad Abbas; Abed Alhakim Freihat; Dhiya El Hak Megtouf

首页> 外文期刊>Procedia Computer Science >Word-Level vs Sentence-Level Language Identification: Application to Algerian and Arabic Dialects

【24h】

Word-Level vs Sentence-Level Language Identification: Application to Algerian and Arabic Dialects

机译：单词级与句子级语言识别：应用于阿尔及利亚和阿拉伯方言

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate a set of methods for textual Arabic Dialect Identification, where we considered word-level and sentence-level approaches. We used three classifiers, namely: Linear Support Vector Machine L-SVM, Bernoulli Naive Bayes BNB and Multinomial Naive Bayes MNB. Then we combined them by using a voting procedure. We carried out experiments on two sets of dialects: the first one, PADIC, which consists of parallel sentences in Maghrebi and Middle Eastern dialects; and the second, a set of Algerian dialects only, that we built manually. For the Arabic dialects, we obtained an average accuracy of 92%. For Algerian dialects, our approach yielded an average accuracy of about 76%.

机译：在本文中，我们研究了一套用于文本阿拉伯方言识别的方法，其中我们考虑了单词级和句子级方法。我们使用了三个分类器，即：线性支持向量机L-SVM，Bernoulli朴素贝叶斯BNB和多项式朴素贝叶斯MNB。然后，我们使用投票程序将它们合并。我们对两套方言进行了实验：第一套是PADIC，由马格里比语和中东方言中的平行句子组成;第二种是我们手动构建的一组仅阿尔及利亚方言。对于阿拉伯语，我们的平均准确度为92％。对于阿尔及利亚方言，我们的方法得出的平均准确度约为76％。

著录项

来源
《Procedia Computer Science》 |2018年第22期|共8页
作者
Mohamed Lichouri; Mourad Abbas; Abed Alhakim Freihat; Dhiya El Hak Megtouf;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Sentence-Level Dialect IdentificationWord-Level Dialect IdentificationNaive Bayes ClassifierSVMAlgerian DialectsArabic Dialects;

机译：句子级方言识别词级方言识别朴素贝叶斯分类器支持向量机阿尔及利亚方言阿拉伯语方言;

相似文献

外文文献
中文文献
专利

1. Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic [J] . Sarikaya R., Afify M., Deng Y., IEEE transactions on audio, speech and language processing . 2008,第7期

机译：形态-词汇联合语言建模，用于处理形态丰富的语言及其在方言阿拉伯语中的应用
2. Prosody-based Spoken Algerian Arabic Dialect Identification [J] . Soumia Bougrine, Hadda Cherroun, Djelloul Ziadi Procedia Computer Science . 2018,第1期

机译：基于韵律的口语阿尔及利亚方言识别
3. Word-Level and Sentence-Level Automaticity in English as a Foreign Language (EFL) Learners: A Comparative Study [J] . Ma Dongmei, Yu Xiaoru, Zhang Haomin Journal of psycholinguistic research . 2017,第6期

机译：单词级和句子级别用英语作为外语（EFL）学习者：比较研究
4. Arabic Dialect Identification with an Unsupervised Learning (Based on a Lexicon). Application Case: ALGERIAN Dialect [C] . Imène Guellil, Faiçal Azouaou 19th IEEE International Conference on Computational Science and Engineering, 14th IEEE International Conference on Embedded and Ubiquitous Computing, 15th International Symposium on Distributed Computing and Applications to Business, Engineering and Science . 2016

机译：无监督学习的阿拉伯方言识别（基于词典）。应用案例：阿尔及利亚方言
5. Arabic Dialect Identification [D] . Al-Mannai, Kamela Ali 2018

机译：阿拉伯方言识别
6. Morphological structure in the Arabic mental lexicon: Parallels between standard and dialectal Arabic [O] . Sami Boudelaa, William D. Marslen-Wilson -1

机译：阿拉伯语心理词典中的形态结构：标准阿拉伯语与方言阿拉伯语之间的平行
7. The identification of two Algerian Arabic dialects by prosodic focus [O] . Ismaël Benali 2019

机译：韵律重点识别两个阿尔及利亚阿拉伯语方言

Word-Level vs Sentence-Level Language Identification: Application to Algerian and Arabic Dialects

摘要

著录项

相似文献

相关主题

期刊订阅