A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection

Endang Wahyu Pamungkas; Valerio Basile; Viviana Patti

首页> 外文期刊>Information Processing & Management >A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection

【24h】

A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection

机译：具有零射击交叉仇恨语音检测知识注射的联合学习方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hate speech is an increasingly important societal issue in the era of digital communication. Hateful expressions often make use of figurative language and, although they represent, in some sense, the dark side of language, they are also often prime examples of creative use of language. While hate speech is a global phenomenon, current studies on automatic hate speech detection are typically framed in a monolingual setting. In this work, we explore hate speech detection in low-resource languages by transferring knowledge from a resource-rich language, English, in a zero-shot learning fashion. We experiment with traditional and recent neural architectures, and propose two joint-learning models, using different multilingual language representations to transfer knowledge between pairs of languages. We also evaluate the impact of additional knowledge in our experiment, by incorporating information from a multilingual lexicon of abusive words. The results show that our joint-learning models achieve the best performance on most languages. However, a simple approach that uses machine translation and a pre-trained English language model achieves a robust performance. In contrast, Multilingual BERT fails to obtain a good performance in cross-lingual hate speech detection. We also experimentally found that the external knowledge from a multilingual abusive lexicon is able to improve the models' performance, specifically in detecting the positive class. The results of our experimental evaluation highlight a number of challenges and issues in this particular task. One of the main challenges is related to the issue of current benchmarks for hate speech detection, in particular how bias related to the topical focus in the datasets influences the classification performance. The insufficient ability of current multilingual language models to transfer knowledge between languages in the specific hate speech detection task also remain an open problem. However, our experimental evaluation and our qualitative analysis show how the explicit integration of linguistic knowledge from a structured abusive language lexicon helps to alleviate this issue.

机译：仇恨言论是数字沟通时代越来越重要的社会问题。可恶的表达往往利用比喻语言，虽然它们在某种意义上代表了语言的黑暗面，但它们也经常是创造性使用语言的素质示例。虽然仇恨言论是全球现象，但目前关于自动仇恨语音检测的研究通常以单机设置框架。在这项工作中，我们通过从资源丰富的语言，英语中传输知识，以零射击学习方式探索低资源语言的仇恨语音检测。我们试验传统和最近的神经架构，并建议使用不同的多语言语言表示来使用不同的多语言语言表示来转移语言对之间的知识。我们还通过将信息从滥用单词的多语种词典中的信息纳入我们的实验中评估额外知识的影响。结果表明，我们的联合学习模型在大多数语言上实现了最佳性能。然而，一种使用机器翻译和预先接受训练的英语语言模型的简单方法实现了稳健的性能。相比之下，多语种伯特未能在交叉语言仇恨语音检测中获得良好的性能。我们还在实验发现，多语言滥用词典中的外部知识能够改善模型的性能，特别是在检测正类方面。我们的实验评估结果突出了这项特定任务中的许多挑战和问题。其中一个主要挑战与仇恨语音检测的当前基准问题有关，特别是如何与数据集中的局部焦点相关的偏差影响分类性能。当前多语言语言模型在特定仇恨语音检测任务中传输语言之间的知识的能力不足也仍然是一个打开的问题。然而，我们的实验评估和我们的定性分析展示了如何从结构化的辱骂语言词典中明确整合语言知识有助于缓解这个问题。

著录项

来源
《Information Processing & Management》 |2021年第4期|102544.1-102544.19|共19页
作者
Endang Wahyu Pamungkas; Valerio Basile; Viviana Patti;
展开▼
作者单位

Department of Computer Science University of Turin Italy;

Department of Computer Science University of Turin Italy;

Department of Computer Science University of Turin Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Hate speech detection; Cross-lingual classification; Social media; Transfer learning; Zero-shot learning;

机译：讨厌语音检测;交叉分类;社交媒体;转移学习;零射击学习;

相似文献

外文文献
中文文献
专利

1. Intelligent detection of hate speech in Arabic social network:A machine learning approach [J] . Ibrahim Aliarah, Maria Habib, Neveen Hijazi, Journal of Information Science . 2021,第4期

机译：阿拉伯社交网络中仇恨言论的智能检测：机器学习方法
2. Automatic hate speech detection using killer natural language processing optimizing ensemble deep learning approach [J] . Al-Makhadmeh Zafer, Tolba Amr Computing . 2020,第2期

机译：使用杀手级自然语言处理的自动仇恨语音检测优化集成深度学习方法
3. A deep neural network based multi-task learning approach to hate speech detection [J] . Kapil Prashant, Ekbal Asif Knowledge-Based Systems . 2020,第Deca27期

机译：基于深度神经网络的讨厌语音检测的多任务学习方法
4. Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection [C] . Debora Nozza Annual Meeting of the Association for Computational Linguistics;International Joint Conference on natural Language Processing . 2021

机译：暴露零射击交叉语言仇恨语音检测的限制
5. On the Detection of Hate Speech, Hate Speakers and Polarized Groups in Online Social Media [D] . Warmsley, Dana. 2017

机译：在线社交媒体中仇恨言论，仇恨演说者和两极分化群体的检测
6. A zero-shot learning approach to the development of brain-computer interfaces for image retrieval [O] . Ben McCartney, Jesus Martinez-del-Rincon, Barry Devereux, 2012

机译：零脑学习方法开发用于图像检索的脑机接口
7. A Deep Learning Approach for Automatic Hate Speech Detection in the Saudi Twittersphere [O] . Raghad Alshalan, Hend Al-Khalifa 2020

机译：沙特·丁群岛自动仇恨语音检测深入学习方法

A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection

摘要

著录项

相似文献

相关主题

期刊订阅