Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models

机译：注意中毒词嵌入式：探索NLP模型中嵌入层的脆弱性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent studies have revealed a security threat to natural language processing (NLP) models, called the Backdoor Attack. Victim models can maintain competitive performance on clean samples while behaving abnormally on samples with a specific trigger word inserted. Previous backdoor attacking methods usually assume that attackers have a certain degree of data knowledge, either the dataset which users would use or proxy datasets for a similar task, for implementing the data poisoning procedure. However, in this paper, we find that it is possible to hack the model in a data-free way by modifying one single word embedding vector, with almost no accuracy sacrificed on clean samples. Experimental results on sentiment analysis and sentence-pair classification tasks show that our method is more efficient and stealthier. We hope this work can raise the awareness of such a critical security risk hidden in the embedding layers of NLP models.

机译：最近的研究揭示了对自然语言处理（NLP）模型的安全威胁，称为后门攻击。受害者模型可以在清洁样本上保持竞争性能，同时在具有插入特定触发单词的样本上表现异常。以前的后门攻击方法通常假设攻击者具有一定程度的数据知识，该数据集可以使用或代理数据集进行类似的任务，用于实现数据中毒过程。然而，在本文中，我们发现通过修改一个单词嵌入向量，可以以无数据的方式破解模型，几乎没有在清洁样品上牺牲的准确度。关于情感分析和句子分类任务的实验结果表明，我们的方法更有效，悄悄。我们希望这项工作能够提高隐藏在NLP模型的嵌入层中这种关键安全风险的意识。

著录项

来源
《Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2021年|2048-2058|共11页
会议地点
作者
Wenkai Yang; Lei Li; Zhiyuan Zhang; Xuancheng Ren; Xu Sun; Bin He;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. AraVec: A set of Arabic Word Embedding Models for use in Arabic NLP [J] . Abu Bakr Soliman, Kareem Eissa, Samhaa R. El-Beltagy Procedia Computer Science . 2017,第11期

机译：AraVec：一组用于阿拉伯语NLP的阿拉伯语单词嵌入模型
2. Word Embedding Models for Finding Semantic Relationship between Words in Tamil Language [J] . S. G. Ajay, M. Srikanth, M. Anand Kumar, Indian Journal of Science and Technology . 2016,第45期

机译：查找泰米尔语单词之间语义关系的单词嵌入模型
3. Toward meaningful notions of similarity in NLP embedding models [J] . Abel Elekes, Adrian Englhardt, Martin Schaeler, International journal on digital libraries . 2020,第2期

机译：对NLP嵌入模型中的有意义的相似概念
4. Integrating extra knowledge into word embedding models for biomedical NLP tasks [C] . Yuan Ling, Yuan An, Mengwen Liu, International Joint Conference on Neural Networks . 2017

机译：将额外的知识整合到生物医学NLP任务的单词嵌入模型中
5. Multilingual model using cross-lingual word embeddings based on subword alignment and cross-task projection利用統計を見る [D] . Sakuma Jin 2019

机译：使用基于子词对齐和跨任务投影的跨语言词嵌入的多语言模型
6. A Word on Words in Words: How Do Embedded Words Affect Reading? [O] . Joshua Snell, Jonathan Grainger, Mathieu Declerck 2018

机译：单词中的单词：嵌入式单词如何影响阅读？
7. BioReddit: Word Embeddings for User-Generated Biomedical NLP [O] . Marco Basaldella, Nigel Collier 2019

机译：BioredDit：用于用户生成的生物医学NLP的Word Embeddings
8. Exploring New RF Circuit Structures with Embedded Patterned Substrate Layers. [R] . Melde, K. L. 2013

机译：利用嵌入式图案化基板层探索新的射频电路结构。

Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models

摘要

著录项

相似文献

相关主题

期刊订阅