Towards Robust and Privacy-preserving Text Representations

机译：朝着强大而隐私保留的文本表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Written text often provides sufficient clues to identify the author, their gender, age, and other important attributes. Consequently, the authorship of training and evaluation corpora can have unforeseen impacts, including differing model performance for different user groups, as well as privacy implications. In this paper, we propose an approach to explicitly obscure important author characteristics at training time, such that representations learned are invariant to these attributes. Evaluating on two tasks, we show that this leads to increased privacy in the learned representations, as well as more robust models to varying evaluation conditions, including out-of-domain corpora.

机译：书面文本经常提供足够的线索来识别作者，其性别，年龄和其他重要属性。因此，培训和评估的作者可以具有无法预料的影响，包括不同用户组的不同模式性能，以及隐私含义。在本文中，我们提出了一种在培训时间明确地模糊重要作者特征的方法，使得学到的表示是不变的这些属性。评估两项任务，我们表明这导致了解到所学习的陈述中的隐私，以及更强大的模型，以不同的评估条件，包括域名基础。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2018年|xlvii 795 p.|共6页
会议地点
作者
Yitong Li; Timothy Baldwin; Trevor Cohn;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Wordless Sounds: Robust Speaker Diarization Using Privacy-Preserving Audio Representations [J] . Parthasarathi S. H. K., Bourlard H., Gatica-Perez D. Audio, Speech, and Language Processing, IEEE Transactions on . 2013,第1期

机译：无言的声音：使用保护隐私的音频表示实现鲁棒的扬声器分离
2. Excellence Cluster "Multimodal Computing and Interaction" - Robust, Efficient and Intelligent Processing of Text, Speech, Visual Data, and High Dimensiona Representations [J] . Hans-Peter Seidel Information Technology . 2008,第4期

机译：卓越集群“多模式计算和交互”-文本，语音，视觉数据和高维表示形式的强大，高效和智能处理
3. Verifiable Privacy-Preserving Multi-Keyword Text Search in the Cloud Supporting Similarity-Based Ranking [J] . Sun W., Wang B., Cao N., Parallel and Distributed Systems, IEEE Transactions on . 2014,第11期

机译：云中可验证的保护隐私的多关键字文本搜索，支持基于相似度的排名
4. Towards Robust and Privacy-preserving Text Representations [C] . Yitong Li, Timothy Baldwin, Trevor Cohn Annual meeting of the Association for Computational Linguistics . 2018

机译：迈向健壮且保护隐私的文本表示
5. Robust Privacy-Preserving Fingerprint Authentication [D] . Zhang, Ye. 2016

机译：强大的隐私保护指纹认证
6. Comprehending expository texts: the dynamic neurobiological correlates of building a coherent text representation [O] . Katherine Swett, Amanda C. Miller, Scott Burns, 2013

机译：理解说明性文本：建立连贯的文本表示形式的动态神经生物学相关性
7. Towards Robust and Privacy-preserving Text Representations [O] . Yitong Li, Timothy Baldwin, Trevor Cohn 2018

机译：朝着强大而隐私保留的文本表示
8. Pictures from Words, Pictures from Text: Constructing Pictorial Representations of Meaning from Text [R] . Cowie, J., Helmreich, S., Dang, H. H. 2009

机译：词语中的图片，文本中的图片：从文本构建意义的图像表征

Towards Robust and Privacy-preserving Text Representations

摘要

著录项

相似文献

相关主题

期刊订阅