Siamese networks for large-scale author identification

Chakaveh Saedi; Mark Dras

首页> 外文期刊>Computer speech and language >Siamese networks for large-scale author identification

【24h】

Siamese networks for large-scale author identification

机译：暹罗网络大型作者识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Authorship attribution is the process of identifying the author of a text. Approaches to tackling it have been conventionally divided into classification-based ones, which work well for small numbers of candidate authors, and similarity-based methods, which are applicable for larger numbers of authors or for authors beyond the training set; these existing similarity-based methods have only embodied static notions of similarity. Deep learning methods, which blur the boundaries between classification-based and similarity-based approaches, are promising in terms of ability to learn a notion of similarity, but have previously only been used in a conventional small-closed-class classification setup.Siamese networks have been used to develop learned notions of similarity in one-shot image tasks, and also for tasks of mostly semantic relatedness in NLP. We examine their application to the stylistic task of authorship attribution on datasets with large numbers of authors, looking at multiple energy functions and neural network architectures, and show that they can substantially outperform previous approaches.

机译：作者归属是识别文本作者的过程。解决它的方法通常分为基于分类的，这对于少量候选作者来说，以及基于相似性的方法，适用于较大数量的作者或除了培训集之外的作者;这些现有的基于相似性的方法仅具有相似性的静态概念。模糊基于分类和相似性的方法之间的边界的深度学习方法在学习相似性概念的能力方面具有很大的承诺，但是先前只用于传统的小型级别分类设置.SIAMESE网络已被用来在单次图像任务中开发知识的相似概念，以及NLP中主要是语义相关性的任务。我们将其应用于具有大量作者的数据集的作者归因的体型任务，观察多个能量功能和神经网络架构，并表明它们可以大大倾向于以前的方法。

著录项

来源
《Computer speech and language》 |2021年第11期|101241.1-101241.15|共15页
作者
Chakaveh Saedi; Mark Dras;
展开▼
作者单位

Department of Computing Macquarie University Sydney Australia;

Department of Computing Macquarie University Sydney Australia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Author identification; Text categorisation; Siamese neural network;

机译：作者识别;文本分类;暹罗神经网络;

相似文献

外文文献
中文文献
专利

1. Incorporating large-scale atmospheric variables in long-term seasonal rainfall forecasting using artificial neural networks: an application to the Ping Basin in Thailand [J] . M.S.Babel, T.A.J.G.Sirisena, N.Singhrattna Nordic hydrology . 2017,第3a4期

机译：使用人工神经网络将大型大气变量纳入长期季节性降雨预测中：在泰国坪盆地的应用
2. Distortive Effects of Initial-Based Name Disambiguation on Measurements of Large-Scale Coauthorship Networks [J] . Jinseok Kim, Jana Diesner Journal of the American Society for Information Science and Technology . 2016,第6期

机译：初始名称歧义化对大规模共同作者网络度量的扭曲效应
3. Exploiting citation networks for large-scale author name disambiguation [J] . Christian Schulz, Amin Mazloumian, Alexander M Petersen, EPJ Data Science . 2014,第1期

机译：利用引文网络大规模消除作者姓名歧义
4. MVB: A Large-Scale Dataset for Baggage Re-Identification and Merged Siamese Networks [C] . Zhulin Zhang, Dong Li, Jinhua Wu, Chinese conference on pattern recognition and computer vision . 2019

机译：MVB：用于行李重新识别和合并的暹罗网络的大规模数据集
5. Structure identification and optimal design of large-scale networks of dynamical systems. [D] . Lin, Fu. 2012

机译：大型动力系统网络的结构辨识与优化设计。
6. Author Correction: Large-scale intact glycopeptide identification by Mascot database search [O] . Ravi Chand Bollineni, Christian Jeffrey Koehler, Randi Elin Gislefoss, -1

机译：作者更正：通过Mascot数据库搜索进行大规模完整糖肽鉴定
7. Siamese networks for large-scale author identification [O] . Chakaveh Saedi, Mark Dras 2021

机译：SIDESE网络用于大型作者识别

Siamese networks for large-scale author identification

摘要

著录项

相似文献

相关主题

期刊订阅