...
首页> 外文期刊>Neurocomputing >Stylistics analysis and authorship attribution algorithms based on self-organizing maps
【24h】

Stylistics analysis and authorship attribution algorithms based on self-organizing maps

机译:基于自组织图的文体分析和作者归因算法

获取原文
获取原文并翻译 | 示例
           

摘要

The style followed by authors can be thought of as a collection of attributes that defines the stylistics space. Texts from the same author tend to be similar in that space. However, the identification of stylistics spaces has proven to be challenging. Associated with the stylistics space is the authorship attribution task. On it, a text of unknown authorship is presented to a system, and the system is expected to identify the author of the text. Two modules define an authorship attribution algorithm: the stylistics space and a classifier. We present a methodology that includes both, a module that allows the identification of novel stylistics spaces, and a classifier to confront the authorship attribution task from the features that define space. The methodology imbricates feature selection, anomaly detection, classification, and visualization algorithms. We applied the capabilities of self-organizing maps not only for visualization but also for anomaly detection, which defines the basis of the classifier. We compared our authorship attribution algorithm with two existing ones. Our methodology achieved similar or better results under bag-of-words-related stylistics spaces, and it presented the lowest error under a novel stylistics space based on the rate of introduction of new words.
机译:作者所遵循的样式可以认为是定义样式空间的属性的集合。同一作者的文字在该领域往往相似。但是,事实证明,文体空间的识别具有挑战性。与文体空间相关联的是作者身份归属任务。在其上,未知作者的文本被呈现给系统,并且期望该系统识别文本的作者。有两个模块定义了作者身份归因算法:文体空间和分类器。我们提供了一种方法论,该方法论既包括允许识别新颖的文体空间的模块,又包括通过定义空间的特征来面对作者归属任务的分类器。该方法结合了特征选择,异常检测,分类和可视化算法。我们将自组织地图的功能不仅用于可视化,还用于异常检测,这定义了分类器的基础。我们将作者身份归因算法与现有的两个算法进行了比较。我们的方法在与词袋相关的风格空间下取得了相似或更好的结果,并且根据新单词的引入率,在新颖的风格空间下其误差最小。

著录项

  • 来源
    《Neurocomputing》 |2015年第5期|147-159|共13页
  • 作者单位

    Complex Systems Group, Universidad Autonoma de la Ciudad de Mexico, San Lorenzo 290, Mexico, D.F., Mexico,Institute for Molecular Medicine Finland, Tukholmankatu 5, 00270 Helsinki, Finland;

    Faculty of Telematics, Universidad de Colima, Mexico;

    CINVESTAV IDS, Mexico D.F., Mexico;

    Postgraduate Program in Complex Systems, Universidad Autonoma de la Ciudad de Mexico, Mexico;

    Faculty of Literary Creation, Universidad Autonoma de la Ciudad de Mexico, Mexico;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Computational stylistics; Authorship attribution; Self-organizing maps; Anomaly detection; Feature selection;

    机译:计算文体学;著作权归属;自组织地图;异常检测;功能选择;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号