首页> 外文学位 >Who's Blogging Now?: Linguistic Features and Authorship Analysis in Sports Blogs.
【24h】

Who's Blogging Now?: Linguistic Features and Authorship Analysis in Sports Blogs.

机译:现在谁在写博客?:体育博客中的语言功能和作者分析。

获取原文
获取原文并翻译 | 示例

摘要

The field of authorship determination, previously largely falling under the umbrella of literary analysis but recently becoming a large subfield of forensic linguistics, has grown substantially over the last two decades. As its body of research and its record of successful forensic application continue to grow, this growth is paralleled by the demand for its application. However, methods which have undergone rigorous testing to show their reliability and replicability, allowing them to meet the strict Daubert criteria put forth by the US court system, have not truly been established.;In this study, I set out to investigate how a list of parameters, many commonly used in the methodologies of previous researchers, would perform when used to test documents of bloggers from a sports blog, Winging It in Motown. Three prolific bloggers were chosen from the site, and a corpus of posts was created for each blogger which was then examined for each of the chosen parameters. One test document for each of the three bloggers which was not included in that blogger's corpus was then chosen from the blog page, and these documents were examined for each of the parameters via the same methodologies as were used to examine the corpora. Once data for the corpora and all three test documents was obtained, the results were compared for similarity, and an author determination was made for each test document along each parameter.;The findings indicated that overall the parameters were quite unsuccessful in determining authorship for these test documents based on the author corpora developed for the study. Only two parameters successfully identified the authors of the test documents at a rate higher than chance, and the possibility exists that other factors may be driving these successful identifications, demanding further research to confirm their validity as parameters for the purpose of authorship work.
机译:在过去的二十年中,作者身份确定的领域以前在很大程度上属于文学分析的范畴,但最近已成为法证语言学的一个重要子领域。随着其研究机构和成功的法证应用记录不断增长,这种增长与应用的需求并驾齐驱。但是,还没有真正建立经过严格测试以显示其可靠性和可复制性,使其符合美国法院系统提出的严格的道伯特准则的方法。在本研究中,我着手研究如何列出参数,通常用于以前的研究人员的方法中,当用于测试来自体育博客Motown的Winging It的博客文档时,其性能会得到改善。从该站点选择了三个多产的博客作者,并为每个博客作者创建了一个文章语料库,然后针对每个所选参数对其进行检查。然后从博客页面中为三个博客作者的一个测试文档选择了一个测试文档,该文档未包含在该博客作者的语料库中,并且通过与用于检查语料库的方法相同的方法对这些文档的每个参数进行了检查。一旦获得了语料库和所有三个测试文档的数据,就比较了结果的相似性,并根据每个参数对每个测试文档进行了作者确定。研究结果表明,总体而言,这些参数在确定这些文档的作者身份方面相当不成功基于为该研究开发的作者语料库的测试文档。只有两个参数成功地以比偶然率高的速度确定了测试文档的作者,并且存在其他因素可能推动这些成功的标识的可能性,因此需要进行进一步的研究以确认其作为参数的有效性,以用于作者工作。

著录项

  • 作者

    Cox, Taylor.;

  • 作者单位

    Arizona State University.;

  • 授予单位 Arizona State University.;
  • 学科 Linguistics.
  • 学位 Ph.D.
  • 年度 2017
  • 页码 172 p.
  • 总页数 172
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号