首页> 外文会议>IEEE International Conference on Big Data >A Streaming Machine Learning Framework for Online Aggression Detection on Twitter
【24h】

A Streaming Machine Learning Framework for Online Aggression Detection on Twitter

机译:在Twitter上的在线侵袭检测的流媒体机器学习框架

获取原文

摘要

The rise of online aggression on social media is evolving into a major point of concern. Several machine and deep learning approaches have been proposed recently for detecting various types of aggressive behavior. However, social media are fast paced, generating an increasing amount of content, while aggressive behavior evolves over time. In this work, we introduce the first, practical, real-time framework for detecting aggression on Twitter via embracing the streaming machine learning paradigm. Our method adapts its ML classifiers in an incremental fashion as it receives new annotated examples and is able to achieve the same (or even higher) performance as batch-based ML models, with over 90% accuracy, precision, and recall. At the same time, our experimental analysis on real Twitter data reveals how our framework can easily scale to accommodate the entire Twitter Firehose (of 778 million tweets per day) with only 3 commodity machines. Finally, we show that our framework is general enough to detect other related behaviors such as sarcasm, racism, and sexism in real time.
机译:在社交媒体上的在线侵略的兴起正在发展成为一个重要的关注点。最近提出了几种机器和深度学习方法,用于检测各种类型的侵略性行为。然而,社交媒体快节奏,产生越来越多的内容,而侵略性行为随着时间的推移而发展。在这项工作中,我们介绍了通过拥抱流动机器学习范式来检测Twitter上的第一个,实际的实时框架,用于检测Twitter上的攻击。我们的方法以增量方式适应其ML分类器,因为它收到新的注释示例,并且能够实现与基于批处理的ML型号相同的(或甚至更高的)性能,具有超过90%的精度,精度和召回。与此同时,我们对真实推特数据的实验分析显示,我们的框架如何轻松扩展,以容纳整个Twitter Firehose(每天77800万推文),只有3种商品。最后,我们表明我们的框架足够一般,无法实时检测讽刺,种族主义和性别歧视等其他相关行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号