WeChat Toxic Article Detection: A Data-Driven Machine Learning Approach

机译：微信有毒物品检测：一种数据驱动的机器学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, toxic information detection has attracted tremendous amounts of research interest because of the popularity of social networks and the widespread of toxic information which may have dire consequences to the public. Existing work extensively studies toxic article detection in open social networks from information diffusion perspective. However, in closed social networks as exemplified by WeChat Moments (WM), the diffusion process is uneasily visible. To tackle the toxic article detection problem in closed social networks, in this paper we empirically study the articles spread in WM which is based on the largest Chinese social platform WeChat. In particular, we systematically analyze users' behavior and text information of normal and toxic articles and identify a striking difference between them. Furthermore, we design a new model named MAT-LSTM which can well capture the impact of different kinds of text information. To improve the performance of automatic toxic article detection, we propose XMATL framework which is enhanced from MAT-LSTM and can utilize text information and users' behavior characteristics in a holistic manner. We conduct extensive experiments using two real-world datasets and demonstrate that our proposed model can effectively detect toxic articles in WM and achieve outstanding performance gain over the classic methods.

机译：近来，由于社交网络的普及以及有毒信息的广泛传播，有毒信息的检测已经引起了巨大的研究兴趣，这可能对公众造成可怕的后果。现有工作从信息传播的角度广泛研究了开放式社交网络中有毒物品的检测。但是，在以微信矩（WM）为代表的封闭式社交网络中，扩散过程不可见。为了解决封闭式社交网络中的有毒物品检测问题，本文以中国最大的社交平台微信为基础，对WM中传播的文章进行了实证研究。特别是，我们系统地分析用户的正常行为和有毒物品的行为和文字信息，并找出它们之间的显着差异。此外，我们设计了一个名为MAT-LSTM的新模型，该模型可以很好地捕获各种文本信息的影响。为了提高有毒物品自动检测的性能，我们提出了XMATL框架，该框架是从MAT-LSTM增强而来的，可以全面利用文本信息和用户的行为特征。我们使用两个现实世界的数据集进行了广泛的实验，并证明了我们提出的模型可以有效地检测WM中的有毒物品，并且比经典方法具有出色的性能提升。

著录项

来源
《Asia-Pacific Signal and Information Processing Association Annual Summit and Conference》|2018年|916-921|共6页
会议地点
作者
Yunpeng Weng; Muhong Wu; Xu Chen; Qiong Wu; Lingnan He; Liang Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Analytical models; Twitter; Machine learning; Distribution functions; Privacy;

机译：特征提取;分析模型; Twitter;机器学习;分布函数;隐私;

相似文献

外文文献
中文文献
专利

1. Machine learning based fog computing assisted data-driven approach for early lameness detection in dairy cattle [J] . Computers and Electronics in Agriculture . 2020,第期

机译：基于机器学习的雾气计算辅助数据驱动方法在奶牛早期跛行检测
2. Solar farm voltage anomaly detection using high-resolution μPMU data-driven unsupervised machine learning [J] . Dey Maitreyee, Rana Soumya Prakash, V. Simmons Clarke, Applied Energy . 2021,第Deca1期

机译：太阳能电压电压异常检测使用高分辨率μpmu数据驱动的无监督机器学习
3. Data-driven symbol detection via model-based machine learning [J] . NARIMAN FARSAD, NIR SHLEZINGER, ANDREA J. GOLDSMITH, Communications in information and systems . 2020,第3期

机译：通过基于模型的机器学习的数据驱动符号检测
4. WeChat Toxic Article Detection: A Data-Driven Machine Learning Approach [C] . Yunpeng Weng, Muhong Wu, Xu Chen, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2018

机译：微信有毒物品检测：数据驱动的机器学习方法
5. Position Falsification Detection in VANET with Consecutive BSM Approach Using Machine Learning Algorithm [D] . Sharma, Aekta. 2021

机译：使用机器学习算法的连续BSM方法在VANET中定位伪造检测
6. Predicting outcomes in older ED patients with influenza in real time using a big data-driven and machine learning approach to the hospital information system [O] . Tian-Hoe Tan, Chien-Chin Hsu, Chia-Jung Chen, 2021

机译：使用大数据驱动和机器学习方法在医院信息系统中实时预测较旧的ED患者患者的结果
7. Machine learning based fog computing assisted data-driven approach for early lameness detection in dairy cattle [O] . Mohit Taneja, John Byabazaire, Nikita Jalodia, 2020

机译：基于机器学习的雾气计算辅助数据驱动方法在奶牛早期跛行检测

WeChat Toxic Article Detection: A Data-Driven Machine Learning Approach

摘要

著录项

相似文献

相关主题

期刊订阅