Prediction of User Retweets Based on Social Neighborhood Information and Topic Modelling

机译：基于社交邻域信息和主题建模的用户转发预测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Twitter and other social networks have become a fundamental source of information and a powerful tool to spread ideas and opinions. A crucial step in understanding the mechanisms that drive information diffusion in Twitter, is to study the influence of the social neighborhood of a user in the construction of her retweeting preferences. In particular, to what extent can the preferences of a user be predicted given the preferences of her neighborhood. We build our own sample graph of Twitter users and study the problem of predicting retweets from a given user based on the retweeting behavior occurring in her second-degree social neighborhood (followed and followed-by-followed). We manage to train and evaluate user-centered binary classification models that predict retweets with an average F1 score of 87.6%, based purely on social information, that is, without analyzing the content of the tweets. For users getting low scores with such models (on a tuning dataset), we improve the results by adding features extracted from the content of tweets. To do so, we apply a Natural Language Processing (NLP) pipeline including a Twitter-specific adaptation of the Latent Dirichlet Allocation (LDA) probabilistic topic model.

机译：Twitter和其他社交网络已成为信息的基本来源和传播思想和观点的强大工具。理解推动Twitter中信息传播的机制的关键步骤是研究用户社交邻域在其转发偏好中的影响。特别地，给定其邻域的偏好，可以在多大程度上预测用户的偏好。我们建立了自己的Twitter用户样本图，并研究了根据其二级社交邻居中发生的转发行为（跟踪和跟踪）来预测给定用户转发的问题。我们设法训练和评估以用户为中心的二进制分类模型，该模型仅基于社交信息即不分析推文的内容即可预测平均F1分数为87.6％的转发。对于使用此类模型（在调整数据集上）得分较低的用户，我们通过添加从推文内容中提取的功能来改善结果。为此，我们应用了自然语言处理（NLP）管道，其中包括Twitter的潜在Dirichlet分配（LDA）概率主题模型的特定改编。

著录项

来源
《Mexican international conference on artificial intelligence》|2017年|146-157|共12页
会议地点
作者
Pablo Gabriel Celayes; Martin Ariel Dominguez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Retweet prediction; Social model Social network analysis; Machine learning; LDA; SVM;

机译：转推预测;社交模型社交网络分析;机器学习; LDA;支持向量机;

相似文献

外文文献
中文文献
专利

1. C-RBFNN: A user retweet behavior prediction method for hotspot topics based on improved RBF neural network [J] . Liu Yanbing, Zhao Jinzhe, Xiao Yunpeng Neurocomputing . 2018,第JANa31期

机译：C-RBFNN：基于改进的RBF神经网络的热点话题用户转推行为预测方法
2. A user-based aggregation topic model for understanding user's preference and intention in social network [J] . Shi Lei, Song Guangjia, Cheng Gang, Neurocomputing . 2020,第Nova6期

机译：基于用户的聚合主题模型，用于了解用户在社交网络中的偏好和意图
3. Multilevel learning based modeling for link prediction and users' consumption preference in Online Social Networks [J] . Sharma Pradip Kumar, Rathore Shailendra, Park Jong Hyuk Future generation computer systems . 2019,第APRa期

机译：基于多层次学习的在线社交网络中链接预测和用户消费偏好的建模
4. Prediction of User Retweets Based on Social Neighborhood Information and Topic Modelling [C] . Pablo Gabriel Celayes, Martin Ariel Dominguez Mexican International Conference on Artificial Intelligence . 2018

机译：基于社会邻居信息和主题建模的用户转发预测
5. Topic Modeling Location-Based Social Media Applications [D] . Osailan, Sarah. 2020

机译：主题建模基于位置的社交媒体应用程序
6. Machine Learning-Based Nicotine Addiction Prediction Models for Youth E-Cigarette and Waterpipe (Hookah) Users [O] . Jeeyae Choi, Hee-Tae Jung, Anastasiya Ferrell, 2021

机译：基于机器学习的尼古丁成瘾预测模型用于青年电子烟和水管（水烟管）
7. Retweet Prediction Based on User Behavior [O] . Syeda Nadia Firdaus 2021

机译：基于用户行为的转发预测

Prediction of User Retweets Based on Social Neighborhood Information and Topic Modelling

摘要

著录项

相似文献

相关主题

期刊订阅