Clustering based online learning in recommender systems: A bandit approach

机译：推荐系统中基于聚类的在线学习：一种强盗方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A big challenge for the design and implementation of large-scale online services is determining what items to recommend to their users. For instance, Netflix makes movie recommendations; Amazon makes product recommendations; and Yahoo! makes webpage recommendations. In these systems, items are recommended based on the characteristics and circumstances of the users, which are provided to the recommender as contexts (e.g., search history, time, and location). The task of building an efficient recommender system is challenging due to the fact that both the item space and the context space are very large. Existing works either focus on a large item space without contexts, large context space with small number of items, or they jointly consider the space of items and contexts together to solve the online recommendation problem. In contrast, we develop an algorithm that does exploration and exploitation in the context space and the item space separately, and develop an algorithm that combines clustering of the items with information aggregation in the context space. Basically, given a user's context, our algorithm aggregates its past history over a ball centered on the user's context, whose radius decreases at a rate that allows sufficiently accurate estimates of the payoffs such that the recommended payoffs converge to the true (unknown) payoffs. Theoretical results show that our algorithm can achieve a sublinear learning regret in time, namely the payoff difference of the oracle optimal benchmark, where the preferences of users on certain items in certain context are known, and our algorithm, where the information is incomplete. Numerical results show that our algorithm significantly outperforms (over 48%) the existing algorithms in terms of regret.

机译：大型在线服务的设计和实现面临的一大挑战是确定向其用户推荐哪些项目。例如，Netflix提供电影推荐;亚马逊提出产品推荐;和雅虎！提出网页建议。在这些系统中，根据用户的特征和环境来推荐项目，将其作为上下文（例如，搜索历史，时间和位置）提供给推荐者。由于项目空间和上下文空间都很大，因此，构建高效的推荐系统的任务是具有挑战性的。现有作品要么集中在没有上下文的大项目空间上，要么集中在具有少量项目的大上下文空间上，或者他们共同考虑项目和上下文的空间以解决在线推荐问题。相反，我们开发了一种在上下文空间和项目空间中分别进行探索和开发的算法，并开发了一种将项目的聚类与上下文空间中的信息聚合相结合的算法。基本上，在给定用户上下文的情况下，我们的算法将其过去的历史汇总到以用户上下文为中心的球上，该球的半径以允许足够准确地估计收益的速率减小，从而使推荐的收益收敛到真实的（未知的）收益。理论结果表明，该算法可以及时获得亚线性学习后悔，即已知最优条件下用户对某些项目的偏好已知的oracle最优基准的收益差异，以及信息不完整的算法。数值结果表明，就遗憾而言，我们的算法明显优于现有算法（超过48％）。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2014年|4528-4532|共5页
会议地点
作者
Song Linqi; Tekin Cem; van der Schaar Mihaela;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Recommender systems; clustering algorithms; multi-armed bandit; online learning;

机译：推荐系统;聚类算法;多臂匪在线学习;

相似文献

外文文献
中文文献
专利

1. A NOVEL APPROACH IN CLUSTERING SUPERVISED LEARNING (CSL) FOR RECOMMENDER SYSTEMS [J] . K. A. Balasubramaniam, M. Chidambaram International journal of simulation: systems, science and technology . 2018,第4aaPagea2期

机译：用于群集监督学习的新方法（CSL），用于推荐系统
2. Detecting Group Shilling Attacks in Online Recommender Systems Based on Bisecting K-Means Clustering [J] . Zhang Fuzhi, Wang Shilei Computational Social Systems, IEEE Transactions on . 2020,第5期

机译：基于Boting K-Means聚类的在线推荐系统中检测组先令攻击
3. Web usage mining based recommender systems using implicit heterogeneous data: A Particle Swarm Optimization based clustering approach [J] . Shafiq Alam, Gillian Dobbie, Yun Sing Koh, Web Intelligence and Agent Systems . 2014,第4期

机译：使用隐式异构数据的基于Web使用挖掘的推荐系统：基于粒子群优化的聚类方法
4. Clustering Based Online Learning in Recommender Systems: A Bandit Approach [C] . Linqi Song, Cem Tekin, Mihaela van der Schaar IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：基于群集的在线学习在推荐系统中：强盗方法
5. Expanding learning and social interaction through intelligent systems design: Implementing a reputation and recommender system for the Claremont Conversation Online. [D] . Thoms, Brian. 2009

机译：通过智能系统设计扩展学习和社交互动：为Claremont在线对话实施声誉和推荐系统。
6. An approach on the implementation of full batch, online and mini-batch learning on a Mamdani based neuro-fuzzy system with center-of-sets defuzzification: Analysis and evaluation about its functionality, performance, and behavior [O] . Sukey Nakasima-López, Juan R. Castro, Mauricio A. Sanchez, 2012

机译：在基于Mamdani的神经模糊系统上进行全批处理，在线和小批量学习的方法，该系统具有集中心去模糊化：有关其功能，性能和行为的分析和评估
7. Combination of Clustering, Classification Association Rule based Approach for Course Recommender System in E-learning [O] . Sunita B.Aher, Lobo L.M.R.J. 2012

机译：基于聚类，分类和关联规则的组合方法在电子学习中的课程推荐系统

Clustering based online learning in recommender systems: A bandit approach

摘要

著录项

相似文献

相关主题

期刊订阅