A Contextual Bandit Approach To Personalized Online Recommendation Via Sparse Interactions

Posted on:2020-02-08

Degree:Master

Type:Thesis

Country:China

Candidate:C Y Zhang

Full Text:PDF

GTID:2428330575455102

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Nowadays,Online recommendation has become a significant feature in many ap-plications.For better user experience and more efficient information searching,online recommendation has been a significant module in many service platforms,such as e-commerce,streaming services,news,music and so on.Various online recommendation algorithms have been proposed,also more and more researches have focus on gener-ating a better recommender model using all historical data.In practice,the interaction between the users and the recommender system might be sparse,which means that users are not always interacting with the recommender system while using the related appli-cation or webpage and this may affect the qualification of data used in the model.In some cases,some users prefer to sweep around the recommendation instead of clicking into the details.Also,some users may just leave the webpage open while doing other businesses,they don't focus on the content of the page and the recommendations are totally ignored.Therefore,a response of 0 may not necessarily be a negative response(i.e.not attractive),but a non-response(i.e.not observed).It comes worse to distin-guish these two situations when only one item is recommended to the user each time and few further information is reachable.Most existing recommendation strategies ignore the difference between non-responses and negative responses.They treat all historical records as valid ones and use them to update the user model.In fact,the historical data contains a large part of noisy data due to sparse interactions.Introducing this kind of noisy data into recommender model may have a bad effect on the model's accuracy.This paper tries to solve this problem.Therefore,this paper's main work contains:(1)We first provide evidence of sparsity of the interactions by analyzing a real dataset,Yahoo!R6A,and analyze the stability of user's short-term interests based on the similarity between user's recent clicked items.(2)Based on our findings in the data analysis,we propose a novel approach to make online recommendations via sparse interactions,named SAOR(Sparsity-Aware Online Recommender),for online recommendation.S AOR makes probabilistic estima-tions on whether the user is interacting or not based on the relations among the user's recent clicked items,by reasonably assuming that similar items are similarly attractive.The algorithm only regards the kind of data generated while the user is interacting with the system as valid feedbacks.SAOR uses positive and negative responses to build the user preference model,ignoring all non-responses.An upper confidence bound is used to direct the exploration-exploitation tradeoff.(3)Theoretical analyses on the regret bound of SAOR is also provided in this paper.By discussing the regret of one single step in different cases,the algorithm's upper regret bound is proved.Experiments on both real and synthetic datasets show that SAOR outperforms competing methods.SAOR can get larger cumulative rewards in all sub-periods and higher overall click ratio.Besides,the algorithm's update frequency is lower as it just updates the model when valid feedback is generated.The speed of learning and the sensitivity of parameters are also tested and the results show that SAOR learns faster and is more stable.

Keywords/Search Tags:

online recommendation, sparse interaction, contextual bandit

PDF Full Text Request

Related items

1	Research Of Recommendation Algorithm Based On Contextual Restless Multi-armed Bandit Model
2	A Study On Dynamic Recommendation Algorithms In Recommendation Systems
3	Research On The Algorithm Of Personalized Learning System Based On Multi-armed Bandit
4	Study On Relay Selection Algorithm Based On Multi-Armed Bandit In Underwater Acoustic Cooperative Communication Networks
5	Adaptive Linear Contextual Bandits For Online Recommendation
6	Research Of Online Multiclass Learning With "Bandit" Feedback Under A Confidence-weighted Approach
7	Research On Multi-armed Bandit Aided Online Learning Approach For Wireless Caching Strategy
8	Classification Of PolSAR Images Based On Sparse Representations And Contextual Information
9	Research On Point Of Interest Recommendation Algorithm Based On Contextual Information In LBSN
10	Image Super Resolution Based On Contextual Sparse Representation