Machine Learning Algorithms For Recommender System - A Comparative Analysis
Machine Learning Algorithms For Recommender System - A Comparative Analysis
Abstract: Recommendation system is one of the most popular applications of Artificial Intelligence which attracts many researchers all
over the globe. The advent of the Internet era has brought wide implementation of recommendation system in our everyday lives. There
are many machine learning techniques which can be used to realize the recommendation system. Among all these techniques we are
dealing with Content Based Filtering, Collaborative Based Filtering, Hybrid Content-Collaborative Based Filtering, k-mean clustering
and Naive Bayes classifier. We have exploited these algorithms to their extreme in order to achieve the best possible precision and have
presented a comprehensive comparative analysis. The strength of all these algorithms can be clearly realized by the significant
enhancement in the accuracy, depicted by the experimental analysis taking cold start problem into consideration.
Keywords: Recommender System, Classifier, Content Based, Collaborative Based, Cluster, Correlation.
1. INTRODUCTION In this paper, we have used the MovieLens dataset[8]. All the
Recommendation system[10] is an application which is used above algorithms deal with this dataset in order to recommend
for prediction in various domains throughout the internet. A the movies and calculate the precision along with tackling the
large amount of data flows through the internet and it gives cold-start problem[3]. Cold-start problem is one of the most
away a lot of information regarding the user searching activity. commonly encountered challenges of the recommendation
The information extracted from the pattern of previously system. It is also known as the new user problem as it creates
searched data can be molded into the prediction of relevant data problem of generating recommendations for the new user. We
for the user[1]. The implementation of the system can be have divided this analysis into various sections. Section II
performed by various techniques. In this paper, we have describes the different state-of-the-art techniques for the
discussed Content Based Filtering, Collaborative Filtering[10], recommendation system. Section III gives the experimental
Hybrid Content-Collaborative Based Filtering, k-mean results for all these techniques. Section IV concludes the study.
clustering Based and Naive-Bayes Classifier based techniques. Section V describes the future work that we propose.
www.ijcat.com 97
International Journal of Computer Applications Technology and Research
Volume 6Issue 2, 97-100, 2017, ISSN:-23198656
www.ijcat.com 98
International Journal of Computer Applications Technology and Research
Volume 6Issue 2, 97-100, 2017, ISSN:-23198656
9. Find similarity (simi) w.r.t s, where i = 1 to p. recommendations (T), the ones which are also present in the
converted movies are the correct recommendations(tc).
10. Select highest simi user.
Precision = (tc / T ) * 100
11. select m' s of highest simi and s' of ith user.
For all the experiments, we are taking value of = 5 and value
12. Calculate weight W (me') where e m' of k = 10.
13. Return top weight recommendations.
15. end
Content Based 18.45 18.66 19.10
8. end for
Fig. 1. Precision Comparison
4 CONCLUSION
3 EXPERIMENTAL RESULT
All the algorithms described in this paper are compared with
We now illustrate the analysis of the experiments performed
respect to their precision rates. This comprehensive analysis
and provide a comparison of all the state-of-the-art methods
depicts the strength and the weakness of each one of them in
described above. To compare their accuracy we have used the
different versions of the MovieLens dataset. The experiments
MovieLens dataset of 10K, 50K and 100K. The dataset varies
performed are the witness of the sparsity handling by these
in sparsity. For example, the 100K MovieLens dataset has
algorithms. Our experiments have shown promising results and
100K ratings, 943 users and 1682 movies of 19 different
this paper conforms that out of all these approaches Naive
genres. The analysis of these algorithms is demonstrated based
Bayes gives the best precision.
on precision measure. For each test user, we convert 30% of the
users seen movies into unseen movies and apply the
algorithms described above. Out of the total number of
www.ijcat.com 99
International Journal of Computer Applications Technology and Research
Volume 6Issue 2, 97-100, 2017, ISSN:-23198656
www.ijcat.com 100