Movie Recommendation System Using Cosine Similarity and KNN: II. Related Work
Movie Recommendation System Using Cosine Similarity and KNN: II. Related Work
Abstract—Over the past years, the internet has broadened the Recommendation systems are primarily using three
horizon of various domains to interact and share meaningful approaches [6]. In content-based filtering, we do profiling
information. As it is said that everything has its pros and cons based on what type of content any user is interested in and
therefore, along with the expansion of domain comes
using the collected information, it recommends items.
information overload and difficulty in extraction of data. To
Another one is collaborative filtering, where we make
overcome this problem the recommendation system plays a vital
role. It is used to enhance the user experience by giving fast and clusters of similar users and use that information to make
coherent suggestions. This paper describes an approach which recommendations. Hybrid systems are the one which takes
offers generalized recommendations to every user, based on into account both above stated approaches to deal with
movie popularity and/or genre. Content-Based Recommender operational data more concisely [7]. Our goal is to provide
System is implemented using various deep learning approaches. accurate recommendations with less computational
This paper also gives an insight into problems which are faced in complexity.
content-based recommendation system and we have made an
effort to rectify them.
II. RELATED WORK
Keywords: Recommendation System, Content-Based
Recommender System, Deep learning Some of the common approaches of recommender
system are:
I. INTRODUCTION
1. Content-based filtering
Advancement in technology is reaching new heights every 2. Collaborative filtering
day and due to which we can see enormous growth in 3. Hybrid filtering
information. To deal with such large data we use machine A. Content Based Filtering
learning that automates analytical model building [1]. The
early classification of machine learning is divided into three This approach filters the items based on the likings of the
broad categories: Supervised learning, Unsupervised user. It gives result based on what the user has rated earlier.
learning and Reinforcement learning [2]. We use computers The method to model this approach is the Vector Space
to make predictions to help us achieve better results using Model (VSM). It derives the similarity of the item from its
various computational statistics. Tasks can be performed description and introduces the concept of TF-IDF (Term
without being explicitly programmed to do so [3]. It Frequency-Inverse Document Frequency) [28].
becomes a tedious task to extract the relevant information.
Search engines solve the problem to some extent but it does
not solve the personalization problem. Recommendation Tf(t)=
System framework plays a vital role in today’s internet
surfing, be it buying a product from an e-commerce site or
watching a movie on some video-on-demand service [4]. In If(t)= 10
our everyday life, we depend on recommendations given by
other people either by word of mouth or reviews of general
surveys. People often use recommender systems over the The similarity between item vectors can be computed by
web to make decisions for the items related to their choice. three methods:
Recommendation systems are software tools and techniques 1. Cosine similarity
whose goal is to make useful and sensible recommendations 2. Euclidian distance
to a collection of users for items or products that might 3. Pearson’s correlation
interest them [5]. In other words, the recommender system COSINE SIMILARITY
or recommendation systems belongs to a class of
information filtering system that aims at predicting the Cosine similarity among two objects measures the angle of
‘preference’ or ‘rating’ given to an item. cosine between the two objects. It compares two documents
on a normalized scale. It can be done by finding the dot
Revised Manuscript Received on May 29.2020. product between the two identities.
* Correspondence Author
Gaurav Srivastav*, Inderprastha Engineering College, AKTU
Ramni Harbir Singh, Inderprastha Engineering College, AKTU
Sargam Maurya, Inderprastha Engineering College, AKTU Tanisha
Tripathi, Inderprastha Engineering College, AKTU Tushar Narula,
Inderprastha Engineering College, AKTU
Published By:
Retrieval Number: E9666069520/2020©BEIESP
Blue Eyes Intelligence Engineering
DOI: 10.35940/ijeat.E9666.069520
& Sciences Publication
Journal Website: www.ijeat.org 556 © Copyright: All rights reserved.
Movie Recommendation System using Cosine Similarity and KNN
IV. CONCLUSION
We have illustrated the modelling of a movie
recommendation system by making the use of content-based
filtering in the movie recommendation system. The KNN
algorithm is implemented in this model along with the
principle of cosine similarity as it gives more accuracy than
the other distance metrics and the complexity is
comparatively low too
Recommendations systems have become the most essential
fount of a relevant and reliable source of information in the
world of internet. Simple ones consider one or a few
Fig: To implement Cosine similarity we take an example parameters while the more complex ones make use of more
of 2 movies of different genre adventure and comedy parameters to filter the results and make it more user
friendly. With the inclusion of advanced deep learning and
other filtering techniques like collaborative filtering and
hybrid filtering a strong movie recommendation system can
be built. This can be a major step towards the further
development of this model as it will not only become more
efficient to use but also increase the business value even
further.
REFERENCES
1. Gediminas Adomavicius and Alexander Tuzhilin. Toward the next
generation of recommender systems: A survey of the state-of-the-art
Fig: Cosine similarity and possible extensions. Knowledge and Data Engineering, IEEE
The angle theta between the two movies will determine the Transactions on, 17(6):734–749, 2005.
similarity between the two movies. The theta ranges from 0- 2. Ricardo Baeza-Yates, Berthier Ribeiro-Neto, et al. Modern
information retrieval, volume 463. ACM Press New York, 1999
1. If the value of the theta is near 1 then it is most similar 3. ShumeetBaluja, Rohan Seth, D Sivakumar, Yushi Jing, Jay Yagnik,
and if it's near to 0 then it is least similar. The movie will be Shankar Kumar, Deepak Ravichandran, and Mohamed Aly. Video
recommended if it is close to 1 otherwise there would be no suggestion and discovery for youtube: taking random walks through
the view graph. In Proceedings of the 17th international conference on
similarity between them. It will recommend the best movies World Wide Web, pages 895–904. ACM, 2008.
to the user according to the Cosine similarity. After the 4. Xu Hailing, Wu Xiao, Li Xiaodong, and Yan Baoping.
cosine similarity, we have used a normalised popular score Comparison study of internet recommendation system. Journal of
Software, 20(2):350–362, 2009.
through which we get our function of computing distance. 5. T. E. D. Mining, “Enhancing teaching and learning through
Then by using the KNN functionality, we have found the educational data mining and learning analytics: An issue brief,” in
nearest neighbour which will be recommended to the user. Proceedings of a conference on advanced technology for
education, 2012.
6. Nakagawa and T. Ito, “An implementation of a knowledge
recommendation system based on similarity among users’
profiles,” in Sice 2002. roceedings of the Sice Conference, 2002,
pp. 326–327 vol.1.
7. T. K. Quan, I. Fuyuki, and H. Shinichi, “Improving the accuracy
of recommender system by clustering items based on the stability
of user similarity,” in International Conference on Computational
Intelligence for Modelling Control and Automation, 2006, p. 61.
Published By:
Retrieval Number: E9666069520/2020©BEIESP
Blue Eyes Intelligence Engineering
DOI: 10.35940/ijeat.E9666.069520
& Sciences Publication
Journal Website: www.ijeat.org 558 © Copyright: All rights reserved.
Movie Recommendation System using Cosine Similarity and KNN