Cold Start Problem
Cold Start Problem
Information Systems
journal homepage: www.elsevier.com/locate/infosys
a r t i c l e i n f o
abstract
Article history:
Received 15 September 2014
Received in revised form
6 October 2014
Accepted 7 October 2014
Recommended by D. Shasha
Available online 24 December 2014
The Recommender System (RS) is an efficient tool for decision makers that assists in the
selection of appropriate items according to their preferences and interests. This system has
been applied to various domains to personalize applications by recommending items such as
books, movies, songs, restaurants, news articles and jokes, among others. An important issue
for the RS that has greatly captured the attention of researchers is the new user cold-start
problem, which occurs when there is a new user that has been registered to the system and no
prior rating of this user is found in the rating table. In this paper, we first present a classification
that divides the relevant studies addressing the new user cold-start problem into three major
groups and summarize their advantages and disadvantages in a tabular format. Next, some
typical algorithms of these groups, such as MIPFGWC-CS, NHSM, FARAMS and HUFCF, are
described. Finally, these algorithms are implemented and validated on some benchmark RS
datasets under various settings of the new user cold start. The experimental results indicate
that NHSM achieves better accuracy and computational time than the relevant methods.
& 2014 Elsevier Ltd. All rights reserved.
Keywords:
Collaborative filtering
NHSM
New user cold start
Recommender systems
Contents
1.
2.
3.
4.
5.
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
Literature review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
The analysis of existing methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
3.1.
MIPFGWC-CS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
3.2.
NHSM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
3.3.
FARAMS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
3.4.
HUFCF. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
Experiments. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
4.1.
Environment setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
4.2.
Results and discussion. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
http://dx.doi.org/10.1016/j.is.2014.10.001
0306-4379/& 2014 Elsevier Ltd. All rights reserved.
88
Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
Appendix A Supporting information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
1. Introduction
The growing development of content-based systems
that provide a large amount of data, such as videos,
images, blogs, multimedia, and wikis, brings great challenges for analysts attempting to extract useful knowledge
and capture meaningful events from the massive data.
Machine learning tools should indeed be oriented to what
users intend to do and how they want the results to be
returned in a given format. An efficient tool that assists
decision makers to choose appropriate items according to
their preferences and interests and that is currently widely
used is the Recommender System (RS). Ricci et al. [31]
defined the RS as a special type of information system that
(i) helps to make choices without sufficient personal
experience of the alternatives, (ii) suggests products to
customers, and (iii) provides consumers with information
to help them decide which products to purchase. The RS is
based on a number of technologies, such as information
filtering, classification learning, user modeling and adaptive hypermedia, and it is applied to various domains to
personalize applications by recommending items such as
books, movies, songs, restaurants, news articles and
jokes, among others. It has been applied to e-commerce
to learn from a customer and recommend products
that he or she will find most valuable from among the
available products, thus helping the customer find suitable
products to purchase. Some e-commerce RSs are named as
follows [37,22]. For example, Amazon.com is the most
famous e-commerce RS, structured with an information
page for each book while providing details of the text and
purchase information. Two recommendations are found
herein, including books frequently purchased by customers who purchased the selected book and authors whose
books are frequently purchased. eBay.com is another
example that provides the Feedback Profile feature, which
allows both buyers and sellers to contribute to the feedback profiles of other customers with whom they have
done business. The feedback consists of a satisfaction
rating and a specific comment about the other customer.
On Moviefinder.com, customers can locate movies with a
similar mood, theme, genre or cast through Match Maker
or by their previously indicated interests through WePredict. We clearly recognize that RSs are becoming important and with increasing influence on various practical
applications.
An important issue for RSs that has greatly captured the
attention of researchers is the cold-start problem. This problem has two variants: the new user cold-start problem and
the new item cold-start problem. The new item cold-start
problem occurs when there is a new item that has been
transferred to the system. Because it is a new product, it has
no user ratings (or the number of ratings is less than a
threshold as defined in some equivalent papers) and is
therefore ranked at the bottom of the recommended items
list. Moreover, this problem can be partially handled by staff
2. Literature review
The beginning of this section starts with an example
that clearly demonstrates the new user cold-start problem.
Example 1. We have a RS that includes three tables: the
users' demographic data (Table 1), the movies' information
(Table 2) and the rating (Table 3). This type of system is able
to predict the user rating of a movie, which is expressed in
Table 3. Nonetheless, the new user cold-start problem
occurs with a new user, e.g., Kim (User ID: 6) in Table 1,
who has no prior rating such that it is difficult to provide a
prediction for the first movie, e.g., Titanic (ID: 1).
Name
Age
Gender
Occupation
1
2
3
4
5
6
John
David
Jenny
Marry
Tom
Kim
23
30
29
20
30
25
Male
Male
Male
Female
Male
Female
Student
Doctor
Student
Engineer
Engineer
Doctor
Table 2
Movies' information.
ID
Name
Genre
Date
Sales
1
2
3
Titanic
Hulk
Scallet
Romantic
Horror
Romantic
9/2004
10/2005
6/2009
150
300
200
Table 3
Rating data.
User ID
Movie ID
Rating
1
1
2
2
2
3
3
4
5
5
6
2
3
1
2
3
2
1
2
2
3
1
4
2
4
3
1
2
1
3
3
2
?
89
Almazro et al. [3] introduced a hybrid demographicbased and collaborative filtering approach on the movie
domain using demographic data to enhance the recommendation suggestion process. Their method classified
the genres of movies based on demographic attributes,
e.g., user age (child, teenager or adult), student (yes or
no), have children (yes or no) and gender (female or
male). Preisach et al. [28] argued that many user
profiles contain untagged resources that could provide valuable information, especially for the cold-start
problem, and proposed a purely graph-based semisupervised relational approach that uses untagged
posts. Said et al. [34,36] modified the user similarity
calculation method to employ the hybridization of
demographic and collaborative approaches. A modification to the k-nearest neighborhood that calculates
the similarity scores between the target user and other
users was introduced. Wang et al. [50] introduced
Credible and co-clustering filterBot for cold-stArt recommendations (COBA), which uses the rating confidence
level to reduce the dimensionality of the itemuser
matrix. The items and users were co-clustered, and the
ratings within every user cluster were smoothed to
overcome data sparsity. The recommendations were
fused from item and user clusters to predict user
preference. Zhang et al. [52] proposed the Cold-start
Recommendations Using Collaborative Filtering (CRUC)
scheme, which involves formulation, filtering and prediction steps. They assumed that users are tracked by
sensors such that each user has their own location,
which is currently regarded as the item. The itemuser
matrix was normalized and clustered to identify users
who have a significant influence on the recommendation. The prediction steps were performed by taking the
hybrid between the item-based and user-based filtering
methods. Chen et al. [8] employed additional information, such as the social sub-community and an ontology
decision model, to assist the recommendation in the
cold-start problem. The social sub-community was
divided according to the exiting users' history data
and the mining relationship between each other. An
ontology decision model was then constructed on the
basis of sub-community and users' static information,
which makes recommendations for the new user based
on his static ontology information. Guo [11] proposed
three different approaches from the perspective of
preference modeling. First, the ratings of trusted neighbors were merged to form a new rating profile for the
active users based on which better recommendations
can be generated. Second, a novel Bayesian similarity
measure was introduced by taking both the direction
and length of rating vectors into account. Third, a new
information source called prior ratings, based on virtual
product experience in virtual reality environments, was
proposed to inherently resolve the concerned problems. Chen et al. [7] proposed a cold start recommendation method for the new user that integrates a user
model with trust and distrust networks to identify trustworthy users, which are then aggregated to provide
useful recommendations for new users. Demographic
90
91
Table 4
Groups of methods for the new user cold-start problem.
Group
Ideas
Typical
algorithms
Advantages
Disadvantages
Makes use of
additional data
sources
# MIPFGWCCS [46]
# Determination of
analogous users is
more accurate
# NHSM
[20]
# The similarity
degrees between
users are enhanced
# Additional data are
not required
Enhances the
prediction using
hybrid methods
# FARAMS
[17]
# HUFCF
[42]
# Specification of values of
parameters is hard
# Irrelevant users are still included in
the computation of similarities
92
the new user has no prior rating, the demographic data are
the only medium to calculate the similarities between
users. After finding users similar to the new one,
MIPFGWC-CS checks whether they rated the considered
item or not. If ratings are found, then they are considered
to be the representative ratings of users. Otherwise, a
similar item to the considered one is found by the Pearson
coefficient, and the rating on the similar item is assumed
to be the representative rating. Finally, the rating of the
new user to the considered item is approximated by the
weighted average operator of the representative ratings.
Fig. 1 and Table 5 illustrate the idea in detail.
As described above, the MIPFGWC-CS algorithm contains several disadvantages, as follows:
a) Determining the optimal number of clusters for
MIPFGWC is required before running the clustering
algorithm. Although other parameters of MIPFGWC
were suggested by Son et al. [39], how to determine
the optimal number of clusters is still an on-going topic
of research. The exact number of clusters would lead to
more accurate results for finding the similar users to a
new user and thus enhance the prediction accuracy.
b) In Fig. 1, finding a similar item to the considered one by
the Pearson coefficient could somehow not achieve
good results because the Pearson metric has some
limitations where there is a poor signal-to-noise ratio
and negative spikes. In other words, if the relationship
between two variables is non-linear, the Pearson coefficient cannot accurately measure the correlation.
c) The MIPFGWC-CS relies solely on the demographic
data (Fig. 1). If this type of data is not available, the
algorithm cannot be performed.
3.2. NHSM
Liu et al. [20] introduced a new similarity metric called
NHSM to replace the traditional Pearson coefficient or the
cosine similarity measure. This heuristic similarity measure is composed of three factors of similarity, which are
Proximity, Significance and Singularity. Proximity considers the distance between two ratings. Significance shows
that the ratings are more significant if the two ratings are
more distant from the median rating. Singularity represents how the two ratings are different from other ratings.
Furthermore, NHSM integrates the modified Jaccard and
the user rating preference in the design. The definition of
NHSM is stated below.
simu; vNHSM simu; vJPSS simu; vURP ;
1
!
";
simu; vURP 1 #
1 exp # ju #v jj u # v j
simu; vJaccard
simu; vPSS
jI u \ I v j
;
jI u j ' jI v j
!
"
Proximity r u;p ; r v;p
pAI
10
11
!
"
!
"
'Signif icance r u;p ; r v;p Singularity r u;p ; r v;p ;
12
!
"
Proximity r u;p ; r v;p 1 #
!
"
Signif icance r u;p ; r v;p
1
!
";
1 exp #jr u;p #r v;p j
13
1
!
";
1 exp # jr u;p # r med jjr v;p #r med j
14
93
Table 5
The pseudo-code of the MIPFGWC procedure.
Input
Geo-demographic data X. The number of elements (clusters) NC. The dimension of dataset r. Threshold and other parameters m; ; , ai
i 1; 3, j j 1; C . Geographic parameters ; ; ; a; b; c; d.
Output Final membership values u0 and centers V t 1
k
MIPFGWC
1:
Set the number of clusters C, threshold 4 0 and other parameters such as m; ; 41, ai 4 0 i 1; 3, j j 1; C as in [39]
2:
Initialize centers of clusters V j , j 1; C at t 0
3:
4:
hkj
5:
6:
7:
8:
! 2
& #1
C %
P
X k # V j
i1
t kj
X k # V i
k 1; N ;
& 1 ;
2 #1
a2 X k # V j
j
j 1; C ;
X k # V i
3
k 1; N ;
j 1; C :
a1 um
kj a2 t kj a3 hkj ' X k
k1
Vj
& ; j 1; C :
N %
P
a1 um
a2 t kj a3 hkj
kj
7
k1
If the difference jjV t 1 # V t jj r , then stop the algorithm. Otherwise, assign V t V t 1 and return to Step 4.
!
"
Singularity r u;p ; r v;p 1 #
1
#":
! #r r
1 exp # # u;p 2 v;p #p #
15
3.3. FARAMS
Leung et al. [17] integrated fuzzy sets theory into
association rule mining techniques and applied the proposed work to the collaborative filtering of recommender
systems. First, the rating data are converted to the transactional database of association rule mining, fuzzified by
FShA [ B;X [ Y i
;
FShA;X i
CovhA;X i;hB;Y i
CORRhA;X i;hB;Y i q;
Var hA;X i nVar hB;Y i
17
18
94
19
20
FShA;X i2
ti A T
o2
aj A A t i aj )
jT j
21
)
*
In Eq. (16), A; X represents an Itemset, FuzzySet. t i aj ) is
the value of aj in the ith record of the transactional
database T. t i aj ) is the membership value of t i aj ).
Eqs. (1618) provide the formulas of Fuzzy Supports, Fuzzy
Confidence and Correlation, respectively. The limitations of
the FARAMS algorithm are as follows:
3.4. HUFCF
The basic concept of the HUFCF method [42] is to
integrate the fuzzy similarity degrees between users based
on the demographic data, with the hard user-based degrees
calculated from the rating histories integrated into the final
95
"
Fig. 4. The HUFCF algorithm.
"
"
NHSM
FARAMS
HUFCF
0.701
0.641
0.821a
0.636a
0.899
0.697
0.895
0.818a
1.040a
0.878
1.091
0.903
1.098
5.36a
12.56a
31.88
44.94
436.58
362.47
0.866
time (s)
1335.49
"
MIPFGWC-CS
s
"2
1 X!
p #r u;i :
N u;i u;i
23
4. Experiments
4.1. Environment setup
96
Fig. 5. The MAE and RMSE values of algorithms on the MovieLens dataset.
Fig. 6. The MAE and RMSE values of algorithms on the Jester dataset.
97
MAE
MIPFGWC-CS
2
3
4
5
6
7
8
9
10
Fold
2
3
4
5
6
7
8
9
10
Fold
2
3
4
5
6
7
8
9
10
a
NHSM
0.804
0.758a
0.782
0.679a
0.798
0.659a
0.705
0.687
0.692
0.684
0.708
0.665
0.723
0.632a
0.693
0.615
0.672
0.591a
RMSE
MIPFGWC-CS
NHSM
1.203
1.072a
1.045
0.945a
0.998
0.909a
0.990
0.858a
0.972
0.963
0.806a
0.982
0.882
0.822a
0.856
0.885
0.824
0.801a
Computational time (s)
MIPFGWC-CS
NHSM
963.26
4.3a
1123.2
5.6a
1254.2
6.3a
1321.2
6.2a
1345.4
6.8a
1235.2
7.7a
1543.6
8.0a
1537.7
10.3a
1843.4
10.9a
FARAMS
HUFCF
0.790
0.744
0.765
0.658a
0.707
0.641a
0.648
0.605a
0.603
0.806
0.793
0.731
0.698
0.672a
0.652
0.641
0.625
0.608
FARAMS
1.138
1.025
0.986
0.972
0.840a
0.880
0.941
0.859
0.845
HUFCF
1.163
1.032
0.969
0.912
0.893
0.886
0.842
0.843a
0.840
FARAMS
48.3
49.6
51.6
54.3
58.5
60.8
65.2
76.8
80.3
HUFCF
345.32
356.70
489.63
478.34
498.43
552.18
603.22
643.43
668.98
Table 8
The results of k-fold cross validation on the Jester dataset.
Fold
MAE
MIPFGWC-CS
2
3
4
5
6
7
8
9
10
Fold
2
3
4
5
6
7
8
9
10
Fold
2
3
4
5
6
7
8
9
10
a
NHSM
a
0.898
0.844a
0.825a
0.887
0.814a
0.795
0.747a
0.745
0.703a
RMSE
MIPFGWC-CS
NHSM
1.203a
1.102a
1.003a
0.993a
0.987a
1.002
0.982
0.972
0.897a
Computational time (s)
MIPFGWC-CS
NHSM
12.4a
13.4a
15.2a
15.6a
17.2a
18.4a
18.9a
18.5a
18.4a
FARAMS
HUFCF
0.903
0.878
0.853
0.835a
0.819
0.784a
0.793
0.742a
0.723
1.123
1.102
1.097
1.002
0.992
0.947
0.909
0.832
0.808
FARAMS
1.304
1.134
1.145
1.091
0.989
0.963a
0.942a
0.923a
0.912
HUFCF
1.534
1.432
1.269
1.101
0.994
0.982
0.985
0.934
0.915
FARAMS
75.2
76.0
79.4
85.6
88.2
91.8
93.8
96.9
100.3
HUFCF
304.5
334.2
365.6
398.0
415.6
405.9
425.3
420.6
489.5
"
98
Fig. 7. The MAE values of algorithms by the number of folds on the MovieLens dataset.
Fig. 8. The RMSE values of algorithms by the number of folds on the MovieLens dataset.
99
Fig. 9. The average MAE values of algorithms by the number of folds on MovieLens.
Fig. 10. The average RMSE values of algorithms by the number of folds on MovieLens.
reserved for the testing set. All users in the testing set are
the new user cold-starts, and all their rated items in the
testing set are cleared. Applying the experimental algorithms to the dataset, we calculate the MAE, RMSE and the
computational time of the algorithms. We continue to
randomly divide the original dataset into 5 parts and
perform similar tasks until exceeding 5 repetitions of
division. The final MAE, RMSE and the computational time
of algorithms are calculated to find the average results of
the 5 isolated division times. Through this cross validation
method, we can assume that the generated new user cold-
100
Fig. 11. The MAE values of algorithms by the number of folds on the Jester dataset.
Fig. 12. The RMSE values of algorithms by the number of folds on the Jester dataset.
101
Fig. 13. The average MAE values of algorithms by the number of folds on Jester.
Fig. 14. The average RMSE values of algorithms by the number of folds on Jester.
102
Fig. 15. The average computational time of algorithms on both datasets (s).
5. Conclusions
In this paper, we concentrated on the new user coldstart problem that negatively affects the recommender
performance due to the inability of the recommender
systems to produce meaningful recommendations. A comparative review of the relevant studies addressing the new
user cold-start problem was performed according to three
groups, namely, (i) makes use of additional data sources,
(ii) chooses the most prominent groups of analogous users,
and (iii) enhances the prediction using hybrid methods. We
also discussed the advantages and disadvantages of these
groups and noted the typical algorithm(s) of each group.
Details of the typical algorithms along with their theoretical
analyses, such as MIPFGWC-CS [46], NHSM [20], FARAMS
[17] and HUFCF [42], were examined. An experimental
validation on the benchmark recommender systems datasets, namely, MovieLens and Jester, under various settings
of the new user cold-start was performed. The experimental results, which were presented in tables and bar chart
figures, revealed the efficiencies of the algorithms in terms
of accuracy and the computational time.
Through the above results and discussion, our findings
in this article can be summarized as follows. First, NHSM is
the most effective algorithm among all of the investigated
methods in terms of both accuracy and computational
time. The MAE and RMSE values of the NHSM algorithm
are approximately 0.663 70.143 and 0.915 70.1, respectively. The computational time of NHSM is approximately
11 s. Second, the descending order of the algorithms in
terms of accuracy is NHSM, FARAMS, HUFCF and
MIPFGWC-CS. Third, all algorithms, especially NHSM, are
stable according to various cross-validation methods, such
Acknowledgments
The authors are greatly indebted to the anonymous
reviewers for their comments and their valuable suggestions, which improved the quality and clarity of this paper.
Other thanks are sent to Ms. Hoang Thi Thu Huong, FPT
and Mr. Donald B. Samuel, WHO for the language editing.
This work is sponsored by the NAFOSTED under Contract
no. 102.05-2014.01.
Appendix A. Supporting information
Supplementary data associated with this article can be
found in the online version at http://dx.doi.org/10.1016/j.is.
2014.10.001.
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
References
[1] M. Aharon, et al., OFF-set: one-pass factorization of feature sets for
online recommendation in persistent cold start settings, in: Proceedings of the 7th ACM Conference on Recommender systems,
2013, pp. 375378.
[2] H.J. Ahn, A new similarity measure for collaborative filtering to
alleviate the new user cold-starting problem, Inf. Sci. 178 (1) (2008)
3751.
[3] D. Almazro, G. Shahatah, L. Albdulkarim, M. Kherees, R. Martinez,
W. Nzoukou, A Survey Paper on Recommender Systems, 2010,
arXiv:1006.5278.
[4] J. Basiri, A. Shakery, B. Moshiri, M.Z. Hayat, Alleviating the cold-start
problem of recommender systems using a new hybrid approach, in:
Proceedings of the 5th IEEE International Symposium on Telecommunications (IST 2010), 2010, pp. 962967.
[5] J. Bobadilla, F. Ortega, A. Hernando, J. Bernal, A collaborative filtering
approach to mitigate the new user cold start problem, Knowl.-Based
Syst. 26 (2012) 225238.
[6] W. Carrer-Neto, M.L. Hernndez-Alcaraz, R. Valencia-Garca,
F. Garca-Snchez, Social knowledge-based recommender system.
Application to the movies domain, Expert Syst. Appl. 39 (12) (2012)
1099011000.
[7] C.C. Chen, Y.H. Wan, M.C. Chung, Y.C. Sun, An effective recommendation method for cold start new users using trust and distrust
networks, Inf. Sci. 224 (2013) 1936.
[8] M. Chen, C. Yang, J. Chen, P. Yi, A method to solve cold-start problem in
recommendation system based on social network sub-community and
[23]
[24]
[25]
[26]
[27]
[28]
[29]
[30]
103
104