Predicting Buying Behavior Using CPT+: A Case Study of An E-Commerce Company
Predicting Buying Behavior Using CPT+: A Case Study of An E-Commerce Company
net/publication/348111217
CITATIONS READS
0 209
3 authors, including:
Some of the authors of this publication are also working on these related projects:
Improve IELTS Listening Skills Through Intensive Visual Videos View project
All content following this page was uploaded by Thon-Da Nguyen on 27 July 2021.
RESEARCH ARTICLE
a
Faculty of Information Systems, University of Economics and Law, VNU-HCM, Ho Chi Minh, Vietnam; bFaculty of
Information technology, Posts and Telecommunications Institute of Technology, Hanoi, Vietnam; cFaculty of
Information Systems, University of Economics and Law, VNU-HCM, Ho Chi Minh, Vietnam
1. INTRODUCTION are commonly purchased together; e.g., beer and diapers; (7)
Trend analysis that could help to reveal the difference
Predicting customer behaviour is the key to the business
between a typical customer this month and last.
success of an enterprise. Therefore, companies must create
an analytic tool to forecast customer behaviour, especially Among the seven mentioned strategies, trend analysis is a
the next potential products that customers intend to buy. hot topic in recent years. In the scope of this article, a way to
Doing this could benefit enterprises from selling products predict buying behaviour (e.g. predict next products bought
and increase income for employers and contribute to the by customers from a shopping cart of an e-commerce
sustainable development of companies. One helpful to solve company) has been intoduced.
this problem is using data mining. For instance, if a customer has chosen some products Px,
Data mining is the discovery of structures and patterns in Py, Pz, in that order, one may want to predict what is the
large and complex data sets [1]. Data mining is the discovery next product that will be selected by that user to help
of structures and patterns in large and complex data sets. merchants improve the performance of their business.
There are common seven strategies that companies are Various models have been proposed for making predictions.
interested in: (1) Market segmentation that could help to However, the CPT+ (Compact Prediction Tree Plus) is better
identify the common characteristics of customers who buy than other methods. This is a model using sequential data
the same products from your company; (2) Customer churn mining to predict the next items in a sequence database and
that could help to predict which customers are likely to leave is also the model improved from the CPT (Compact
your company and go to a competitor; (3) Fraud detection Prediction Tree) [2].
that could help to identify which transactions are most likely The rest of this paper is organised as follows. Sections 2,
to be fraudulent; (4) Direct marketing that could help to 3, 4, and 5 respectively present the problem definition and
determine which prospects should be included in a mailing related work, the CPT+ algorithm, the experimental results,
list to obtain the highest response rate; (5) Interactive and the conclusion.
marketing that could help to predict what each is accessing a
Web site is most likely interested in; (6) Market basket
2. RELATED WORK
analysis that helps to understand what products or services
Many models are proposed to predict the next item, such
as Association Rules [3], Markov, and so on.
*Address correspondence to this author at the Faculty of Information
Systems, University of Economics and Law, VNU-HCM, Ho Chi Minh,
Vietnam; E-mail: [email protected]
The research [4] proposed a product prediction model has chosen some products Px, Py, Pz, in that order, one may
using the decision tree classifier and a main classification want to predict what is the next product that will be chosen
method [5]. by that user to help merchants improve the performance of
their business. Data used to analyse in this paper from the
A typical paper has used Association Rules in which the
authors [6] utilised the association rule model, especially the buying history of customers on an e-commerce website. In
particular, data from a shopping cart will be converted into a
Apriori algorithm for product prediction. Besides, Clustering
sequence database to make predictions.
and k-means algorithm are used. Data collected for the study
consists of 42 products with two main attributes, such as time Table 1. Comparison among models for sequence prediction.
in the week and required quantity. The prediction related to
the range of products in a certain period is proposed. Prediction Models Loss Information Lossless Information
However, they are a few drawbacks of using the
candidate-and-test approach. Because of this, many valid Markov X
candidates could be ignored. Besides, a large number of Clustering X
suitable candidates are not interested in seeing.
Rules X
According the authors [7], models using Markov have a
significant limitation although chains are also widely used CPT X X
for sequence prediction, they assume that sequences are
Markovian. CPT+ X X
<3110, 12, 827, 652, 355> 124 981, 947, 1023, 979, 398
<9247, 50, 776, 652, 590> 211 947, 827, 1017, 919, 884
<1058, 941, 77, 112, 625, 430> 223 981, 1011, 626, 627, 628
The sequence <3110, 12, 827, 652, 355>shows that there In the future, we are going to perform another project to
are six products (981, 947, 1023, 979, 398) bought by a improve further the accuracy of CPT+ for buying product
specific customer and thanks to the CPT+, five next products prediction by a combination of various novel approaches.
are predicted (981, 947, 1023, 979, 398), where the product Besides, we also plan to explore product prediction on large
981 is the best-predicted product. Following the product 981 systems like Spark [22], Hadoop or cloud-based systems.
is 947. The third and the fourth predicted ones are 1023, 979,
and finally, the product 398 is predicted. LIST OF ABBREVIATIONS
Similarly, with the sequence <1058, 941, 77, 112, 625, AKOM = All-k-order Markov
430>, the possible website templates that were predicted are
981, 1011, 626, 627, 628. It means that if a customer buys CFS = Compressing Frequent Substrings
website templates 1058, 941, 77, 112, 625, and 430, he could CPT = Compact Prediction Tree
buy the next website templates such as 918, 1011, and so on.
CPT+ = Compact Prediction Tree Plus
As shown in Table 3, products are predicted that belongs
CSB = Compressing Simple Branches
to top 3 predicted products are 947 (occurred 6 times in 22
prediction cases), 981 (occurred 5 times in 22 prection CTW = Context Tree Weighting
cases), 904 (occurred 3 times in 22 prection cases), 1011
CusID = CustomerID
(occurred 3 times in 22 prediction cases), 211 (occurred 3
times in 22 prection cases), 827 (occurred 3 times in 22 DG = Dependency Graph
prection cases). Thus, customers are likely to choose PPM = Partial Matching
products like 947, 981, 904, 1011, 211, 827 after choosing a
sequence of other products that were bought before. Among 1PST = Probabilistic Suffix Tree
the products mentioned above, there are two products that ProdID = ProductID
are favourite. They are 947 (occurred 6 times in 22
considered cases), 981 (5 times in 22 considered cases). In PRN = Prediction with improved Noise
other words, the company should concentrate on investing in Reduction
products like 947 and 981. LZ78 = J. Ziv and A. Lempel (1978)
TDAG = Transition Directed Acyclic Graph
5. DISCUSSION
VNU-HCM = Vietnam Nation University Ho Chi
The work contributed an algorithm to convert a relation
Minh
database (here is a shopping cart) into sequence database to
predict the next products. After a few users' activities to buy
products, trends on their behaviour have been created. This CONSENT FOR PUBLICATION
is really essential for the business or enterprises to build Not applicable.
recommender systems aiming to benefit from purchasing
products. AVAILABILITY OF DATA AND MATERIALS
However, data collected in this work came from our Not applicable.
partner company. For this reason, this result can only apply
to enterprises having shopping cart model that is similar to FUNDING
that of the Smart Work company.
None.
CONCLUSION
CONFLICT OF INTEREST
This article proposes the approach for making predictions
for products in online business. In particular, we built the The authors declare no conflict of interest, financial or
sequence database from data related to the shopping cart. We otherwise.
also prove that with our new dataset (ShopSDB), the CPT+
is also useful for making predictions. In particular, we ACKNOWLEDGEMENTS
presented a procedure to convert the shopping cart into a
Declared none.
sequence database and utilised CPT+ to the next product
prediction applying in the e-commerce context at an e-
commerce company. Experimental results on the same REFERENCES
dataset derived from a shopping cart show that CPT+, in this [1] P-N. Tan, M. Steinbach, and V. Kumar, Introduction to data
case, is much better than other sequence prediction methods mining.. Pearson Education: India, 2016.
(CPT, All-K-Order Markov, DG, Lz78, PPM, and TDAG). [2] T. Gueniche, P. Fournier-Viger, and V.S. Tseng, ""Compact
Prediction Tree: A Lossless Model for Accurate Sequence
Prediction", ADMA, no. 2, pp. 177-188, 2013.
CURRENT & FUTURE DEVELOPMENTS http://dx.doi.org/10.1007/978-3-642-53917-6_16
[3] R. Agrawal, and R. Srikant, "Fast algorithms for mining association
Currently, we are developing the proposed method on rules", Proc. 20th int. conf. very large databases, VLDB, vol. vol.
how to improve the performance of time execution. 1215, 1994pp. 487-499
[4] C. Cumby, A. Fano, R. Ghani, and M. Krema, "Predicting
customer shopping lists from point-of-sale purchase data",
Predicting Buying Behavior using CPT+ Recent Advances in Computer Science and Communications, XXXX, Vol. XX, No. XX 7
Proceedings of the tenth ACM SIGKDD international conference [13] V. Padmanabhan, and J. Mogul, "Using Prefetching to Improve
on Knowledge discovery and data mining, , 2004pp. 402-409 World Wide Web Latency", Comput. Commun., vol. 16, pp. 358-
http://dx.doi.org/10.1145/1014052.1014098 368, 1998.
[5] A. Carlson, C. Cumby, J. Rosen, and D. Roth, "The SNoW learning [14] J. Pitkow, and P. Pirolli, "Mininglongestrepeatin g subsequences to
architecture", Technical report .UIUCDCS1999 predict world wide web surfing", Proc. Usenix Symp. on Internet
[6] R. Ismail, Z. Othman, and A.A. Bakar, "Associative prediction Technologies and systems, 1999p. 1
model and clustering for product forecast data", 2010 10th [15] P. Laird, and R. Saul, "Discrete sequence prediction and its
International Conference on Intelligent Systems Design and applications", Mach. Learn., vol. 15, no. 1, pp. 43-68, 1994.
Applications, , 2010pp. 1459-1464 http://dx.doi.org/10.1007/BF01000408
http://dx.doi.org/10.1109/ISDA.2010.5687116 [16] J. Ziv, and A. Lempel, "Compression of individual sequences via
[7] T. Gueniche, P. Fournier-Viger, R. Raman, and V.S. Tseng, variable-rate coding", IEEE Trans. Inf. Theory, vol. 24, no. 5, pp.
"CPT+: Decreasing the time/space complexity of the Compact 530-536, 1978.
Prediction Tree", Pacific-Asia Conference on Knowledge http://dx.doi.org/10.1109/TIT.1978.1055934
Discovery and Data Mining, , 2015pp. 625-636 [17] R. Begleiter, R. El-Yaniv, and G. Yona, "On prediction using
http://dx.doi.org/10.1007/978-3-319-18032-8_49 variable-order Markov models", J. Artif. Intell. Res., vol. 22, pp.
[8] P. Fournier-Viger, R. Nkambou, and V.S-M. Tseng, "RuleGrowth: 385-421, 2004.
mining sequential rules common to several sequences by pattern- http://dx.doi.org/10.1613/jair.1491
growth", Proceedings of the 2011 ACM symposium on applied [18] F.M. Willems, Y.M. Shtarkov, and T.J. Tjalkens, "The context-tree
computing, 2011pp. 956-961 weighting method: basic properties", IEEE Trans. Inf. Theory, vol.
http://dx.doi.org/10.1145/1982185.1982394 41, no. 3, pp. 653-664, 1995.
[9] P. Fournier-Viger, U. Faghihi, R. Nkambou, and E.M. Nguifo, http://dx.doi.org/10.1109/18.382012
"CMRules: Mining sequential rules common to several sequences", [19] K. Gopalratnam, and D.J. Cook, "Online sequential prediction via
Knowl. Base. Syst., vol. 25, no. 1, pp. 63-76, 2012. incremental parsing: The active lezi algorithm", IEEE Intell. Syst.,
http://dx.doi.org/10.1016/j.knosys.2011.07.005 vol. 22, no. 1, pp. 52-58, 2007.
[10] P. Fournier-Viger, T. Gueniche, S. Zida, and V.S. Tseng, http://dx.doi.org/10.1109/MIS.2007.15
"ERMiner: sequential rule mining using equivalence classes", [20] P. Fournier-Viger, "The SPMF open-source data mining library
International Symposium on Intelligent Data Analysis, , 2014pp. version 2", Joint European conference on machine learning and
108-119 knowledge discovery in databases, , 2016pp. 36-40
http://dx.doi.org/10.1007/978-3-319-12571-8_10 http://dx.doi.org/10.1007/978-3-319-46131-1_8
[11] W. Tian, B. Choi, and V.V. Phoha, "An adaptive web cache access [21] T. Gueniche, P. Fournier-Viger, and V.S. Tseng, "Compact
predictor using neural network", International Conference on prediction tree: A lossless model for accurate sequence prediction",
Industrial, Engineering and Other Applications of Applied International Conference on Advanced Data Mining and
Intelligent Systems, , 2002pp. 450-459 Applications, , 2013pp. 177-188
http://dx.doi.org/10.1007/3-540-48035-8_44 http://dx.doi.org/10.1007/978-3-642-53917-6_16
[12] J. Cleary, and I. Witten, "Data compression using adaptive coding [22] M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I.
and partial string matching", IEEE Trans. Commun., vol. 32, no. 4, Stoica, "Spark: Cluster computing with working sets", HotCloud ,
pp. 396-402, 1984. vol. 10 , no. 10-10 , p. 95 , 2010.
http://dx.doi.org/10.1109/TCOM.1984.1096090
DISCLAIMER: The above article has been published in Epub (ahead of print) on the basis of the materials provided by the author. The Editorial Department
reserves the right to make minor modifications for further improvement of the manuscript.