Big Data Analytics For Customer Lifetime Value Prediction: Aslekar Avinash, Piyali Sahu, Arunima Pahari
Big Data Analytics For Customer Lifetime Value Prediction: Aslekar Avinash, Piyali Sahu, Arunima Pahari
Abstract
Predicting all the values that a business can derive from its long-standing relationship with its customers is Customer Lifetime
Value. As it helps to create a sustainable relationship with selected customers, the importance of CLV is growing at a brisk pace,
generating higher revenue that in turn enhances business growth. With increasing competition, retaining existing customers is
more profitable than acquiring new customers. To manage and allocate resources efficiently for each and every customer, big
data analytics comes into play. Taking this into account, a large amount of data should be taken into consideration, such as
client's attrition, objectives, diverse products and services that they use, client's characteristics viz demographic, psychographic,
geographic etc. First step to this is data cleaning, pre-processing and data manipulation to achieve a meaningful outcome or
information from the raw data, followed by data analysis and visualization. Techniques that can be recommended for the data
analysis and visualization of CLV model can be Stepwise regression, Classification and regression trees (CART), Generalized
linear models (GLM). To determine the dynamic view of customer behavior, future marketing strategies and to foster brand loyalty,
prediction of a proper CLV model is much needed.
Keywords: CLV, Predictive Models, Pareto/NBD Model, Gamma-Gamma Model, Retail Industry CLV, Purchase Count, Lifetime
Value, Monetary Value
4. Use Case for Retail Industry ∑∑ Prediction of CLV Using Probabilistic Models:
These models (Pareto/NBD model) work on a meth-
A non-contractual relationship exists between businesses od which predicts a customer’s expected purchases
and customers in e-Commerce or retail business. In the in the next period going by their transactional histo-
non-contractual world, churning of customers happens ry. Data should be predicted by the model and tested
silently. So, it is more complex to model customer and cross-validated by machine learning practices
lifetime value for non-contractual businesses and we need to achieve higher model accuracy for predicting
prediction for this. CLV. According to that trained model, the histori-
∑∑ Data to Analyze cal probability of customer being alive is predicted,
o Customer’s last transactions. given a customer’s transaction history. For predict-
ing monetary value, Gamma-Gamma model is ideal,
o Is the customer “dead”, or is the customer alive
since this model considers the economic component
but dormant?
of each transaction and then estimates customer’s
o What is the frequency of repeat purchases made probability to remain “alive” indicating proper cus-
by the customer? tomer’s CLV prediction in retail businesses.
o The age of the customer or the duration between
a customer’s first purchase and the end of the pe- 5. Value Addition of Pareto and
riod (under consideration). Gamma-Gamma Model for
o Recency or the age of the customer when they Predictive CLV Calculation
made their most recent purchases (Thus if only 1 Historical customer lifetime value is the simplest
purchase is made, the Recency is 0.). approach for calculating CLV. The historical methods
∑∑ RFM (Recency, Frequency, Monetary) Analysis: do not account for time. It is solely based on the past
transactions but doesn’t help in predicting the future
RFM analysis is used to quantitatively determine the
activities of the customer. This approach is valid if the
recency of purchase done by the customer, purchas-
customers have same behavioral pattern and over a same
ing frequency of a customer and the expense done time span. Among customers, there can be a good amount
by the customer (monetary) from this, probability of heterogeneity. Historical approach of CLV estimation
of customers who are surely alive, can be predicted. will apply recency of the last purchase, thereby filtering
Customers having very high frequency and very high out the criteria of segregation between the non-active and
recency are likely to be the best customers in future. the active users.
Big Data Analytics for Customer Lifetime Value Prediction 49
The aim of a predictive model is identifying the References
purchasing behavior of customers in order to predict the
Custora, U. (n.d.). Pareto/NBD model. Retrieved from https:
future activities of a customer. In the non-contractual //university.custora.com/for-marketers/clv/advanced/
business context, Pareto model is a frequently applied pareto-nbd
probabilistic model. It focuses on three latent behaviors,
Fader, P. S., & Hardie, B. G. S. (2013). The gamma-gam-
the time period for which the customer is in relationship
ma model of monetary value. Retrieved from http://
with the company or defining customer lifetime behavior www.brucehardie.com/notes/025/gamma_gamma.pdf
and the number of purchases made by the customer over a
Kudyba, S. (2002). Pentium processors improved the pro-
given period or the purchase rate. The lifetime distribution
cessing speed of the computers. Retrieved from https:
follows an exponential distribution with slope μ and the
//www.modernanalytics.com/wp-content/uploads
purchase count follows a Poisson distribution with rate λ.
/2014/07/Chapter-1.pdf
The Pareto model is trained to find out the parameters over
MyCustomer. (n.d.). Customer lifetime value: How on-
a particular training period within a minimum time span
line retailers can measure and use CLV. Retrieved
that corresponds to thrice of the typical inter-purchase
from https://www.mycustomer.com/community/blogs/
time of the customers. Pareto model takes the historic or
silviya-dineva/customer-lifetime-value-how-online-
transaction data as primary input to predict for the future, retailers-can-measure-and-use-clv
thereby helping in business contexts.
Petrison, B., & Wang, P. (1993). During the past cen-
tury, database marketing techniques have become in-
6. Conclusion creasingly important. Retrieved from https://www.
Pareto/NBD model mainly focuses only on modelling scholars.northwestern.edu/en/publications/database
purchase count and lifetime but Gamma Gamma model -marketing-past-present-and-future
provides monetary aspect of the future purchases also. Schmittlein, D. C., Morrison, D. G., & Colombo, R.
Hence, Gamma Gamma model is basically the monetary (1987). Counting your customers: Who are they
value extension to the Pareto/NBD model. The expenditure and what will they do next? Management Science,
of a client is assumed to be independent of the transaction 33(1), 1-24. Retrieved from https://www.jstor.org/
behavior in case of Gamma-Gamma model. Models stable/2631608?seq=1#metadata_info_tab_contents
like Pareto/NBD only predict the transaction behavior, Towards Data Science Pages. (n.d.). Retrieved from https:
independent of the monetary value and this limitation is //towardsdatascience.com/whats-a-customer-worth
being nullified by Gamma-Gamma model. -8daf183f8a4f
Copyright of Telecom Business Review is the property of Publishing India Group and its
content may not be copied or emailed to multiple sites or posted to a listserv without the
copyright holder's express written permission. However, users may print, download, or email
articles for individual use.