0% found this document useful (0 votes)

52 views19 pages

Walkthrough For The Clvtools Package: 1 Prerequisites: Setup The R Environment

This document provides a walkthrough for using the CLVTools package in R to analyze customer lifetime value (CLV) data. It outlines the general workflow, which involves initializing a CLV data object from a transactional dataset, fitting a probabilistic model to the data to estimate parameters, and using the results to predict future customer behavior. Sample apparel retail data is loaded from the package and used to demonstrate initializing the data object, checking summaries, and estimating parameters for the Pareto/NBD model. The document provides guidance on preparing CLV data and estimating the first model to analyze customer purchase patterns.

Uploaded by

gal3fc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views19 pages

Walkthrough For The Clvtools Package: 1 Prerequisites: Setup The R Environment

Uploaded by

gal3fc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Walkthrough for the CLVTools Package

1 Prerequisites: Setup the R environment

Install the stable version from CRAN:
install.packages("CLVTools")

Install the development version from GitHub (using the devtools package (Wickham, Hester, and Chang
2019)):
install.packages("devtools")
devtools::install_github("bachmannpatrick/CLVTools", ref = "master")

Load the package

library("CLVTools")

#Apply the CLVTools Package

1.1 General workflow

Independent of the probabilistic model applied in CLVTools, the general workflow consists of three main
steps:
1. Create a clv.data object containing the dataset and required meta-information such as date formats
and column names in the dataset. After initializing the object, there is the option to add additional
information on covariates in a separate step.
2. Fit the model on the data provided.
3. Use the estimated model parameters to predict future customer purchase behavior.
Reporting and plotting results is facilitated by the implementation of well-known generic methods such as
plot(), print() and summary(). These commands adapt their output according to the model state and
may be used at any point of the workflow.

1.2 Load sample data provided in the package

As Input data CLVTools requires customers’ transaction history. Every transaction record consists of a
purchase date and customer ID. Optionally, the price of the transaction may be included to allow for
prediction of future customer spending using an additional Gamma/Gamma model(Fader, Hardie, and Lee
2005b; Colombo and Jiang 1999). Using the full history of transaction data allows for comprehensive plots
and summary statistics, which allow the identification of possible issues prior to model estimation. Data
may be provided as data.frame or data.table (Dowle and Srinivasan 2019).
It is common practice to split time series data into two parts, an estimation and a holdout period. The
model is estimated based on the data from the estimation period while the data from the holdout period
allows to rigorously assess model performance. Once model performance is checked on known data one can
proceed to predict data without a holdout period. The length of the estimation period is heavily dependent
on the characteristics of the analyzed dataset. We recommend to choose an estimation period that contains

1
in minimum the length of the average inter-purchase time. Note that all customers in the dataset need to
purchase at least once during the estimation period, i.e. these models do not account for prospects who have
not yet a purchase record.
Some models included in CLVTools allow to model the impact of covariates. These covariates may explain
heterogeneity among the customers and therefore increase the predictive accuracy of the model. At the same
time we may also identify and quantify the effects of these covariates on customer purchase and customer
attrition. CLVTools distinguishes between time-invariant and time-varying covariates. Time-invariant co-
variates include customer characteristics such as demographics that do not change over time. Time-varying
covariates are allowed to change over time. They include for example direct marketing information or
seasonal patterns.
For the following example, we use simulated data comparable to data from a retailer in the apparel industry.
The dataset contains transactional detail records for every customer consisting of customer id, date of
purchase and the total monetary value of the transaction.The apparel dataset is available in the CLVTools
package. Use the data(apparelTrans) to load it:
data("apparelTrans")
apparelTrans
#> Id Date Price
#> 1: 1 2005-01-03 230.30
#> 2: 10 2005-01-03 84.39
#> 3: 10 2005-02-25 131.07
#> 4: 10 2005-04-05 86.43
#> 5: 100 2005-01-03 11.49
#> ---
#> 2349: 1221 2006-01-23 26.57
#> 2350: 1221 2006-03-09 129.82
#> 2351: 1221 2006-05-14 14.37
#> 2352: 1222 2005-01-03 44.77
#> 2353: 1222 2005-03-03 99.21

1.3 Initialize the CLV-Object

Before we estimate a model, we are required to initialize a data object using the clvdata() command. The
data object contains the prepared transactional data and is later used as input for model fitting. Make sure
to store the generated object in a variable, e.g. in our example clv.apparel.
Be aware that probabilistic models such as the ones implemented in CLVTools are usually applied to specific
customer cohorts. That means, you analyze customer that have joined your company at the same time
(usually same day, week, month, or quarter). For more information on cohort analysis, see also here. Con-
sequently, the data apparelTrans in this example is not the full transaction records of a fashion retailer, but
rather only the customer cohort of 250 customers purchasing for the first time at this business on the day of
2005-01-03. This has to be done before initializing a data object using the clvdata() command.
Through the argument data.transactions a data.frame or data.table which contains the transaction
records, is specified. In our example this is data.transactions=apparelTrans. The argument date.format
is used to indicate the format of the date variable in the data used. The date format in the apparel dataset
is given as “year-month-day” (i.e., “2005-01-03”), therefore we set date.format="ymd". Other combinations
such as date.format="dmy" are possible. See the documentation of lubridate (Grolemund and Wickham
2011) for all details. time.unit is the scale used to measure time between two dates. For this dataset
and in most other cases The argument time.unit="week" is the preferred choice. Abbreviations may be
used (i.e. “w”). estimation.split indicates the length of the estimation period. Either the length of the
estimation period (in previous specified time units) or the date at which the estimation period ends can be
specified. If no value is provided, the whole dataset is used as estimation period (i.e. no holdout period).

2
In this example, we use an estimation period of 40 weeks. Finally, the three name arguments indicate
the column names for customer ID, date and price in the supplied dataset. Note that the price column is
optional.
clv.apparel <- clvdata(apparelTrans,
date.format="ymd",
time.unit = "week",
estimation.split = 40,
name.id = "Id",
name.date = "Date",
name.price = "Price")

1.4 Check the clvdata Object

To get details on the clvdata object, print it to the console.

clv.apparel
#> CLV Transaction Data
#>
#> Call:
#> clvdata(data.transactions = apparelTrans, date.format = "ymd",
#> time.unit = "week", estimation.split = 40, name.id = "Id",
#> name.date = "Date", name.price = "Price")
#>
#> Total # customers 250
#> Total # transactions 2257
#> Spending information TRUE
#>
#>
#> Time unit Weeks
#>
#> Estimation start 2005-01-03
#> Estimation end 2005-10-10
#> Estimation length 40.0000 Weeks
#>
#> Holdout start 2005-10-11
#> Holdout end 2006-07-16
#> Holdout length 39.71429 Weeks

Alternatively the summary() command provides full detailed summary statistics for the provided transac-
tional detail. summary() is available at any step in the process of estimating a probabilistic customer attrition
model with CLVTools. The result output is updated accordingly and additional information is added to the
summary statistics.nobs() extracts the number of observations. For the this particular dataset we observe
a total of 250 customers who made in total 2257 repeat purchases. Approximately 26% of the customers are
zero repeaters, which means that the only a minority of the customers do not return to the store after their
first purchase.
summary(clv.apparel)
#> CLV Transaction Data
#>
#> Time unit Weeks
#> Estimation length 40.0000 Weeks
#> Holdout length 39.71429 Weeks
#>
#> Transaction Data Summary

3
#> Estimation Holdout Total
#> Number of customers - - 250
#> First Transaction in period 2005-01-03 2005-10-11 2005-01-03
#> Last Transaction in period 2005-10-10 2006-07-16 2006-07-16
#> Total # Transactions 1311 946 2257
#> Mean # Transactions per cust 5.244 8.226 9.028
#> (SD) 6.082 8.934 12.603
#> Mean Spending per Transaction 39.436 38.519 39.051
#> (SD) 42.649 59.723 50.503
#> Total Spending 51700.490 36438.640 88139.130
#> Total # zero repeaters 77 - -
#> Percentage of zero repeaters 30.800 - -
#> Mean Interpurchase time 7.361 5.756 9.462
#> (SD) 6.791 6.394 12.266

1.5 Estimate Model Parameters

After initializing the object, we are able to estimate the first probabilistic latent attrition model. We start
with the standard Pareto/NBD model (Schmittlein, Morrison, and Colombo 1987) and therefore use the
command pnbd() to fit the model and estimate model parameters. clv.data specifies the initialized object
prepared in the last step. Optionally, starting values for the model parameters and control settings for
the optimization algorithm may be provided: The argument start.params.model allows to assign a vector
(e.g. c(alpha=1, beta=2, s=1, beta=2) in the case of the Pareto/NBD model) of starting values for the
optimization. This is useful if prior knowledge on the parameters of the distributions are available. By
default starting values are set to 1 for all parameters. The argument optimx.args provides an option to
control settings for the optimization routine. It passes a list of arguments to the optimizer. All options
known from the package optimx (Nash and Varadhan 2011; Nash 2014) may be used. This option enables
users to specify specific optimization algorithms, set upper and/or lower limits or enable tracing information
on the progress of the optimization. In the case of the standard Pareto/NBD model, CLVTools uses by
default the optimization method L-BFGS-G (Byrd et al. 1995). If the result of the optimization is in-feasible,
the optimization automatically switches to the more robust but often slower Nelder-Mead method (Nelder
and Mead 1965). verbose shows additional output.
est.pnbd <- pnbd(clv.data = clv.apparel)
#> Starting estimation...
#> Estimation finished!
est.pnbd
#> Pareto NBD Standard Model
#>
#> Call:
#> pnbd(clv.data = clv.apparel)
#>
#> Coefficients:
#> r alpha s beta
#> 0.7866 5.3349 0.3570 11.6152
#> KKT1: TRUE
#> KKT2: TRUE
#>
#> Used Options:
#> Correlation: FALSE

If we assign starting parameters and additional arguments for the optimizer we use:

4
est.pnbd <- pnbd(clv.data = clv.apparel,
start.params.model = c(r=1, alpha = 2, s = 1, beta = 2),
optimx.args = list(control=list(trace=5),
method="Nelder-Mead"
))

Parameter estimates may be reported by either printing the estimated object (i.e. est.pnbd) directly in
the console or by calling summary(est.pnbd) to get a more detailed report including the likelihood value
as well as AIC and BIC. Alternatively parameters may be directly extracted using coef(est.pnbd). Also
loglik(), confint() and vcov() are available to directly access the Loglikelihood value, confidence intervals
for the parameters and to calculate the Variance-Covariance Matrix for the fitted model. For the standard
Pareto/NBD model, we get 4 parameters 𝑟, 𝛼, 𝑠 and 𝛽. where 𝑟, 𝛼 represent the shape and scale parameter
of the gamma distribution that determines the purchase rate and 𝑠, 𝛽 of the attrition rate across individual
customers. 𝑟/𝛼 can be interpreted as the mean purchase and 𝑠/𝛽 as the mean attrition rate. A significance
level is provided for each parameter estimates. In the case of the apparelTrans dataset we observe a an
average purchase rate of 𝑟/𝛼 = 0.147 transactions and an average attrition rate of 𝑠/𝛽 = 0.031 per customer
per week. KKT 1 and 2 indicate the Karush-Kuhn-Tucker optimality conditions of the first and second order
(Kuhn and Tucker 1951). If those criteria are not met, the optimizer has probably not arrived at an optimal
solution. If this is the case it is usually a good idea to rerun the estimation using alternative starting values.
#Full detailed summary of the parameter estimates
summary(est.pnbd)
#> Pareto NBD Standard Model
#>
#> Call:
#> pnbd(clv.data = clv.apparel)
#>
#> Fitting period:
#> Estimation start 2005-01-03
#> Estimation end 2005-10-10
#> Estimation length 40.0000 Weeks
#>
#> Coefficients:
#> Estimate Std. Error z-val Pr(>|z|)
#> r 0.7866 0.1324 5.942 2.81e-09 ***
#> alpha 5.3349 0.9027 5.910 3.42e-09 ***
#> s 0.3570 0.1838 1.943 0.0521 .
#> beta 11.6152 10.6598 1.090 0.2759
#> ---
#> Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
#>
#> Optimization info:
#> LL -2879.4699
#> AIC 5766.9399
#> BIC 5781.0257
#> KKT 1 TRUE
#> KKT 2 TRUE
#> fevals 23.0000
#> Method L-BFGS-B
#>
#> Used Options:
#> Correlation FALSE

#Extract the coefficients only

5
coef(est.pnbd)
#> r alpha s beta
#> 0.7865982 5.3349363 0.3570472 11.6151630
#Alternative: oefficients(est.pnbd.obj)

To extract only the coefficients, we can use coef(). To access the confidence intervals for all parameters
confint() is available.
#Extract the coefficients only
coef(est.pnbd)
#> r alpha s beta
#> 0.7865982 5.3349363 0.3570472 11.6151630
#Alternative: oefficients(est.pnbd.obj)

#Extract the confidence intervals

confint(est.pnbd)
#> 2.5 % 97.5 %
#> r 0.527143413 1.0460530
#> alpha 3.565676736 7.1041959
#> s -0.003172205 0.7172665
#> beta -9.277581357 32.5079074

In order to get the Likelihood value and the corresponding Variance-Covariance Matrix we use the following
commands:
# LogLikelihood at maximum
logLik(est.pnbd)
#> 'log Lik.' -2879.47 (df=4)

# Variance-Covariance Matrix at maximum

vcov(est.pnbd)
#> r alpha s beta
#> r 0.01752376 0.10255385 -0.00561020 -0.5538978
#> alpha 0.10255385 0.81486739 -0.02386251 -2.4623808
#> s -0.00561020 -0.02386251 0.03377831 1.8512345
#> beta -0.55389777 -2.46238080 1.85123446 113.6304689

As an alternative to the Pareto/NBD model CLVTools features the BG/NBD model (Fader, Hardie, and Lee
2005a) and the GGomp/NBD (Bemmaor and Glady 2012). To use the alternative models replace pnbd() by
the corresponding model-command. Note that he naming and number of model parameters is dependent on
the model. Consult the manual for more details on the individual models. Beside probabilistic latent attrition
models, CLVTools also features the Gamma/Gamma model (Colombo and Jiang 1999; Fader, Hardie, and
Lee 2005a) which is used to predict customer spending. See section Customer Spending for details on the
spending model.

Command Model Covariates Type

pnbd() Pareto/NBD time-invariant & time-varying latent attrition model
bgnbd() BG/NBD time-invariant latent attrition model
ggomnbd() GGom/NBD time-invariant latent attrition model
gg() Gamma/Gamma - spending model

To estimate the GGom/NBD model we apply the ggomnbd()to the clv.apparel object. The GGom/NBD
model is more flexible than the Pareto/NBD model, however it sometimes is challenging to optimize. Note
that in this particular case providing start parameters is essential to arrive at an optimal solution (i.e. kkt1:

6
TRUE and kkt2: TRUE).
est.ggomnbd <- ggomnbd(clv.data = clv.apparel,
start.params.model = c(r=0.7, alpha=5, b=0.005, s=0.02, beta=0.001),
optimx.args = list(control=list(trace=5),
method="Nelder-Mead"))

1.6 Predict Customer Behavior

Once the model parameters are estimated, we are able to predict future customer behavior on an individual
level. To do so, we use predict() on the object with the estimated parameters (i.e. est.pnbd). The
prediction period may be varied by specifying prediction.end. It is possible to provide either an end-date
or a duration using the same time unit as specified when initializing the object (i.e prediction.end =
"2006-05-08" or prediction.end = 30). By default, the prediction is made until the end of the dataset
specified in the clvdata() command. The argument continuous.discount.factor allows to adjust the
discount rate used to estimated the discounted expected transactions (DERT). The default value is 0.1
(=10%). Probabilistic customer attrition model predict in general three expected characteristics for every
customer:
• “conditional expected transactions” (CET), which is the number of transactions to expect form a
customer during the prediction period,
• “probability of a customer being alive” (PAlive) at the end of the estimation period and
• “discounted expected residual transactions” (DERT) for every customer, which is the total number of
transactions for the residual lifetime of a customer discounted to the end of the estimation period.
If spending information was provided when initializing the clvdata-object, CLVTools provides prediction
for
• predicted mean spending estimated by a Gamma/Gamma model (Colombo and Jiang 1999; Fader,
Hardie, and Lee 2005a) and
• the customer lifetime value (CLV). CLV is calculated as the product of DERT and predicted spending.
If a holdout period is available additionally the true numbers of transactions (“actual.x”) and true spending
(“actual.total.spending”) during the holdout period are reported.
To use the parameter estimates on new data (e.g., an other customer cohort), the argument newdata option-
ally allows to provide a new clvdata object.
results <- predict(est.pnbd)
#> Predicting from 2005-10-11 until (incl.) 2006-07-16 (39.86 Weeks).
#> Estimating gg model to predict spending...
#> Starting estimation...
#> Estimation finished!
print(results)
#> Id period.first period.last period.length actual.x actual.total.spending
#> 1: 1 2005-10-11 2006-07-16 39.85714 0 0.00
#> 2: 10 2005-10-11 2006-07-16 39.85714 0 0.00
#> 3: 100 2005-10-11 2006-07-16 39.85714 23 737.53
#> 4: 1000 2005-10-11 2006-07-16 39.85714 23 1069.91
#> 5: 1001 2005-10-11 2006-07-16 39.85714 11 364.00
#> ---
#> 246: 1219 2005-10-11 2006-07-16 39.85714 14 413.76
#> 247: 122 2005-10-11 2006-07-16 39.85714 0 0.00
#> 248: 1220 2005-10-11 2006-07-16 39.85714 0 0.00
#> 249: 1221 2005-10-11 2006-07-16 39.85714 9 302.65
#> 250: 1222 2005-10-11 2006-07-16 39.85714 0 0.00

7
#> PAlive CET DERT predicted.mean.spending predicted.CLV
#> 1: 0.3571791 0.2212506 0.05848859 39.95483 2.336902
#> 2: 0.4225636 0.9272819 0.24513121 55.23031 13.538672
#> 3: 0.9155479 13.5448630 3.58064629 43.57390 156.022721
#> 4: 0.9967780 13.1766970 3.48331993 41.60921 144.938180
#> 5: 0.5098134 3.5275846 0.93253307 45.58153 42.506281
#> ---
#> 246: 0.9579241 3.6108002 0.95453149 33.58728 32.060115
#> 247: 0.3571791 0.2212506 0.05848859 39.95483 2.336902
#> 248: 0.3571791 0.2212506 0.05848859 39.95483 2.336902
#> 249: 0.9434302 4.2991096 1.13648922 34.28958 38.969738
#> 250: 0.4136156 0.5819279 0.15383529 47.35500 7.284870

To change the duration of the prediction time, we use the predicton.end argument. We can either provide
a time period (30 weeks in this example):
predict(est.pnbd, prediction.end = 30)

or provide a date indication the end of the prediction period:

predict(est.pnbd, prediction.end = "2006-05-08")

1.7 Plotting

CLVTools, offers a variety of different plots. All clvdata objects may be plotted using the plot() command.
Similar to summary(), the output of plot() and the corresponding options are dependent on the current
modeling step. When applied to a data object created the clvdata() command, the following plots can be
selected using the which option of plotting:
• Tracking plot (which="tracking"): plots the the aggregated repeat transactions per period over a
given time period. The period can be specified using the prediction.end option. It is also possible
to generate cumulative tracking plots (cumulative = FALSE). The tracking plot is the default option.
• Frequency plot (which="frequency"): plots the distribution of transactions or repeat transactions per
customer, after aggregating transactions of the same customer on a single time point. The bins may be
adjusted using the option trans.bins. (Note that if trans.bins is changed, the option for labeling
(label.remaining) usually needs to be adapted as well.)
• Spending plot (which="spending"): plots the empirical density of either customer’s average spending
per transaction. Note that this includes all transactions and not only repeat-transactions. You can
switch to plotting the value of every transaction for a customer (instead of the a customers mean
spending) using mean.spending=FALSE.
• Interpurchase time plot (which="interpurchasetime"): plots the empirical density of customer’s
mean time (in number of periods) between transactions, after aggregating transactions of the same
customer on a single time point. Note that customers without repeat-transactions are note part of this
plot.
In the following, we have a basic tracking-plot for the aggregated repeat transactions
plot(clv.apparel)
#> Plotting from 2005-01-03 until 2006-07-16.

8
Weekly tracking plot
Estimation end: 2005−10−10

50
Number of Repeat Transactions

2005−01 2005−07 2006−01 2006−07

Date
To plot customers mean interpurchase time, we use:
plot(clv.apparel, which="interpurchasetime")

Density of Customer's Mean Time between Transactions

0.100

0.075
Density

0.050

0.025

0.000

0 10 20 30
Mean Interpurchase Time (Weeks)
When the plot() command is applied to an object with the an estimated model (i.e. est.pnbd), the
following plots can be selected using the which option of:
• Tracking plot (which="tracking"): plots the actual repeat transactions and overlays it with the repeat
transaction as predicted by the fitted model. Currently, following previous literature, the in-sample
unconditional expectation is plotted in the holdout period. The period can be specified using the
prediction.end option. It is also possible to generate cumulative tracking plots (cumulative =
FALSE). The tracking plot is th the default option. The argument transactions disable for plot-
ting actual transactions (transactions=FALSE). For further plotting options see the documentation.
Note that only whole periods can be plotted and that the prediction end might not exactly match
prediction.end. See the ?plot.clv.data for more details.
• Probability mass function (pmf) plot (which="pmf"): plots the actual and expected number of cus-
tomers which made a given number of repeat transaction in the estimation period. The expected

9
number is based on the PMF of the fitted model, the probability to make exactly a given number
of repeat transactions in the estimation period. For each bin, the expected number is the sum of
all customers’ individual PMF value. The bins for the transactions can be adjusted using the option
trans.bins. (Note that if trans.bins is changed, label.remaining usually needs to be adapted as
well.
For a standard tracking plot including the model, we use:
plot(est.pnbd)
#> Plotting from 2005-01-03 until 2006-07-16.

Weekly tracking plot

Estimation end: 2005−10−10

50
Number of Repeat Transactions

0
2005−01 2005−07 2006−01 2006−07
Date

Legend Actual Pareto NBD Standard

To plot the cumulative expected transactions 30 time units (30 weeks in this example) ahead of the end of
the estimation plot, we use:
plot(est.pnbd, prediction.end = 30, cumulative = TRUE)

Alternatively, it is possible to specify a date for the prediction.endargument. Note that dates are rounded
to the next full time unit (i.e. week):
plot(est.pnbd, prediction.end = "2006-05-08", cumulative = TRUE)

For a plot of the probability mass function (pmf), with 7 bins, we use:
plot(est.pnbd, which="pmf", trans.bins=0:5, label.remaining="6+")

1.8 Covariates

CLVTools provides the option to include covariates into probabilistic customer attrition models. Covariates
may affect the purchase or the attrition process, or both. It is also possible to include different covariates
for the two processes. However, support for covariates is dependent on the model. Not all implemented
models provide the option for covariates. In general, CLVTools distinguishes between two types of covariates:
time-invariant and time-varying. The former include factors that do not change over time such as customer
demographics or customer acquisition information. The latter may change over time and include marketing
activities or seasonal patterns.
Data for time-invariant covariates must contain a unique customer ID and a single value for each covariate.
It should be supplied as a data.frame or data.table. In the example of the apparel retailer we use
demographic information “gender” as time-invariant and information on the acquisition channel as covariate

10
for both, the purchase and the attrition process. Use the data("apparelStaticCov") command to load the
time-invariant covariates. In this example gender is coded as a dummy variable with male=0 and female=1
and channel with online=0 and offline=1.
data("apparelStaticCov")
apparelStaticCov
#> Id Gender Channel
#> 1: 1 0 0
#> 2: 10 0 0
#> 3: 100 1 0
#> 4: 1000 1 1
#> 5: 1001 1 0
#> ---
#> 246: 1219 0 1
#> 247: 122 0 0
#> 248: 1220 0 0
#> 249: 1221 1 1
#> 250: 1222 1 0

Data for time-varying covariates requires a time-series of covariate values for every customer. I.e. if the time-
varying covariates are allowed to change every week, a value for every customer for every week is required.
Note that all contextual factors are required to use the same time intervals for the time-series. In the
example of the apparel retailer we use information on direct marketing (Marekting) as time-varying covariate.
Additionally, we add gender as time-invariant contextual factors. Note that the data structure of invariant
covariates needs to be aligned with the structure of time-varying covariate. Use data("apparelDynCov")
command to load
data("apparelDynCov")
apparelDynCov
#> Id Cov.Date Marketing Gender Channel
#> 1: 1 2004-12-26 1 0 0
#> 2: 1 2005-01-02 1 0 0
#> 3: 1 2005-01-09 0 0 0
#> 4: 1 2005-01-16 1 0 0
#> 5: 1 2005-01-23 2 0 0
#> ---
#> 20496: 1222 2006-06-18 0 1 0
#> 20497: 1222 2006-06-25 0 1 0
#> 20498: 1222 2006-07-02 0 1 0
#> 20499: 1222 2006-07-09 0 1 0
#> 20500: 1222 2006-07-16 0 1 0

To add the covariates to an initialized clvdata object the commands SetStaticCovariates() and
SetDynamicCovariates() are available. The two commands are mutually exclusive. The argument
clv.data specifies the initialized object and the argument data.cov.life respectively data.cov.trans
specifies the data source for the covariates for the attrition and the purchase process. Covariates are added
separately for the purchase and the attrition process. Therefore if a covariate should affect both processes it
has to be added in both arguments: data.cov.life and data.cov.trans. The arguments names.cov.life
and names.cov.trans specify the column names of the covariates for the two processes. In our example, we
use the same covariates for both processes. Accordingly, we specify the time-invariant covariates “Gender”
and “Channel” as follows:
clv.static<- SetStaticCovariates(clv.data = clv.apparel,
data.cov.life = apparelStaticCov,
data.cov.trans = apparelStaticCov,
names.cov.life = c("Gender", "Channel"),

11
names.cov.trans =c("Gender", "Channel"),
name.id = "Id")

To specify the time-varying contextual factors for seasonal patterns and direct marketing, we use the follow-
ing:
clv.dyn <- SetDynamicCovariates(clv.data = clv.apparel,
data.cov.life = apparelDynCov,
data.cov.trans = apparelDynCov,
names.cov.life = c("Marketing", "Gender", "Channel"),
names.cov.trans = c("Marketing", "Gender", "Channel"),
name.id = "Id",
name.date = "Cov.Date")

In order to include time-invariant covariates in a time-varying model, they may be recoded as a time-varying
covariate with a constant value in every time period.
Once the covariates are added to the model the estimation process is almost identical to the standard model
without covariates. The only difference is that the provided object now data for contains either time-invariant
or time-varying covariates and the option to define start parameters for the covariates of both processes using
the arguments start.params.life and start.params.trans. If not set, the staring values are set to 1.
To define starting parameters for the covariates, the name of the corresponding factor has to be used. For
example in the case of time-invariant covariates:
est.pnbd.static <- pnbd(clv.static,
start.params.model = c(r=1, alpha = 2, s = 1, beta = 2),
start.params.life = c(Gender=0.6, Channel=0.4),
start.params.trans = c(Gender=0.6, Channel=0.4))
#> Starting estimation...
#> Estimation finished!

Analogously, we can estimate the model containing time-varying covariates.

est.pnbd.dyn <- pnbd(clv.dyn,
start.params.model = c(r=1, alpha = 2, s = 1, beta = 2),
start.params.life = c(Marketing=0.5, Gender=0.6, Channel=0.4),
start.params.trans = c(Marketing=0.5, Gender=0.6, Channel=0.4))

To inspect the estimated model we use summary(), however all other commands such as print(), coef(),
loglike(), confint() and vcov() are also available. Now, output contains also parameters for the covari-
ates for both processes. Since covariates are added separately for the purchase and the attrition process,
there are also separate model parameters for the two processes. These parameters are directly interpretable
as rate elasticity of the corresponding factors: A 1% change in a contextual factor XP or XL changes the
purchase or the attrition rate by 𝛾𝑝𝑢𝑟𝑐ℎ XP or 𝛾𝑙𝑖𝑓𝑒 XL percent, respectively (Gupta 1991). In the example of
the apparel retailer, we observe that female customer purchase significantly more (trans.Gender=1.42576).
Note, that female customers are coded as 1, male customers as 0. Also customers acquired offline (coded
as Channel=1), purchase more (trans.Channel=0.40304) and stay longer (life.Channel=0.9343). Make
sure to check the Karush-Kuhn-Tucker optimality conditions of the first and second order (Kuhn and Tucker
1951) (KKT1 and KKT1) before interpreting the parameters. If those criteria are not met, the optimizer
has probably not arrived at an optimal solution. If this is the case it is usually a good idea to rerun the
estimation using alternative starting values.
summary(est.pnbd.static)
#> Pareto NBD with Static Covariates Model
#>
#> Call:
#> pnbd(clv.data = clv.static, start.params.model = c(r = 1, alpha = 2,

12
#> s = 1, beta = 2), start.params.life = c(Gender = 0.6, Channel = 0.4),
#> start.params.trans = c(Gender = 0.6, Channel = 0.4))
#>
#> Fitting period:
#> Estimation start 2005-01-03
#> Estimation end 2005-10-10
#> Estimation length 40.0000 Weeks
#>
#> Coefficients:
#> Estimate Std. Error z-val Pr(>|z|)
#> r 1.41800 0.27733 5.113 3.17e-07 ***
#> alpha 35.62069 8.58072 4.151 3.31e-05 ***
#> s 0.27258 0.09512 2.866 0.00416 **
#> beta 8.63265 11.26299 0.766 0.44340
#> life.Gender 1.53314 1.09655 1.398 0.16207
#> life.Channel -1.70528 0.66153 -2.578 0.00994 **
#> trans.Gender 1.42366 0.19764 7.203 5.88e-13 ***
#> trans.Channel 0.40225 0.15123 2.660 0.00782 **
#> ---
#> Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
#>
#> Optimization info:
#> LL -2846.1677
#> AIC 5708.3355
#> BIC 5736.5071
#> KKT 1 TRUE
#> KKT 2 TRUE
#> fevals 52.0000
#> Method L-BFGS-B
#>
#> Used Options:
#> Correlation FALSE
#> Regularization FALSE
#> Constraint covs FALSE

To predict future customer behavior we use predict(). Note that dependent on the model, the predicted
metrics may differ. For example, in the case of the Pareto/NBD model with time-varying covariates, in-
stead of DERT, DECT is predicted. DECT only covers a finite time horizon in contrast to DERT. Time-
varying covariates must be provided for the entire prediction period. If the data initially provided in the
SetDynamicCovariates() command does not cover the complete prediction period, the argument new.data
offers the ability to supply new data for the time-varying covariates in the from of a clvdata object.

1.9 Add Correlation to the model

To relax the assumption of independence between the purchase and the attrition process, CLVTools pro-
vides the option to specify the argument use.cor in the command t fit the model (i.e. pnbd). In case of
use.cor=TRUE, a Sarmanov approach is used to correlate the two processes. start.param.cor allows to
optionally specify a starting value for the correlation parameter.
est.pnbd.cor <- pnbd(clv.apparel,
use.cor= TRUE)
summary(est.pnbd.cor)

The parameter Cor(life,trans) is added to the parameter estimates that may be directly interpreted as

13
a correlation. In the example of the apparel retailer the correlation parameter is not significant and the
correlation is very close to zero, indicating that the purchase and the attrition process are independent.

1.10 Advanced Options for Contextual Factors

CLVTools provides two additional estimation options for models containing covariates (time-invariant or
time-varying): regularization and constraints for the parameters of the covariates. Both options are included
in the command to fit the model (i.e., pnbd(). Support for this option is dependent on the model. They
may be used simultaneously.
• The argument reg.lambdas provides the possibility to specify separate \lambda_{reg} for the two
processes (i.e. reg.lambdas = c(trans=100, life=100). The larger the \lambda_{reg} the stronger
the effects of the regularization. Regularization only affects the parameters of the covariates.
• The argument names.cov.constr implements equality constraints for contextual factors with regards
to the two processes. For example the variable “gender” is forced to have the same effect on the
purchase as well as on the attrition process. To do so, the option names.cov.constr is available
(i.e. names.cov.constr=c("Gender")). To provide starting parameters for the constrained variable
use start.params.constr.
To enable regularization for the covariates, we use the following command:
est.pnbd.reg <- pnbd(clv.static,
start.params.model = c(r=1, alpha = 2, s = 1, beta = 2),
reg.lambdas = c(trans=100, life=100))
#> Starting estimation...
#> Estimation finished!
summary(est.pnbd.reg)
#> Pareto NBD with Static Covariates Model
#>
#> Call:
#> pnbd(clv.data = clv.static, start.params.model = c(r = 1, alpha = 2,
#> s = 1, beta = 2), reg.lambdas = c(trans = 100, life = 100))
#>
#> Fitting period:
#> Estimation start 2005-01-03
#> Estimation end 2005-10-10
#> Estimation length 40.0000 Weeks
#>
#> Coefficients:
#> Estimate Std. Error z-val Pr(>|z|)
#> r 7.871e-01 2.093e+00 0.376 0.707
#> alpha 5.343e+00 1.430e+01 0.374 0.709
#> s 3.468e-01 2.734e+00 0.127 0.899
#> beta 1.111e+01 1.576e+02 0.071 0.944
#> life.Gender -2.514e-05 1.576e-02 -0.002 0.999
#> life.Channel -1.162e-04 1.576e-02 -0.007 0.994
#> trans.Gender 5.362e-04 1.576e-02 0.034 0.973
#> trans.Channel 4.422e-04 1.576e-02 0.028 0.978
#>
#> Optimization info:
#> LL -11.5178
#> AIC 39.0357
#> BIC 67.2074
#> KKT 1 TRUE

14
#> KKT 2 TRUE
#> fevals 158.0000
#> Method L-BFGS-B
#>
#> Used Options:
#> Correlation FALSE
#> Regularization TRUE
#> lambda.life 100.0000
#> lambda.trans 100.0000
#> Constraint covs FALSE

To constrain “Gender” to have the same effect on both processes we use the following command. Note,
that the output now only contains one parameter for “Gender” as it is constrained to be the same for both
processes.
est.pnbd.constr <- pnbd(clv.static,
start.params.model = c(r=1, alpha = 2, s = 1, beta = 2),
start.params.constr = c(Gender=0.6),
names.cov.constr=c("Gender"))
#> Starting estimation...
#> Estimation finished!
summary(est.pnbd.constr)
#> Pareto NBD with Static Covariates Model
#>
#> Call:
#> pnbd(clv.data = clv.static, start.params.model = c(r = 1, alpha = 2,
#> s = 1, beta = 2), names.cov.constr = c("Gender"), start.params.constr = c(Gender = 0.6))
#>
#> Fitting period:
#> Estimation start 2005-01-03
#> Estimation end 2005-10-10
#> Estimation length 40.0000 Weeks
#>
#> Coefficients:
#> Estimate Std. Error z-val Pr(>|z|)
#> r 1.42491 0.27552 5.172 2.32e-07 ***
#> alpha 35.43796 8.37521 4.231 2.32e-05 ***
#> s 0.27249 0.09451 2.883 0.00393 **
#> beta 7.72441 7.53594 1.025 0.30536
#> life.Channel -1.69866 0.64451 -2.636 0.00840 **
#> trans.Channel 0.40107 0.15098 2.656 0.00790 **
#> constr.Gender 1.41494 0.18404 7.688 1.49e-14 ***
#> ---
#> Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
#>
#> Optimization info:
#> LL -2846.1731
#> AIC 5706.3462
#> BIC 5730.9964
#> KKT 1 TRUE
#> KKT 2 TRUE
#> fevals 28.0000
#> Method L-BFGS-B
#>

15
#> Used Options:
#> Correlation FALSE
#> Regularization FALSE
#> Constraint covs TRUE
#> Constraint params Gender

2 Customer Spending
Customer lifetime value (CLV) is composed of three components of every customer: the future level of
transactions, expected attrition behaviour (i.e. probability of being alive) and the monetary value. While
probabilistic latent attrition models provide metrics for the first two components, they do not predict cus-
tomer spending. To predict customer spending an additional model is required. The CLVToolspackage
features the Gamma/Gamma (G/G) (Fader, Hardie, and Lee 2005b; Colombo and Jiang 1999) model for
predicting customer spending. For convenience, the predict() command allows to automatically predict
customer spending for all latent attrition models using the option predict.spending=TRUE (see section Cus-
tomer Spending). However, to provide more options and more granular insights the Gamma/Gamma model
can be estimated independently. In the following, we discuss how to estimate a Gamma/Gamma model
using CLVTools.
The general workflow remains identical. It consists of the three main steps: (1) creating a clv.data object
containing the dataset and required meta-information, (2) fitting the model on the provided data and (3)
predicting future customer purchase behavior based on the fitted model.
Reporting and plotting results is facilitated by the implementation of well-known generic methods such as
plot(), print() and summary().

2.1 Load sample data provided in the package

For estimating customer spending CLVTools requires customers’ transaction history including price. Every
transaction record consists of a purchase date,customer ID and the price of the transaction. Data may be
provided as data.frame or data.table (Dowle and Srinivasan 2019). Currently, the Gamma/Gamma model
does not allow for covariates.
We use again simulated data comparable to data from a retailer in the apparel industry. The apparel dataset
is available in the CLVTools package. We use the data(apparelTrans) to load it and initialize a data object
using the clvdata() command. For details see section Initialize the CLV-Object.
data("apparelTrans")
apparelTrans
#> Id Date Price
#> 1: 1 2005-01-03 230.30
#> 2: 10 2005-01-03 84.39
#> 3: 10 2005-02-25 131.07
#> 4: 10 2005-04-05 86.43
#> 5: 100 2005-01-03 11.49
#> ---
#> 2349: 1221 2006-01-23 26.57
#> 2350: 1221 2006-03-09 129.82
#> 2351: 1221 2006-05-14 14.37
#> 2352: 1222 2005-01-03 44.77
#> 2353: 1222 2005-03-03 99.21

clv.apparel <- clvdata(apparelTrans,

16
date.format="ymd",
time.unit = "week",
estimation.split = 40,
name.id = "Id",
name.date = "Date",
name.price = "Price")

2.2 Estimate Model Parameters

To estimate the Gamma/Gamma spending model, we use the command gg() on the initialized clvdata
object. clv.data specifies the initialized object prepared in the last step. Optionally, starting values for
the model parameters and control settings for the optimization algorithm may be provided: The argument
start.params.model allows to assign a vector of starting values for the optimization (i.e c(p=1, q=2,
gamma=1) for the the Gamma/Gamma model). This is useful if prior knowledge on the parameters of
the distributions are available. By default starting values are set to 1 for all parameters. The argument
optimx.args provides an option to control settings for the optimization routine (see section Estimate Model
Parameters).
est.gg<- gg(clv.data = clv.apparel)
#> Starting estimation...
#> Estimation finished!
est.gg
#> Gamma-Gamma Model
#>
#> Call:
#> gg(clv.data = clv.apparel)
#>
#> Coefficients:
#> p q gamma
#> 2.305 17.148 279.974
#> KKT1: TRUE
#> KKT2: TRUE

In line with literature, CLVTools does not use by default the monetary value of the first transaction to fit
the model since it might be atypical of future purchases. If the first transaction should be considered the
argument remove.first.transaction can be set to FALSE.
est.gg<- gg(clv.data = clv.apparel, remove.first.transaction=FALSE)
#> Starting estimation...
#> Estimation finished!
est.gg
#> Gamma-Gamma Model
#>
#> Call:
#> gg(clv.data = clv.apparel, remove.first.transaction = FALSE)
#>
#> Coefficients:
#> p q gamma
#> 2.47 15.45 227.09
#> KKT1: TRUE
#> KKT2: TRUE

17
2.3 Predict Customer Spending

Once the model parameters are estimated, we are able to predict future customer mean spending on an
individual level. To do so, we use predict() on the object with the estimated parameters (i.e. est.gg).
Note that there is no need to specify a prediction period as we predict mean spending.
In general, probabilistic spending models predict the following expected characteristic for every customer:
• predicted mean spending (“predicted.mean.spending”)
If a holdout period is available additionally the true mean spending (“actual.mean.spending”) during the
holdout period is reported.
To use the parameter estimates on new data (e.g., an other customer cohort), the argument newdata option-
ally allows to provide a new clvdata object.
results.spending <- predict(est.gg)
print(results.spending)
#> Id actual.mean.spending predicted.mean.spending
#> 1: 1 0.00000 66.78361
#> 2: 10 0.00000 59.77959
#> 3: 100 32.06652 42.18584
#> 4: 1000 46.51783 40.32674
#> 5: 1001 33.09091 44.89572
#> ---
#> 246: 1219 29.55429 30.40038
#> 247: 122 0.00000 35.19998
#> 248: 1220 0.00000 37.77123
#> 249: 1221 33.62778 31.27807
#> 250: 1222 0.00000 47.27699

2.4 Plot Spendings

an estimated spending model object (i.e. est.gg) may be plotted using the plot() command. The
plot provides a comparison of the estimated and actual density of customer spending. The argument
plot.interpolation.points allows to adjust the number of interpolation points in density graph.
plot(est.gg)

18
Density of Average Transaction Value

0.02
Density

0.01

0.00

0 50 100 150 200

Average Value per Transaction

Legend Actual Gamma−Gamma

Literature
Bemmaor, Albert. C., and Nicolas Glady. 2012. “Modeling Purchasing Behavior with Sudden ”Death”: A
Flexible Customer Lifetime Model.” Management Science 58 (5): 1012–21.
Byrd, Richard H, Peihuang Lu, Jorge Nocedal, and Ciyou Zhu. 1995. “A Limited Memory Algorithm for
Bound Constrained Optimization.” SIAM Journal on Scientific Computing 16 (5): 1190–1208.
Colombo, Richard, and Weina Jiang. 1999. “A stochastic RFM model.” Journal of Interactive Marketing
13 (3): 2–12.
Dowle, Matt, and Arun Srinivasan. 2019. Data.table: Extension of ’Data.frame’. https://CRAN.R-project.
org/package=data.table.
Fader, Peter S., Bruce G. S. Hardie, and KL Lee. 2005a. “’Counting Your Customers’ the Easy Way: An
Alternative to the Pareto/NBD Model.” Marketing Science 24 (2): 275–84.
———. 2005b. “RFM and CLV: Using Iso-Value Curves for Customer Base Analysis.” Journal of Marketing
Research 42 (4): 415–30.
Grolemund, Garrett, and Hadley Wickham. 2011. “Dates and Times Made Easy with lubridate.” Journal
of Statistical Software 40 (3): 1–25. https://www.jstatsoft.org/article/view/v040i03.
Gupta, Sunil. 1991. “Stochastic Models of Interpurchase Time with Time-Dependent Covariates.” Journal
of Marketing Research 28 (1): 1–15.
Kuhn, H. W., and A. W. Tucker. 1951. “Nonlinear Programming.” In Second Berkeley Symposium on
Mathematical Statistics and Probability, edited by J. Neyman, 481–92.
Nash, John C. 2014. “On Best Practice Optimization Methods in R.” Journal of Statistical Software 60 (2):
1–14. https://www.jstatsoft.org/article/view/v060i02.
Nash, John C., and Ravi Varadhan. 2011. “Unifying Optimization Algorithms to Aid Software System
Users: optimx for R.” Journal of Statistical Software 43 (9): 1–14. https://www.jstatsoft.org/article/
view/v043i09.
Nelder, John A, and Roger Mead. 1965. “A Simplex Method for Function Minimization.” The Computer
Journal 7 (4): 308–13.
Schmittlein, David C., Donald G. Morrison, and Richard Colombo. 1987. “Counting Your Customers:
Who-Are They and What Will They Do Next?” Management Science 33 (1): 1–24.
Wickham, Hadley, Jim Hester, and Winston Chang. 2019. Devtools: Tools to Make Developing r Packages
Easier. https://CRAN.R-project.org/package=devtools.

Store-Sales-Ts-Forecasting-A-Comprehensive-Guide - Jupyter Notebook
No ratings yet
Store-Sales-Ts-Forecasting-A-Comprehensive-Guide - Jupyter Notebook
26 pages
BTYD Walkthrough
No ratings yet
BTYD Walkthrough
39 pages
Customer Lifetime Value Customer Analytics With R
No ratings yet
Customer Lifetime Value Customer Analytics With R
7 pages
Experiences With Using R in Credit Risk
No ratings yet
Experiences With Using R in Credit Risk
40 pages
BTYD Walkthrough
No ratings yet
BTYD Walkthrough
39 pages
Master Thesis Full Report v3 livroChurneELifetimevalue LTV
No ratings yet
Master Thesis Full Report v3 livroChurneELifetimevalue LTV
51 pages
Best Fc300 English
No ratings yet
Best Fc300 English
69 pages
Journal TKDD 2021
No ratings yet
Journal TKDD 2021
38 pages
A Novel Approach To Predicting Customer Lifetime Value in B2B Saas Companies
No ratings yet
A Novel Approach To Predicting Customer Lifetime Value in B2B Saas Companies
15 pages
Bernat
No ratings yet
Bernat
87 pages
Predictive Modelling
100% (1)
Predictive Modelling
58 pages
SSMDA Project
No ratings yet
SSMDA Project
27 pages
CustomerChurn Assignment
100% (3)
CustomerChurn Assignment
15 pages
An Boosting Business Intelligent To Customer Lifetime Value With Robust M-Estimation
No ratings yet
An Boosting Business Intelligent To Customer Lifetime Value With Robust M-Estimation
8 pages
Amazon ML Case 1689698392
No ratings yet
Amazon ML Case 1689698392
7 pages
DAB303 Project 4 Presentation
No ratings yet
DAB303 Project 4 Presentation
8 pages
Estimating CLV Using Aggregated Data - The Tuscan Lifestyles Case Revisited
No ratings yet
Estimating CLV Using Aggregated Data - The Tuscan Lifestyles Case Revisited
17 pages
Predicting Customer Lifetime Values - Ecommerce U
No ratings yet
Predicting Customer Lifetime Values - Ecommerce U
7 pages
Assingment 1
No ratings yet
Assingment 1
6 pages
Workshop 7
No ratings yet
Workshop 7
4 pages
Customer Lifetime Value (CLV) : Ravi Agarwal
No ratings yet
Customer Lifetime Value (CLV) : Ravi Agarwal
22 pages
Marketing Analytics - Group 8
No ratings yet
Marketing Analytics - Group 8
19 pages
ANZ Virtual Internship Module Model Answer For Task 1
No ratings yet
ANZ Virtual Internship Module Model Answer For Task 1
7 pages
Rashmi Jeswani Capstone
No ratings yet
Rashmi Jeswani Capstone
84 pages
Telecom Customer Churn Project Report
50% (2)
Telecom Customer Churn Project Report
25 pages
Conference Paper
No ratings yet
Conference Paper
11 pages
Predict Future Sales Group1 Presentation
No ratings yet
Predict Future Sales Group1 Presentation
11 pages
Quadexp IDS Project
No ratings yet
Quadexp IDS Project
22 pages
Determing Customer Lifetime Value
No ratings yet
Determing Customer Lifetime Value
16 pages
Predictive Modeling For Real-Time Customer Lifetime Value
No ratings yet
Predictive Modeling For Real-Time Customer Lifetime Value
6 pages
Rossmann nr1 Doc
No ratings yet
Rossmann nr1 Doc
7 pages
Estimating Customer Lifetime Value Using Machine L
No ratings yet
Estimating Customer Lifetime Value Using Machine L
19 pages
Predicting Customer Class Using Customer Lifetime Value With Random Forest Algorithm
No ratings yet
Predicting Customer Class Using Customer Lifetime Value With Random Forest Algorithm
6 pages
DM Group Assignment
No ratings yet
DM Group Assignment
23 pages
B.A Assignment
No ratings yet
B.A Assignment
7 pages
Customer Lifetime Value - Customer Analytics With R
No ratings yet
Customer Lifetime Value - Customer Analytics With R
7 pages
Estimating Customer Lifetime Value Using Machine Learning Techniques
No ratings yet
Estimating Customer Lifetime Value Using Machine Learning Techniques
18 pages
Unit 3 - Functions and Strings-1
No ratings yet
Unit 3 - Functions and Strings-1
94 pages
AS400 Command DSP
No ratings yet
AS400 Command DSP
158 pages
BS Mini Project 2
No ratings yet
BS Mini Project 2
5 pages
Report
No ratings yet
Report
9 pages
Applied Datascience - Phase3
No ratings yet
Applied Datascience - Phase3
8 pages
Machine Learning - It3190E: Hanoi University of Science and Technology School of Information and Communication Technology
No ratings yet
Machine Learning - It3190E: Hanoi University of Science and Technology School of Information and Communication Technology
14 pages
Class Handout CI125544 Analyze and Devise in Subassembly Composer Kati Mercier P.E. 1 PDF
100% (1)
Class Handout CI125544 Analyze and Devise in Subassembly Composer Kati Mercier P.E. 1 PDF
52 pages
437 33 Powerpoint-Slides Chapter-4-Functions Ch4
No ratings yet
437 33 Powerpoint-Slides Chapter-4-Functions Ch4
23 pages
How To Optimize An Expert Advisor Using MetaTrader 4 Strategy Tester
100% (2)
How To Optimize An Expert Advisor Using MetaTrader 4 Strategy Tester
10 pages
Object Oriented Programming With CPP Study Material GTU 2017 21022017 023107AM
100% (1)
Object Oriented Programming With CPP Study Material GTU 2017 21022017 023107AM
67 pages
Usfos UM 06
No ratings yet
Usfos UM 06
220 pages
Praktikum Modul 3
No ratings yet
Praktikum Modul 3
5 pages
Sample - Customer Churn Prediction Python Documentation
No ratings yet
Sample - Customer Churn Prediction Python Documentation
33 pages
3 - A Modified Pareto
100% (1)
3 - A Modified Pareto
26 pages
Cs301 Supreme File For Mid by HaCkeRzZz
No ratings yet
Cs301 Supreme File For Mid by HaCkeRzZz
55 pages
ABB
100% (2)
ABB
84 pages
s7300 fm352 Operating Instructions en en-US PDF
No ratings yet
s7300 fm352 Operating Instructions en en-US PDF
226 pages
Exam 19 March Questions
No ratings yet
Exam 19 March Questions
13 pages
Operating Manual: HGM6200K Series Automatic Generator Module
No ratings yet
Operating Manual: HGM6200K Series Automatic Generator Module
35 pages
Paper La Perla
No ratings yet
Paper La Perla
23 pages
TemperatureControllerAI208v78 191003
No ratings yet
TemperatureControllerAI208v78 191003
2 pages
PitchBook Analyst Note Leveraging Technology To Revamp Health Insurance
No ratings yet
PitchBook Analyst Note Leveraging Technology To Revamp Health Insurance
12 pages
RSD ICBS v3
No ratings yet
RSD ICBS v3
35 pages
ASP Net Notes1
No ratings yet
ASP Net Notes1
172 pages
On The Cover: September 2001, Volume 7, Number 9
No ratings yet
On The Cover: September 2001, Volume 7, Number 9
49 pages
Wizards
No ratings yet
Wizards
230 pages
MELSEC iQ-R Ethernet, CC-Link IE, and MELSECNET/H Function Block Reference
No ratings yet
MELSEC iQ-R Ethernet, CC-Link IE, and MELSECNET/H Function Block Reference
202 pages
Python Lab Manual
No ratings yet
Python Lab Manual
107 pages
SAP HR - Ad Hoc Query
100% (1)
SAP HR - Ad Hoc Query
5 pages
BC 400
No ratings yet
BC 400
5 pages
ImageAnalysis CASA PDF
No ratings yet
ImageAnalysis CASA PDF
30 pages
Macro Accounting and General Equilibrium Nshi
No ratings yet
Macro Accounting and General Equilibrium Nshi
67 pages
2017 Investor Day 03.23.2017
No ratings yet
2017 Investor Day 03.23.2017
66 pages
Common Errors To Avoid: Review Questions and Exercises
No ratings yet
Common Errors To Avoid: Review Questions and Exercises
5 pages
Air Hockey For CS Viva
No ratings yet
Air Hockey For CS Viva
17 pages
SSRN Id2725485
No ratings yet
SSRN Id2725485
10 pages
MKC Material PythonProgramming
No ratings yet
MKC Material PythonProgramming
11 pages
Plan 1010 Final Paper
No ratings yet
Plan 1010 Final Paper
15 pages
Haddad, Yvonne Yazbeck, and Ellison Banks Findly, Eds. Women, Religion, and Social Change. SUNY Press, 1985. PG 19
No ratings yet
Haddad, Yvonne Yazbeck, and Ellison Banks Findly, Eds. Women, Religion, and Social Change. SUNY Press, 1985. PG 19
5 pages
435 IEEE TR Proposal 04
No ratings yet
435 IEEE TR Proposal 04
14 pages
The Ultimate Slack Cheat Sheet
No ratings yet
The Ultimate Slack Cheat Sheet
2 pages
Design of A Basic Block
No ratings yet
Design of A Basic Block
5 pages
Object Oriented Concepts-II
No ratings yet
Object Oriented Concepts-II
8 pages
Auditing Cloud Computing: A Security and Privacy Guide
From Everand
Auditing Cloud Computing: A Security and Privacy Guide
Ben Halpert
3/5 (2)
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Backtrader Essentials: Building Successful Strategies with Python
From Everand
Backtrader Essentials: Building Successful Strategies with Python
Ali AZARY
No ratings yet
Management Accounting: Business Strategy & Performance: Decision-Making by Numbers
From Everand
Management Accounting: Business Strategy & Performance: Decision-Making by Numbers
Commerce Central
No ratings yet
SAP Variant Configuration: Your Successful Guide to Modeling
From Everand
SAP Variant Configuration: Your Successful Guide to Modeling
Mike Piehl
5/5 (3)
Data Science Programming In Python
From Everand
Data Science Programming In Python
Anita Raichand
No ratings yet
"C Programming for Beginners: A Step-by-Step Guide"
From Everand
"C Programming for Beginners: A Step-by-Step Guide"
Lov kush
No ratings yet
Cloud Brokering
From Everand
Cloud Brokering
Felipe Díaz-Sánchez
No ratings yet
Manufacturing: Engineering, Management and Marketing
From Everand
Manufacturing: Engineering, Management and Marketing
S.O.T Ogaji
No ratings yet
Cost Management: A Case for Business Process Re-engineering
From Everand
Cost Management: A Case for Business Process Re-engineering
Ivor Ogidefa
No ratings yet
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
From Everand
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
vivian njoroge
No ratings yet
Process Capability: Six Sigma Thinking, #4
From Everand
Process Capability: Six Sigma Thinking, #4
Sumeet Savant
5/5 (2)
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
AZ-104 Azure Administrator Practice Paper 2: AZ-104 Azure Administrator, #2
From Everand
AZ-104 Azure Administrator Practice Paper 2: AZ-104 Azure Administrator, #2
Tech Interviews
No ratings yet
Exam AZ 900: Azure Fundamental Study Guide-1: Explore Azure Fundamental guide and Get certified AZ 900 exam
From Everand
Exam AZ 900: Azure Fundamental Study Guide-1: Explore Azure Fundamental guide and Get certified AZ 900 exam
Mamta Devi
No ratings yet
(ISC)2 CCSP Certified Cloud Security Professional Official Study Guide
From Everand
(ISC)2 CCSP Certified Cloud Security Professional Official Study Guide
Mike Chapple
5/5 (1)
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
From Everand
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
Georgio Daccache
No ratings yet

Walkthrough For The Clvtools Package: 1 Prerequisites: Setup The R Environment

Uploaded by

Walkthrough For The Clvtools Package: 1 Prerequisites: Setup The R Environment

Uploaded by

Walkthrough for the CLVTools Package

1 Prerequisites: Setup the R environment

Load the package

#Apply the CLVTools Package

1.1 General workflow

1.2 Load sample data provided in the package

1.3 Initialize the CLV-Object

1.4 Check the clvdata Object

To get details on the clvdata object, print it to the console.

1.5 Estimate Model Parameters

#Extract the coefficients only

#Extract the confidence intervals

# Variance-Covariance Matrix at maximum

Command Model Covariates Type

1.6 Predict Customer Behavior

or provide a date indication the end of the prediction period:

2005−01 2005−07 2006−01 2006−07

Density of Customer's Mean Time between Transactions

Weekly tracking plot

Legend Actual Pareto NBD Standard

Analogously, we can estimate the model containing time-varying covariates.

1.9 Add Correlation to the model

1.10 Advanced Options for Contextual Factors

2.1 Load sample data provided in the package

clv.apparel <- clvdata(apparelTrans,

2.2 Estimate Model Parameters

2.4 Plot Spendings

0 50 100 150 200

Legend Actual Gamma−Gamma

You might also like