Sequential Clustering and Classication Approach To Analyze Sales Performance of Retail Stores Based On Point of Sale Data
Sequential Clustering and Classication Approach To Analyze Sales Performance of Retail Stores Based On Point of Sale Data
Point-of-Sale (POS) data analysis is usually used to explore sales performance in business
commence. This manuscript aims to combine unsupervised clustering and supervised classi¯ca-
tion methods in an integrated data analysis framework to analyze the real-world POS data.
Clustering method, which is performed on sales dataset, is used to cluster the stores into several
groups. The clustering results, data labels, are then combined with other information in store
features dataset as the inputs of the classi¯cation model which classi¯es the clustering labels by
using store features dataset. Non-dominated sorting generic algorithm-II (NSGA-II) is applied in
the framework to employ the multi-objective of clustering and classi¯cation. The experimental
case study shows clustering results can reveal the hidden structure of sales performance of retail
stores while classi¯cation can reveal the major factors that e®ect to the sales performance under
di®erent group of retail stores. The correlations between sales clusters and the store information
can be obtained sequentially under a series of data analysis with the proposed framework.
1. Introduction
Point-of-sale (POS) data are commonly used for sales performance analysis, for
example, analyzing sales e®ects,1 improving customer service,2 and sale prediction.3,4
The transactional data contain statistical information that identi¯es a particular
customer who bought a set of products at certain prices, at a store or branch, and
‡ Corresponding author.
885
886 C.-L. Yang & T. P. Q. Nguyen
time.5,6 Fisher and Raman also mentioned that the proper analysis of sales data can
be extremely useful to a retailer in improving decision-making on optimizing store
assortments and dynamic pricing of items.7 Understanding the POS data not only
reveals the customers' purchase behaviors but also helps in enhancing the supply
chain management with more e®ective inventory control. For instance, POS data
analysis can discover how well an item was sold. Then, using this information, a
retailer can adjust the ordering level correspondingly as well as make pricing deci-
sions that adapt the trend and seasonality of the item at di®erent periods.8
Duchessi et al. grouped the existing techniques used for the retail POS data
analysis into three approaches9: (1) query-based method, (2) statistical techniques,
and (3) data-mining or machine learning method. Recently, Choi et al. also sum-
marized that statistics, machine learning, data mining, and optimization are the
main big data analytics techniques that have been utilized in operation management
areas.10 Essentially, the query-based methods by database query language (SQL) are
evaluated as easy-to-use techniques on the datasets with the prede¯ned columns/
features. However, these techniques were unable to detect hidden information
and knowledge. The statistical techniques such as multiple analysis of variance
(MANOVA), cluster analysis, regression analysis, and factor analysis can perform
multivariate patterns' analysis of association discovery. Although statistical tech-
niques provide scienti¯c interference, it is hard to interpret the results under multi-
dimensional datasets in a big data environment as well as to construct a statistical
approach when the assumption of data distribution might not be held. The algo-
rithms of data mining or machine learning techniques such as tree induction, neural
network induction, and association rule induction can provide promising analytics on
a relatively higher level of data complexity in large-scale data than the other
approaches. Conversely, the \algorithm-based" approach requires professional model
building and the coding of data into classes to handle the big data.
In general, a framework that can conduct data mining functions simply and
straightforwardly to analyze POS data is necessary for daily or weekly retailing
analysis. For example, a manager who is in charge of supervising multiple stores or
branches would like to investigate the sales performance of each store and also to
search out which factors in°uence the sales performance. To conduct the analysis,
usually, the performance of each store can be represented by the revenue received
within a speci¯ed period, such as a season, month, or week. To di®erentiate the store
performance, a grouping technique or clustering method is required to cluster the
stores by their multivariate time series revenue data. Based on the clustering result, a
manager would like to ¯nd out if there is a special sales pattern existing in the
clustering results. Besides, the manager might want to know further why those stores
have similar sales performances. Is this similarity based on store geometry, location,
customer demographics, or local competition? In other words, revealing the factors
related to the store similarity in terms of sale performance among stores can help in
managing the assortment of the item in similar stores (\clusters") rather than trying
to deal with listing items each store or each item individually. Factors regarding the
Sequential Clustering and Classi¯cation Approach 887
stores such as location, human resources, population around the store, available
transportation nearby the store, and so on are critical in in°uencing the sales per-
formance. Thus, utilizing classi¯cation analysis to ¯nd the most correlated factors
which a®ect the sales signi¯cantly is important.
Based on the analytics scenarios addressed beforehand, a systematic data analysis
framework is needed to handle the daily or weekly analysis tasks as a routine. In this
research, a data mining framework combining data clustering and classi¯cation tasks
using fast and elitist non-dominated sorting genetic algorithm (NSGAII-SCC)11 is
employed to analyze POS data. Clustering stores or sale items by the sale records
help in revealing the hidden information behind the selling pattern. Classi¯cation
analysis is applied to identify the factors which are likely correlated with the found
sales pattern in each cluster. By combining clustering and classi¯cation on data
exploration, an interesting research question was raised. When utilizing the clus-
tering method on the performance measures and then further using the classi¯cation
method to reveal the correlated factors, how to maintain the clustering and classi-
¯cation e®ectiveness (or qualities) to show the con¯dence of the POS data analysis?
The complexity of this combined analysis is depended upon the number of ways to
cluster retail stores and the multiple possibilities of the associated factors related to
the found store clusters (in terms of sale performance), for each clustering case. As
studied in the literature, the clustering problem is an NP-hard optimization prob-
lem,12 and searching the in°uential factor so-called the feature selection for classi-
¯cation is also a complicated technique, especially for a large dataset.13 When
searching space of ¯nding the correlation among the in°uential factors and retail
performance by POS data is large, how to conduct the data analysis under the need
of investigating retailing performance is an interesting research question.
To attack this research question, the NSGA-II-SCC method simultaneously opti-
mizes the solution quality of clustering and classi¯cation. The objective functions are
formed as (1) to minimize the clustering compactness when performing store clustering
based on the revenue performance and (2) to maximize the classi¯cation accuracy
when investigating factors that likely contribute to classifying the revenue perfor-
mance of each store cluster. For this combined data analysis, the number of clusters
(k), which is a user-de¯ned parameter, seems crucial and needs further investigation.
Too many clusters may lead to classi¯cation di±culty in recognizing the correlated
factors and loss of generality. On the other hand, too few clusters may make the
partitions too vague that the classi¯cation result may be too general and lose the
investigation capability. Thus, this study uses Technique for Order of Preference by
Similarity to Ideal Solution (TOPSIS)14 method to analyze the Pareto-optimal solu-
tions to determine the best number of clusters with its corresponding solution for the
framework. Besides, stepwise regression technique15 is employed to identify the sig-
ni¯cant features for classi¯cation which are highly correlated with the store patterns.
This paper is structured as follows. Section 2 reviews POS data analysis and
clustering and classi¯cation techniques. Section 3 describes the methodology of the
proposed framework. Section 4 shows the experimental result of applying the
888 C.-L. Yang & T. P. Q. Nguyen
proposed framework on the POS data of a bakery retail chain as a case study.
Section 5 presents the conclusion and future research direction.
2. Literature Review
2.1. POS data analysis
The POS system contains information on sold products such as the item number,
price, selling time, location, and customer information. Sales data are convenient to
be collected and managed using a POS system and have been applied widely and
e®ectively on di®erent commercial activities. The POS data will be stored and an-
alyzed so that the inventory and supply information can be provided rapidly to
decrease labor costs and improve productivity. For example, Croson and Donohue
studied the impact of sharing POS data on reducing the bullwhip e®ect in supply
chain management.16 Sharing POS information increased supply chain performance
by decreasing the magnitude of order oscillations for manufacturers, distributors,
wholesalers, and retailers. Karen Stein analyzed POS data for food service and built a
system to expedite customer service and support the managers to control their
business such as who are dining, what is selling as well as the inventory.17 Li et al.
analyzed ¯nancial data to de¯ne the groups of customers' behaviors and the po-
tential risks. In their study, a new cluster validation index and a novel penalty-
function-based solver were proposed to automatically detect customer groups as well
as re¯ne the clusters' centroids and hyperellipsoidal scopes.18
In the literature, using POS data to predict product sales is one of the popular
research works in POS data analysis. POS data can be processed in di®erent ways for
sales forecasting, which will bring great e®ects on marketing and sales strategies.3
POS data are formed in chronological order, as a result, time series models are
commonly used. Sundararaman et al. adopted the POS data and considered the
databases of trends, seasonality, and seasonal index using centered moving averages
and normalization.4 Holt-Winters model used the original trend and seasonal factor
to calculate the average value of POS data. Employing deleting and ¯ltering the
seasonal factor, fewer forecasting errors and better forecasting results can be pre-
sented. Williams et al. also aggregated order forecasting and past order information,
executing processes and other uncertain factors from POS data.19 Their model ¯l-
tered the in°uence and errors of the bullwhip e®ect and proposed a long-term balance
of inventory strategy.
Some researchers studied the e®ect of customers' behaviors based on POS data.
Duchessi et al. used POS data to group retail grocery chain customers based on their
similar purchasing behavior by geodemography.9 Kashima et al. developed a rec-
ommendation method that integrated a POS system and an automatic order func-
tion for a restaurant.20 The developed system recommended a menu to customers
based on the customers' pro¯le analyzed based on POS data. Ogawa et al. used POS
data to analyze customers' purchasing behaviors on vegetables and fruits.2 The sales
data revealed that point-of-purchase health information could be e®ective to foster
Sequential Clustering and Classi¯cation Approach 889
customers' healthy dietary habits. Aloysius et al. also investigated the service process
perceptions to shipping outcomes in the retail environment by using \mobile" POS
data carried by the smartphone.21 They found that the technology enablers on
mobile POS systems in scanning and payment scenarios give retailers competitive
advantage in the era of big data. All of the mentioned research utilized certain data
analysis methods or analysis framework to conduct the study under POS data.
Although POS data have been used in a variety of research domains, a few
research, based on our knowledge, focuses on developing the particular data mining
technique for analyzing POS data to deal with the analysis of retailing performance
and the corresponding factors. Therefore, this study aims to develop a new data
analysis framework that combines data clustering and data classi¯cation to analyze
POS data. The proposed technique is expected to apply to general POS datasets. In
this work, a case study of a bakery retail store chain was demonstrated as an example
of the utilization of the developed framework.
3. Methodology
In this study, sequential clustering and classi¯cation are utilized to analyze the
performance of a retail chain based on the store sales patterns. The framework of
sequential clustering and classi¯cation using a fast and elitism non-dominated
Sequential Clustering and Classi¯cation Approach 891
Fig. 1. Framework of NSGAII-SCC on retail POS data with the stepwise regression analysis for feature
selection and TOPSIS for solution selection.
Fig. 2. Illustration of identifying data features for clustering and classi¯cation of retail POS data.
time durations such as daily or weekly without losing the generality. For example, in
our case study, the Q dataset was aggregated as the total sales transactions per
month of 54 retail bakery stores which sold more than 500 products in each store.
The second dataset X can include store characteristics such as store geometry size,
location of the store, and surrounding facilities nearby which the manager might be
interested in ¯nding the association against the sales performance of the store. These
two datasets Q and X will be used for clustering and classi¯cation, respectively, for
investigating store features that correlate to the sales performance under certain
groups of stores in terms of their sales performance. The process to identify data
features for each dataset is illustrated in Fig. 2.
Fig. 3. Illustration of sequential clustering and classi¯cation data analysis framework with stepwise
regression feature selection and determination of number of cluster by TOPSIS.
for each chromosome. Then, genetic operations such as selection, crossover, and
mutation on feature chromosomes are then applied to search for the solutions for
clustering and classi¯cation. Similar to the original NSGAII method which was
proposed by Deb et al.,38 non-dominated sorting and crowding distance are com-
puted to select populations for the next generation. This process of the genetic
algorithm (GA) will repeat until the stopping criterion is met. The output solutions
on the Pareto front represent the solution candidates. In addition, the chromosomes
of each generation are recorded for stepwise regression analysis to study factors on
classi¯cation accuracy. The TOPSIS was then applied to select the solution for
concluding the analysis. The details of NSGAII-SCC with stepwise regression feature
selection and TOPSIS will be described in the following subsections.
to the sequential clustering and classi¯cation algorithm to obtain the ¯tness values.
Then, the procedure follows the original algorithm of Deb et al.38 Parent chromo-
somes are combined with o®spring chromosomes to conduct non-dominated sorting
and calculate crowding distance to select the next generation. Figure 4 illustrates the
non-dominated sorting procedure of the NSGAII-SCC framework.11
The process of GA will repeat until the stopping criterion is met. The output
solutions on the Pareto front represent the solution candidates.
FR-class is to ¯nd out the solutions which have a relatively smaller number of features
used based on the principle of Occam's razor in data science (simple is better).41 After
assigning weights for MSE, 1/acc, FR-clus, and FR-class, TOPSIS method is used to
determine the relative closeness of each solution. Then, the number of cluster associ-
ated with the solution which has the largest relative closeness will be determined as the
cluster of X dataset. This determination is crucial for further analyzing the relationship
between store performance and store features on POS data.
4. Case Study
4.1. Data description and parameter setting
In this case study, POS data are collected from a bakery franchisee in Hangzhou,
China. The data are divided into two sets, one for clustering and the other one for
classi¯cation. The ¯rst set is the sales dataset Q that contains aggregated monthly
revenue by store. There are 54 bakery stores and each store sells more than 500
products in this bakery franchisee. The second portion is the store features dataset X
that comprises characteristics information related to each store. The factors included
in the store features dataset are as follows:
rate ¼ 0:1, weighting component ¼ 1:2 and chromosome selection ¼ 0:1. The num-
ber of generations is used as the stopping criteria for GA. The number of clusters (k)
ranges from 2 to 10. To evaluate the performance of the NSGAII-SCC framework on
the POS data, the results will be compared with the conventional sequential clus-
tering and classi¯cation (SCC) without NSGA. As mentioned in the previous section,
two validation indices (MSE and 1/acc) are used to measure the results. The clus-
tering result is validated by the MSE index and the classi¯cation performance is
evaluated by 1/acc.
agglomerative clustering are compared to select the best one. Similarly, ANN, KNN,
SVM, and DT results are compared to choose the method that can give the best
results on the given POS dataset. The results are shown in Table 2 and plotted in
Figs. 6 and 7.
Figure 6 shows that k-means performs better than hierarchical clustering in all
values of k: For two to six clusters, there is a signi¯cant di®erence in MSE between
k-means and the hierarchical method. However, the di®erence decreases when the
number of clusters exceeds six. Based on this result, k-means is selected to perform
the clustering task on the given POS dataset.
Similarly, Fig. 7 illustrates the comparison of four classi¯cation methods using the
POS data. DT performs better than other methods in terms of 1/acc (smaller is
better). In most cases, the 1/acc index rises steadily when the number of clusters
increases. This indicates that the classi¯cation technique becomes less accurate when
there are more class labels (k).
Table 3. Solution of ¯rst Pareto front using NSGAII-SCC with their corresponding features.
k MSE ð108 Þ 1/acc Chromosome for clustering Chromosome for classi¯cation Number of selected features
Clustering Classi¯cation
Based on the results of this prior experiment, this study selects k-means and DT
to perform clustering and classi¯cation tasks, respectively, for the POS data using
the NSGAII-SCC framework. The results of NSGAII-SCC are compared with SCC
to evaluate the performance of the proposed framework.
k NSGAII-SCC SCC
Fig. 8. Solutions of ¯rst Pareto front using NSGAII-SCC and SCC methods based on bakery POS data.
the statistical result con¯rmed the superior of NSGAII-SCC against SCC in terms of
smaller MSE and 1/acc.
Figure 8 further illustrates the solutions of NSGAII-SCC and SCC methods based
on the bakery POS data. In Fig. 8, the red line with the square indicator shows the
solutions of the ¯rst Pareto front using NSGAII-SCC, while the blue line with the
cycle indicator shows the solutions of the ¯rst Pareto front using SCC. The red line
with the square indicator is closer to the original coordinate (left-bottom corner)
which means NSGAII-SCC can obtain the smaller MSE and 1/acc simultaneously.
Once again, this result shows that the proposed framework NSGAII-SCC outper-
forms SCC in terms of better clustering and classi¯cation performances. It also
means NSGAII-SCC can search the better solutions of clustering and classi¯cation
when conducting POS analysis for revenue performance investigation.
revenue performance of stores, ¯ve groups of stores can be established for better
analysis due to the relevant clustering and classi¯cation results combined in Pareto
front solutions.
each cluster to show the sales patterns among stores. Similar sales patterns in one
cluster can be recognized. For example, most of the stores are grouped into Cluster 4
which had diminished revenue in the 2nd, 6th, and 8th months. Meanwhile, Cluster 5
has only one store that has outstanding revenue each month. Cluster 3 contains
stores with a signi¯cant decrease of revenue at the 7th month which shows the unique
patterns than other groups. Cluster 1 and Cluster 2 seem not much distinct from
each other but Cluster 2 has relatively larger variation. Based on this clustering
analysis, the groups of stores with di®erent sale patterns can be revealed for further
investigation.
The chromosomes which represent the selected features help us to identify im-
portant features for both clustering and classi¯cation. Following the selected solution
at k ¼ 5, the clustering chromosome is a string of f010010110011g. This indicates
that six features, among a total of 12 features in the revenue dataset, are chosen as
the representative features to perform clustering. These six selected features are the
revenue of the 2nd, 5th, 7th, 8th, 11th, and 12th months. These six months are
signi¯cant in di®erentiating retail stores shown in Fig. 9 because they represent the
critical sales performance when conducting a comparison of all stores. For instance,
the 7th feature contributes to the signi¯cant pattern characteristic in Cluster 3. The
2nd feature is signi¯cant in most of the clusters. Therefore, using the selected features
can predict clustering results that lead closer to the result while using all data. It also
means that the proposed framework provides not only the mechanism of determining
cluster number considered clustering and classi¯cation result but also selecting the
features which are the most in°uential for store clustering.
Once the clustering method obtains the store labels, the store features are clas-
si¯ed using the clustering labels as classi¯cation targets. Similar to the process of
exploring selected features for data clustering, the chromosome for classi¯cation
corresponding to the optimal value of k determined by TOPSIS is analyzed. Besides,
the stepwise regression method is applied on the classi¯cation chromosomes to
eliminate the redundant and non-signi¯cant features to the acc. The signi¯cance
level is set as 0.1. Finally, the ¯ve most correlated factors, i.e., store size, department
stores, residential area, location, and population density, can be used to classify the
store cluster. The correlations between sales revenue and the associated store in-
formation for ¯ve store clusters are described in Table 6. As can been seen, store size,
department stores, residential area, location, and population density of each store
cluster have di®erent magnitude levels. For example, Cluster 5 was identi¯ed as the
high sales revenue store which happens to have a larger store size in a high popu-
lation density region. However, the stores with bigger °oor plans do not guarantee
tremulous sales. There is no negligible °uctuation in sales in Cluster 4; even the
stores in that cluster have middle store size. Based on the analysis result, the de-
partment stores nearby turn out to be a signi¯cant factor to identify the sale per-
formance in Cluster 4. Moreover, Cluster 3 can be recognized as a special cluster
which is located in the lowest population density region #8 and #10 and relatively
low sales in month 7th. This analysis points out the stores which were located near
Table 6. Correlations between sales revenue and the associated store information for 5 store clusters derived by NSGAII-SCC.
Sales revenue Medium sales Medium sales and Sales decrease sharply Low sales to medium sales High sales
more °uctuating in month 7th
Store size (m2 ) 70 to 78.5 44 to 48 and 79 to 130 44 48 to 70 120
Location (District No. in Table 1) #5 #2, #7 #8, #10 #1, #3, #4, #9 #6
Department stores nearby (unit) 0 2 0, 1 0, 1, 3 1
Residential area (0/1) 0 1 0 0 1
Population density (persons/km2 ) 3000 to 3200 800 to 1000 650 900 to 1000 10000
9900 to 10000 1300 to 1400 1600 to 1700
Sequential Clustering and Classi¯cation Approach
905
906 C.-L. Yang & T. P. Q. Nguyen
elementary schools in suburban areas. It can be easily related to the low sales during
summer because students do not need to go to school during the break. Please note
that although the analysis seems intuitive, the features (factors) mentioned above
were found out by the proposed algorithm without any human intervention. The
analysis afterward actually shows that NSGAII-SCC can cluster the stores by sales
revenue and classify relevant features which correlate signi¯cantly to the clustering
result (monthly revenue). Last but not the least, the result shown in Table 6
demonstrates the useful information that came out by the framework for the retailing
store manager to appraisal the performances of the store clusters based on the
provided POS data.
4.6. Discussion
In this research, the NSGAII-SCC was compared with the conventional SCC method
which has been widely used for data mining. Without considering the complexity of
the combination of clustering and classi¯cation, SCC has straightforward analytics
which considers clustering and classi¯cation as the separated tasks. For the analysis
scenarios of retail revenue performance, the determination of the number of store
clusters and the feature selection for both clustering and classi¯cation are con-
founded together. It means that for any number of clusters determined ¯rst, the
signi¯cant features for clustering and classi¯cation might be di®erent to guarantee
the analysis's e®ectiveness. Also, even we can select features ¯rst for clustering and
classi¯cation, the performance under the di®erent number of clusters might be di-
verse. By using the multi-objective framework with GA searching operations, the
solutions on the Pareto front provide a relatively smaller number of solution can-
didates for further investigation. The solutions also have non-dominated char-
acteristics to ensure the e®ectiveness of clustering and classi¯cation combined.
Without a doubt, NSGAII-SCC needs more computational e®ort because of
the iterative processes under the GA framework. Generally speaking, NSGAII-
SCC needs more than 20–100 times of computational time than SCC depending
on how the stop criteria of GA is set. However, this computation framework can
be executed automatically under the database platform as the stored procedure,
or the analytics platform with data mining subroutines. The analytics framework
of NSGAII-SCC is paid o® without exhaustively searching all combinations with
a number of ðp 1Þ ð2n 1Þ ð2m 1Þ, where k is the largest number of clusters;
n is the number of features for clustering; m is the number of features for classi¯-
cation. For example, in the cast of POS study shown in Table 3, the NSGAII-SCC
can generate Pareto front without going through ð10 1Þ ð212 1Þ ð213 1Þ ¼
301; 879; 305 combinations of clustering and classi¯cation with k=2–10. This meta-
heuristic method proposed by NSGAII-SCC can still be considered computational
e±ciency.
The setting of weights in the TOPSIS method also provides the capability of
managerial adjustment. If the manager of a retail store has more concerned about the
Sequential Clustering and Classi¯cation Approach 907
accuracy of classi¯cation results, the weight of acc can be adjusted higher. Then, the
ranking of choosing the number of store clusters will be tended to more favorable to
the one which has better classi¯cation accuracy. Similarly, if the manager would like
to ensure the clustering has better compactness, the weight of MSE can be set higher
comparing with other decision variables.
In addition, the stepwise regression method is employed to eliminate the redun-
dant and non-signi¯cant features that are correlated to the store patterns. The
removed features may depend on the signi¯cance level. If the p-value of the t-test in
the stepwise model is greater than the signi¯cance level for removal, the feature will
be eliminated from the stepwise model. Thus, setting the higher value for signi¯cance
level may lead to reduce the probability of removing a feature from the stepwise
model. In contrast, the signi¯cant level can be set as a small value to increase the
probability of features' removal.
that measures stores' performance. Then, the relevant store information for each
store is de¯ned as the store features' dataset X for the classi¯cation process. The
proposed data analysis framework is substantial to investigate the store sales per-
formance and reveal the correlated factors. This analysis can be performed by
NSGAII-SCC automatically and the result has signi¯cant usage for improving sales
and developing business strategies. Additionally, the proposed framework can ex-
plicit the best number of clusters account for the better clustering and classi¯cation
performance using POSSIS. Finding out factors that correlate to the sales perfor-
mance based on the stepwise method can further provide managerial meaning when
studying the POS data.
For future research, there are multiple directions to improve the analysis and the
algorithm framework. First, this result can be compared with other multivariate
analysis methods. Second, the algorithm framework can be integrated with the
standard database or ERP system to verify its performance. Third, a data pre-
processing technique can be developed to screen out unimportant features either in
the sales dataset or the store features' dataset to improve the computation e±ciency.
Acknowledgment
We appreciate the ¯nancial support from the Ministry of Science and Technology
of Taiwan, R.O.C. (Contract No. 106-2221-E-011-106-MY3) and the \Center for
Cyber-Physical System Innovation" from The Featured Areas Research Center
Program within the framework of the Higher Education Sprout Project by the
Ministry of Education (MOE) in Taiwan. We also thank Wang Jhan Yang Chari-
table Trust Fund (Contract No. WJY 2020-HR-01) and Vingroup Joint Stock
Company (Vingroup JSC) by Vingroup Innovation Foundation (VINIF) under
project code VINIF.2020.DA19 for their ¯nancial support.
References
1. A. G. Woodside and G. L. Waddle, Sales e®ects of in-store advertising, Journal of Ad-
vertising Research 15(3) (1975) 29–33.
2. Y. Ogawa, N. Tanabe, A. Honda, T. Azuma, N. Seki, T. Suzuki and H. Suzuki, Point-of-
purchase health information encourages customers to purchase vegetables: Objective
analysis by using a point-of-sales system, Environmental Health and Preventive Medicine
16(4) (2011) 239–246.
3. J. S. Zhu, POS data and your demand forecast, First Int. Conf. Information Technology
and Quantitative Management, 16–18 May 2013, Suzhou, China, pp. 8–13.
4. K. Sundararaman, J. Parthasarathi, G. S. V. Rao and S. N. Kumar, Baseline prediction of
point of sales data for trade promotion optimization, Communications and Information
Technology (ICCIT), 2012 Int. Conf., 2012, 26–28 June 2012, Hammamet, Tunisia,
pp. 17–20.
5. A. Banerjee and B. Banerjee, E®ective retail promotion management: Use of point of sales
information resources, Vikalpa 25(4) (2000) 51–60.
Sequential Clustering and Classi¯cation Approach 909
6. G. Kou, Y. Xu, Y. Peng, F. Shen, Y. Chen, K. Chang and S. Kou, Bankruptcy prediction
for SMEs using transactional data and two-stage multiobjective feature selection, Deci-
sion Support Systems 140 (2021) 113429.
7. M. Fisher and A. Raman, Using data and Big Data in retailing, Production and
Operations Management 27(9) (2018) 1665–1669.
8. M. Margaret Weber and S. Prasad Kantamneni, POS and EDI in retailing: An exami-
nation of underlying bene¯ts and barriers, Supply Chain Management: An International
Journal 7(5) (2002) 311–317.
9. P. Duchessi, C. M. Schaninger and T. Nowak, Creating cluster-speci¯c purchase pro¯les
from point-of-sale scanner data and geodemographic clusters: Improving category man-
agement at a major US grocery chain, Journal of Consumer Behaviour 4(2) (2004)
97–117.
10. T.-M. Choi, S. W. Wallace and Y. Wang, Big Data analytics in operations management,
Production and Operations Management 27(10) (2018) 1868–1883.
11. C.-L. Yang and N. T. P. Quyen, Data analysis framework of sequential clustering and
classi¯cation using non-dominated sorting genetic algorithm, Applied Soft Computing 69
(2018) 704–718.
12. M. Mahajan, P. Nimbhorkar and K. Varadarajan, The planar k-means problem is NP-
hard, Theoretical Computer Science 442 (2012) 13–21.
13. J. Cai, J. Luo, S. Wang and S. Yang, Feature selection in machine learning: A new
perspective, Neurocomputing 300 (2018) 70–79.
14. B. Uzun, M. Taiwo, A. Syidanova and D. Uzun Ozsahin, The technique for order of
preference by similarity to ideal solution (TOPSIS), in Application of Multi-Criteria
Decision Analysis in Environmental and Civil Engineering, eds. D. Uzun Ozsahin et al.
(Springer International Publishing, Cham, 2021), pp. 25–30.
15. B. Żogała-Siudem and S. Jaroszewicz, Fast stepwise regression based on multidimensional
indexes, Information Sciences 549 (2021) 288–309.
16. R. Croson and K. Donohue, Impact of pos data sharing on supply chain management: An
experimental study, Production and Operations Management 12(1) (2003) 1–11.
17. K. Stein, Point-of-sale systems for foodservice, Journal of the American Dietetic Asso-
ciation 105(12) (2005) 1861.
18. T. Li, G. Kou, Y. Peng and P. S. Yu, An integrated cluster detection, optimization,
and interpretation approach for ¯nancial data, IEEE Transactions on Cybernetics (2021)
1–14.
19. B. D. Williams, M. A. Waller, S. Ahire and G. D. Ferrier, Predicting retailer orders with
POS and order data: The inventory balance e®ect, European Journal of Operational
Research 232(3) (2014) 593–600.
20. T. Kashima, S. Matsumoto and H. Ishii, Recommendation method with rough sets
in restaurant point of sales system, in Proc. Int. MultiConf. Engineers and Computer Sci-
entists, Hong Kong, 2010, pp. 2018–2023, https://ieeexplore.ieee.org/document/9546664.
21. J. A. Aloysius, H. Hoehle, S. Goodarzi and V. Venkatesh, Big data initiatives in retail
environments: Linking service process perceptions to shopping outcomes, Annals of
Operations Research 270(1) (2018) 25–51.
22. P.-N. Tan, M. Steinbach, A. Karpatne and V. Kumar, Introduction to Data mining, 2nd
edn., What's New in Computer Science, (Pearson, NY, 2018).
23. N. Soni and A. Ganatra, Categorization of several clustering algorithms from di®erent
perspective: A review, International Journal of Advanced Research in Computer Science
and Software Engineering 2(8) (2012) 63–68.
24. G. Kou, Y. Peng and G. Wang, Evaluation of clustering algorithms for ¯nancial risk
analysis using MCDM methods, Information Sciences 275 (2014) 1–12.
910 C.-L. Yang & T. P. Q. Nguyen