0% found this document useful (0 votes)

22 views3 pages

SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-O

The document discusses the K-Means clustering algorithm. It describes how K-Means works by assigning data points to clusters based on distance to centroids, and recalculating centroids iteratively until clusters stabilize. It notes that specifying the number of clusters K is difficult and provides steps to apply K-Means in SAP HANA to cluster customer mobile phone usage data into segments.

Uploaded by

jefferyleclerc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views3 pages

SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-O

Uploaded by

jefferyleclerc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

3/14/24, 7:16 AM SAP HANA PAL – K-Means Algorithm or How to do Cust...

- SAP Community

Each cluster is associated with a centroid and each point is assigned to the cluster with the closest centroid. The centroid is
the mean of the points in the cluster. The closeness can be measured using:

Manhattan Distance
Euclidean Distance (most commonly used)
Minkowski Distance

Every time a point is assigned to a cluster the centroid is recalculated. This is repeated in multiple iterations until centroids
don’t change anymore (meaning all points have been assigned to a corresponding cluster) or until relatively few points
change clusters. Usually most of the centroid movement happens in the first iterations.

One of the main drawbacks of the K-Means Algorithm is that you need to specify the number of Ks (or clusters) upfront as
an input parameter. Knowing this value is usually very hard, that is why it is important to run quality measurement
functions to check the quality of your clustering. Later in this post we will talk about this.

I came across a very interesting paper that talks about segmentation in the telecommunication industry, so I thought it
would be a very nice use case to demo the K-Means algorithm in HANA (if you are interested in this topic, I very much
recommend reading this paper). These are the steps I followed:

https://community.sap.com/t5/technology-blogs-by-members/sap-hana-pal-k-means-algorithm-or-how -to-do-customer-segmentation-for-the/ba-p/12976696/page/2 3/39

3/14/24, 7:16 AM SAP HANA PAL – K-Means Algorithm or How to do Cust... - SAP Community

Prepare the Data

The first step is creating a table that will contain information on customers mobile phone usage habits with the following
structure:

CREATE COLUMN TABLE "TELCO" (

"ID" INTEGER NOT NULL, --> Customer ID

"AVG_CALL_DURATION" DOUBLE, --> Average Call Duration

"AVG_NUMBER_CALLS_RCV_DAY" DOUBLE, --> Average Calls Received per Day

"AVG_NUMBER_CALLS_ORI_DAY" DOUBLE, --> Average Calls Originated per Day

"DAY_TIME_CALLS" DOUBLE, --> Percentage of Calls made during day time hours (9 a.m. - 6 p.m.)

"WEEK_DAY_CALLS" DOUBLE, --> Percentage of Calls made during week days (Monday thru Friday)

"CALLS_TO_MOBILE" DOUBLE, --> Percentage of Calls made to mobile phones

"SMS_RCV_DAY" DOUBLE, --> Number of SMSs received per day

"SMS_ORI_DAY" DOUBLE, --> Number of SMSs sent per day

PRIMARY KEY ("ID"))

https://community.sap.com/t5/technology-blogs-by-members/sap-hana-pal-k-means-algorithm-or-how -to-do-customer-segmentation-for-the/ba-p/12976696/page/2 4/39

3/14/24, 7:16 AM SAP HANA PAL – K-Means Algorithm or How to do Cust... - SAP Community

/* Table Type that will be used as the input parameter

that will contain the data that I would like to cluster */

DROP TYPE PAL_KMEANS_DATA_TELCO;

CREATE TYPE PAL_KMEANS_DATA_TELCO AS TABLE(

"ID" INT,

"AVG_CALL_DURATION" DOUBLE,

"AVG_NUMBER_CALLS_RCV_DAY" DOUBLE,

"AVG_NUMBER_CALLS_ORI_DAY" DOUBLE,

"DAY_TIME_CALLS" DOUBLE,

"WEEK_DAY_CALLS" DOUBLE,

"CALLS_TO_MOBILE" DOUBLE,

"SMS_RCV_DAY" DOUBLE,

"SMS_ORI_DAY" DOUBLE,

https://community.sap.com/t5/technology-blogs-by-members/sap-hana-pal-k-means-algorithm-or-how -to-do-customer-segmentation-for-the/ba-p/12976696/page/2 7/39

BDA Unit 2
No ratings yet
BDA Unit 2
31 pages
Customer Segmentation Using Machine Learning
No ratings yet
Customer Segmentation Using Machine Learning
6 pages
Słowacja Wszystko PDF
No ratings yet
Słowacja Wszystko PDF
379 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-C
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-C
6 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-7
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-7
5 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-6
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-6
4 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-B
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-B
5 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-8
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-8
4 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-R
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-R
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-U
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-U
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-Q
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-Q
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-2
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-2
4 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-I
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-I
4 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1U
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1U
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-F
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-F
4 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-Y
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-Y
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community 1
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community 1
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1Y
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1Y
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-N
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-N
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-E
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-E
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-3
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-3
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-L
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-L
2 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-15
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-15
3 pages
Customer Segmentation
No ratings yet
Customer Segmentation
43 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-M
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-M
2 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-12
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-12
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-13
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-13
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-J
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-J
2 pages
ML Assignment 1
No ratings yet
ML Assignment 1
23 pages
Mall Customer Segmentation Using Machine Learning Techniques
No ratings yet
Mall Customer Segmentation Using Machine Learning Techniques
17 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-K
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-K
2 pages
ML Project Report
No ratings yet
ML Project Report
22 pages
Customer Segmentation Using Ensemble Clustering
No ratings yet
Customer Segmentation Using Ensemble Clustering
20 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1W
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1W
2 pages
K Means
No ratings yet
K Means
40 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-16
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-16
3 pages
Customer Segmentation Using K-Means Algorithm PROJECT
No ratings yet
Customer Segmentation Using K-Means Algorithm PROJECT
28 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-9
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-9
4 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-14
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-14
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-A
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-A
6 pages
New Community
No ratings yet
New Community
30 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
5 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1Q
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1Q
2 pages
Intro Data Science: Cluster Analysis
No ratings yet
Intro Data Science: Cluster Analysis
60 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
5 pages
DWDM PPT
No ratings yet
DWDM PPT
13 pages
JPSP202244
No ratings yet
JPSP202244
7 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-17
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-17
3 pages
Hands-On Document - Step02 - Technical Objects
No ratings yet
Hands-On Document - Step02 - Technical Objects
66 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-P
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-P
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-5
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-5
4 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-4
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-4
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community
3 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
31 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1E
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1E
2 pages
Intro To Software Engineering
No ratings yet
Intro To Software Engineering
7 pages
K, Eans
No ratings yet
K, Eans
4 pages
DevOps Roadmap by CloudChamp
No ratings yet
DevOps Roadmap by CloudChamp
18 pages
Power BI: # (Advanced DAX Patterns)
No ratings yet
Power BI: # (Advanced DAX Patterns)
10 pages
Honey Research Paper
No ratings yet
Honey Research Paper
4 pages
BI Trouble Shotting (364547.1)
No ratings yet
BI Trouble Shotting (364547.1)
13 pages
Module 4 CE Computer Fundamentals Programming
No ratings yet
Module 4 CE Computer Fundamentals Programming
14 pages
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-C
No ratings yet
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-C
10 pages
Excel VBA Range Object
No ratings yet
Excel VBA Range Object
11 pages
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-A
No ratings yet
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-A
7 pages
Software: Its Nature and Qualities
100% (1)
Software: Its Nature and Qualities
11 pages
High Performance Computing
No ratings yet
High Performance Computing
236 pages
Microservices Technology and Tools
No ratings yet
Microservices Technology and Tools
16 pages
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-H
No ratings yet
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-H
4 pages
Analysis of Mapreduce Algorithms: Harini Padmanaban
No ratings yet
Analysis of Mapreduce Algorithms: Harini Padmanaban
6 pages
CSE220 Data Structures - Course Description and Outcome - Zaber Mohammad
No ratings yet
CSE220 Data Structures - Course Description and Outcome - Zaber Mohammad
7 pages
Chapter 2 Query Processing and Optimization
No ratings yet
Chapter 2 Query Processing and Optimization
58 pages
Technology
No ratings yet
Technology
44 pages
The Incremental Online K Means Clustering Algorithm and Its Application To Color Quantization
No ratings yet
The Incremental Online K Means Clustering Algorithm and Its Application To Color Quantization
42 pages
2 Mapreduce Model Principles
No ratings yet
2 Mapreduce Model Principles
7 pages
K-Means Clustering Optimization Algorithm Based On Mapreduce
No ratings yet
K-Means Clustering Optimization Algorithm Based On Mapreduce
6 pages
Balanced K-Means Revisited-1
No ratings yet
Balanced K-Means Revisited-1
3 pages
Fuzzy K-Mean Clustering in Mapreduce On Cloud Based Hadoop: Dweepna Garg
No ratings yet
Fuzzy K-Mean Clustering in Mapreduce On Cloud Based Hadoop: Dweepna Garg
4 pages
MapReduce - What It Is, and Why It Is So Popular
No ratings yet
MapReduce - What It Is, and Why It Is So Popular
7 pages
Hadoop
No ratings yet
Hadoop
7 pages
Improved K-Means Map Reduce Algorithm For Big Data Cluster Analysis
No ratings yet
Improved K-Means Map Reduce Algorithm For Big Data Cluster Analysis
7 pages
Rebuilding Rails (Noah Gibbs) (Z-Library)
No ratings yet
Rebuilding Rails (Noah Gibbs) (Z-Library)
137 pages
Managing Tablespaces
No ratings yet
Managing Tablespaces
32 pages
Data Visualization Cheat Sheet For Basic Machine Learning Algorithms - by Boriharn K - Mar, 2024 - Towards Data Science
No ratings yet
Data Visualization Cheat Sheet For Basic Machine Learning Algorithms - by Boriharn K - Mar, 2024 - Towards Data Science
3 pages
Fast Scalable K-Means++ Algorithm With Mapreduce
No ratings yet
Fast Scalable K-Means++ Algorithm With Mapreduce
2 pages
Analysis and Design of Algorithms
No ratings yet
Analysis and Design of Algorithms
15 pages
CPIT110 - Chapter 5
No ratings yet
CPIT110 - Chapter 5
227 pages
Programming and DBMS Slide
No ratings yet
Programming and DBMS Slide
19 pages
Untitled Boxing Game AUTO DODGE AND COUNTER
No ratings yet
Untitled Boxing Game AUTO DODGE AND COUNTER
2 pages
Paper Dvi
No ratings yet
Paper Dvi
7 pages
DSUU Report
No ratings yet
DSUU Report
23 pages
Python 10mark Revision Complete
No ratings yet
Python 10mark Revision Complete
5 pages
"Sanjeevani": A Project Work-I Report Submitted in Partial Fulfillment of Requirement of The Degree of
No ratings yet
"Sanjeevani": A Project Work-I Report Submitted in Partial Fulfillment of Requirement of The Degree of
20 pages
Data Analytics Certificate Glossary
No ratings yet
Data Analytics Certificate Glossary
23 pages
Dotnet
No ratings yet
Dotnet
2 pages
Chapter 6 - Synchronization Tools - Part 2
No ratings yet
Chapter 6 - Synchronization Tools - Part 2
32 pages
Tutorial For K Means Clustering in Python Sklearn - MLK - Machine Learning Knowledge-5
No ratings yet
Tutorial For K Means Clustering in Python Sklearn - MLK - Machine Learning Knowledge-5
3 pages
Embed and Conquer: Scalable Embeddings For Kernel K-Means On Mapreduce
No ratings yet
Embed and Conquer: Scalable Embeddings For Kernel K-Means On Mapreduce
9 pages
Balanced K-Means Revisited-5
No ratings yet
Balanced K-Means Revisited-5
3 pages
SQL Project
No ratings yet
SQL Project
16 pages
Advanced Bash Shell Scripting Guide - Reference Cards
No ratings yet
Advanced Bash Shell Scripting Guide - Reference Cards
5 pages
A Distance-Based Kernel For Classification Via Support Vector Machines - PMC-17
No ratings yet
A Distance-Based Kernel For Classification Via Support Vector Machines - PMC-17
1 page
Actividad Piedra Papel Tijeras - Micro - Bit
No ratings yet
Actividad Piedra Papel Tijeras - Micro - Bit
7 pages
2.3P - Drawing Program - A Basic Shape
No ratings yet
2.3P - Drawing Program - A Basic Shape
6 pages
RA ARCHI DAVAO Jan2017 PDF
No ratings yet
RA ARCHI DAVAO Jan2017 PDF
5 pages
PHP - Sessions: Starting A PHP Session
No ratings yet
PHP - Sessions: Starting A PHP Session
4 pages
Array Daily Challenge
No ratings yet
Array Daily Challenge
4 pages

SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-O

Uploaded by

SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-O

Uploaded by

3/14/24, 7:16 AM SAP HANA PAL – K-Means Algorithm or How to do Cust...

https://community.sap.com/t5/technology-blogs-by-members/sap-hana-pal-k-means-algorithm-or-how -to-do-customer-segmentation-for-the/ba-p/12976696/page/2 3/39

Prepare the Data

CREATE COLUMN TABLE "TELCO" (

"ID" INTEGER NOT NULL, --> Customer ID

"AVG_CALL_DURATION" DOUBLE, --> Average Call Duration

"AVG_NUMBER_CALLS_RCV_DAY" DOUBLE, --> Average Calls Received per Day

"AVG_NUMBER_CALLS_ORI_DAY" DOUBLE, --> Average Calls Originated per Day

"CALLS_TO_MOBILE" DOUBLE, --> Percentage of Calls made to mobile phones

"SMS_RCV_DAY" DOUBLE, --> Number of SMSs received per day

"SMS_ORI_DAY" DOUBLE, --> Number of SMSs sent per day

PRIMARY KEY ("ID"))

https://community.sap.com/t5/technology-blogs-by-members/sap-hana-pal-k-means-algorithm-or-how -to-do-customer-segmentation-for-the/ba-p/12976696/page/2 4/39

/* Table Type that will be used as the input parameter

that will contain the data that I would like to cluster */

DROP TYPE PAL_KMEANS_DATA_TELCO;

CREATE TYPE PAL_KMEANS_DATA_TELCO AS TABLE(

https://community.sap.com/t5/technology-blogs-by-members/sap-hana-pal-k-means-algorithm-or-how -to-do-customer-segmentation-for-the/ba-p/12976696/page/2 7/39

You might also like