A Basic Approach To K-Means Clustering Applied To Stock Data
A Basic Approach To K-Means Clustering Applied To Stock Data
K-means clustering is one of the basic So far we have covered what K-means is all
clustering algorithms in the machine about, now we will look at how to apply this
learning domain. The inference of this concept to real world data. Stock data is
algorithm is based on the value of ‘K’ which universe where each of them is related in
is the number of clusters that can be found some way or the other, to bring out some
in n-dimensional dataset. In K-Means meaningful information out of this universe
algorithm, since it is consider there are ‘k’ is of much importance. In this exercise we
number of clusters; we consider there are will take universe of 50 stocks which
’k’ number of cluster means (center points), constitute NIFTY, we will take return series
where the cluster mean is average of all the of this constitutes in order to make uniform
data points falling under each cluster. The series and to create clusters.
end objective algorithm is that each data
In order to carry out this exercise we need
point in the data set is grouped into ‘k’
to have two parameters for each stocks, so
cluster and ‘k’ cluster means. If the dataset
we will take mean and standard deviation
is tightly surrounding the cluster means,
of each stock for last two years of data and
then it consider as a good cluster.
plot them to see how they looks like.
After finding out distance we need to figure This paper highlight about clustering further
out minimum distance and among which discusses about linear form of unsupervised
cluster particular data point belongs clustering method K-means clustering. The
Class Minimum Distance idea further implemented to universe of
Cluster 1 0.00039 stock data with their return and standard
Cluster 2 0.00202 deviation properties and tries to classify
Cluster 1 0.00104 optimum cluster for each stocks.
Cluster 2 0.00172
Cluster 1 0.00091
Name 2 Years Reference
ACC Cluster 1
Ambuja Cements Cluster 2 http://en.wikipedia.org/wiki/Cluster_an
Asian Paints Cluster 1
alysis
Axis Bank Cluster 2
Bajaj Auto Cluster 1 Aravind H, C Rajgopal, K P Soman. “A
Bank of Baroda Cluster 2 simple approach to Clustering in excel”
Bharti Airtel Cluster 2 (international journal of computer
BHEL Cluster 3
applications (0975-8887)
BPCL Cluster 2
Cairn India Cluster 1 https://sites.google.com/site/dataclust
Cipla Cluster 1 eringalgorithms/
Coal India Cluster 1 http://www.microarrays.ca/services/km
DLF Cluster 3 eans_clustering.pdf
Dr Reddys Labs Cluster 1
GAIL Cluster 1
http://home.deib.polimi.it/matteucc/Cl
Grasim Cluster 1 ustering/tutorial_html/kmeans.html
HCL Tech Cluster 1
HDFC Cluster 1
HDFC Bank Cluster 1
Hero Motocorp Cluster 1
Hindalco Cluster 3
HUL Cluster 1
ICICI Bank Cluster 2
IDFC Cluster 3
IndusInd Bank Cluster 2
Infosys Cluster 2
ITC Cluster 1
Jaiprakash Asso Cluster 3
Jindal Steel Cluster 3
Kotak Mahindra Cluster 1
Larsen Cluster 2
Lupin Cluster 1
Mah and Mah Cluster 1
Maruti Suzuki Cluster 2
NMDC Cluster 2
NTPC Cluster 1
ONGC Cluster 1
PNB Cluster 2
Power Grid Corp Cluster 1
Ranbaxy Labs Cluster 3
Reliance Cluster 1
Reliance Infra Cluster 3
SBI Cluster 2
Sesa Goa Cluster 3
Sun Pharma Cluster 1
Tata Motors Cluster 3
Tata Power Cluster 2
Tata Steel Cluster 2
TCS Cluster 1
UltraTechCement Cluster 1