Demographics Segmentation Using Machine Learning
Demographics Segmentation Using Machine Learning
Submitted by-
Alok Kumar Singh(1803310034)
Submitted to-
Harshit Kumar (1803310090) Prof. Akash Luthra Sir
Aditya Singh (1803310020) (Department of CSE)
RKGIT,GHAZIABAD.
INTRODUCTION
• The main aim of this project is to create a machine learning model which will categorize
data set of population into groups for better understanding of the traits shown by them.
As we know that India is one of the largest growing economies in the world ,and with high
population. India is known for its rich diversity with More than 400 cities have populations
of 100,000 or more with different religions and ethnicity etc.
Suppose a marketing firm wants to launch a promotional campaign across different regions
of country so in order to do that the firm wants to understand the diversity in population
demography so that it can plan the promotional campaign accordingly.
The objective is to segregate the regions into different groups so that marketing team can
plan their resources accordingly
SOLUTION
• Thus allowing marketing analysts and demographics experts to paint a picture of what
the average consumer of that particular segment is like.
LITERATURE SURVEY
• It is well said that data is the new oil ,so with the presence of dataset , we
can better analyse and build model for more efficiency and better growth.
• Breaking markets down into smaller groups is more efficient for both the
brand and the consumer because brands can know whom to ignore and
consumers are more likely to receive only messaging about products they
are likely
PROPOSED MODULES
• CLUSTERING
Clustering is the task of dividing the population or data points into a number of groups
such that data points in the same groups are more similar to other data points in the same
group than those in other groups.
• K-MEANS
K-means algorithm identifies k number of centroids, and then allocates every data point
to the nearest cluster, while keeping the centroids as small as possible.
• EUCLIDEAN DISTANCE
Euclidean Distance represents the shortest distance between two points.
SOFTWARES USED
• Jupyter Notebook
for data preprocessing ,applying model and visualising the results.
• VsCode IDE
to create frontend.
Thank You