Data Viz Case Study
Data Viz Case Study
Data Viz Case Study
* Data Science is a multidisciplinary field where computer science and statistics meet each
other and blends up with business acumen with sheer intention of extracting hidden
insights from the data and realize them into impactful decisions .
* The origin of data science can be traced back to the early days of statistics , when
scientists and engineers began to use statistical methods to analyse data . However, it was
not until the 1990s that the field of data science really began to stemmed out . Because of
the overwhelming growth of data and their potential utilities it urges to widen up the
capabilities of data analysing techniques . So it got collide with AI and that kind of cause
for the role of data scientist to be branched out in several aspects and a lot of new roles
emerges with that such as ;
-> Data scientist / Data engineer (ETL) / Machine Learning Engineer / Big Data
Engineer / Computer Vision Engineer / Finance Data Analyst / Business Intelligence
Analyst / NLP Engineer
* Even though the initial role of data scientist has been split up into a couple of different
roles as stated above we are curious to know that still that demand is outstripping the supply
, or else the job opportunities of the lower tier (freshers) have been already saturated .
Since we don’t have any direct measure to gauge that straight away , we going to observe
how the salaries have been evolved over the course of most recent years and
What it reflects .
1
* So this will help to thousands of data science aspirants who are out there looking forward
to take career transitions towards data science let alone who are willing to make their path
forward in data science or AI related field .
Objectives:
* To understand the current market value of data science skills & To identify factors that
influence the salaries of data science related jobs :
( There are quite a lot of factors are playing around determining the incentives for any
designation that related to data science . So our primary concern is to pinpoint some of the
key factors and analyse their unique traits as well as the impact them having on the
wages using variety of makers . And we looking forward to put a lot of strain on visual
representations (plots, charts , graphs , etc,....) to acquire those makers.
* To track the trends in the salaries of data science related jobs over time.
*To compare salaries across different industries and locations at the same time how the
salary varies with the scale of the organisation .
Data:
2
Proposed Analysis Plan
• First we going to go ahead with the pre ritual of data analytics which is data
cleaning . This consists with;
(Remove outliers , Impute missing values , Scale down the necessary numerical
variables , Encode categorical features , Reduce the cardinality of categorical
features , In here we going to use suitable plots both before and after each technique
being applied to make sure that , the method that we used didn’t distort the original
structure of the data )
References: Kaggle