0% found this document useful (0 votes)
77 views

Assignment 2 Data Analysis Framework

Data Analysis Framework,Tableau Project

Uploaded by

Hema Latha
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
77 views

Assignment 2 Data Analysis Framework

Data Analysis Framework,Tableau Project

Uploaded by

Hema Latha
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

Assignment 2

Hemalatha Ramakrishna

Trine University

BAN--6093-1P1-SU-2023 - Business Analytics Capstone

Shelby Davis

June 22, 2024


Data Analysis Framework for Date Science job posting in USA

To make informed decisions when analyzing data science job opportunities in the US, you need

to strategically plan your data analysis approach. The data analysis process is a set of steps

necessary to make sense of the available data. Each step is equally important to ensure that data

is properly analyzed and provides valuable, actionable insights. Here are the five most important

steps:

Fig 1- Data Analysis Process

Step 1: The need for Data Analysis

Data analytics is integral for businesses to understand organizational challenges and examine

data in meaningful ways. Data itself is just facts and figures. Data Analysis organizes, interprets,
structures, and distills data into useful information that adds meaning to the analysis. Decision-

makers can use these insights to take actions that enhance productivity and business value.

Step 2: Data Collection:

This data set is regarding Data science Job Posting in USA. Link to the Data set is

https://data.world/jobspikr/10000-data-scientist-job-postings-from-the-usa. The US Data Science

Jobs dataset helps in analyzing the demand for data science job for professionals, it helps in

identifying the emerging trends in job requirements, and better understand regional differences in

various sectors of the data science job role in US job market. This dataset can also be used for

research, workforce planning, skills development, and strategic decision making in data science

and related industries. Datasets contain detailed information about job titles, descriptions,

required skills, education levels, salaries, and geographic locations. In this project, we will

examine job postings for data scientists across salaries, locations, companies, departments,

industries, and skills to determine whether there is a relationship between this factor and whether

differences were observed or not. One of the main factors driving the demand for data scientists

is the growth of data. The shift by companies to a data-intensive business strategy and the

growing acceptance of advanced data-enabled careers are creating job opportunities for data

scientists.

Step 3: Data Cleaning

Data cleaning involves identifying and correcting issues in a data set. The goal of data cleaning

is to correct data that is inaccurate, incomplete, erroneous, duplicated, or irrelevant to the


purpose of the data set. This is typically accomplished by replacing, modifying, or removing data

that falls into any of these categories. Combining multiple data sources can result in duplicate or

mislabeled data. If the data is incorrect, the results and algorithms will be unreliable, even if they

appear correct. Our decisions are usually based on a dataset. Therefore, if the quality of the data

is poor, the results will not be accurate. Therefore, data cleaning is essential to provide high-

quality data that leads to better decisions. Not all data in a dataset is good data. There is some

junk data. The data set used for this analysis contained some null values. Some records contain

empty or missing values. This data needs to be removed and filtered to focus on the required data

set. Unnecessary columns are removed, and some columns are isolated. Data is cleaned manually

in an Excel spreadsheet.

Step 4: Analyzing the data

This step is where the data is imported to the data visualization tool like tableau, Power BI etc to

extract the visualizations. The data visualization techniques are used to uncover hidden patterns

and relationships in the data. This data analysis involves data analysis techniques such as

descriptive and predictive analytics. Descriptive analytics is a type of data analysis that helps to

describe, display, or summarize data points in a constructive manner so that a pattern can emerge

that meets all the conditions of the data. Predictive analytics tries to find patterns and predict the

outcome of the data so that the user can understand what conditions will be used to identify the

patterns and analyze future outcomes.


Step 5: Interpret the results and apply them

Interpreting the results is the final step in the data analysis. This part is the essential because it

give insights from the previous step that can be applied to solve real world problems and solves

business problems. Data visualization tells us a story through data to understand that

visualization and apply it to solve a business problem this step is essential.

References:

Hillier, W. (2023). A step-by-step guide to the data analysis process [2024]. CareerFoundry.

https://careerfoundry.com/en/blog/data-analytics/the-data-analysis-process-step-by-step/

Luna, J. C. (2022). How to analyze data for your business in 5 steps. DataCamp.

https://www.datacamp.com/blog/how-to-analyze-data-for-business

You might also like