0% found this document useful (0 votes)

309 views14 pages

Data Analysis & Exploratory Data Analysis (EDA)

This document discusses data analysis and exploratory data analysis (EDA). It defines data analysis as using statistics and probability to identify trends in data sets and distinguish real trends from noise. The document outlines some common techniques used in data analysis, including general linear models, generalized linear models, and structural equation modeling. It emphasizes that the correct technique must be used to avoid faulty conclusions. The document also discusses exploratory data analysis and its purpose of gaining initial insights into data.

Uploaded by

John Luis Masangkay Bantolino

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

309 views14 pages

Data Analysis & Exploratory Data Analysis (EDA)

Uploaded by

John Luis Masangkay Bantolino

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Data Analysis & Exploratory Data Analysis (EDA)

Share on






Contents (Click to skip to that section):

1. Data Analysis
 Definition
 Techniques
 The Two Tools
 Variation
 The Three Rules
 Issues with Data Analysis
2. Exploratory Data Analysis
 Definition
 Purpose
 Types

How is wealth distributed in the United States? Which drugs work to cure cancer? Which stocks should I invest in?
All of these questions can be answered with data analysis.

Data Analysis Definition

Data Analysis is basically where you use statistics and probability to figure out trends in data set. It helps you to sort
out the “real” trends from the statistical noise. What is “noise”? A large amount of data that doesn’t seem to mean
anything at all (sometimes it can be impossible to see the trees because of the forest!). If you’ve ever tried to make
sense of the figures and numbers in a copy of the Wall Street Journal, you’ll know what “noise” means.

Data analysis is about picking out trends from sets of data.

Back to Top
Techniques.
The type of data analysis you use depends on what kind of study you’re doing. For example, you would use a
different technique for data gathered from interviews than you would for an analysis of stock market trends. Some
techniques you might use are:

 General linear model: Useful for assessing how several variables affect continuous variables.
Example: ANOVA tests.
 Generalized linear model: Used for discrete variables. Example: Linear Regression (What is Linear
Regression?).
 Structural equation modelling: Used for abstract variables like “Soap preference,” “Intelligence,” or
“Future goals.” SEM helps you to figure out if you have a valid model for your data.
 Item response theory: A way to analyze results from tests, exams, and questionnaires.
It’s vital you use the right technique; Using the wrong one can lead to faulty claims about your data. There are
dozens of examples of faulty claims about data on the internet. Perhaps two of the most famous are the Cold
Fusion debacle and the now infamous data on women’s poor prospects of getting married over age 30.
Back to Top

The Two Tools of Data analysis.

The two main tools that make up data analysis are lines and tables. For example, you might create a line graph with
a linear regression equation.

A high-leverage outlier. The point has moved the graph more because it is outside the range.

Or you could make a frequency distribution table to display data.

A frequency chart.

Variation
If life were simple, we could make a chart or a graph for every situation. But in real life, things are never as simple
as they appear. Take a two-pound bag of sugar. Does it really weight two pounds? Measure a hundred bags of sugar
and you’ll likely find a hundred different weights, from 5.0 pounds to 5.1 pounds and everything in between. That’s
what we call variance, and variance is one of the reasons we have to use probability distributions to evaluate data.
Back to Top

The Three Rules of Data analysis.

Using three basic rules of thumb can help you avoid incorrectly making claims about your data:

1. Look at your data and think about what it is you want to know. Do you want to prove that the
Earth is round? Or do you want to prove that the Earth has a circumference? Framing this question is
what we call stating the hypothesis.
2. Estimate a Central Tendency for your Data. Examples of measures of central tendency are
the mean and median. Which one you use will depend on your hypothesis in Step 1. For example, if
you wanted to prove the Earth was round, you might choose to look at the average volume, or the
average circumference.
3. Consider the exceptions to the central tendency. If you’ve measured the average, look at the figures
that are not average. If you’ve measured a median, look at the figures that don’t meet that expectation.
Exceptions can help you spot problems with your conclusion. A simple example: your child’s average
score in school is 70. Not bad, right? But if you look at the exceptions, you might find they are getting
100 in three classes (great!) and 40 in three other classes (uh oh). In this case, the average is
completely misleading.
Back to Top

Issues with Data Analysis.

Why do so many cases of data analysis end with faulty claims? One of the main reasons is that analyzing data is a
complicated and tedious process. It’s never as easy as plugging numbers into a computer. Some issues that can lead
to faulty data analysis include:

1. Not having the right analysis skills.

2. Using the wrong tools to analyze data. For example, using a z score when your data doesn’t have
a normal distribution.
3. Letting bias influence your results.
4. Not figuring out statistical significance.
5. Incorrectly stating the null hypothesis and alternate hypothesis.
6. Using misleading graphs and charts.
Unintentional reporting of bad results is usually the result of a lack of proper training. More than one study
(including this one) found that physicians were very poorly trained in the proper management of clinical trials.
Physicians were also very poorly trained in reading statistics from good data obtained from valid setups! (See: Even
Physicians Don’t Understand Statistics). Why would highly educated people have so much trouble interpreting data
analysis? Take a very simple example: A Word Count.
Example problem: You’re given an e-book of Shakespeare’s Romeo and Juliet. Your task is to find out how many
times the Word “Love” appears in it. Easy, right? You run it through a word count in a word processor and you
report that it’s found 126 times.
Oops. The word “love” is only found 94 times. Why is the word count so wrong? You failed to take into account all
of the other words that contain the letters “love”:

 Loves (2).
 Loved (3).
 Loving (6).
 Love’s (12).
 Lover (4).
 Lover’s (3).
 Lovest (2).
Now imagine if you were analyzing a text on the results from blood analysis to see if a particular cancer drug
worked or not. Perhaps you were looking for a specific chemical to see if it showed up more frequently than another.
Typing in just part of the chemical name could lead you to a (possibly harmful) conclusion.
Back to Top

Introduction to Statistical Data Analysis

 Neelam Tyagi
 Oct 29, 2020
 Statistics
“The number of people who think they understand statistics dangerously
dwarfs those who actually do, and maths can cause fundamental problems
when badly used.”― Rory Sutherland

In the information era, data is no protracted scarce, on the other hand, it is
irresistible. From delving into the overpowering quantity of data to precisely
interpret its complexity in order to provide insights for intense progress to
organizations and businesses, all sorts of data and information is exploited at
their entirety and this is where statistical data analysis has a significant part.

“Statistics is the specific branch of science from where the professionalists
bring distinct conclusion/interference under the same data”

Moving discussion a step further, we shall discuss the comprehensive notion
concerning statistical data analysis and its types. Further, four basic steps
required for completion of statistical data analysis will be explained.

What is Statistical Data Analysis?

Being a branch of science, Statistics incorporates data acquisition, data
interpretation, and data validation, and statistical data analysis is the
approach of conducting various statistical operations, i.e. thorough
quantitative research that attempts to quantify data and employs some sorts
of statistical analysis. Here, quantitative data typically includes descriptive
data like survey data and observational data.

In the context of business applications, it is a very crucial technique for
business intelligence organizations that need to operate with large data
volumes. The basic goal of statistical data analysis is to identify trends, for
example, in the retailing business, this method can be approached to uncover
patterns in unstructured and semi-structured consumer data that can be used
for making more powerful decisions for enhancing customer experience and
progressing sales.

Apart from that, statistical data analysis has various applications in the field
of statistical analysis of market research, business intelligence(BI), data
analytics in big data, machine learning and deep learning, and financial and
economical analysis. (Recommend blog: Top Business Intelligence Tools and
Techniques in 2020)

In addition to that, the significance of data under statistical data analysis,

1. Data comprises variables which are univariate or multivariate, and
extremely relying on the number of variables, the experts execute several
statistical techniques. If the data has a singular variable then univariate
statistical data analysis can be conducted including t-test for
significance, z test, f test, ANOVA one way, etc. And if the data has many
variables then different multivariate techniques can be performed such as
statistical data analysis, or discriminant statistical data analysis, etc.
(Related blog: An Introduction to Probability Distribution)
2. Data is of two types, continuous data and discrete data. The continuous
data cannot be counted and changes over time, e.g the intensity of light,
the temperature of a room, etc. The discrete data can be counted and has
a certain number of values, e.g. the number of bulbs, the number of
people in a group, etc.
3. Under statistical data analysis, the continuous data is distributed under
continuous distribution function, also known as the probability density
function. And the discrete data is distributed under a discrete distribution
function, also termed as the probability mass function.
4. Data can either be quantitative or qualitative. Qualitative data are labels
or names that are implemented to find a characteristic of each element,
whereas quantitative data are always in the form of numbers that intimate
either how much or how many. (More to read: Steps for qualitative data
analysis)
5. Under statistical data analysis, cross-sectional and time-series data are
important. For a definition, cross-sectional data are the data accumulated
at the same time or relatively the same point in time, whereas, time-
series data are the data gathered across certain time periods.

Statistical data analysis can be adopted in;

 Existing essential findings/conclusions unveiled through a dataset.
 Abstract and compile information.
 Compute measures of cohesiveness, relevance, or diversity in data.
 Originate forthcoming prophecies on the basis of earlier reported data.
 Test experimental forecasts.

Statistical Data Analysis Tools

Generally, under statistical data analysis, some form of statistical analysis
tools are practised that a layman can’t do without having statistical
knowledge. Various software programs are available to perform statistical
data analysis, these software include Statistical Analysis System(SAS),
Statistical Package for Social Science (SPSS), Stat soft and many more.

“Machine learning, in the simplest terms, is the analysis of statistics to help
computers make decisions based on repeatable characteristics found in the
data.”― Vardhan Kishore Agrawal

These tools allow extensive data-handling capabilities and several statistical
analysis methods that could examine a small chunk to very comprehensive
data statistics. Though computers serve as an important factor in statistical
data analysis that can assist in the summarization of data, statistical data
analysis concentrates on the interpretation of the result in order to drive
inferences and prophecies.

What are the Types of Statistical Data Analysis?

There are two important components of a statistical study, that are:

 Population - an assemblage of all elements of interest in a study, and
 Sample - a subset of the population.

And, there are two categories of widely used statistical methods under
statistical data analysis techniques;

1. Descriptive Statistics
It is a form of data analysis that is basically used to describe, show or
summarize data from a sample in a meaningful way. For example, mean,
median, standard deviation and variance. In other words, descriptive
statistics attempts to illustrate the relationship between variables in a
sample or population and gives a summary in the form of mean, median
and mode.

2. Inferential Statistics
This method is used for making conclusions from the data sample by
using the null and alternative hypotheses that are subjected to random
variation. Also, probability distribution, correlation testing and regression
analysis fall into this category. In simple words, inferential statistics
employs a random sample of data, taken from a population, to make and
explain inferences about the whole population. (Most related: What is p-
value in statistics?)

The table below shows the factual differences between descriptive statistics
and inferential statistics;

S.N
Descriptive Statistics Inferential Statistics
o

Make inferences from the

Related with specifying the target sample and make them
1
population. generalize also according to
the population.

Arrange, analyze and reflect the Correlate, test and anticipate

2
data in a meaningful mode. future outcomes.

Concluding outcomes are

Final outcomes are the
3 represented in the form of charts,
probability scores.
tables and graphs.

Attempts in making
Explains the earlier acknowledged conclusions regarding the
4
data. population which is beyond
the data available.
Deployed tools-Measure of central
Deployed tools- Hypothesis
tendency (mean, median, mode),
5 testing, Analysis of variance,
Spread of data (Range, standard
etc.
deviation, etc.)

Difference between Descriptive Statistics and Inferential Statistics

4 Basics Steps for Statistical Data Analysis

In order to analyze any problem with the use of statistical data analysis
comprises four basic steps;

1. Defining the problem

The precise and actuarial definition of the problem is imperative for achieving
accurate data concerning it. It becomes extremely difficult to collect data
without knowing the exact definition/address of the problem.

2. Accumulating the data

After addressing the specific problem, designing multiple ways in order to
accumulate data is an important task under statistical data analysis. Data can
be collected from the actual sources or can be obtained by observation and
experimental research studies, conducted to get new data.

 In an experimental study, the important variable is identified according
to the defined problem, then one or more elements in the study are
controlled for getting data regarding how these elements affect other
variables.
 In an observational study, no trial is executed for controlling or
impacting the important variable. For example, a conducted surrey is the
examples or a common type of observational study.

3. Analyzing the data

Under statistical data analysis, the analyzing methods are divided into two
categories;

 Exploratory methods, this method is deployed for determining what the
data is revealing by using simple arithmetic and easy-drawing
graphs/description in order to summarize data.
 Confirmatory methods, this method adopts concept and ideas from
probability theory for trying to answer particular problems.

Probability is extremely imperative in decision-making as it gives a procedure
for estimating, representing, and explaining the possibilities associated with
forthcoming events.

4. Reporting the outcomes

By inferences, an estimate or test that claims to be the characteristics of a
population can be derived from a sample, these results could be reported in
the form of a table, a graph or a set of percentages. Since only a small portion
of data has been investigated, therefore the reported result can depict some
uncertainties by implementing probability statements and intervals of values.

With the help of statistical data analysis, experts could forecast and anticipate
future aspects from data. By understanding the information available and
utilizing it effectively may lead to adequate decision-making. (Source)

Conclusion

The statistical data analysis furnishes sense to the meaningless numbers and
thereby giving life to lifeless data. Therefore, it is imperative for a researcher
to have adequate knowledge about statistics and statistical methods to
perform any research study. This will assist in conducting an appropriate and
well-designed study preeminently to accurate and reliable results. Also, results
and inferences are explicit only and only if proper statistical tests are
practised.

“Regression analysis is the hydrogen bomb of the statistics
arsenal.”― Charles Wheelan

While concluding the blog, we can say that statistical data analysis is nothing
but the compilation and interpretation of data in order to reveal hidden
patterns and trends. It can be adopted in dealing with situations like
accumulating research analyses, statistical modelling or sketching surveys
and studies

Introduction to Statistics: An Intuitive Guide for Analyzing Data and Unlocking Discoveries
From Everand
Introduction to Statistics: An Intuitive Guide for Analyzing Data and Unlocking Discoveries
Jim Frost
5/5 (1)
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
From Everand
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
David Borman
4.5/5 (13)
Painless Statistics
From Everand
Painless Statistics
Barron's Educational Series
No ratings yet
Medical Statistics Made Easy, fourth edition
From Everand
Medical Statistics Made Easy, fourth edition
Michael Harris
4.5/5 (2)
Hypothesis Testing: An Intuitive Guide for Making Data Driven Decisions
From Everand
Hypothesis Testing: An Intuitive Guide for Making Data Driven Decisions
Jim Frost
No ratings yet
Evaluation Tools
100% (1)
Evaluation Tools
11 pages
Thinking Analytically: A Guide for Making Data-Driven Decisions
From Everand
Thinking Analytically: A Guide for Making Data-Driven Decisions
Jim Frost
No ratings yet
Data Analytics
From Everand
Data Analytics
Jeffery Short
1/5 (1)
Presentation On Data Analysis: Submitted by
No ratings yet
Presentation On Data Analysis: Submitted by
38 pages
Surviving Statistics: A Professor's Guide to Getting Through
From Everand
Surviving Statistics: A Professor's Guide to Getting Through
Luther Maddy
No ratings yet
Tips and Tricks For Analyzing Non-Normal Data
No ratings yet
Tips and Tricks For Analyzing Non-Normal Data
3 pages
Quantitative Data Analysis Guide
No ratings yet
Quantitative Data Analysis Guide
6 pages
"Data Analysis" Basic Concepts and Applications
From Everand
"Data Analysis" Basic Concepts and Applications
Sukanta Bhattacharya
No ratings yet
Module Data Analysis
No ratings yet
Module Data Analysis
6 pages
Unit 5 Exploratory Data Analysis (EDA)
100% (1)
Unit 5 Exploratory Data Analysis (EDA)
41 pages
Lecture 1
No ratings yet
Lecture 1
39 pages
Microsoft Excel Statistical and Advanced Functions for Decision Making
From Everand
Microsoft Excel Statistical and Advanced Functions for Decision Making
Palani Murugappan
5/5 (2)
Quantitative Data Analysis Guide
100% (1)
Quantitative Data Analysis Guide
6 pages
Lecture 1 - Introduction To Data Analysis
No ratings yet
Lecture 1 - Introduction To Data Analysis
37 pages
Associations and Correlations for Medical Research
From Everand
Associations and Correlations for Medical Research
Lee Baker
No ratings yet
Data Types: Getting Started With Statistics
From Everand
Data Types: Getting Started With Statistics
Lee Baker
No ratings yet
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
173 pages
208 RM Lab File1 PDF
No ratings yet
208 RM Lab File1 PDF
31 pages
Data Analysis
No ratings yet
Data Analysis
17 pages
Typical Statistical Testing Procedures
No ratings yet
Typical Statistical Testing Procedures
29 pages
Essays On Data Analysis
100% (1)
Essays On Data Analysis
136 pages
E Data Analysis
No ratings yet
E Data Analysis
2 pages
What Is Data Visualization and Why Is It Important
No ratings yet
What Is Data Visualization and Why Is It Important
18 pages
Data Analysis
No ratings yet
Data Analysis
19 pages
Statistical Analysis in Excel by Golden MCpherson
No ratings yet
Statistical Analysis in Excel by Golden MCpherson
315 pages
Data science-Unit-3-Complete
No ratings yet
Data science-Unit-3-Complete
33 pages
Chapter 10 Data Analysis-Quantitative
No ratings yet
Chapter 10 Data Analysis-Quantitative
93 pages
Estadístic A Descriptiv A: Dr. Lázaro Bustio Martínez Otoño 2023
No ratings yet
Estadístic A Descriptiv A: Dr. Lázaro Bustio Martínez Otoño 2023
42 pages
Eda Reviewer
No ratings yet
Eda Reviewer
2 pages
Data Science Presentation
100% (3)
Data Science Presentation
113 pages
Data Analysis
No ratings yet
Data Analysis
8 pages
The Art of Data Analysis: January 2015
No ratings yet
The Art of Data Analysis: January 2015
8 pages
935-Module 03 PPT
No ratings yet
935-Module 03 PPT
12 pages
Data Analysis and Interpretation: Major Points For Discussions
No ratings yet
Data Analysis and Interpretation: Major Points For Discussions
39 pages
Jomeri 030
No ratings yet
Jomeri 030
8 pages
Practical Research Week 1
No ratings yet
Practical Research Week 1
1 page
Lesson 3 Notes
No ratings yet
Lesson 3 Notes
53 pages
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet
Lesson One Introduction To Inferential Statistics
No ratings yet
Lesson One Introduction To Inferential Statistics
20 pages
PR2 Modular M
100% (1)
PR2 Modular M
5 pages
Chapter 7
No ratings yet
Chapter 7
39 pages
Module 5 Research Methodology
No ratings yet
Module 5 Research Methodology
9 pages
Islamabad Semester Terminal Exam Autumn 2020 Name Zeenat Bibi Roll Number By479775 Program Bs English Course Name Introduction To Statistics
100% (1)
Islamabad Semester Terminal Exam Autumn 2020 Name Zeenat Bibi Roll Number By479775 Program Bs English Course Name Introduction To Statistics
23 pages
Unit V Statistical Data Analysis
No ratings yet
Unit V Statistical Data Analysis
72 pages
Lectura The Art of Data Science
No ratings yet
Lectura The Art of Data Science
22 pages
Unit .......
No ratings yet
Unit .......
45 pages
CLC - Data Cleansing and Data Summary
No ratings yet
CLC - Data Cleansing and Data Summary
17 pages
Unit 3
No ratings yet
Unit 3
42 pages
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
Data Analytics
100% (1)
Data Analytics
98 pages
Statistics Important Points: Properties of Normal Distribution
No ratings yet
Statistics Important Points: Properties of Normal Distribution
2 pages
Statistics in Research Analysis
No ratings yet
Statistics in Research Analysis
12 pages
Key Statistical Ideas For Research Students v2
No ratings yet
Key Statistical Ideas For Research Students v2
4 pages
Additional Mathematics Project Work Form 5 2014
No ratings yet
Additional Mathematics Project Work Form 5 2014
33 pages
Chapter Five:: Analyses and Interpretation of Data
No ratings yet
Chapter Five:: Analyses and Interpretation of Data
64 pages
AL - I (Unit - I)
No ratings yet
AL - I (Unit - I)
19 pages
Oral Presentation Rubrics
No ratings yet
Oral Presentation Rubrics
1 page
Letter of Invitation PE
No ratings yet
Letter of Invitation PE
1 page
Quantitative Methods
No ratings yet
Quantitative Methods
5 pages
Judge 2023
No ratings yet
Judge 2023
5 pages
Mathematics of Investment 4
No ratings yet
Mathematics of Investment 4
14 pages
Mathematics of Investment 3
100% (1)
Mathematics of Investment 3
25 pages
Mathematics of Investment 1
No ratings yet
Mathematics of Investment 1
49 pages
Mathematics of Investment 2
No ratings yet
Mathematics of Investment 2
39 pages
Mathematics-of-Investment-midterm Lessons
No ratings yet
Mathematics-of-Investment-midterm Lessons
50 pages
Sample Invitation
No ratings yet
Sample Invitation
2 pages
FIVB VB Scoresheet 2013 Updated2
No ratings yet
FIVB VB Scoresheet 2013 Updated2
4 pages
Dancesport 1
No ratings yet
Dancesport 1
2 pages
Purpose Purpose Scoring Scoring Portfolio Type Portfolio Type What To Include? What To Include?
No ratings yet
Purpose Purpose Scoring Scoring Portfolio Type Portfolio Type What To Include? What To Include?
1 page
FYUGP Sem1-24
No ratings yet
FYUGP Sem1-24
2 pages
3 - The Linguistics Turn in Cultural Studies
No ratings yet
3 - The Linguistics Turn in Cultural Studies
24 pages
Historical Foundation
No ratings yet
Historical Foundation
13 pages
Placement Metrics Ece 2026
No ratings yet
Placement Metrics Ece 2026
5 pages
Presentation On Gene Analysis Using Cloud Computing
No ratings yet
Presentation On Gene Analysis Using Cloud Computing
9 pages
Relatedness Need Satisfaction, Intrinsic Motivation, and Engagement in Secondary School Physical Education
No ratings yet
Relatedness Need Satisfaction, Intrinsic Motivation, and Engagement in Secondary School Physical Education
14 pages
Verbal and Non Verbal Communication at Work
No ratings yet
Verbal and Non Verbal Communication at Work
30 pages
Mech-Nd-2022-Ge 8071-Disaster Management-90328639-Nd22sh
No ratings yet
Mech-Nd-2022-Ge 8071-Disaster Management-90328639-Nd22sh
3 pages
Henry Widdowson
No ratings yet
Henry Widdowson
3 pages
50 Years Later: A Conversation About The Biological Study of Language With Noam Chomsky
No ratings yet
50 Years Later: A Conversation About The Biological Study of Language With Noam Chomsky
13 pages
LESSON 5 Flexibility in Learning
No ratings yet
LESSON 5 Flexibility in Learning
15 pages
Senior High School Department: Dr. V. Locsin Street, City of Dumaguete 6200
No ratings yet
Senior High School Department: Dr. V. Locsin Street, City of Dumaguete 6200
21 pages
Abaarso Tech University Thesis Presentation
No ratings yet
Abaarso Tech University Thesis Presentation
24 pages
Case Study - The Goldfish (GROUP 7)
No ratings yet
Case Study - The Goldfish (GROUP 7)
7 pages
Unit 4
No ratings yet
Unit 4
17 pages
Form 09 Gender Assessment and Action Plan Template - 0
No ratings yet
Form 09 Gender Assessment and Action Plan Template - 0
6 pages
Anxiety 2312.15272
No ratings yet
Anxiety 2312.15272
8 pages
Notes 1
No ratings yet
Notes 1
537 pages
How To Prepare and Make Submission An Article in XYZ Journal
No ratings yet
How To Prepare and Make Submission An Article in XYZ Journal
13 pages
Memories On The Move: Migration, Diasporas and Citizenship
No ratings yet
Memories On The Move: Migration, Diasporas and Citizenship
301 pages
Sociological Understanding of Crime Manish Kumar
No ratings yet
Sociological Understanding of Crime Manish Kumar
10 pages
MSc-Process-Engineering ETH Zurich
No ratings yet
MSc-Process-Engineering ETH Zurich
9 pages
Vygotskyarticle
No ratings yet
Vygotskyarticle
7 pages
University Admission Prediction
No ratings yet
University Admission Prediction
18 pages
DLL Cot Cesc
100% (2)
DLL Cot Cesc
4 pages
Personal Mastery and Mental Model
No ratings yet
Personal Mastery and Mental Model
14 pages
GBS 550 Module Session 1 2019
No ratings yet
GBS 550 Module Session 1 2019
8 pages
January February March April May: Tasks
No ratings yet
January February March April May: Tasks
4 pages
Blue Prent Test For Tenth Grade
No ratings yet
Blue Prent Test For Tenth Grade
2 pages

Data Analysis & Exploratory Data Analysis (EDA)

Uploaded by

Data Analysis & Exploratory Data Analysis (EDA)

Uploaded by

Data Analysis & Exploratory Data Analysis (EDA)

Contents (Click to skip to that section):

Data Analysis Definition

Data analysis is about picking out trends from sets of data.

The Two Tools of Data analysis.

Or you could make a frequency distribution table to display data.

The Three Rules of Data analysis.

Issues with Data Analysis.

1. Not having the right analysis skills.

Introduction to Statistical Data Analysis

Make inferences from the

Arrange, analyze and reflect the Correlate, test and anticipate

Concluding outcomes are

Difference between Descriptive Statistics and Inferential Statistics

You might also like