
Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 26

Data Analysis

Data Analysis

 Dataanalysis is the process of systematically

applying statistical and/or logical techniques
to describe and illustrate and evaluate data.

 A simple example of data analysis can be seen

whenever we make a decision in our daily lives by
evaluating what has happened in the past or what will
happen if we make that decision. Basically, this is the
process of analyzing the past or future and making a
decision based on that analysis
Types of Data Analysis

1. Descriptive
2. Inferential
Descriptive Data Analysis

 It involves summarizing and describing the main

features of a dataset. It focuses on organizing and
presenting the data in a meaningful way, often using
measures such as mean, median, mode, and standard
 It provides an overview of the data and helps identify
patterns or trends
Inferential Data Analysis

 It aims to make inferences or predictions about a

larger population based on sample data. It involves
applying statistical techniques such as hypothesis
testing, confidence intervals, and regression analysis.
 It helps generalize findings from a sample to a larger
Data Analysis Methods

 Qualitative Data Analysis

 Quantitative Data Analysis
Qualitative Data Analysis

The most common qualitative methods include:

1. Content Analysis, for analyzing behavioral and verbal data.
2. Narrative Analysis, for working with data culled from interviews,
diaries, surveys.
Quantitative Data Analysis

 Hypothesis Testing, for assessing the truth of a given hypothesis

or theory for a data set or demographic.
 Mean, or average determines a subject’s overall trend by dividing
the sum of a list of numbers by the number of items on the list.
Importance of Data Analysis in
1. Plays a key role in distilling this information into a more accurate
and relevant form, making it easier for researchers to do to their
2. Provides researchers with a vast selection of different tools, such
as descriptive statistics, inferential analysis, and quantitative
3. Offers researchers better data and better ways to analyze and
study said data
Data Analysis Tools

 Microsoft Excel
 SPSS (Statistical Package for the Social Sciences)
 Python
 Rapid Miner
Statistical Analysis of Differences

 Statistical analysis of differences involves assessing

whether there are significant variations or disparities
between groups, conditions, or treatments in a
 Depending on the nature of your data and research
questions, there are various statistical methods to
analyze differences

 T Test
 ANOVA (Analysis of Variation)
 Chi square Test
 Parametric Tests
 Non Parametric Tests

 a statistical test used to analyze the difference between the

means of more than two groups.
 A one-way ANOVA uses one independent variable, while a two-
way ANOVA uses two independent variables.
Chi square test

 to compare observed results with expected results. The purpose

of this test is to determine if a difference between observed data
and expected data is due to chance, or if it is due to a
relationship between the variables you are studying.
Parametric test versus non-
 If your data are normally distributed, parametric tests can usually
be used, if they are not normally distributed, non-parametric
tests are usually used.
 If the data are normally distributed, parametric tests such as the
t-test, ANOVA or Pearson correlation are used.
 If the data are not normally distributed, the nonparametric tests
are used. These are for example the Mann-Whitney U Test or the
T Test

 A t-test is a statistical hypothesis test used in data

analysis to determine whether there is a significant
difference between the means of two groups or
 It is particularly useful when you want to compare the
means of two independent groups or when you want
to compare the mean of a sample to a known
population mean
Types of T Test

 Independent Sample T Test

 Paired / Non independent T Test
Paired / Non independent T Test

 The paired samples t-test is used when you have two sets of
related or paired observations. It's commonly used to assess
whether there is a significant difference between the means of
two related groups.
 For example, you might use a paired samples t-test to compare
the blood pressure of patients before and after a treatment.
Independent Sample T Test

 This type of t-test is used when you want to compare the means
of two independent groups.
 For example, you might use an independent samples t-test to
determine whether there is a significant difference in the average
test scores between two groups of students, one group receiving
a new teaching method and the other group receiving the
traditional teaching method.
Statistical Analysis of Relationship

 Statistical analysis of relationships focuses on examining and

quantifying the associations or correlations between variables in
a dataset.
 Understanding the relationships between variables is crucial for
making informed decisions and drawing meaningful insights from

What is the association of cigarette smoking with lung cancer?

Does risk of lung cancer increases with the number of cigarettes

 Correlation Analysis
 Regression Analysis
 Correlation and regression are statistical measurements that are
used to quantify the strength of the linear relationship between
two variables. Correlation determines if two variables have a
linear relationship while regression describes the cause and
effect between the two.

You might also like