0% found this document useful (0 votes)

34 views40 pages

Review of Basic Stat

The document discusses different types of measurement scales and data analysis methods. It covers nominal, ordinal, interval, and ratio scales. It also discusses dependence methods that test relationships between dependent and independent variables, and interdependence methods that examine how variables are related among themselves without designated dependent/independent variables. Finally, it provides an overview of descriptive and inferential statistics, including frequency distributions, measures of central tendency and dispersion, the standard normal distribution, and t-tests.

Uploaded by

RB Niña

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views40 pages

Review of Basic Stat

Uploaded by

RB Niña

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 40

REVIEW OF

BASIC
STATISTICS
MEASUREMENT SCALES/LEVELS

The Nominal Scale

•simply represents qualitative difference in the
variable measured
•can only tell us that difference exists without the
possibility of telling the direction or magnitude of
the difference
•e.g. Program in college, race, gender, occupation,
religion, etc.
MEASUREMENT SCALES/LEVELS

The Ordinal Scale

•the categories that make up an ordinal scale
form an ordered sequence
•can tell us the direction of the difference but
not the magnitude
•e.g. coffee cup sizes, socioeconomic class, T-
shirt sizes, food preferences
MEASUREMENT SCALES/LEVELS

The Interval Scale

•categories on an interval scale are organized sequentially, and all categories
are numerically measured
•we can determine the direction and the magnitude of a difference
•may have an arbitrary zero (convenient point of reference) but has no true zero
point
e.g. temperature in Fahrenheit, time in seconds
MEASUREMENT SCALES/LEVELS

The Ratio Scale

•consists of equal, ordered categories anchored by a zero point that is not
arbitrary but meaningful (representing absence of a variable
•allows us to determine the direction, the magnitude, and the ratio of the
difference
•e.g. reaction time, number of errors on a test, scores in a test, speed of
cars, weight loss, etc
CLASSIFICATION OF DATA ANALYTIC METHODS

Dependence Method
The dependence methods test for the presence of or absence of
relationship between two sets of variables – the dependent and
independent variables. Common dependence methods are t-test,
ANOVA, ANCOVA, regression analysis, chi-square test, MANOVA,
discriminant analysis and, logistic regression.
CLASSIFICATION OF DATA ANALYTIC METHODS

Interdependence methods
When data sets do exist for which it is impossible to conceptually
designate one set of variables as dependent and another set of variables
as independent. For these types of data sets the objectives are to
identify how and why the variables are related among themselves.
Common examples are correlation analysis, principal component
analysis, and factor analysis.
RELATIONSHIPS OF VARIABLES
Dependency

Independent
Variables
•Age
Hypertension
•Lifestyle
•BMI
•Family History
RELATIONSHIPS OF VARIABLES

Interdependency

•Age
Systolic Pressure
•Weight
Blood Sugar Level
•Cholesterol level
INTERPRETING STATISTICAL RESULT
Important Terms

The test statistic is a value computed from the sample data, and it
is used in making the decision about the rejection of the null
hypothesis.
The critical region (or rejection region) is the set of all values of
the test statistic that cause us to reject the null hypothesis. It is
decided by Critical Value.
The significance level (denoted by ) is the probability that the
test statistic will fall in the critical region when the null hypothesis
is actually true. Common choices for  are 0.05, 0.01, and 0.10.
INTERPRETING STATISTICAL
RESULT
The statement of the problem/hypothesis is the basis for
interpreting results.
The null hypothesis is either rejected or not to be rejected
Significant result is met when the null hypothesis is
rejected. Not significant when the null hypothesis is not
rejected.
INTERPRETING STATISTICAL
RESULT
Significance can mean any of the following:
• There is a relationship.
• There is an association between or among variables.
• There is an effect.
• The treatment is effective.
• A variable is dependent on the other variable/s.
• There is a difference/different effect.
INTERPRETING STATISTICAL RESULT

Question:
• When and how do you reject or fail to reject the
null hypothesis?
• When do we say that the result is Significant?
TRADITIONAL METHOD

 Reject H0 if the test statistic falls within the critical region.

 Fail to reject H0 if the test statistic does not fall within the critical
region.

Critical Critical
Value Value
P-VALUE METHOD

Reject H0 if P-value   (where  is the significance level, such

as 0.05).
Fail to reject H0 if P-value > .
DESCRIPTIVE STATISTICS

THE FREQUENCY DISTRIBUTION TABLE

(FDT)
An FDT is a statistical table showing the frequency or
number of observations contained in each of the defined
classes or categories.
TYPES OF FDT
Qualitative or Categorical FDT – an FDT where the data are group
according to some qualitative characteristics; data are grouped into
non numerical categories.
Quantitative FDT – an FDT where data are grouped according to
some numerical or quantitative characteristics.
DESCRIPTIVE STATISTICS
EXAMPLE QUALITATIVE FDT

Table 1. Distribution Respondents by Educational Level

Educational Level Frequency Percentage (%)
Highschool 30 20
College 75 50
MA/Ms 45 30
Total 150 100

Interpretation: Most of the respondents are college graduate which constitute

50% (75 out of 150) of the total respondent.
DESCRIPTIVE STATISTICS
EXAMPLE QUANTITATIVE FDT

Table 1. Frequency Distribution of the Age of the Respondents

Age Frequency Percentage (%)
16-19 134 66
20-25 64 32
26 above 4 2
Total 202 100

Interpretation: Majority of the respondents, about 134 out of 202 (66%), are 16-
19 years of age.
DESCRIPTIVE STATISTICS

Charts and Graphs

• Pie Chart
• Bar Chart
• Line Chart
• Histogram
• Scatter Diagram
DESCRIPTIVE STATISTICS

Distribution of Respondent s by Type of Behavior Distribution of Respondent s by Type of Behavior

120

100

23%
33% 80

100
23%
40
20% 70 70
60

0
Envious Optimistic Pessimistic Trusting Envious Optimistic Pessimistic Trusting
DESCRIPTIVE STATISTICS

SALES IN MILLION
25

20 19.7 20.1
18.2
17.5

15 14.5
13.8
12.8 12.5
11.3
10.2
10

0
2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
DESCRIPTIVE STATISTICS

5.5

4.5

3.5

2.5

2
2 2.5 3 3.5 4 4.5 5
DESCRIPTIV
E STATISTICS
1. MEASURES OF
CENTRAL
TENDENCY
Measures of absolute dispersion are expressed in the units of
DESCRIPTIV the original observations.

E STATISTICS There are three main measures of absolute dispersion:

• The range
• Variance

1. MEASURES OF • Standard deviation

DISPERSION
Measures of relative dispersion are unit-less and are used
when one wishes to compare the scatter of one distribution with
another distribution.

Some measures of absolute dispersion:

• Coefficient of Variation
• Standard Score
THE STANDARD NORMAL DISTRIBUTION
The distribution of a normal random variable with mean zero and standard
deviation equal to 1 is called a standard normal distribution.

If X follows a normal distribution, then X can be transformed into a standard

normal random variable through the following transformation.
INFERENTIAL
STATISTICS (BASIC
TOOLS)
T- TEST

• T-test is a parametric test that is commonly used

to test difference between 2 group means. Means
may be from independent or dependent groups
• A dependence method, usually a univariate tests
and is most effective to use when the independent
variable is non-metric.
Example: testing the relationship between level of
job satisfaction and gender.
ONE-SAMPLE T-TEST

Used to test single population mean

Usually compare the mean to existing population mean or to the standard
norm
Example is comparing the performance in the medical board exam of a
certain school to the national result
T-TEST FOR INDEPENDENT SAMPLES

•
Also called the two sample t-test for independent samples
Assumptions maybe equal or unequal variances
It intends to test whether there is a significant difference
between the means of two unrelated groups
It is use to test the null hypothesis:
T-TEST FOR DEPENDENT SAMPLES

•
Also called the paired t-test
It intends to test whether there is a significant
difference between the means from the same
group.
Mostly used in comparing pre-test and post-
test results
It is use to test the null hypothesis:
ANOVA – ANALYSIS OF VARIANCE

It is an appropriate technique for estimating the parameters of a linear model, Y

= α + βx + ε, when the independent variables are nominal or categorical.
In practice, it is used to test significant differences among group means (more
than 2 groups)
Mostly use in experimental research, esp. when design of experiment is applied.
Example: Consider the case where a medical researcher is interested about the
effect of occupation on cholesterol level. The independent variable, occupation,
is nominal.
CORRELATION ANALYSIS

Correlation is a measure of the direction and strength of linear

relationship between two variables.
 Direction means positive or negative.
 Strength can be perfect, strong or high, moderate, low or zero or no
correlation.
Correlation between two variables does not prove X causes Y or Y
causes X.
SCATTER DIAGRAM
PEARSON CORRELATION COEFFICIENT R

Pearson Correlation coefficient is a numerical value that measures strength and

direction of linear relationship
Symbol: r
r can range from -1.0 to +1.0
Sign (+/-) indicates “direction”
Value indicates “strength”
Measures a “linear” relationship only
Significance of the Pearson r can be tested using t-test
PEARSON CORRELATION COEFFICIENT R

Illustration:
-1 0 1
Perfect Negative No/Zero Perfect
Correlation Correlation Positive
Correlation

 Closer to 0 = weaker
 Closer to 1.0 = stronger
 r close to 1.0 perfect
 r  0 could mean many things:
No correlation at all between X & Y
Non-linear relationship between X & Y
Restricted range on X and/or Y
Outlier may be causing problems
ACTIVITY: INTERPRET THE FOLLOWING R
COEFFICIENT
1) r = 0.85
2) r = -0.69
3) r = -0.37
4) r = -0.11
5) r = 0.09
6) r = 0.32
7) r = -0.92
8) r = 0.75
ACTIVITY: INTERPRET THE FOLLOWING R
COEFFICIENT
1) r = 0.85 Ans.: Very Strong Positive
2) r = -0.69 Ans.: Moderate/Strong Negative
3) r = -0.37 Ans.: Weak Negative
4) r = -0.11 Ans.: No/Very weak
5) r = 0.09 Ans.: No/Very weak
6) r = 0.29 Ans.: Weak Positive
7) r = -0.92 Ans.: Very Strong Negative
8) r = 0.75 Ans.: Strong Positive
INTERPRETING R (Evans, 1996)
r Verbal Interpretation
-1 Perfect Negative Correlation
-0.8 to -0.99 Very Strong Negative Correlation

-0.6 to -0.79 Strong Negative Correlation

-0.4 to -0.59 Moderate Negative Correlation
-0.2 to -0.39 Weak Negative Correlation
-0.01 to -0.19 Very Weak Negative Correlation
0 No Correlation
0.01 to 0.19 Very Weak Positive Correlation
0.2 to 0.39 Weak Positive Correlation
0.4 to 0.59 Moderate Positive Correlation
0.6 to 0.79 Strong Positive Correlation
0.8 to 0.99 Very Strong Positive Correlation
1 Perfect Positive Correlation
CHI-SQUARE TEST

• The Chi-Square test is known as the test of goodness of fit and

Chi-Square test of Independence. In the Chi-Square test of
Independence, the frequency of one nominal variable is
compared with different values of the second nominal variable.
• The Chi-square test of Independence is used when we want to
test associations between two categorical variables.
CHI-SQUARE TEST

Assumptions
Independent random sampling
Nominal/Ordinal level data
No more than 20% of the cells have an expected frequency less than 5
No empty cells

Statistical Treatment
No ratings yet
Statistical Treatment
22 pages
Descriptive and Inferential Statistical Analysis
No ratings yet
Descriptive and Inferential Statistical Analysis
25 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
34 pages
Research Methods Chapter 5
No ratings yet
Research Methods Chapter 5
59 pages
Chapter-1 Introduction To Research Methodology.
89% (9)
Chapter-1 Introduction To Research Methodology.
42 pages
Inferential Statistics
No ratings yet
Inferential Statistics
171 pages
Unit IV - Analytics Tasks (Students)
No ratings yet
Unit IV - Analytics Tasks (Students)
127 pages
Rohit Seminar
No ratings yet
Rohit Seminar
22 pages
Quantitative Analysis
No ratings yet
Quantitative Analysis
30 pages
Gender Factor and Entrepreneurial Intention Among Final Year Students of Polytechnic
No ratings yet
Gender Factor and Entrepreneurial Intention Among Final Year Students of Polytechnic
19 pages
2 T-Test
No ratings yet
2 T-Test
26 pages
BRM Answer Key Q Bank by Alam.
No ratings yet
BRM Answer Key Q Bank by Alam.
90 pages
Research Methods Chapter 5
No ratings yet
Research Methods Chapter 5
59 pages
Nemo Analyze 5.10 User Manual
No ratings yet
Nemo Analyze 5.10 User Manual
337 pages
Group 12 - PR2
No ratings yet
Group 12 - PR2
47 pages
Statistical Analysis of Data With Report Writing
100% (2)
Statistical Analysis of Data With Report Writing
16 pages
ECO601 PPT 1 45
No ratings yet
ECO601 PPT 1 45
388 pages
Stats Reviewer (4th Quarter) - 1
No ratings yet
Stats Reviewer (4th Quarter) - 1
5 pages
Central Tendency Dispersion Visualization
No ratings yet
Central Tendency Dispersion Visualization
34 pages
Main Title: Planning Data Analysis Using Statistical Data
100% (1)
Main Title: Planning Data Analysis Using Statistical Data
40 pages
Lecture Notes in MAED Stat Part 1
100% (1)
Lecture Notes in MAED Stat Part 1
15 pages
Determinants of Capital Structure in Tanzania
100% (2)
Determinants of Capital Structure in Tanzania
34 pages
Bharathidasan University-Statistics-QP-Nov-2010
No ratings yet
Bharathidasan University-Statistics-QP-Nov-2010
3 pages
UNIT-2 by Ramanathan
No ratings yet
UNIT-2 by Ramanathan
67 pages
DS Unit 3
No ratings yet
DS Unit 3
14 pages
Firefight V 1.3.1
No ratings yet
Firefight V 1.3.1
121 pages
MATH 101-Week 7-8 - Lesson 4.1 Correlation & Regression Analysis
No ratings yet
MATH 101-Week 7-8 - Lesson 4.1 Correlation & Regression Analysis
53 pages
ba4e3ddd028c72570ef868df39c9fe65
No ratings yet
ba4e3ddd028c72570ef868df39c9fe65
370 pages
Aicp Review Stats
No ratings yet
Aicp Review Stats
62 pages
Reviewer For Psych Stats
No ratings yet
Reviewer For Psych Stats
36 pages
STAT22209 - Chapter 01-Correlation Analyisis - 2022
No ratings yet
STAT22209 - Chapter 01-Correlation Analyisis - 2022
53 pages
3 Polkinghorne
No ratings yet
3 Polkinghorne
9 pages
BRM Unit V
No ratings yet
BRM Unit V
99 pages
Lesson 18 Basic Statistical Tool
100% (1)
Lesson 18 Basic Statistical Tool
36 pages
Statistics
No ratings yet
Statistics
33 pages
Math 1011 Final Exam
No ratings yet
Math 1011 Final Exam
14 pages
Educational Statistics Reviewer
No ratings yet
Educational Statistics Reviewer
5 pages
CG8 Data-Analysis
No ratings yet
CG8 Data-Analysis
63 pages
Chapter 4: Seasonal Series: Forecasting and Decomposition
No ratings yet
Chapter 4: Seasonal Series: Forecasting and Decomposition
29 pages
DATA PROCESSING, ANALYSING AND INTERPRETATION Ipmi
100% (1)
DATA PROCESSING, ANALYSING AND INTERPRETATION Ipmi
120 pages
CH 5
No ratings yet
CH 5
26 pages
Statistical Tests of Difference: Vedasto R. Santiago High School OCTOBER 25, 2017
No ratings yet
Statistical Tests of Difference: Vedasto R. Santiago High School OCTOBER 25, 2017
41 pages
K Values For Pearson Type Iii Distribution
No ratings yet
K Values For Pearson Type Iii Distribution
4 pages
Group 1 Thesis
No ratings yet
Group 1 Thesis
10 pages
Transformation and Dummy Variables Econometrics
No ratings yet
Transformation and Dummy Variables Econometrics
34 pages
Educ 301 Angel Mae A. Llobrera
No ratings yet
Educ 301 Angel Mae A. Llobrera
14 pages
5.basic Statistics
No ratings yet
5.basic Statistics
43 pages
Psychometrics: Psicometria Psicometría
No ratings yet
Psychometrics: Psicometria Psicometría
8 pages
Chapter12345 Mylene
No ratings yet
Chapter12345 Mylene
32 pages
Research
No ratings yet
Research
21 pages
Untitled
No ratings yet
Untitled
60 pages
Inferenatial Assign, of Iqra Sajid
No ratings yet
Inferenatial Assign, of Iqra Sajid
8 pages
Unit-IV of Data Science
No ratings yet
Unit-IV of Data Science
38 pages
Research Methodology
No ratings yet
Research Methodology
18 pages
Chapter 5 Data Analysis Ab
No ratings yet
Chapter 5 Data Analysis Ab
56 pages
Homework 4
No ratings yet
Homework 4
2 pages
Factors Contributing To Rework and Their Impact On Construction Projects Performance
No ratings yet
Factors Contributing To Rework and Their Impact On Construction Projects Performance
22 pages
Dummy Variables EAB
No ratings yet
Dummy Variables EAB
12 pages
EDU 411 Topic 5 Data Analysis
No ratings yet
EDU 411 Topic 5 Data Analysis
9 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Inquiries Chapter 4
No ratings yet
Inquiries Chapter 4
6 pages
Correlation Analysis
No ratings yet
Correlation Analysis
102 pages
Conditional Distributions and Stochastic Independence
No ratings yet
Conditional Distributions and Stochastic Independence
2 pages
Comparison of Means: Hypothesis Testing
No ratings yet
Comparison of Means: Hypothesis Testing
52 pages
Data Analysis: Parametric vs. Non-Parametric Tests
No ratings yet
Data Analysis: Parametric vs. Non-Parametric Tests
19 pages
Statistics
No ratings yet
Statistics
13 pages
BRM Data Analysis Techniques
No ratings yet
BRM Data Analysis Techniques
53 pages
Statistics
No ratings yet
Statistics
8 pages
Price Book Value & Tobin's Q: Which One Is Better For Measure Corporate Governance?
No ratings yet
Price Book Value & Tobin's Q: Which One Is Better For Measure Corporate Governance?
6 pages
Night Market As A Tourist Attraction: Territory Personal Space
No ratings yet
Night Market As A Tourist Attraction: Territory Personal Space
1 page
Final Exam
No ratings yet
Final Exam
5 pages
BRM Presentation Group 5 - Univariate & Bivariate Analysis
No ratings yet
BRM Presentation Group 5 - Univariate & Bivariate Analysis
26 pages
Cabrera. R. Designs, Sa-Rj 3
No ratings yet
Cabrera. R. Designs, Sa-Rj 3
15 pages
Med 4TH Chapter
No ratings yet
Med 4TH Chapter
14 pages
Statistical Techniques - Bda
No ratings yet
Statistical Techniques - Bda
33 pages
T6-Hang Li - Machine Learning Methods-Springer (2023) - 230-252
No ratings yet
T6-Hang Li - Machine Learning Methods-Springer (2023) - 230-252
23 pages
Data Analysis: Florenda F. Cabatit RN MA Facilitator
No ratings yet
Data Analysis: Florenda F. Cabatit RN MA Facilitator
44 pages
Program Flow
No ratings yet
Program Flow
1 page
3 Matm111
No ratings yet
3 Matm111
3 pages
"Abohan"/ Kitchen
No ratings yet
"Abohan"/ Kitchen
1 page
Existing Ground Floor Plan: Scale 1:100 MTS
No ratings yet
Existing Ground Floor Plan: Scale 1:100 MTS
1 page
Model Econometric Time Series - MATLAB
No ratings yet
Model Econometric Time Series - MATLAB
2 pages
Theory Question For 504 A
No ratings yet
Theory Question For 504 A
2 pages
HW 1
No ratings yet
HW 1
2 pages
Week 10: Basic Concept
No ratings yet
Week 10: Basic Concept
11 pages
Quantitative Data Analysis: Harshad Bajpai
No ratings yet
Quantitative Data Analysis: Harshad Bajpai
26 pages
Statistics: An Introduction and Overview
No ratings yet
Statistics: An Introduction and Overview
51 pages
Stats 4TH Quarter Reviewer
No ratings yet
Stats 4TH Quarter Reviewer
2 pages
What Is Statistics
No ratings yet
What Is Statistics
5 pages

Review of Basic Stat

Uploaded by

Review of Basic Stat

Uploaded by

REVIEW OF

The Nominal Scale

The Ordinal Scale

The Interval Scale

The Ratio Scale

 Reject H0 if the test statistic falls within the critical region.

Reject H0 if P-value   (where  is the significance level, such

THE FREQUENCY DISTRIBUTION TABLE

Table 1. Distribution Respondents by Educational Level

Interpretation: Most of the respondents are college graduate which constitute

Table 1. Frequency Distribution of the Age of the Respondents

Charts and Graphs

Distribution of Respondent s by Type of Behavior Distribution of Respondent s by Type of Behavior

E STATISTICS There are three main measures of absolute dispersion:

1. MEASURES OF • Standard deviation

Some measures of absolute dispersion:

If X follows a normal distribution, then X can be transformed into a standard

• T-test is a parametric test that is commonly used

Used to test single population mean

It is an appropriate technique for estimating the parameters of a linear model, Y

Correlation is a measure of the direction and strength of linear

Pearson Correlation coefficient is a numerical value that measures strength and

-0.6 to -0.79 Strong Negative Correlation

• The Chi-Square test is known as the test of goodness of fit and

You might also like