0% found this document useful (0 votes)

40 views57 pages

1 Data Collection Procedure Research Instrument and Interpretation of Data

Uploaded by

johnbenedictrago

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views57 pages

1 Data Collection Procedure Research Instrument and Interpretation of Data

Uploaded by

johnbenedictrago

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 57

A Research instrument is a tool used to collect, measure, and

analyze data related to your research interests.

These tools are mostly used in health sciences, social

sciences, and education to assess patients, clients, students,
teachers, staff, etc.

A research instrument can include interviews, tests, surveys,

or checklists.

The Research instrument is usually determined by the

researcher and is tied to the study methodology.
Collecting data is one major component of any type of research.

Undermining its importance would result in the production of

inaccurate data sufficient to render your research study invalid.

Hence, in collecting quantitative data, stress is given to the

accuracy or appropriateness of your data-gathering technique as
well as of the right instrument to collect the data.
Name the
following
pictures
presented
• Bar graphs should be
used for categoric,
ordered, and discrete
variables. If the number
of units in a discrete
variable is large it may
be displayed as a
continuous variable.
• Line graphs should be
used for continuous
variables.
• Pie graphs (sometimes called pie
or circle charts) are used to show
the parts that make up a whole.
They can be useful for comparing
the size of relative parts. Because
it is difficult to compare
different circle graphs, and often
hard to compare the angles of
different sectors of the pie, it is
sometimes better to choose other
sorts of graphs.
• Tables are generally used to
present large amounts of exact
values of qualitative or
quantitative data, rather than
quantitative information such as
trends or patterns. Tables can be
used to summarize information
from the Methods or Results.
When preparing tables, keep in
mind that they must be able to
stand alone.
Types of Variable
The types of variables you have usually determine what type of statistical test
you can use.
Quantitative variables represent amounts of things (e.g. the number of trees in a
forest). Types of quantitative variables include:
• Continuous (aka ratio variables): represent measures and can usually be
divided into units smaller than one (e.g. 0.75 grams).
• Discrete (aka integer variables): represent counts and usually can’t be
divided into units smaller than one (e.g. 1 tree).

Categorical variables represent groupings of things (e.g. the different tree

species in a forest). Types of categorical variables include:
• Ordinal: represent data with an order (e.g. rankings).
• Nominal: represent group names (e.g. brands or species names).
• Binary: represent data with a yes/no or 1/0 outcome (e.g. win or lose).
• summarize the characteristics of a data set
• allows you to describe a data set

Using descriptive statistics, you can report characteristics

of your data:
• The distribution concerns the frequency of each value.
• The central tendency concerns the averages of the
values.
• The variability concerns how spread out the values are.
• You collect data on the NAT scores of all 11th
graders in a school for three years.

• You can use descriptive statistics to get a quick

overview of the school’s scores in those years.
You can then directly compare the mean NAT
score with the mean scores of other schools.
Measures of central tendency help you find the middle, or
the average, of a dataset. The 3 most common measures
of central tendency are the mode, median, and mean.

• Mode: the most frequent value.

• Median: the middle number in an ordered dataset.
• Mean: the sum of all values divided by the total number
of values.
A dataset is a distribution of n number of scores or
values.
In a normal distribution,
data is symmetrically
distributed with no skew.
Most values cluster
around a central region,
with values tapering off as
they go further away from
the center. The mean,
mode and median are
exactly the same in a
normal distribution.
In skewed distributions, more values fall on one side of the
center than the other, and the mean, median and mode all
differ from each other. One side has a more spread out and
longer tail with fewer scores at one end than the other. The
direction of this tail tells you the side of the skew

In a positively skewed distribution, there’s a cluster of lower

scores and a spread out tail on the right. In a negatively skewed
distribution, there’s a cluster of higher scores and a spread out
tail on the left.
In this histogram, your distribution is skewed to In this histogram, your distribution is skewed to the left,
the right, and the central tendency of your and the central tendency of your dataset is towards the
dataset is on the lower end of possible scores. higher end of possible scores.
In a positively skewed distribution, In a negatively skewed distribution,
mode < median < mean. mean < median < mode.
The mode is the most frequently occurring value in the dataset. It’s possible to
have no mode, one mode, or more than one mode.

To find the mode, sort your dataset numerically or categorically and select the
response that occurs most frequently.
The median of a dataset is the value that’s exactly in the middle when it is ordered
from low to high
For an odd-numbered dataset, find the value that lies at
the position, where n is the number of values in the dataset.
The arithmetic mean of a dataset (which is different from the geometric mean) is
the sum of all values divided by the total number of values. It’s the most commonly
used measure of central tendency because all values are used in the calculation.
The 3 main measures of central tendency are best used in combination
with each other because they have complementary strengths and
limitations. But sometimes only 1 or 2 of them are applicable to your
dataset, depending on the level of measurement of the variable.

• The mode can be used for any level of measurement,

but it’s most meaningful for nominal and ordinal
levels.
• The median can only be used on data that can be
ordered – that is, from ordinal, interval and ratio
levels of measurement.
• The mean can only be used on interval and ratio
levels of measurement because it requires equal
spacing between adjacent values or scores in the
scale.
help you come up with conclusions and make predictions
based on your data

Inferential statistics have two main uses:

• making estimates about populations (for example, the
mean NAT score of all 11th graders in the US).
• testing hypothesis to draw conclusions about populations
(for example, the relationship between NAT scores and
family income).
The characteristics of samples and populations
are described by numbers called statistics and
parameters:

• A statistics is a measure that describes the

sample (e.g., sample mean).
• A parameter is a measure that describes the
whole population (e.g., population mean).
Statistical tests come in three forms:
tests of (1) comparison, (2) correlation or
(3) regression.
Comparison tests assess whether there are differences in
means, medians or rankings of scores of two or more
groups.

To decide which test suits your aim, consider whether

your data meets the conditions necessary for parametric
tests, the number of samples, and the levels of
measurement of your variables.

Means can only be found for interval or ratio data, while

medians and rankings are more appropriate measures for
ordinal data.
T Test
• A t test is a statistical test that is used to compare the means of
two groups. It is often used in hypothesis testing to determine
whether a process or treatment actually has an effect on the
population of interest, or whether two groups are different from
one another.
• When choosing a t test, you will need to consider two things:
whether the groups being compared come from a single population
or two different populations, and whether you want to test the
difference in a specific direction.
• If the groups come from a single population (e.g., measuring
before and after an experimental treatment), perform a paired t
test. This is a within-subjects design.
• If the groups come from two different populations (e.g., two
different species, or people from two separate cities), perform a
two-sample t test (a.k.a. independent t test). This is a between-
subjects design.
• If there is one group being compared against a standard value
(e.g., comparing the acidity of a liquid to a neutral pH of 7),
perform a one-sample t test.
One-tailed or two-tailed t test
• If you only care whether the two populations are
different from one another, perform a two-tailed t
test.
• If you want to know whether one population mean is
greater than or less than the other, perform a one-
tailed t test.
ANOVA
• ANOVA, which stands for Analysis of Variance, is a statistical
test used to analyze the difference between the means of more
than two groups.
• A one-way ANOVA uses one independent variable, while a two-way
ANOVA uses two independent variables.
ANOVA
• Use a one-way ANOVA when you have collected data about one
categorical independent variable and one quantitative dependent
variable. The independent variable should have at least three
levels (i.e. at least three different groups or categories).
• ANOVA tells you if the dependent variable changes according to
the level of the independent variable. For example:
• Your independent variable is social media use, and you assign groups to
low, medium, and high levels of social media use to find out if there is a
difference in hours of sleep per night.
• Your independent variable is brand of soda, and you collect data on
Coke, Pepsi, and Sprite to find out if there is a difference in the price
per 100ml.
• Your independent variable is type of fertilizer, and you treat crop
fields with mixtures 1, 2 and 3 to find out if there is a difference in
crop yield.
Correlation tests determine the extent to which two variables
are associated.

Although Pearson’s r is the most statistically powerful test,

Spearman’s rho is appropriate for interval and ratio variables
when the data doesn’t follow a normal distribution.

The chi square test of independence is the only test that can be
used with nominal variables.
Pearson Correlation Coefficient (r)
• The Pearson correlation coefficient (r) is the most common way
of measuring a linear correlation. It is a number between –1 and 1
that measures the strength and direction of the relationship
between two variables.
Pearson Correlation Coefficient (r)
Pearson Correlation Coefficient (r)
Spearman’s Rho
• Spearman’s Rho is used to understand the strength of the
relationship between two variables. Your variables of interest can
be continuous or ordinal and should have a monotonic relationship.
• Every statistical method has assumptions. Assumptions mean
that your data must satisfy certain properties in order for
statistical method results to be accurate (1) continuous or
ordinal, (2) monotonicity.
Chi-square Test
• A Pearson’s chi-square test is a statistical test for
categorical data. It is used to determine whether your
data are significantly different from what you expected.
Chi-square Test of Independence
• You can use a chi-square test of independence when you have two
categorical variables. It allows you to test whether the two
variables are related to each other. If two variables are
independent (unrelated), the probability of belonging to a certain
group of one variable isn’t affected by the other variable.
Chi-square goodness of fit test
• You can use a chi-square goodness of fit test when you have one
categorical variable. It allows you to test whether the frequency
distribution of the categorical variable is significantly different
from your expectations. Often, but not always, the expectation is
that the categories will have equal proportions.
Chi-square goodness of fit test
Regression tests demonstrate whether changes in predictor
variables cause changes in an outcome variable. You can decide which
regression test to use based on the number and types of variables
you have as predictors and outcomes.

Most of the commonly used regression tests are parametric. If your

data is not normally distributed, you can perform data
transformations.

Data transformations help you make your data normally distributed

using mathematical operations, like taking the square root of each
value.
Simple Linear Regression
• Simple linear regression is used to estimate the
relationship between two quantitative variables. You can
use simple linear regression when you want to know:
• How strong the relationship is between two variables
(e.g., the relationship between rainfall and soil erosion).
• The value of the dependent variable at a certain value
of the independent variable (e.g., the amount of soil
erosion at a certain level of rainfall).
Multilinear Linear Regression
• Multiple linear regression is used to estimate the relationship
between two or more independent variables and one dependent
variable. You can use multiple linear regression when you want to
know:
• How strong the relationship is between two or more
independent variables and one dependent variable (e.g. how
rainfall, temperature, and amount of fertilizer added affect
crop growth).
• The value of the dependent variable at a certain value of the
independent variables (e.g. the expected yield of a crop at
certain levels of rainfall, temperature, and fertilizer
addition).
Multilinear Linear Regression
• Suppose we fit a multiple linear regression model using the
predictor variables hours studied and prep exams taken and a
response variable exam score.
Multilinear Linear Regression
• Suppose we fit a multiple linear regression model using the
predictor variables hours studied and prep exams taken and a
response variable exam score.

Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
100% (1)
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
33 pages
Statistical Methods
No ratings yet
Statistical Methods
43 pages
MMW Data Management
No ratings yet
MMW Data Management
35 pages
Reviewer For Psych Stats
No ratings yet
Reviewer For Psych Stats
36 pages
Statistical Techniques - Bda
No ratings yet
Statistical Techniques - Bda
33 pages
Statistics: An Introduction and Overview
No ratings yet
Statistics: An Introduction and Overview
51 pages
Statistics
83% (6)
Statistics
33 pages
Lesson 5 (Descriptive Statistics Part 1) - Oct 2024
No ratings yet
Lesson 5 (Descriptive Statistics Part 1) - Oct 2024
72 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
Data Analysis and Statistical Treatment
No ratings yet
Data Analysis and Statistical Treatment
99 pages
Handout-A-Preliminaries (Advance Statistics)
No ratings yet
Handout-A-Preliminaries (Advance Statistics)
29 pages
Wa0014
No ratings yet
Wa0014
63 pages
Statistics
No ratings yet
Statistics
63 pages
Lecture Notes: (Introduction To Medical Laboratory Science Research)
No ratings yet
Lecture Notes: (Introduction To Medical Laboratory Science Research)
13 pages
Descriptive Analytics Notes
No ratings yet
Descriptive Analytics Notes
6 pages
Basic Statistics (3685) PPT - Lecture On 20-01-2019
100% (1)
Basic Statistics (3685) PPT - Lecture On 20-01-2019
64 pages
Practical Research 2 Q2 Lesson
No ratings yet
Practical Research 2 Q2 Lesson
7 pages
Letter For Exemption
No ratings yet
Letter For Exemption
9 pages
Emerging Trends & Analysis 1. What Does The Following Statistical Tools Indicates in Research
No ratings yet
Emerging Trends & Analysis 1. What Does The Following Statistical Tools Indicates in Research
7 pages
RESEARCH
No ratings yet
RESEARCH
5 pages
Lesson-6 - Data Analysis
No ratings yet
Lesson-6 - Data Analysis
24 pages
Statistics SS2020
No ratings yet
Statistics SS2020
12 pages
Unit II TYCS DS
No ratings yet
Unit II TYCS DS
176 pages
CG8 Data-Analysis
No ratings yet
CG8 Data-Analysis
63 pages
Lecture 1
No ratings yet
Lecture 1
72 pages
Gned 3 Finals Reviewer
No ratings yet
Gned 3 Finals Reviewer
5 pages
Interpreting Test Score: Online Workshop 8602 Aiou
100% (1)
Interpreting Test Score: Online Workshop 8602 Aiou
39 pages
Statistical Instruments and References Writing in Research
No ratings yet
Statistical Instruments and References Writing in Research
36 pages
Statistics
No ratings yet
Statistics
68 pages
Organization of Data
No ratings yet
Organization of Data
6 pages
Physics
No ratings yet
Physics
6 pages
Introduction and Descriptive Statistics
No ratings yet
Introduction and Descriptive Statistics
50 pages
Chmsu Compre Notes
No ratings yet
Chmsu Compre Notes
7 pages
Statistics A Review
No ratings yet
Statistics A Review
47 pages
STATS
No ratings yet
STATS
22 pages
PSM 2020N
No ratings yet
PSM 2020N
399 pages
Statistical Foundations - Intro 64zlf
100% (2)
Statistical Foundations - Intro 64zlf
86 pages
Main Title: Planning Data Analysis Using Statistical Data
100% (1)
Main Title: Planning Data Analysis Using Statistical Data
40 pages
Levels of Data
100% (1)
Levels of Data
26 pages
Inquiries Chapter 4
No ratings yet
Inquiries Chapter 4
6 pages
Midterms Statistics Reviewer
No ratings yet
Midterms Statistics Reviewer
10 pages
Descriptive Statistics, Tables and Graphs 20
No ratings yet
Descriptive Statistics, Tables and Graphs 20
34 pages
Data Processing and Anlysis
No ratings yet
Data Processing and Anlysis
41 pages
Module-for-Blended-Thesis Writing Lesson 15
No ratings yet
Module-for-Blended-Thesis Writing Lesson 15
8 pages
Understandingstatisticsinresearch 151026064600 Lva1 App6892
No ratings yet
Understandingstatisticsinresearch 151026064600 Lva1 App6892
37 pages
SS 104 - Lecture Notes Part 1 EDITED
No ratings yet
SS 104 - Lecture Notes Part 1 EDITED
8 pages
CH11 PPT
No ratings yet
CH11 PPT
33 pages
Ummiee
No ratings yet
Ummiee
5 pages
Week 5A - Statistics Handout
No ratings yet
Week 5A - Statistics Handout
9 pages
Mathworld Reviewer Stats
No ratings yet
Mathworld Reviewer Stats
4 pages
AEB801 20222023-Lecture 03-1
No ratings yet
AEB801 20222023-Lecture 03-1
38 pages
AL - I (Unit - I)
No ratings yet
AL - I (Unit - I)
19 pages
Inferential Statistics
No ratings yet
Inferential Statistics
92 pages
Advance Statistics For Data Science and Data Analysis
No ratings yet
Advance Statistics For Data Science and Data Analysis
47 pages
Analysis of Data-Statistic: Unit IV
No ratings yet
Analysis of Data-Statistic: Unit IV
30 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
3rd QTR Stats Reviewer
No ratings yet
3rd QTR Stats Reviewer
24 pages
Central Tendency Dispersion Visualization
No ratings yet
Central Tendency Dispersion Visualization
34 pages
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Glossary of Research Methodology
From Everand
Glossary of Research Methodology
Dr. Awadhesh Kishore
No ratings yet
M Tech Cse Syllabus 2021 Ds 1875a43a25
No ratings yet
M Tech Cse Syllabus 2021 Ds 1875a43a25
96 pages
Research Proposal: 1.0 Statement of The Problem
No ratings yet
Research Proposal: 1.0 Statement of The Problem
8 pages
Mekie Proposal
No ratings yet
Mekie Proposal
27 pages
T-Test For Two Independent Samples
No ratings yet
T-Test For Two Independent Samples
44 pages
Nist Technical Note 1297 S
100% (1)
Nist Technical Note 1297 S
25 pages
Non Probability Sampling
No ratings yet
Non Probability Sampling
6 pages
Preacher Kelley 2011
No ratings yet
Preacher Kelley 2011
23 pages
Cluster Analysis in R TML
No ratings yet
Cluster Analysis in R TML
5 pages
PPT
0% (1)
PPT
15 pages
Unit 7 Single Sampling Plans: Structure
No ratings yet
Unit 7 Single Sampling Plans: Structure
30 pages
Dissertation Using Multiple Regression
100% (3)
Dissertation Using Multiple Regression
8 pages
Curriculum Map 2019 - 2021
No ratings yet
Curriculum Map 2019 - 2021
29 pages
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
Chap11 PPT
100% (1)
Chap11 PPT
46 pages
MKT 537/536 Oct 2007
No ratings yet
MKT 537/536 Oct 2007
8 pages
533653
100% (1)
533653
20 pages
MA2216/ST2131 Probability Notes 5 Distribution of A Function of A Random Variable and Miscellaneous Remarks
No ratings yet
MA2216/ST2131 Probability Notes 5 Distribution of A Function of A Random Variable and Miscellaneous Remarks
13 pages
MAE 301: Applied Experimental Statistics
No ratings yet
MAE 301: Applied Experimental Statistics
10 pages
Detailed Lesson Plan (DLP) Format: Code
No ratings yet
Detailed Lesson Plan (DLP) Format: Code
6 pages
Matematic
No ratings yet
Matematic
244 pages
Icma Centre University of Reading: Quantitative Methods For Finance
No ratings yet
Icma Centre University of Reading: Quantitative Methods For Finance
3 pages
MCOM - All Chapter - THEORY 2023-24
No ratings yet
MCOM - All Chapter - THEORY 2023-24
41 pages
Selecting The Best Curve Fit
No ratings yet
Selecting The Best Curve Fit
4 pages
May MG 1
No ratings yet
May MG 1
19 pages
Project Report: University Business School, Chandigarh Panjab University
No ratings yet
Project Report: University Business School, Chandigarh Panjab University
12 pages
Reliance Jio
No ratings yet
Reliance Jio
11 pages
Aurora Turmelle - Updated Identifying Misleading Graphs and Stats Lesson Plan With Reflection - Edu 361-2
No ratings yet
Aurora Turmelle - Updated Identifying Misleading Graphs and Stats Lesson Plan With Reflection - Edu 361-2
10 pages
AP Stats Chapter 9B Test
No ratings yet
AP Stats Chapter 9B Test
7 pages
Quantitative Methods For Management: Session 8
No ratings yet
Quantitative Methods For Management: Session 8
60 pages
Task 1 - Example Answer
No ratings yet
Task 1 - Example Answer
9 pages

1 Data Collection Procedure Research Instrument and Interpretation of Data

Uploaded by

1 Data Collection Procedure Research Instrument and Interpretation of Data

Uploaded by

A Research instrument is a tool used to collect, measure, and

analyze data related to your research interests.

These tools are mostly used in health sciences, social

A research instrument can include interviews, tests, surveys,

The Research instrument is usually determined by the

Undermining its importance would result in the production of

Hence, in collecting quantitative data, stress is given to the

Categorical variables represent groupings of things (e.g. the different tree

Using descriptive statistics, you can report characteristics

• You can use descriptive statistics to get a quick

• Mode: the most frequent value.

In a positively skewed distribution, there’s a cluster of lower

• The mode can be used for any level of measurement,

Inferential statistics have two main uses:

• A statistics is a measure that describes the

To decide which test suits your aim, consider whether

Means can only be found for interval or ratio data, while

Although Pearson’s r is the most statistically powerful test,

Most of the commonly used regression tests are parametric. If your

Data transformations help you make your data normally distributed

You might also like