RM Practical File
RM Practical File
UNIVERSITY
RESEARCH METHODOLOGY
PRACTICAL FILE
SUBJECT CODE: BBA 208
Exercise 9
How to apply Paired-Sample T Test.
Exercise 10 How to apply one way ANOVA test.
Exercise 11
How to apply one way ANOVA test.
Exercise 12 How to apply Chi-Square Test.
What is SPSS?
SPSS is a Windows based program that can be used to perform data entry and
analysis and to create tables and graphs. SPSS is capable of handling large amounts
of data and can perform all of the analyses covered in the text and much more. SPSS
is commonly used in the Social Sciences and in the business world, so familiarity
with this program should serve you well in the future. SPSS is updated often. This
document was written around an earlier version, but the differences should not cause
any problems.
Opening SPSS
Depending on how the computer you are working on is structured, you can
open SPSS in one of two ways.
1. If there is an SPSS shortcut like this SPSS on the desktop, simply put the cursor on it and
double click the left mouse button.
2. Click the left mouse button on the START button on your screen, then put your cursor on
Programs or All Programs and left click the mouse. Select SPSS 20.0for Windows by clicking the left
mouse button. (For a while that started calling the program PASW Statistics 20, but they seem to have
given that up as a dumb idea when everyone else calls it SPSS. The version number may change by the
time you read this.)Either approach will launch the program.
You will see a screen that looks like the image on the next page. The dialog box
that appears offers choices of running the tutorial, typing in data, running queries, or
opening an existing data source. The window behind this is the Data Editor window
which is used to display the data from whatever file you are using. You could select
anyone of the options on the start-up dialog box and click OK, or you could simply hit
Cancel. If you hit Cancel, you can either enter new data in the blank Data Editor or
you could open an existing file using the File menu bar as explained later.
aClick Cancel, and we’ll get acquainted with the layout of SPSS.
Layout of SPSS
The Data Editor window has two views that can be selected from the lower
left hand side of the screen. Data View is where you see the data you are using.
Variable View is where you can specify the format of your data when you are
creating a file or where you can check the format of a pre-existing file. The data in
the Data Editor is saved in a file with the extension .sav.
Menubar
Icons
Start-updialogbox
The other most commonly used SPSS window is the SPSS Viewer window
which displays the output from any analyses that have been run and any error
messages.
Information from the Output Viewer is saved in a file with the extension . so. Let’s
open an output file and look at it.
aOn the File menu, click Open and select Output. Select appendix output. Spo from
the files that can be found at
http://www.uvm.edu/~dhowell/fundamentals7/SPSSManual/SPSSLongerManual/
DataForSPSS/.(At the moment this set of web pages is the most recent version which
ever of my books you are using.) Click Ok. The following will appear. The left hand
side is an outline of all of the output in the file.The right side is the actual output.To
shrink or enlarge either side put your cursor on the line that divides them. When the
double headed arrow appears, hold the left mouse button and move the line in either
direction. Release the button and the size will be adjusted.
Finally, there is the Syntax window which displays the command language used to run
various operations. Typically, you will simply use the dialog boxes to set up
commands , and would not see the Syntax window. The Syntax window would be
activated if you pasted the commands from the dialog box to it, or if you wrote you
own syntax—something we will not focus on here. Syntax files end in the
extension .spss.
SPSS Menus and Icons
Now, let’s review the menus and icons.
aReviewtheoptionslistedundereachmenuontheMenuBarbyclickingthemoneatatim
e. Follow along with the below descriptions.
View allows you to select which toolbars you want to show, select font size,
add or remove the gridlines that separate each piece of data, and to select whether
or not to display your raw data or the data labels.
Data allows you to select several options ranging from displaying data
that is sorted by a specific variable to selecting certain cases for subsequent
analyses.
Utilities allows you to list file information which is a list of all variables,
the relabels, values, locations in the data file, and type.
Add-ons are programs that can be added to the base SPSS package. You
probably do not have access to any of those.
Window can be used to select which window you want to view (i.e., Data
Editor, Output Viewer, or Syntax). Since we have a data file and an output file
open, let’s try this.
Help has many useful options including a link to the SPSS homepage, a
statistics coach, and a syntax guide. Using topics, you can use the index option to
type in any keyword and get a list of options, or you can view the categories and
subcategories available under contents. This is an excellent too land can be used to
troubleshoot most problems.
The Icons directly under the Menu bar provide shortcuts to many common
commands that are available in specific menus. Take a moment to review these as
well.
aPlace your cursor over the Icons for a few seconds, and a description of the
under lying command will appear. For example, this icon is the shortcut for
Save.
QUESTIONNAIRE
A questionnaire is a research instrument consisting of a series of questions for the purpose of gathering
information from respondents. Questionnaires can be thought of as a kind of written interview. They can be
carried out face to face, by telephone, computer or post.
FREQUENCY PROCEDURE
The Frequencies procedure can produce summary measures for categorical variables in the form of
frequency tables, bar charts, or pie charts.
A percentage frequency distribution is a display of data that specifies the percentage of observations that
exist for each data point or grouping of data points. It is a particularly useful method of expressing the
relative frequency of survey responses and other data. Many times, percentage frequency distributions are
displayed as tables or as bar graphs or pie charts.
The process of creating a percentage frequency distribution involves first identifying the total number of
observations to be represented; then counting the total number of observations within each data point or
grouping of data points; and then dividing the number of observations within each data point or grouping of
data points by the total number of observations. The sum of all the percentages corresponding to each data.
Frequencies Statistics –
(continue) - - (ok).
Frequencies chart -
Bar chart
Histogram –show normal curve
(continue)---(ok).
MEANS
Mean implies average and it is the sum of a set of data divided by the number of data. Mean can prove to be
an effective tool when comparing different sets of data. The mean is the most frequently used measure of
central tendency because it uses all values in the data set to give you an average. For data from skewed
distributions, the median is better than the mean because it isn't influenced by extremely large values.
Steps :Analyze–Compare Means- Means
Variables – Height (ScaleVariable ) ,Gender (Nominal Variable ) male -1,female-2.
Means –Dependent list (height ) , independent list –(gender ).
Click ----(ok).
HISTOGRAM
A histogram is a method that uses bars to display count or frequency data. The independent variable consists
of interval- or ratio-level data and is usually displayed on the abscissa (x-axis), and the frequency data on the
ordinate (y-axis), with the height of the bar proportional to the count.
It is used to summarize discrete or continuous data that are measured on an interval scale. It is often used to
illustrate the major features of the distribution of the data in a convenient form. It is also useful when dealing
with large data sets ,
Variables – height (Scale Variable ), weight (Scale Variable).
Steps; Graphs – Chart Builder
Gallery-(Histogram )- Elemen Properties
Set Patameters (In Custom )Achor Value-0,Interval width -1 and
Number of interval -3 and 400 .
(countinue)-----(apply)-----(ok)
Reliability Analysis
Reliability analysis allows you to study the properties of measurement scales and the items that compose the
scales. The Reliability Analysis procedure calculates a number of commonly used measures of scale
reliability and also provides information about the relationships between individual items in the scale
Testing – to check the correlation in mean of inner -items that how they are reliable using scale.
Steps : Analysis-Scale-Reliability analysis
- Descriptive for Item, Scale, item if deleted
- Inner item -correlation
-Summaries – Means and Correlation
NORMALITY TEST
Normality tests are used to determine if a data set is well-modeledby a normal distribution and to compute
how likely it is for a random variable underlying the data set to be normally distributed.
More precisely, the tests are a form of model selection, and can be interpreted several ways, depending on
one's interpretations of probability.
(continue)------(ok)
One-Sample t-test
The one-sample t-test is a statistical hypothesis test used to determine whether an
unknown population mean is different from a specific value.
Variable :-Height (Scale Variable )
Steps :-
Analyze - Compare Means - One-Sample T Test.
Drag and drop the variable you want to test against the population mean
into the Test Variable(s) box.
Specify your population mean in the Test Value box.
Click OK.
Independent Samples t-test
Testing – Whether the population mean for one category is significantly different from the
population mean for another category, where in the two samples drawn are independent of
each other.
Two independent samples to be measured at a single point in time and since, the groups are
independent of each other; we can appropriately apply independent samples t-test.
Variables: Gender and Height w.r.t. the reference data set
Categorical (Nominal Scale) Variable – Gender (Males – 1 & Females – 2)
Height (Scale Variable)
HO: “Is there a significant difference in the height of males and females”?
Steps: Analyze ---- Compare Means -----Independent samples t-test
Height – Test Variable Box; Gender - Grouping Variable
Define Groups: Group 1 – 1; Group 2 – 2 --------Continue -------OK
Output Window: Group Statistics table – Descriptive statistics table
Independent Samples test table – Inferential statistics table
We need to focus on the middle part of the inferential statistics’ table including the
following:
a) T calculated value
b) P-value (sig.)
c) Degrees of freedom (df)
d) Equal variances assumed row for Height variable
Three ways to determine the statistical significance of the difference between the heights
across the categories (males and females):
a) T calculated Value > T (critical value); Reject HO
b) P-Value < 0.05; Reject HO
c) If the confidence interval doesn’t include 0; Reject HO
Paired Samples t-test
OR
Within-Subjects design/Repeated Measures Design/Paired Samples
Design
A single sample to be measured at different points in time (just like a before and after
design) and since, the groups are dependent of each other; we can appropriately apply paired
samples t-test.
Paired Samples t-test ---- Comparing two related means about population
Variables: Gender and Height & Weight w.r.t. the reference data set
Categorical (Nominal Scale) Variable – Gender (Males – 1 & Females – 2)
Height (Scale Variable) & Weight (Scale Variable)
Steps: Analyze ----- Compare Means ----- Paired samples t-test
Paired Variables Box: Height – Variable 1 & Weight – Variable 2 ------- OK
Paired Samples Statistics and Paired Samples Correlation Tables – Descriptive Statistics
Table
Paired Samples t-test – Inferential statistics table
Three ways to determine the statistical significance of the difference between the heights
across the categories (males and females):
a) T calculated Value > T (critical value); Reject HO
b) P-Value < 0.05; Reject HO
c) If the confidence interval doesn’t include 0; Reject HO
Following a series of complaints about wicked witches, the Wizard conducts a study to
determine certain regions of city have more problems with wicked witches than other
regions. He randomly surveys five munchkins from each of four regions and records the
number of complaints he received about wicked witchiness from each one.
HO: Is there a difference in witch wickedness between regions?
Steps: Analyze ----- Compare means ----- One Way ANOVA
Right click on any variable and enable ‘Display Variable Name’
Complaints – Dependent List and Region – Factor List
Click on Post hoc ---- Click on Tukey (Each region has same no. of participants), Click -----
Games Howell ----- Continue
Click ----- Options, Select (Descriptive, Homogeneity of variance test, Welch test, means
plot) ----- Continue ----- OK
Interpretation:
Homogeneity of variance test is tested through Levene’s test. In our case Levene’s value
(standard value – 0.05) is not even close to significant 0.918. Therefore, the groups are not
statistically significantly different ------ Group variances are homogeneous.
This indicates that we do not need Welch ANOVA test and Games Howell post hoc test.
Welch’s ANOVA
Only report Welch ANOVA if Levene’s test is significant so the assumption of equality of
variances has been violated.
There was a significant difference among the four regions on wickedness complaints
Post hoc Results:
There is a significant difference between pairs of regions with North and South having fewer
complaints than East and West regions.
These findings indicate that there are more wicked witches’ complaints in the eastern and
western part of the city.
Chi- SQUARE TEST
(A test for the statistical significance of the strength of an association)
The chi-square test for independence, also called Pearson's chi-square test or the
For applying Chi-square test, data can be ordinal or categorical. The objective is to
compare the distribution of responses, or the proportions of participants in each
response category, to a known distribution.
The observed frequencies in each response category are compared to the frequencies
that would be expected if the null hypothesis was true.
Χ2 = Σ(f0 – fe)2/fe
Fo = Observed frequencies Fe =
expected frequencies
Steps:
Analyze ---- Descriptive Statistics----------Cross tabs
Two nominal/categorical variables within the row and column box (either way)
Cells (Click on expected and observed)---------Continue
Statistics ---- Chi-square-----------Continue
Line graphs are used to track changes over short and long periods of time. When smaller
changes exist, line graphs are better to use than bar graphs. Line graphs can also be used to
compare changes over the same period of time for more than one group.
Correlation
A correlation is simply defined as a relationship between two variables. The whole purpose of using
correlations in research is to figure out which variables are connected.
PEARSON’S CORRELATION COIEFFICIENT (r) is applicable to determine the strength of correlation
between the two quantitative variables involved at a time.
Edward Spearman’s correlation (p) is applicable to determine the strength of correlation between two of the
qualitative nature of variables.
Correlation coefficient ranges within ±1 wherein a value closer to ±1 represents the stronger relationship
between the variables involved.
Regression
Regression analysis is used when you want to predict a continuous dependent variable from a number of independent
variables. If the dependent variable is dichotomous, then logistic regression should be used.
Regression is applied to determine the cause and effect relationship between the variables involved. Regression
conveys us the causation between the variables.
Questionnaires’ link applicable at variable description
http://spss.allenandunwin.com.s3-website-ap-southeast-
2.amazonaws.com/data-files.html#.Xhj5L39Ki1u
SPSS data files and exercises
http://spss.allenandunwin.com.s3-website-ap-southeast-
2.amazonaws.com/data-files.html#.YnizD7fhUzY