0% found this document useful (0 votes)
88 views

RM Practical File

This document provides instructions for using SPSS (Statistical Package for the Social Sciences) software. It describes opening SPSS, the main layout including the data editor window and output viewer window. It reviews the various menus across the top including File, Edit, View, Data, Transform, Analyze, Graphs, Utilities and Help. It also describes the icons and provides instructions to view different windows. The goal is to familiarize the user with navigating the SPSS interface and main components.

Uploaded by

garvit sharma
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
88 views

RM Practical File

This document provides instructions for using SPSS (Statistical Package for the Social Sciences) software. It describes opening SPSS, the main layout including the data editor window and output viewer window. It reviews the various menus across the top including File, Edit, View, Data, Transform, Analyze, Graphs, Utilities and Help. It also describes the icons and provides instructions to view different windows. The goal is to familiarize the user with navigating the SPSS interface and main components.

Uploaded by

garvit sharma
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 59

GURU GOBIND SINGH INDRAPRASTHA

UNIVERSITY

INSTITUTE OF INNOVATION IN TECHNOLOGY &


MANAGEMENT

RESEARCH METHODOLOGY
PRACTICAL FILE
SUBJECT CODE: BBA 208

Submitted To: Submitted By:


Mr. Akshay Chauhan Name: GARVIT SHARMA
Asst. Professor Class: BBA 2nd Year M-1
Enrollment No.:04690301720
INDEX
Exercise Topic Signature/Remarks
No
Exercise 1 Creation of the Google form for data collection.

Exercise 2 Importing of data file in Excel from google form and in


SPSS from excel.
Exercise 3 Data Manipulation: Sorting, Filtering in Excel and
Sorting and Splitting of files in SPSS.

Exercise 4 Renaming of variables, changing the variable type and


labelling variables in SPSS –
A researcher wanted to conduct a research and wish to
study the opinion about online education with the
following categories identified
1. To study the opinion on the basis of gender (Gender)
Gender: 1,2,3 (1=Male, 2=Female,
3=Transgender)
2. To study the opinion on the basis of their age (Age)
Age: 1,2,3,4(less than 18, 18-25, 25-40, 40 &
above)
3. To study the opinion on the basis of their occupation
(Occupation)
Occupation: 1,2,3,4 (1=Student, 2=
Salaried(pvt), 3= Salaried (Govt),4=Business)
4. What is your opinion about online education:
Satisfactory or Dissatisfactory (Satisfaction level)
2 categories: 1,2 (1= Satisfaction, 2 =
Dissatisfaction)
Create the variables in SPSS and a database with 10
respondents.

Exercise 5 From the information given in the Demog data file,


create the Frequency tables of the categorical variable:
Gender, Age and Occupation in SPSS & Excel (count
if).

How to apply One-Sample T Test. Also create Bar graph


for weight of the respondents and Histogram for height
Exercise 6 of the respondents.
Data to be Used –
Exercise 7 How to apply One-Sample T Test.

Exercise 8 How to apply Independent-Sample T Test.

Exercise 9
How to apply Paired-Sample T Test.
Exercise 10 How to apply one way ANOVA test.

Exercise 11
How to apply one way ANOVA test.
Exercise 12 How to apply Chi-Square Test.

Exercise 13 How to apply Chi-Square Test.


Exercise 14 How to apply correlation.
(Use Advert Data)
Exercise 15 How to apply regression.
(Use Advert Data)
Also draw Scatter chart and Line chart.
Exercise 16 How to draw Simple Box Plot.
(Use Demo Data)
SPSS

What is SPSS?
SPSS is a Windows based program that can be used to perform data entry and
analysis and to create tables and graphs. SPSS is capable of handling large amounts
of data and can perform all of the analyses covered in the text and much more. SPSS
is commonly used in the Social Sciences and in the business world, so familiarity
with this program should serve you well in the future. SPSS is updated often. This
document was written around an earlier version, but the differences should not cause
any problems.
Opening SPSS
Depending on how the computer you are working on is structured, you can
open SPSS in one of two ways.
1. If there is an SPSS shortcut like this SPSS on the desktop, simply put the cursor on it and
double click the left mouse button.
2. Click the left mouse button on the START button on your screen, then put your cursor on
Programs or All Programs and left click the mouse. Select SPSS 20.0for Windows by clicking the left
mouse button. (For a while that started calling the program PASW Statistics 20, but they seem to have
given that up as a dumb idea when everyone else calls it SPSS. The version number may change by the
time you read this.)Either approach will launch the program.

aUse one of these approaches to open SPSS yourself.

You will see a screen that looks like the image on the next page. The dialog box
that appears offers choices of running the tutorial, typing in data, running queries, or
opening an existing data source. The window behind this is the Data Editor window
which is used to display the data from whatever file you are using. You could select
anyone of the options on the start-up dialog box and click OK, or you could simply hit
Cancel. If you hit Cancel, you can either enter new data in the blank Data Editor or
you could open an existing file using the File menu bar as explained later.

aClick Cancel, and we’ll get acquainted with the layout of SPSS.
Layout of SPSS
The Data Editor window has two views that can be selected from the lower
left hand side of the screen. Data View is where you see the data you are using.
Variable View is where you can specify the format of your data when you are
creating a file or where you can check the format of a pre-existing file. The data in
the Data Editor is saved in a file with the extension .sav.
Menubar

Icons

Start-updialogbox

The other most commonly used SPSS window is the SPSS Viewer window
which displays the output from any analyses that have been run and any error
messages.
Information from the Output Viewer is saved in a file with the extension . so. Let’s
open an output file and look at it.

aOn the File menu, click Open and select Output. Select appendix output. Spo from
the files that can be found at
http://www.uvm.edu/~dhowell/fundamentals7/SPSSManual/SPSSLongerManual/
DataForSPSS/.(At the moment this set of web pages is the most recent version which
ever of my books you are using.) Click Ok. The following will appear. The left hand
side is an outline of all of the output in the file.The right side is the actual output.To
shrink or enlarge either side put your cursor on the line that divides them. When the
double headed arrow appears, hold the left mouse button and move the line in either
direction. Release the button and the size will be adjusted.

Finally, there is the Syntax window which displays the command language used to run
various operations. Typically, you will simply use the dialog boxes to set up
commands , and would not see the Syntax window. The Syntax window would be
activated if you pasted the commands from the dialog box to it, or if you wrote you
own syntax—something we will not focus on here. Syntax files end in the
extension .spss.
SPSS Menus and Icons
Now, let’s review the menus and icons.

aReviewtheoptionslistedundereachmenuontheMenuBarbyclickingthemoneatatim
e. Follow along with the below descriptions.

File includes all of the options you


typically use in other programs, such as
open, save, exit. Notice, that you can open or
create new files of multiple types as
illustrated to the right.

Edit includes the typical cut, copy,


and paste commands, and allows you to
specify various options for displaying data
and output.

aClick on Options, and you will seethe


dialog box to the left. You can use this to
format the data, output, charts, etc. These
choices are rather overwhelming, and
you can simply take the default options
for now. The author of your text (me)was
too dumb to even know these options
could easily be set.

View allows you to select which toolbars you want to show, select font size,
add or remove the gridlines that separate each piece of data, and to select whether
or not to display your raw data or the data labels.

Data allows you to select several options ranging from displaying data
that is sorted by a specific variable to selecting certain cases for subsequent
analyses.

Transform includes several options to change current variables. For


example, you can change continuous variables to categorical variables, change
scores into rank scores, add a constant to variables, etc.
Analyze includes all of the commands to carry out statistical analyses and to
calculate descriptive statistics. Much of this book will focus on using commands
located in this menu.

Graphs includes the commands to create various types of


Graphs including box plots, histograms, line graphs, and bar charts.

Utilities allows you to list file information which is a list of all variables,
the relabels, values, locations in the data file, and type.

Add-ons are programs that can be added to the base SPSS package. You
probably do not have access to any of those.

Window can be used to select which window you want to view (i.e., Data
Editor, Output Viewer, or Syntax). Since we have a data file and an output file
open, let’s try this.

aSelect Window/Data Editor. Then select Window/SPSS Viewer.

Help has many useful options including a link to the SPSS homepage, a
statistics coach, and a syntax guide. Using topics, you can use the index option to
type in any keyword and get a list of options, or you can view the categories and
subcategories available under contents. This is an excellent too land can be used to
troubleshoot most problems.

The Icons directly under the Menu bar provide shortcuts to many common
commands that are available in specific menus. Take a moment to review these as
well.

aPlace your cursor over the Icons for a few seconds, and a description of the
under lying command will appear. For example, this icon is the shortcut for
Save.
QUESTIONNAIRE
A questionnaire is a research instrument consisting of a series of questions for the purpose of gathering
information from respondents. Questionnaires can be thought of as a kind of written interview. They can be
carried out face to face, by telephone, computer or post.
FREQUENCY PROCEDURE
 The Frequencies procedure can produce summary measures for categorical variables in the form of
frequency tables, bar charts, or pie charts.
A percentage frequency distribution is a display of data that specifies the percentage of observations that
exist for each data point or grouping of data points. It is a particularly useful method of expressing the
relative frequency of survey responses and other data. Many times, percentage frequency distributions are
displayed as tables or as bar graphs or pie charts.

The process of creating a percentage frequency distribution involves first identifying the total number of
observations to be represented; then counting the total number of observations within each data point or
grouping of data points; and then dividing the number of observations within each data point or grouping of
data points by the total number of observations. The sum of all the percentages corresponding to each data.

Testing- the difference in frequencies of variables in statistics and chart format.

Variables - Unique identification number (Nominal Variable ),Gender(Nominal Variable )

(1-Male ,2- female )

Height (ScaleVariable ),Weight (Scale Variable )

Steps; Analyze- Descriptive- Frequencies.

Frequencies Statistics –

 Percentile value –Quartiles


 Central Tendency- Mean and Sum
 Dispersion- Standard deviation, Varience, Range ,Minimum ,Maximum.
 Distribution – Skewness, Kurtisos

(continue) - - (ok).

Frequencies chart -

 Bar chart
 Histogram –show normal curve

(continue)---(ok).
MEANS
Mean implies average and it is the sum of a set of data divided by the number of data. Mean can prove to be
an effective tool when comparing different sets of data. The mean is the most frequently used measure of
central tendency because it uses all values in the data set to give you an average. For data from skewed
distributions, the median is better than the mean because it isn't influenced by extremely large values.
Steps :Analyze–Compare Means- Means
Variables – Height (ScaleVariable ) ,Gender (Nominal Variable ) male -1,female-2.
Means –Dependent list (height ) , independent list –(gender ).
Click ----(ok).
HISTOGRAM
A histogram is a method that uses bars to display count or frequency data. The independent variable consists
of interval- or ratio-level data and is usually displayed on the abscissa (x-axis), and the frequency data on the
ordinate (y-axis), with the height of the bar proportional to the count.
It is used to summarize discrete or continuous data that are measured on an interval scale. It is often used to
illustrate the major features of the distribution of the data in a convenient form. It is also useful when dealing
with large data sets ,
Variables – height (Scale Variable ), weight (Scale Variable).
Steps; Graphs – Chart Builder
 Gallery-(Histogram )- Elemen Properties
 Set Patameters (In Custom )Achor Value-0,Interval width -1 and
 Number of interval -3 and 400 .

(countinue)-----(apply)-----(ok)
Reliability Analysis

Reliability analysis allows you to study the properties of measurement scales and the items that compose the
scales. The Reliability Analysis procedure calculates a number of commonly used measures of scale
reliability and also provides information about the relationships between individual items in the scale
Testing – to check the correlation in mean of inner -items that how they are reliable using scale.
Steps : Analysis-Scale-Reliability analysis
- Descriptive for Item, Scale, item if deleted
- Inner item -correlation
-Summaries – Means and Correlation
NORMALITY TEST
Normality tests are used to determine if a data set is well-modeledby a normal distribution and to compute
how likely it is for a random variable underlying the data set to be normally distributed.
More precisely, the tests are a form of model selection, and can be interpreted several ways, depending on
one's interpretations of probability.

Variables—Height(Scale Variable ),Gender(Nominal Variable ).

To run the Normality Test ,Analyze -Descriptive Statistics -Explore.

 Dependent list –( height) , Independent list – (Gender)


 Explore plot --- box plot-Factor level together , Descriptive(Histogram),
 Normal plot with test

(continue)------(ok)
One-Sample t-test
The one-sample t-test is a statistical hypothesis test used to determine whether an
unknown population mean is different from a specific value.
Variable :-Height (Scale Variable )
Steps :-
 Analyze - Compare Means - One-Sample T Test.
 Drag and drop the variable you want to test against the population mean
into the Test Variable(s) box.
 Specify your population mean in the Test Value box.
 Click OK.
Independent Samples t-test
Testing – Whether the population mean for one category is significantly different from the
population mean for another category, where in the two samples drawn are independent of
each other.
Two independent samples to be measured at a single point in time and since, the groups are
independent of each other; we can appropriately apply independent samples t-test.
Variables: Gender and Height w.r.t. the reference data set
Categorical (Nominal Scale) Variable – Gender (Males – 1 & Females – 2)
Height (Scale Variable)
HO: “Is there a significant difference in the height of males and females”?
Steps: Analyze ---- Compare Means -----Independent samples t-test
Height – Test Variable Box; Gender - Grouping Variable
Define Groups: Group 1 – 1; Group 2 – 2 --------Continue -------OK
Output Window: Group Statistics table – Descriptive statistics table
Independent Samples test table – Inferential statistics table
We need to focus on the middle part of the inferential statistics’ table including the
following:
a) T calculated value
b) P-value (sig.)
c) Degrees of freedom (df)
d) Equal variances assumed row for Height variable

Three ways to determine the statistical significance of the difference between the heights
across the categories (males and females):
a) T calculated Value > T (critical value); Reject HO
b) P-Value < 0.05; Reject HO
c) If the confidence interval doesn’t include 0; Reject HO
Paired Samples t-test
OR
Within-Subjects design/Repeated Measures Design/Paired Samples
Design

A single sample to be measured at different points in time (just like a before and after
design) and since, the groups are dependent of each other; we can appropriately apply paired
samples t-test.
Paired Samples t-test ---- Comparing two related means about population
Variables: Gender and Height & Weight w.r.t. the reference data set
Categorical (Nominal Scale) Variable – Gender (Males – 1 & Females – 2)
Height (Scale Variable) & Weight (Scale Variable)
Steps: Analyze ----- Compare Means ----- Paired samples t-test
Paired Variables Box: Height – Variable 1 & Weight – Variable 2 ------- OK
Paired Samples Statistics and Paired Samples Correlation Tables – Descriptive Statistics
Table
Paired Samples t-test – Inferential statistics table
Three ways to determine the statistical significance of the difference between the heights
across the categories (males and females):
a) T calculated Value > T (critical value); Reject HO
b) P-Value < 0.05; Reject HO
c) If the confidence interval doesn’t include 0; Reject HO

What exactly is a confidence interval?


A confidence interval is a range in which the mean difference is likely to fall 95% (α = 5%)
of the times. Alternatively, if a similar test is repeated, say, 100 times, 95% of the time the
mean difference would fall within the desired range.
The confidence interval is much more certain, but less precise too. However, with good
measurements and low variability, we can get both the mean and the confidence interval as
accurate as possible.
Positive and Negative t values (t calculated) are interpreted in a similar manner.
One-Way ANOVA Test
Case

Following a series of complaints about wicked witches, the Wizard conducts a study to
determine certain regions of city have more problems with wicked witches than other
regions. He randomly surveys five munchkins from each of four regions and records the
number of complaints he received about wicked witchiness from each one.
HO: Is there a difference in witch wickedness between regions?
Steps: Analyze ----- Compare means ----- One Way ANOVA
Right click on any variable and enable ‘Display Variable Name’
Complaints – Dependent List and Region – Factor List
Click on Post hoc ---- Click on Tukey (Each region has same no. of participants), Click -----
Games Howell ----- Continue
Click ----- Options, Select (Descriptive, Homogeneity of variance test, Welch test, means
plot) ----- Continue ----- OK

Interpretation:
Homogeneity of variance test is tested through Levene’s test. In our case Levene’s value
(standard value – 0.05) is not even close to significant 0.918. Therefore, the groups are not
statistically significantly different ------ Group variances are homogeneous.
This indicates that we do not need Welch ANOVA test and Games Howell post hoc test.

Welch’s ANOVA
Only report Welch ANOVA if Levene’s test is significant so the assumption of equality of
variances has been violated.

There was a significant difference among the four regions on wickedness complaints
Post hoc Results:
There is a significant difference between pairs of regions with North and South having fewer
complaints than East and West regions.
These findings indicate that there are more wicked witches’ complaints in the eastern and
western part of the city.
Chi- SQUARE TEST
(A test for the statistical significance of the strength of an association)

Non-Parametric Test for One Sample

The chi-square test for independence, also called Pearson's chi-square test or the

chi-square test of association, is used to discover if there is a relationship

between two categorical variables

Rationale and utility

 For applying Chi-square test, data can be ordinal or categorical. The objective is to
compare the distribution of responses, or the proportions of participants in each
response category, to a known distribution.
 The observed frequencies in each response category are compared to the frequencies
that would be expected if the null hypothesis was true.
 Χ2 = Σ(f0 – fe)2/fe
Fo = Observed frequencies Fe =
expected frequencies

Steps:
Analyze ---- Descriptive Statistics----------Cross tabs
Two nominal/categorical variables within the row and column box (either way)
Cells (Click on expected and observed)---------Continue
Statistics ---- Chi-square-----------Continue

 Expected count for any frequency must not be less than 5


 Pearson chi square value is the calculated chi-square statistic value in SPSS
 Asymptotic significance is p-value (which must be greater than the standard 5% for
any statistical insignificance for a possible rejection of Ho), Henceforth, rejecting
Ha.
 P-value > 0.05, we do not have sufficient statistical evidence to reject Ho.
Line graphs

Line graphs are used to track changes over short and long periods of time. When smaller
changes exist, line graphs are better to use than bar graphs. Line graphs can also be used to
compare changes over the same period of time for more than one group.
Correlation
A correlation is simply defined as a relationship between two variables. The whole purpose of using
correlations in research is to figure out which variables are connected.
PEARSON’S CORRELATION COIEFFICIENT (r) is applicable to determine the strength of correlation
between the two quantitative variables involved at a time.
Edward Spearman’s correlation (p) is applicable to determine the strength of correlation between two of the
qualitative nature of variables.
Correlation coefficient ranges within ±1 wherein a value closer to ±1 represents the stronger relationship
between the variables involved.
Regression
Regression analysis is used when you want to predict a continuous dependent variable from a number of independent
variables. If the dependent variable is dichotomous, then logistic regression should be used.
Regression is applied to determine the cause and effect relationship between the variables involved. Regression
conveys us the causation between the variables.
Questionnaires’ link applicable at variable description
 http://spss.allenandunwin.com.s3-website-ap-southeast-
2.amazonaws.com/data-files.html#.Xhj5L39Ki1u
SPSS data files and exercises
 http://spss.allenandunwin.com.s3-website-ap-southeast-
2.amazonaws.com/data-files.html#.YnizD7fhUzY

You might also like