RM Project Report

Download as pdf or txt
Download as pdf or txt
You are on page 1of 48

PRACTICAL FILE

ON
"RESEARCH METHODOLOGY"

Submitted in partial fulfilment of the requirement for the award


of the degree of BACHELOR OF BUSINESS ADMINISTRATION
To
GURU GOBIND SINGH INDRAPRASTHAUNIVERSITY
SUPERVISED BY: DR..VINOD JANGID SUBMITTED BY: HARSH SETHI
ENROLLMENT ROLL NO: - 06315501720

NEW DELHI INSTITUTE OF MANAGEMENT


61, TUGHLAKABAD, NEW DELHI-62

1
CERTIFICATE

This is to certify that the practical titled “Research methodology -Lab” submitted by
HARSH SETHI to New Delhi Institute of Management, Guru Gobind Singh Indraprastha
University in partial fulfilment of requirement for the award of the Bachelor of Business
Administration degree is an original piece of work carried out under my guidance and may
be submitted for evaluation.

The assistance rendered during the study has been duly acknowledged.

No part of this work has been submitted for any other degree.

Place: New Delhi Institute of


Management

Date:12/04/2022 Faculty Guide: Dr. VINOD JANGID

2
TABLE OF CONTENTS
SNo. CONTENTS Page No.
Chapter 1 Introduction to SPSS
1.1 Introduction to SPSS
1.2 How to install SPSS
Chapter 2 Layout of SPSS
2.1 Layout of SPSS
2.2 Components of SPSS
2.3 Graphs
Chapter 3 SPSS Lab Exercise
3.1 Exercise 1- Descriptive Statistics
3.2 Exercise 2 – Histogram
3.3 Exercise 3 – Crosstabs
3.4 Exercise 4 – Chi Square
3.5 Exercise 5 – T Test
3.6 Exercise 6 – One Way ANOVA
3.7 Exercise 7 – Independent Sample test
3.8 Exercise 8 – Pie Chart
3.9 Exercise 9 – 3D Graph
3.10 Exercise 10 – Bar graph
Annexure Questionnaire

3
ACKNOWLEDGEMENT

Any accomplishment requires the effort of many people and this work is
not different. Regardless of the source, I wish to express my gratitude to
those who may have contributed to this work, even though anonymously.

I would like to pay my sincere thanks to my Research Methodology


faculty Dr.VINOD JANGID under whose guidance I was able to complete
my Practical successfully. I have been fortunate enough to get all the
support, encouragement and guidance from him needed to explore,
think new and initiate.

My final thank goes out to my parents, family members, teachers and


friends who encouraged me countless times to persevere through this
entire process.

HARSH SETHI

4
CHAPTER
1
INTRODUCTION
TO SPSS

SPSS - What Is It?


SPSSmeans “Statistical Package for the Social Sciences" and
was first launched in 1968. SinceSPSS was acquired by IBM
in 2009, it's officially known as IBM SPSS Statistics but most
usersstill just refer to it as "SPSS”.
SPSS is software for editing and analysing all sorts of data.
These data may come frombasically any source: scientific
research, a customer database, Google Analytics or even the

5
server log files of a website. SPSS can open all file formats
that are commonly used forstructured data such as
spreadsheets from MS Excel orOpen Office;
plain text files (.txt or .csv); • relational (SQL) databases;
Stata and SAS.

SPSS Data View


Afteropening data, SPSS displays them in a spreadsheet-like
fashion as shown in thescreenshot belowfromfreelancers. Sav.
This sheet calleddata view- always displays our data values.
For instance, our first recordseems to contain a male
respondent from 1979 and so on. A more detailed explanation
on theexact meaning of our variables and data values is found
in a second sheet shown below.

SPSS Variable View


An SPSS data file always has a second sheet called variable
view. It shows the metadataassociated with the data. Metadata
is information about the meaning of variables and datavalues.
This is generally known as the "codebook" but in SPSS it's
called the dictionaryFor non SPSS users, the look and feel of
SPSS' Data Editor window probably come closestto an Excel
workbook containing two different but strongly related sheets.

6
Data Analysis
Right, so SPSS can open all sorts of data and display them -
and their metadata in twosheets in its Data Editor window. So
how to analyse your data in SPSS? Well, one option isusing
SPSS' elaborate menu options.For instance, if our data contain
a variable holding respondents' incomes over 2010, wecan
compute the average income by navigating to Descriptive
Statistics as shown below.Doing so opens a dialog box in
which we select one or many variables and one orseveral
statistics we'd like to inspect.
SPSS Output Window
After clicking Ok, a new window opens up: SPSS' output
viewer window. It holds a nice tablewith all statistics on all
variables we chose. The screenshot below shows what it looks
like.As we see, the Output Viewer window has a different
layout and structure than the Data Editorwindow we saw
earlier. Creating output in SPSS does not change our data in
any way, unlike Excel,SPSS uses different windows for data
and research outcomes based on those data.For non- SPSS
users, the look and feel of SPSS' Output Viewer window
probably comesclosest to a Powerpoint slide holding items
such as blocks of text, tables and charts.
SPSS Reporting
SPSS Output items, typically tables and charts, are easily
copy-pasted into other programs.For instance, many SPSS
users use a word processor such as MS Word, OpenOffice
orGoogleDocs for reporting. Tables are usually copied in rich
text format, which means they'llretain their styling such as
fonts and borders. The screenshot below illustrates the result.
7
SPSS Syntax Editor Window
The output table we showed was created by running
Descriptive Statistics from SPSS' menu.Now, SPSS has a
second option for running this (or any other) command: we
can open athird window, known as the syntax editor window.
Here we can type and run SPSS codeknown as SPSS syntax.
For instance, running descriptive income_2010. has theexact
same result as running this command from SPSS' menu like
we did earlierBesides typing commands into the Syntax Editor
window, most of them can also be pastedinto it by clicking
through SPSS' menu options. Like so, SPSS users unfamiliar
with syntax canstill use it. But why use syntax if SPSS has
such a nice menu?The basic point is that syntax can be saved,
corrected, rerun and shared between projects orusers. Your
syntax makes your SPSS work replicable. If anybody raises
any doubts regardingyour outcomes, you can show exactly
what you did and -if needed-correct and rerun it inseconds.For
non SPSS users, the look and feel of SPSS' Syntax Editor
window probably come closestto Notepad: a single window
basically just containing plain text.
SPSS - Overview Main Features
Now that we have a basic idea of how SPSS works, let's take a
look at what it can do.Following a typical project workflow,
SPSS is great forOpening data_files, either in SPSS' own file
format or many others;Editing data such as computing sums
and means over columns or rows of data. SPSShasoutstanding
options for more complex operations as well.Creating tables
and charts containing frequency counts or summary statistics
over (groupsof) cases and variables.running inferential
8
statistics such as ANOVA, regression and factor
analysis.saving_data_and output in a wide variety of file
formats.
We'll now take a closer look at each one of these features.
Opening Data Files
SPSS has its own data file format. Other file formats it easily
deals with include MS Excel,plain text files, SQL, Stata and
SAS.
Web analytics data -often downloaded as Excel files- can
easily be opened andfurther analysed in SPSS.
Editing Data
In real world research, raw data usually need some editing
before they can be properlyanalysed. Typical examples are
creating means or sums as new variables, restructuring dataor
detecting and removing unlikely observations. SPSS performs
such tasks -and morecomplex ones with amazing
efficiency.For getting things done fast, SPSS contains many
numeric functions, string functions, datefunctions and other
handy routines.
Tables and Charts
All basic tables and charts can be created easily and fast in
SPSS. Typical examples aredemonstrated under Data
Analysis. A real weakness of SPSS is that its charts tend to
beugly and often have a clumsy layout. A great way
toovercome this problem is developingand applying SPSS
chart templates. Doing so, however, requires a fair amount of
effort andexpertise

9
SPSS clustered bar chart with chart template applied.
Inferential Statistics
SPSS contains all basic statistical tests and multivariate
analyses such as
. t-tests;
. chi-square tests
. ANOVA
. correlations and other association measures;
. regression
. nonparametric tests
. factor analysis
. cluster analysis

Some analyses are available only after purchasing additional


SPSS options on top of themain program. An overview of all
commands and the options to which they belong ispresented
in Overview All SPSS Commands
SPSS One Sample T-Test Output Example
Saving Data and Output
SPSS data can be saved as a variety of file formats, including
. MS Excel
. plain text (.txt or .csv);
. Stata
. OSAS
10
The options for output are even more elaborate: charts are
often copy-pasted as images inpng format. For tables, rich text
format is often used because it retains the tables' layout,fonts
and bordersBesides copy-pasting individual output items, all
output items can be exported in one gopdf, HTML, MS Word
and many other file formats. A terrific strategy for writing a
reportis creating an SPSS output file with nicely styled tables
and chart. Then export the entiredocument to Word and insert
explanatory text and titles between the output items.Right, I
hope that gives at least a basic idea of what SPSS is and what
it does. Let's nowexplore SPSS in some more detail, starting
off with the Data Editor window. We'llpresent many more
examples in the next couple of tutorials as well.

APPLICATIONS OF SPSS
Statistical Package for the social sciences (SPSS) is awindow-
based program first launched in 1968. In 2009, SPSS is
acquired by IBM. Hence, it is officially known as IBM SPSS
statistics. SPSS is widely used in the social andbehavioural
sciences. It is also used by health researchers, market
researchers, survey companies, education researchers,
government, etc. Various windows can be opened when using
SPSS such as data editor, output navigator, pivot table editor,
chart editor, text output editor, and syntax editor. The data
editor is a spreadsheet in which variables can be defined and
entered into the data. Each row corresponds to a case while
each column represents a variable. This window opens
11
automatically when SPSS is started. The output navigator
window displays the statistical results, tables, and charts from
the analysis. Output displayed in pivot tables can be modified
in many ways with the pivot table editor. It is possible to
modify and save high-resolution charts and plots by invoking
the chart editor for a certain chart in an output navigator
window. Text output not displayed in pivot tables can be
modified with the text output editor.SPSS contains all basic
statistical tests and multivariate analyses such as t-tests, chi-
square tests, ANOVA, correlations and regressions, non-
parametric tests, cluster analysis, etc.IBM SPASS statistics 26
continues to increase accessibility to advanced analytics
through improved tools, integration, output, and ease-of-use
features. This release mainly focuses on increasing the
analytic capabilities of the software through quantile
regression, ROC analysis, Bayesian statistics, one
samplebinomial and Poisson enhancements, reliability
analysis, and command enhancements.SPSS software is used
for editing and analysing all sorts of data available from
scientific research, clinical studies, customer database, Google
Analytics, etc. SPSS can open all file formats that are
commonly used for structured data such as spreadsheets from
MS Excel, plain text files, relational database, stata, SAS, etc.
SPSS Statistics can read and write data from ASCII text files,
other statistics packages, spreadsheets, and databases.
Statistical output is a proprietary file format and the
proprietary output can be exported to text or Microsoft word,
pdf, excel, and other formats. The typical workflow of SPSS
software is as follows:
• Opening data files in SPSS file format or others.

12
• Editing data such as computing sums and means over
columns or rows of data.
• Creating tables and charts containing frequency counts or
summary statistics over cases and variables.
• Running inferential statistics such as one-way ANOVA,
two-way ANOVA, regression, correlation, factor analysis, etc.
• Saving data and output in different file formats.

13
CHAPTER
2
LAYOUT OF
SPSS

14
Opening SPSS Data
When SPSS is launched, a pop-up window (Error! Reference source not found.) with a few
options will appear. Assume the goal is to analyze a data set, one can select New Dataset or
open a file recently used or another file under Recent Files and then click OK. The other
windows shows What’s New, Modules and Programmability and Tutorials, which help one to
navigate SPSS. IBM SPSS Welcome Screen Sometimes you have already entered the SPSS
session as described above, worked on a data set for a while, and then want to open and work
on another data set. You do not have to quit the current SPSS session to perform this. Simply
click on the File menu, follow Open then Data… and find your file .

OPEN SPSS

SPSS looks like this when you start it

SPSS Windows

The SPSS program has three main types of windows: the data editor, output window and
syntax window. The data editor window is open by default, and contains the data set. It
consists of two views, the Data View and the Variable View. This window is described in

15
more detail in the section on Working With Data and Variables. Data files are saved with a
file type of .sav.

The output window holds the results of analyses. This window will open automatically once
an analysis is requested. The tables of the Output Viewer are saved (click File, Save or Save
As) with a file type of .spv, which can only be opened with SPSS software.

The syntax window contains written commands corresponding to each menu command and
options. Syntax can be created by hitting {Paste} instead of {oK} on main windows for each
procedure. Using {Paste} will not cause the procedure to be performed. To run procedures
from the syntax window, click on .

The syntax window will only open if a syntax file is opened by the user, or if the paste option
is used when executing a command. Output and syntax files can be saved and opened using
the File menu. Multiple output and syntax files can be open at the same time. Syntax files are
saved as plain text and almost any text editor can open them, but with a file extension of .sps.

Dialogue Boxes

Although each dialog box is unique, they have many common features. A fairly typical
example is the dialog box for producing frequency tables (tables with counts and percents).
To bring up this dialog box from the menus in the data window, click on Analyze
Descriptive Statistics  Frequencies.

Working with Data and Variables

Viewing data and variables

16
Data in SPSS can be viewed in two different ways: data view and variable view. The data
view allows the user to look at the entire data set, with each row showing a different
observation, and each column representing a different variable. Another way to view the data
is to use the variable view. This shows the variable names and general properties for each
variable. The user can alternate between these views using the tabs at the bottom left hand
side of the SPSS data editor window, Figure 8 below shows the data view.

Define variable properties

To define or change the attributes of variables, change to “Variable View” to see a list of all
the variables with their properties from the current data file. Click or double click the variable
you would like to specify or change. Descriptions of each attribute are shown below

17
Name is the name of the variable. Rules for establishing variable names can be found on
IBM SPSS help  Command Syntax Reference  Universals  Variables  Variable Names.

Type is the type of a variable. Common options are Numeric for numbers, Date for dates,
and String for character strings. The string option allows the user to type in any set of
characters including punctuation marks and blank spaces. It is ideal for inputting open- ended
questions which are not coded.

Widthis the maximal number of characters or digits allowed for a variable. Generally a
width large enough to accommodate all the possible values of the variable should be chosen;
otherwise any values with length greater than the specified value will be truncated.

Decimals are valid for numeric variables only. It specifies the number of decimals to be
kept for a variable. All the extra decimals will be rounded up and the rounded numbers will
be used in all the analysis, so be careful to specify the number of decimals to fit the required
precision.

Labelis the descriptive label for a variable. One can assign descriptive variable labels up to
256 characters long, and variable labels can contain spaces and reserved characters not
allowed in variable names.

Valuesis the descriptive value labels for each value of a variable. This is particularly useful
if the data file uses numeric codes to represent non-numeric categories (for example, codes of
1 and 2 for male and female).

18
Missing specifies some data values as user-missing values. Refer to the Missing Values
section for more detail.

Columns is the column width for a variable. Column formats affect only the display of
values in the Data Editor. Changing the column width does not change the defined width of a
variable. If the defined and actual width of a value are wider than the column, asterisks (*)
are displayed in the Data 10 view. Column widths can also be changed in the Data view by
clicking and dragging the column borders.

Aligncontrols the display of data values and/or value labels in the Data view. The default
alignment is right for numeric variables and left for string variables. This setting affects only
the display in the Data view.

Measure is the level of measurement as scale (numeric data on an interval or ratio scale),
ordinal, or nominal. Nominal and ordinal data can be either string (alphanumeric) or numeric.
Nominal and ordinal are both treated as categorical. The variable, origin (Country of Origin)
is measured on a nominal scale as the cars are distinguished on the basis of a name or label,
i.e. American, European, and Japanese; whilst the variable gallon (miles per gallon) is
measured on a scale, specifically, ratio measurement scale because the difference between
measurements and ratios are meaningful, and that they have a true zero value.

To download the Cars data file as an SPSS file (i.e. with the .sav extension and all variable
attributes edited as in the example) already click here.

Missing Values

Missing values are a topic that deserves special attention. This section explains why they
arise and how to define them. In SPSS there are two types of missing values: user defined
missing values and system missing values. By default in SPSS, both types of missing values
will be disregarded in all statistical procedures, except for analyses devoted specifically to
missing values, for example, replacing missing values. In frequency tables, missing values
will be shown, but they will be marked as such and will not be used in computation.

User Defined Missing Values

User defined missing values indicate data values that are either missing, due to reasons like
nonresponse, or are not desired to be used in most analyses (e.g. “Not Applicable”.) By
default SPSS uses “.” to represent missing values. In some cases, there might be the need to
distinguish between data missing because a respondent refused to answer and data missing

19
because the question did not apply to that respondent, and thus would like more than one
expression for missing values. One can achieve this by setting up the “Missing” property of
the corresponding variable to specify some data values as missing values. These options
allows one to enter up to three discrete missing values, a range of missing values, or a range
plus one discrete missing value. All string values, including null or blank values, are
considered valid values unless they are explicitly defined as missing. To define null or blank
values as missing for a string variable, enter a single space in one of the fields for discrete
missing values. You will notice missing values denoted by “.”, for the variable mpg
observations 11-15. The example in below shows how to specify user defined missing values
for variable mpg by setting up its Missing” property

System Missing Values

System missing values occur when no value can be obtained for a variable during data
transformations. For example, if there are two variables, one indicating a person’s gender and
the other whether she or he is married and you create a new variable that tells whether (a) a
person is male and married, (b) female and married, (c) male and not married, all females that
are not married will have a system missing value (“.”) instead of a real value.

Modifying and creating new variables

Insert

20
The easiest way to manually input a new variable is to scroll through the data view
spreadsheet horizontally until the first empty column is encountered, and entering in the data.
The new variable can be named appropriately in the variable view spreadsheet. Alternatively,
selecting the “Insert Variable” option under the “Data” menu allows insertion of a new
variable at other locations in the table. By default, this inserts the new variable in the first
column of the spreadsheet, but this can be changed by highlighting the column to the right of
the desired location.

Recode

The recode function can be used to collapse ranges of data into categorical variables, and
reassigning existing values to other values. To create a new variable as a function of another
(log, sin, etc), use “Compute” (described in the next section.)

1. Select Recode into Different Variables under the Transform menu. Recoding into Same
Variables is not recommended, since it will change existing variables and you will lose the
original values.

2. Select each variable to be transformed, and move it into the section on the right hand side
using the button. Note that the same transformation will be applied to all of these variables. If
different types of transformations are required, each transformation needs to be done
separately.

3. If new variables are being created, define name for the output variable on the right hand
side. If desired, a label can be entered as well, though it is not required. Once the desired
name and label are entered, you must click the Change button.

4. Select the Old and New Values button and the window below will appear. In the Old Value
side of the window, select the appropriate original values to be recoded.

21
a) By selecting Value one can specify a value to replace (e.g. “male” or “1”). It is case
sensitive, so “A” and “a” are considered to be different values.

b) System-Missing and System- or user-missing allows missing values to be replaced by


actual values. It is not recommended to recode missing values using this method, unless there
is a strong reason to do so. Missing values should be handled with care, using techniques
such as multiple imputation.

c) The three range options partition numeric variables into categories. The above figure
demonstrates how a range of continuous variables can be condensed into a category. Rather
than running any procedures to find out the range of variables, the range options with
LOWEST through value: and value through HIGHEST: can be used to catch every point in
the data set.

d) All other values can be used to pick up values not specifically referenced elsewhere.

5. On the New Value side, type in the new value. Then click the Add button to add it to the
Old->New list. When recoding into different variables, one has the option of changing
numbers to strings, or converting numbers saved as strings to numbers. Unless otherwise
specified, the new variable will be saved in the same format as the original variable. Click
Continue to close the window. On the main screen, click OK or Paste to finish

22
Compute

Suppose you want to create a new variable, measuring the ratio of the vehicle’s weight to its
horsepower, you define the new variable as weihorse for the weight per unit horsepower. To
create a new variable as a function of one or more existing variables, select Compute from
the Transform menu. Enter the name of the new variable, weihorse, in the Target Variable
box. In the Numeric Expression box, use the keypad, function list, and the variable list to
write out the equation used to compute the new variable, (in this example: weight/horse).
Click OK or Paste to close the window..

23
CREATING GRAPHS

Graphs in SPSS may be generated using one of two options. The first option is the Legacy
Dialogs, which allows one to create basic charts and graphs. The second option is to use the
Chart Builder which allows one to generate charts either from a predefined gallery or by
specifying individual parts (for example, axes and bars). The steps to create a few common
graphs are shown below. However, SPSS has the ability to produce many other graphs such
as population pyramid, error bar, and 3-D bar chart. The Chart Builder allows more flexibility
in creating graphs. For any graph generated in SPSS, one can double click on the graph to
invoke a Chart Editor window, inside which one can double click any part of the graph to edit
it.

Scatterplot

Suppose, we seek to investigate the linear relationship between miles per gallon and the
vehicle weight, we first plot a scatterplot to see the direction in which they are related. We
will introduce the Simple Scatterplot. In the “Graphs” menu, choose Legacy Dialogs 
Scatter/dot. Select Simple Scatter, click on the Define button to get the window shown below.
Select a variable for the Y-axis and a variable for the X-axis. These variables must be numeric
and not in date format. One can also select a categorical variable to define rows of panels and
another categorical variable to define columns of panels. Using the “Title” button one may
specify the title, subtitles and the footnotes for the plot. In the following example we are
plotting mpg against vehicle weight, using model year to define rows of panels

24
Histogram

A histogram shows the distribution of a single numeric variable. By selecting Legacy


Dialogs Histogram in the Graphs menu, one can generate a histogram. One can check the
Display normal curve option to require an estimated normal curve displayed over the
histogram. Suppose, you want to draw histograms of the miles per gallon based on the origin
of the vehicle, in the Panel by dialogue box, either in the rows or columns, you can put the
variable, origin, as shown below

25
Q-Q Plot

The Q-Q Plot (quantile-quantile plot) procedure plots the quantiles of a variable's
distribution against the quantiles of a variable from a test distribution. Q-Q plots are generally
used to investigate whether the distribution of a variable is consistent with a proposed
distribution. Specifically, Q-Q plots can be used to investigate whether a variable (e.g.
residuals in a regression model) follows a Normal distribution. If the distribution of the
variable and the proposed distribution are the same, points in the Q-Q plot follow a straight
line. If the distributions are not similar, points in the Q-Q plot deviate from the straight line.
Suppose you want to generate a Q-Q plot with a Normal distribution as the test distribution.
Select Descriptive Statistics  Q-Q Plots in the Analyze menu. Enter the variables you want
to plot into the Variables box, and select Normal by clicking Test Distribution. Click OK to
generate the plot.

26
Syntax File

Here is an example of the syntax for the Q-Q plot in Figure 18. After selecting Descriptive
Statistics  QQ Plots in the Analyze menu. You enter the variables, mpg (Miles per Gallon),
you want to plot into the Variables box, and select Normal by clicking Test Distribution. Then
you click Paste to generate the syntax, below

27
You can save the syntax as a .sps file for later access in running the analysis.

CHAPTER
3
28
SPSS LAB
EXRECISE

It will look like this after you Apply (IF Condition) in excel
sheet.

DATA VIEW of SPSS


29
VARIABLE VIEW of SPSS

30
EXERCISE 1

 DESCRIPTIVE STATISTICS
 In the Analyze menu, the option Descriptive Statistics produces a
submenu with the choices Frequencies, Descriptives, Explore, Crosstabs,
and Ratio. Of these, Crosstabs and Descriptives have some particularly
useful features which this manual will cover. For more information on the
other three, more information can be found in the SPSS help menu, which
is discussed on section “Help in SPSS” of this manual.
 Descriptives
 The descriptives procedure calculates univariate statistics for selected
variables. In addition, it provides the option of creating a standardized
variable for the selected variables. Simply check the box at the bottom of
the window to save the standardized variable. The options menu provides
a list of univariate statistics available. For more statistics or computing
statistics by group, see the Means procedure under Compare Means


 Frequencies

Statistics
which How often Amount
brand Are you you drink spend on Why do
Energy willing to your your you drink
drink you change Energy energy energy
AGE prefer the brand drink? drink drink?
N Valid 11 11 11 11 11 11
Missi 0 0 0 0 0 0
ng
Mean 1.18 1.27 2.09 2.09 1.45 2.36
Median 1.00 1.00 2.00 2.00 1.00 2.00
Mode 1 1 2 2 1 1a
a. Multiple modes exist. The smallest value is shown

31
Frequency Table

AGE
Frequen Valid Cumulative
cy Percent Percent Percent
Valid 16-24 9 81.8 81.8 81.8
24-32 2 18.2 18.2 100.0
Total 11 100.0 100.0

which brand Energy drink you prefer


Frequen Valid Cumulative
cy Percent Percent Percent
Valid Red 8 72.7 72.7 72.7
bull
Monste 3 27.3 27.3 100.0
r
Total 11 100.0 100.0

Are you willing to change the brand


Frequen Valid Cumulative
cy Percent Percent Percent
Valid Yes 2 18.2 18.2 18.2
No 6 54.5 54.5 72.7
Mayb 3 27.3 27.3 100.0
e
Total 11 100.0 100.0

How often you drink your Energy drink?


Frequen Valid Cumulative
cy Percent Percent Percent
Valid Daily 1 9.1 9.1 9.1
Once in a 9 81.8 81.8 90.9
week
Other 1 9.1 9.1 100.0

32
Total 11 100.0 100.0

Amount spend on your energy drink


Frequen Valid Cumulative
cy Percent Percent Percent
Valid 1000- 8 72.7 72.7 72.7
1500
1500- 2 18.2 18.2 90.9
2000
2500- 1 9.1 9.1 100.0
3000
Total 11 100.0 100.0

Why do you drink energy drink?


Frequen Valid Cumulative
cy Percent Percent Percent
Valid It give energy 3 27.3 27.3 27.3
it make you more 3 27.3 27.3 54.5
active
it give you 3 27.3 27.3 81.8
enthusiasm
Other 2 18.2 18.2 100.0
Total 11 100.0 100.0

EXERCISE 2

Histogram

The histogram is a popular graphing tool. It is used to summarize discrete or


continuous data that are measured on an interval scale. It is often used to
illustrate the major features of the distribution of the data in a convenient form.
It is also useful when dealing with large data sets (greater than
100 observations). It can help detect any unusual observations (outliers) or any
gaps in the data.
A histogram divides up the range of possible values in a data set into classes or
groups. For each group, a rectangle is constructed with a base length equal to
the range of values in that specific group and a length equal to the number of

33
observations falling into that group. A histogram has an appearance similar to a
vertical bar chart, but there are no gaps between the bars.

34
35
36
EXERCISE 3

Crosstabs

The Crosstabs procedure forms two-way and multi-way tables and


provides a variety of tests and measures of association for two-way
tables. Multi-way tables are formed using the ‘Layer’ button. Note
that tests are not made across layers. When layers are used,
comparisons are made for the row and column variables at each value
of the layer variable. The Statistics button at the bottom allows
various statistics to be computed, including correlations and Chi-
square tests. To help uncover patterns in the data that contribute to a
significant chi-square test, the Cells button provides options for
displaying expected frequencies and three types of residuals
(deviates) that measure the difference between observed and expected
frequencies. Each cell of the table can contain any combination of
counts, percentages, and residuals selected
37
Case Processing Summary
Cases
Valid Missing Total
N Percent N Percent N Percent
AGE * which brand 11 100.0% 0 0.0% 11 100.0%
Energy drink you
prefer
AGE * Are you willing 11 100.0% 0 0.0% 11 100.0%
to change the brand
AGE * How often you 11 100.0% 0 0.0% 11 100.0%
drink your Energy
drink?
AGE * Why do you 11 100.0% 0 0.0% 11 100.0%
drink energy drink?

AGE * which brand Energy drink you prefer

Crosstab
which brand Energy drink
you prefer
Red bull Monster Total
AGE 16-24 Count 6 3 9
% within AGE 66.7% 33.3% 100.0%
% within which brand 75.0% 100.0% 81.8%
Energy drink you
prefer
24-32 Count 2 0 2
% within AGE 100.0% 0.0% 100.0%
% within which brand 25.0% 0.0% 18.2%
Energy drink you
prefer
Total Count 8 3 11
% within AGE 72.7% 27.3% 100.0%

38
% within which brand 100.0% 100.0% 100.0%
Energy drink you
prefer

Chi-Square Tests
Asymptotic
Significance Exact Sig. Exact Sig.
Value df (2-sided) (2-sided) (1-sided)
Pearson Chi-Square .917a 1 .338
Continuity Correctionb .006 1 .936
Likelihood Ratio 1.434 1 .231
Fisher's Exact Test 1.000 .509
Linear-by-Linear .833 1 .361
Association
N of Valid Cases 11
a. 3 cells (75.0%) have expected count less than 5. The minimum expected count is
.55.
b. Computed only for a 2x2 table

AGE * Are you willing to change the brand

Crosstab
Are you willing to change
the brand
Yes No Maybe Total
AGE 16-24 Count 0 6 3 9
% within AGE 0.0% 66.7% 33.3% 100.0%
% within Are you 0.0% 100.0% 100.0% 81.8%
willing to change the
brand
24-32 Count 2 0 0 2
% within AGE 100.0% 0.0% 0.0% 100.0%
% within Are you 100.0% 0.0% 0.0% 18.2%
willing to change the
brand
Total Count 2 6 3 11
% within AGE 18.2% 54.5% 27.3% 100.0%

39
% within Are you 100.0% 100.0% 100.0% 100.0%
willing to change the
brand

Chi-Square Tests
Asymptotic
Significance
Value df (2-sided)
Pearson Chi-Square 11.000a 2 .004
Likelihood Ratio 10.431 2 .005
Linear-by-Linear 5.926 1 .015
Association
N of Valid Cases 11
a. 6 cells (100.0%) have expected count less than 5.
The minimum expected count is .36.

AGE * Why do you drink energy drink?

Crosstab
Why do you drink energy drink?
it make it give you
It give you more enthusias
energy active m other Total
AGE 16- Count 3 3 2 1 9
24 % within AGE 33.3% 33.3% 22.2% 11.1% 100.0
%
% within Why do 100.0% 100.0% 66.7% 50.0% 81.8%
you drink energy
drink?
24- Count 0 0 1 1 2
32 % within AGE 0.0% 0.0% 50.0% 50.0% 100.0
%
% within Why do 0.0% 0.0% 33.3% 50.0% 18.2%
you drink energy
drink?

40
Total Count 3 3 3 2 11
% within AGE 27.3% 27.3% 27.3% 18.2% 100.0
%
% within Why do 100.0% 100.0% 100.0% 100.0 100.0
you drink energy % %
drink?

Oneway ANOVA

ANOVA
Amount spend on your energy drink
Sum of Mean
Squares df Square F Sig.
Between 2.672 1 2.672 3.971 .077
Groups
Within 6.056 9 .673
Groups
Total 8.727 10

ANOVA
Amount spend on your energy drink
Sum of Mean
Squares df Square F Sig.
Between 2.672 1 2.672 3.971 .077
Groups
Within 6.056 9 .673
Groups
Total 8.727 10

EXERCISE 4

41
The Chi-Square Test of Independence determines whether there is an
association between categorical variables (i.e., whether the variables are
independent or related). It is a nonparametric test.

This test is also known as:

 Chi-Square Test of Association.

This test utilizes a contingency table to analyze the data. A contingency table
(also known as a cross-tabulation, crosstab, or two-way table) is an arrangement
in which data is classified according to two categorical variables. The categories
for one variable appear in the rows, and the categories for the other variable
appear in columns.

The Chi-Square Test of Independence is commonly used to test the following:

 Statistical independence or association between two or more categorical


variables.

The Chi-Square Test of Independence can only compare categorical variables. It


cannot make comparisons between continuous variables or between categorical
and continuous variables. Additionally, the Chi-Square Test of Independence
only assesses associations between categorical variables, and cannot provide
any inferences about causation.

Chi-Square Tests
Asymptotic
Significance
Value df (2-sided)
Pearson Chi-Square .543a 2 .762
Likelihood Ratio .896 2 .639
Linear-by-Linear .041 1 .839
Association
N of Valid Cases 11
a. 5 cells (83.3%) have expected count less than 5. The
minimum expected count is .18.

42
EXERCISE 5

T-Test
A t-test is a type of inferential statistic used to determine if there is a significant
difference between the means of two groups, which may be related in certain
features. It is mostly used when the data sets, like the data set recorded as the
outcome from flipping a coin 100 times, would follow a normal distribution
and may have unknown variances. A t-test is used as a hypothesis testing tool,
which allows testing of an assumption applicable to a population.

A t-test looks at the t-statistic, the t-distribution values, and the degrees of
freedom to determine the statistical significance. To conduct a test with three or
more means, one must use an analysis of variance.

Group Statistics
Std. Std. Error
AGE N Mean Deviation Mean
Amount spend on 16-24 9 1.22 .441 .147
your energy drink 24-32 2 2.50 2.121 1.500

EXERCISE 6
One Way ANOVA
The one-way analysis of variance (ANOVA) is used to determine whether there
are any statistically significant differences between the means of three or more
independent (unrelated) groups. This guide will provide a brief introduction to
the one-way ANOVA, including the assumptions of the test and when you
should use this test.

Oneway ANOVA

ANOVA
Amount spend on your energy drink
Sum of Mean
Squares df Square F Sig.

43
Between 2.672 1 2.672 3.971 .077
Groups
Within 6.056 9 .673
Groups
Total 8.727 10

ANOVA
Amount spend on your energy drink
Sum of Mean
Squares df Square F Sig.
Between 2.672 1 2.672 3.971 .077
Groups
Within 6.056 9 .673
Groups
Total 8.727 10

EXERCISE 7

The Independent Samples t Test compares the means of two independent groups
in order to determine whether there is statistical evidence that the associated
population means are significantly different. The Independent Samples t Test is
a parametric test.

This test is also known as:

 Independent t Test
 Independent Measures t Test
 Independent Two-sample t Test
 Student t Test
 Two-Sample t Test
 Uncorrelated Scores t Test
 Unpaired t Test
 Unrelated t Test

The variables used in this test are known as:

 Dependent variable, or test variable


Independent variable, or grouping variable

44
Independent Samples Test
Levene's
Test for
Equality of
Variances t-test for Equality of Means
95%
Confidence
Interval of
Significan the
ce Difference
One Two
- - Mea Std.
Sid Sid n Error
ed ed Differ Differ Low Uppe
F Sig. t df p p ence ence er r
Amount Equal 40.8 <.00 - 9 .03 .07 - .641 - .173
spend on variances 73 1 1.9 9 7 1.27 2.72
your assumed 93 8 8
energy Equal - 1.0 .27 .55 - 1.50 - 17.0
drink variances .84 19 5 0 1.27 7 19.5 37
not 8 8 93
assumed

Independent Samples Effect Sizes


95% Confidence
Standardiz Point Interval
era Estimate Lower Upper
Amount spend on Cohen's d .820 -1.558 -3.211 .165
your energy drink Hedges' .898 -1.424 -2.935 .151
correction
Glass's delta 2.121 -.602 -2.202 1.180

45
a. The denominator used in estimating the effect sizes.
Cohen's d uses the pooled standard deviation.
Hedges' correction uses the pooled standard deviation, plus a correction factor.
Glass's delta uses the sample standard deviation of the control group.

EXERCISE 8

PIE CHART
A Pie Chart is a type of graph that displays data in a circular graph. The pieces of the graph are
proportional to the fraction of the whole in each category. In other words, each slice of the pie is
relative to the size of that category in the group as a whole. The entire “pie” represents 100
percent of a whole, while the pie “slices” represent portions of the whole.
Pie charts give you a snapshot of how a group is broken down into smaller pieces. The following
chart shows what New Yorkers throw in their trash cans. You could read that New Yorkers (perhaps
surprisingly) throw a lot of recyclables into their trash, but a pie graph gives a clear picture of the
large percentage of recyclables that find their way into the trash.

BM SPSS statistics is software specifically designed for stats, especially in the social sciences. The
software is capable of creating a large number of graph types with a huge variety of options.
Unlike simpler programs like Excel, SPSS gives you a lot of options for creating pie charts.

EXERCISE 9 46

3D GRAPH
Three-dimensional graphs are rarely used in practice except for didactic purposes.
47
48

You might also like