0% found this document useful (0 votes)

75 views8 pages

Table Function

This document discusses how to work with two-way tables in R, including creating tables from data, directly, and graphically. Methods covered include creating tables from data using the table command, directly specifying data in a matrix, and graphical views using barplots, mosaic plots, and the plot and mosaicplot commands. Marginal distributions and proportions can be calculated and a chi-squared test performed to test independence between variables.

Uploaded by

Real Ecuador

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views8 pages

Table Function

Uploaded by

Real Ecuador

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Docs 12.

Two Way Tables

12. Two Way Tables

Contents

Creating a Table from Data

Creating a Table Directly
Tools For Working With Tables
Graphical Views of Tables

Here we look at some examples of how to work with two way tables. We assume that you can enter data
and understand the different data types.

12.1. Creating a Table from Data

We rst look at how to create a table from raw data. Here we use a ctitious data set, smoker.csv. This
data set was created only to be used as an example, and the numbers were created to match an example
from a text book, p. 629 of the 4th edition of Moore and McCabes Introduction to the Practice of
Statistics. You should look at the data set in a spreadsheet to see how it is entered. The information is
ordered in a way to make it easier to gure out what information is in the data.

The idea is that 356 people have been polled on their smoking status (Smoke) and their socioeconomic
status (SES). For each person it was determined whether or not they are current smokers, former
smokers, or have never smoked. Also, for each person their socioeconomic status was determined (low,
middle, or high). The data le contains only two columns, and when read R interprets them both as factors:
>smokerData<read.csv(file='smoker.csv',sep=',',header=T)
>summary(smokerData)
SmokeSES
current:116High:211
former:141Low:93
never:99Middle:52

You can create a two way table of occurrences using the table command and the two columns in the data
frame:

>smoke<table(smokerData$Smoke,smokerData$SES)
>smoke

HighLowMiddle
current514322
former922821
never68229

In this example, there are 51 people who are current smokers and are in the high SES. Note that it is
assumed that the two lists given in the table command are both factors. (More information on this is
available in the chapter on data types.)

12.2. Creating a Table Directly

Sometimes you are given data in the form of a table and would like to create a table. Here we examine how
to create the table directly. Unfortunately, this is not as direct a method as might be desired. Here we
create an array of numbers, specify the row and column names, and then convert it to a table.
In the example below we will create a table identical to the one given above. In that example we have 3
columns, and the numbers are specied by going across each row from top to bottom. We need to specify
the data and the number of rows:

>smoke<matrix(c(51,43,22,92,28,21,68,22,9),ncol=3,byrow=TRUE)
>colnames(o)<c("High","Low","Middle")
>rownames(o)<c("current","former","never")
>smoke<as.table(smoke)
>smoke
HighLowMiddle
current514322
former922821
never68229

12.3. Tools For Working With Tables

Here we look at some of the commands available to help look at the information in a table in different
ways. We assume that the data using one of the methods above, and the table is called smoke. First,
there are a couple of ways to get graphical views of the data:

>barplot(smoke,legend=T,beside=T,main='SmokingStatusbySES')
>plot(smoke,main="SmokingStatusBySocioeconomicStatus")

There are a number of ways to get the marginal distributions using the margin.table command. If you just
give the command the table it calculates the total number of observations. You can also calculate the
marginal distributions across the rows or columns based on the one optional argument:
>margin.table(smoke)
[1]356
>margin.table(smoke,1)

currentformernever
11614199
>margin.table(smoke,2)

HighLowMiddle
2119352

Combining these commands you can get the proportions:

>smoke/margin.table(smoke)

HighLowMiddle
current0.143258430.120786520.06179775
former0.258426970.078651690.05898876
never0.191011240.061797750.02528090
>margin.table(smoke,1)/margin.table(smoke)

currentformernever
0.32584270.39606740.2780899
>margin.table(smoke,2)/margin.table(smoke)

HighLowMiddle
0.59269660.26123600.1460674

That is a little obtuse, so fortunately, there is a better way to get the proportions using the prop.table
command. You can specify the proportions with respect to the different marginal distributions using the
optional argument:
>prop.table(smoke)

HighLowMiddle
current0.143258430.120786520.06179775
former0.258426970.078651690.05898876
never0.191011240.061797750.02528090
>prop.table(smoke,1)

HighLowMiddle
current0.43965520.37068970.1896552
former0.65248230.19858160.1489362
never0.68686870.22222220.0909091
>prop.table(smoke,2)

HighLowMiddle
current0.24170620.46236560.4230769
former0.43601900.30107530.4038462
never0.32227490.23655910.1730769

If you want to do a chi-squared test to determine if the proportions are different, there is an easy way to
do this. If we want to test at the 95% condence level we need only look at a summary of the table:

>summary(smoke)
Numberofcasesintable:356
Numberoffactors:2
Testforindependenceofallfactors:
Chisq=18.51,df=4,pvalue=0.0009808

Since the p-value is less that 5% we can reject the null hypothesis at the 95% condence level and can say
that the proportions vary.
Of course, there is a hard way to do this. This is not for the faint of heart and involves some linear algebra
which we will not describe. If you wish to calculate the table of expected values then you need to multiply
the vectors of the margins and divide by the total number of observations:

>expected<as.array(margin.table(smoke,1))%*%t(as.array(margin.table(smoke,2)))/margin.table(smoke)
>expected

HighLowMiddle
current68.7528130.3033716.94382
former83.5702236.8342720.59551
never58.6769725.8623614.46067

(The t function takes the transpose of the array.)

The result in this array and can be directly compared to the existing table. We need the square of the
difference between the two tables divided by the expected values. The sum of all these values is the Chi-
squared statistic:

>chi<sum((expectedas.array(smoke))^2/expected)
>chi
[1]18.50974

We can then get the p-value for this statistic:

>1pchisq(chi,df=4)
[1]0.0009808236

12.4. Graphical Views of Tables

The plot command will automatically produce a mosaic plot if its primary argument is a table.
Alternatively, you can call the mosaicplot command directly.

>smokerData<read.csv(file='smoker.csv',sep=',',header=T)
>smoke<table(smokerData$Smoke,smokerData$SES)
>mosaicplot(smoke)
>help(mosaicplot)
>

The mosaicplot command takes many of the same arguments for annotating a plot:

>mosaicplot(smoke,main="Smokers",xlab="Status",ylab="EconomicClass")
>

If you wish to switch which side (horizontal versus vertical) to determine the primary proportion then you
can use the sort option. This can be used to switch whether the width or height is used for the rst
proportional length:

>mosaicplot(smoke,main="Smokers",xlab="Status",ylab="EconomicClass")
>mosaicplot(smoke,sort=c(2,1))
>

Finally if you wish to switch which side is used for the vertical and horzintal axis you can use the dir option:

>mosaicplot(smoke,main="Smokers",xlab="Status",ylab="EconomicClass")
>mosaicplot(smoke,dir=c("v","h"))
>
Previous Next

Sponsorship

This site generously supported by Datacamp. Datacamp offers a free interactive introduction to R
coding tutorial as an additional resource. Already over 100,000 people took this free tutorial to
sharpen their R coding skills.

Introduction to Instat
No ratings yet
Introduction to Instat
322 pages
Chapter 1 (1)
No ratings yet
Chapter 1 (1)
63 pages
Structural Equation Modeling for Health and Medicine, 1st Edition Full-Resolution Download
No ratings yet
Structural Equation Modeling for Health and Medicine, 1st Edition Full-Resolution Download
16 pages
correlation-Partial unit-3
No ratings yet
correlation-Partial unit-3
33 pages
2 Simple Linear Regression I Least Squares Estimation
No ratings yet
2 Simple Linear Regression I Least Squares Estimation
119 pages
Complete Download Bayesian Theory and Applications 1st Edition Paul Damien PDF All Chapters
100% (17)
Complete Download Bayesian Theory and Applications 1st Edition Paul Damien PDF All Chapters
71 pages
lec2 (1)
No ratings yet
lec2 (1)
46 pages
EE311_Lecture_#2_Descriptive_Statistics
No ratings yet
EE311_Lecture_#2_Descriptive_Statistics
47 pages
Midterm_Project_Group_6
No ratings yet
Midterm_Project_Group_6
41 pages
Group 5 - Applied Statistics and Experimental 152611
No ratings yet
Group 5 - Applied Statistics and Experimental 152611
28 pages
Graphical Presentation - 2017
No ratings yet
Graphical Presentation - 2017
27 pages
Advance Biostatic Group Assignment
No ratings yet
Advance Biostatic Group Assignment
10 pages
An Introduction To Statistics
No ratings yet
An Introduction To Statistics
36 pages
BES - R Lab 5
No ratings yet
BES - R Lab 5
7 pages
Medical Statistics With R
No ratings yet
Medical Statistics With R
85 pages
Applied Multivariate Statistics with R Full Text EPUB
100% (12)
Applied Multivariate Statistics with R Full Text EPUB
15 pages
Lecture # 2-1 Probabilistic Models
No ratings yet
Lecture # 2-1 Probabilistic Models
40 pages
UE23MA242A Unit-2 Class-24 Hypothesis and Inference Introduction
No ratings yet
UE23MA242A Unit-2 Class-24 Hypothesis and Inference Introduction
16 pages
Dr. Lemma Longtudinal Data Analysis
No ratings yet
Dr. Lemma Longtudinal Data Analysis
98 pages
Activity Sheets in Statistics and Probability
No ratings yet
Activity Sheets in Statistics and Probability
13 pages
R. Van Buuren 2014 FCS - Chapter in Book Fitzmaurice Et Al
No ratings yet
R. Van Buuren 2014 FCS - Chapter in Book Fitzmaurice Et Al
41 pages
Ma6451 PRP Notes Rejinpaul PDF
No ratings yet
Ma6451 PRP Notes Rejinpaul PDF
228 pages
ACMT 311 Assignment
No ratings yet
ACMT 311 Assignment
6 pages
P&S (Solex24)
No ratings yet
P&S (Solex24)
6 pages
Topic II
No ratings yet
Topic II
17 pages
Bio624 Class1handout
No ratings yet
Bio624 Class1handout
48 pages
Classes - Correspondence Analysis - Data Anaysis in Management - MBM
No ratings yet
Classes - Correspondence Analysis - Data Anaysis in Management - MBM
31 pages
MATH10282: Introduction To Statistics Lecture Notes
No ratings yet
MATH10282: Introduction To Statistics Lecture Notes
49 pages
FAM_QUESTION_BANK_CT[1]
No ratings yet
FAM_QUESTION_BANK_CT[1]
14 pages
Week2 PDF
No ratings yet
Week2 PDF
58 pages
Camry Group Activity Bsba-Fm3a
No ratings yet
Camry Group Activity Bsba-Fm3a
12 pages
SG Chapter 02
No ratings yet
SG Chapter 02
10 pages
Ejercicios Mercado de Trabajo Carlos Flores Tapullima
No ratings yet
Ejercicios Mercado de Trabajo Carlos Flores Tapullima
17 pages
AMFE Module 5 - Unit Root Test
No ratings yet
AMFE Module 5 - Unit Root Test
13 pages
1.08 Hypothesis Testing
No ratings yet
1.08 Hypothesis Testing
7 pages
Proba 20212022
No ratings yet
Proba 20212022
4 pages
Introductory Notes
No ratings yet
Introductory Notes
30 pages
CSC 820 How To Do Analyses in Spss
No ratings yet
CSC 820 How To Do Analyses in Spss
39 pages
Data Analyses R Manual NYTS
No ratings yet
Data Analyses R Manual NYTS
24 pages
Spss Manual
No ratings yet
Spss Manual
27 pages
Important Concepts Doc
No ratings yet
Important Concepts Doc
40 pages
Ma3391-P&s Cat - 1 Solution
No ratings yet
Ma3391-P&s Cat - 1 Solution
6 pages
STAT501 Online - HW2R - Spring2024
No ratings yet
STAT501 Online - HW2R - Spring2024
7 pages
Excel Guide
No ratings yet
Excel Guide
23 pages
VCD Tutorial PDF
No ratings yet
VCD Tutorial PDF
37 pages
MBA5112 - Project 1 - Luka Khmaladze
No ratings yet
MBA5112 - Project 1 - Luka Khmaladze
15 pages
R Basics: 26-JULY-2019
No ratings yet
R Basics: 26-JULY-2019
32 pages
Lab0 R Tutorial EHS
No ratings yet
Lab0 R Tutorial EHS
9 pages
Mlda DD
No ratings yet
Mlda DD
17 pages
Stata An Introduction Summer 2020
No ratings yet
Stata An Introduction Summer 2020
60 pages
Cases Conjoint Analysis
No ratings yet
Cases Conjoint Analysis
5 pages
Median Polish: Sample Statfolio
No ratings yet
Median Polish: Sample Statfolio
6 pages
Math Assignment 2
No ratings yet
Math Assignment 2
2 pages
Financial Econometric Modelling
100% (1)
Financial Econometric Modelling
211 pages
String Functions: Extract 1st Word From String "Name"
No ratings yet
String Functions: Extract 1st Word From String "Name"
28 pages
Multiple-Choice Questions: Describing Data: Numerical
No ratings yet
Multiple-Choice Questions: Describing Data: Numerical
4 pages
Stata Intro
No ratings yet
Stata Intro
20 pages
Chapter 5 Queueing Theory
No ratings yet
Chapter 5 Queueing Theory
32 pages
Intro To Stats 2018
No ratings yet
Intro To Stats 2018
44 pages
Heston Model: Sankarshan Basu Professor of Finance Indian Institute of Management Bangalore
No ratings yet
Heston Model: Sankarshan Basu Professor of Finance Indian Institute of Management Bangalore
3 pages
Biostatistics in Public Health Using STATA-2016
100% (3)
Biostatistics in Public Health Using STATA-2016
202 pages
The Power of CLV-Managing Customer Lifetime Value at IBM
No ratings yet
The Power of CLV-Managing Customer Lifetime Value at IBM
7 pages
STATA
No ratings yet
STATA
26 pages
Lab 1 Introduction To Data
No ratings yet
Lab 1 Introduction To Data
11 pages
Median Polish: Sample Statfolio
No ratings yet
Median Polish: Sample Statfolio
6 pages
Use of Statistics by Scientist
No ratings yet
Use of Statistics by Scientist
22 pages
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
No ratings yet
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
39 pages
VCD Tutorial
No ratings yet
VCD Tutorial
37 pages
SPSS
No ratings yet
SPSS
128 pages
Frequency Distribution Table Graph
No ratings yet
Frequency Distribution Table Graph
10 pages
STA301 Fall 2004 Final Term Paper
No ratings yet
STA301 Fall 2004 Final Term Paper
3 pages
MINITAB 14 Supplement For Biostatistics For Health Sciences
No ratings yet
MINITAB 14 Supplement For Biostatistics For Health Sciences
87 pages
Logit Probit
No ratings yet
Logit Probit
66 pages
Buku SPSS Complete
No ratings yet
Buku SPSS Complete
72 pages
Minitab14 Manual
No ratings yet
Minitab14 Manual
87 pages
Understanding The Structure of Scientific Data: LC - GC Europe Online Supplement
No ratings yet
Understanding The Structure of Scientific Data: LC - GC Europe Online Supplement
22 pages
7CCMMS61 Statistics For Data Analysis: Francisco Javier Rubio Department of Mathematics
No ratings yet
7CCMMS61 Statistics For Data Analysis: Francisco Javier Rubio Department of Mathematics
13 pages
Research Methods: Wiji Arulampalam
No ratings yet
Research Methods: Wiji Arulampalam
45 pages
Tutorial 5 - Calculating Mean, Standard Deviation, Frequencies
No ratings yet
Tutorial 5 - Calculating Mean, Standard Deviation, Frequencies
6 pages
This Study Resource Was: P (X 3) 1-P (X 0) - P (X 1) - P (X 2) - P (X 3) 0.1428765
No ratings yet
This Study Resource Was: P (X 3) 1-P (X 0) - P (X 1) - P (X 2) - P (X 3) 0.1428765
3 pages
Time Series Econometrics
100% (5)
Time Series Econometrics
421 pages
Introduction To Stata: Li-Pin Juan
No ratings yet
Introduction To Stata: Li-Pin Juan
41 pages
Entendimiento de Datos Cientificos
No ratings yet
Entendimiento de Datos Cientificos
6 pages
Using Stata With The Fundamentals of Political: Science Research
No ratings yet
Using Stata With The Fundamentals of Political: Science Research
20 pages
How To Use Minitab 1 Basics
No ratings yet
How To Use Minitab 1 Basics
28 pages
Unit-17 IGNOU STATISTICS
No ratings yet
Unit-17 IGNOU STATISTICS
15 pages
From Average To K-means
From Everand
From Average To K-means
Beam van Waardenberg
No ratings yet
Charts & Diagrams Primer
From Everand
Charts & Diagrams Primer
Beam Vanwaardenberg
No ratings yet
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
3/5 (4)