Lab 5 Solutions
Lab 5 Solutions
Error Detection: Cleaning Data and Searching for Information on ABS web site.
5.1. An updated version of the Attitudes to the Library questionnaire with 8 questions is
attached to this lab sheet, and a data file representing the responses of 30 survey
participants can be found on UTS Online. We are going to start by inserting a variable at
the start of the data giving the questionnaire id number.
Set up column 1 in the data as id. Go to Variable View, and right click the 1 to the left of
the first row. Select “Insert Variable” from the menu. Label the new variable id. Click on
Transform > Compute, select id as the Target Variable and as the Numeric
Expression type $casenum.
There are 12 deliberate data entry errors in the data file. Find these errors. For each error
explain the questionnaire number and the variable involved, what the error is, and what
method you used to find the error. This is an exercise in data cleaning. Use frequency
tables, cross tabulations and any other logic tests to find the errors. (Just scanning the
data file is not a good method particularly if the data file you have is very large.)
Errors 1-4 can be found by constructing frequency tables. Errors 5-7 can be found by
summing ranks. Errors 8-12 can be found by constructing crosstabs.
Female
Q.2 Please tick the box that best represents your view on the statement below:
Q.3 How many times have you visited the library during the last week?
Q.4 On your most recent visit to the library during the last week, which of the following
features of the library did you use?
(You may tick more than one box.)
Closed reserve
Study areas
Search facilities
Journal collections
Photocopiers
Q.5 From the following list of services provided by the Library choose two which you
feel are the most important. (Indicate each of your two choices with a tick.)
Closed reserve
Study areas
Search facilities
Journal collections
Photocopiers
Item Rank
Q7 Have you ever used the online service to reserve a book at the Library?
Yes
No
Q8 If you answered “No” to question 7, please indicate your reason for not using the
service.
No need to reserve
Other
Australia
State/Territory (S-T)
Statistical Division (SD)
Statistical Subdivision (SSD)
Statistical Local Area (SLA)
Census Collection District (CD)
5.3. The ABS makes a huge amount of data freely available. Go to the Census page. You
can then select a geographic area to obtain a summary of that geographic area using the
QuickStats Search. Enter “Rockdale” in the search box, and select “Rockdale LGA”
(towards the bottom of the list) and select “GO”.
You will now see some basic information followed three sections: “People”, “Families”
and “Dwellings”. What is different about the tables in these three sections?
The tables corresponding to “People” count individual people. The tables corresponding
to “Families” counts a family as a single observation. The tables in the “Dwellings”
section count whole dwellings as a single observation in the table.
Now, find how many (total) dwellings in the Rockdale LGA at the 2011 Census had 3 or
more motor vehicles per dwelling.
There were 3482 dwellings with three or more motor vehicles. This is a lower proportion
than both NSW as a whole and Australia…