0% found this document useful (0 votes)
2 views4 pages

Module 2 Assignment

The document outlines a series of exercises related to data transformation and analysis using SPSS, including calculating probabilities for student demographics, evaluating a screening test for Down syndrome, and constructing various data visualizations. It also includes instructions for recoding data into different formats and categories. Each exercise is assigned a point value and requires the application of statistical methods and interpretation of results.

Uploaded by

herdsnerds
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views4 pages

Module 2 Assignment

The document outlines a series of exercises related to data transformation and analysis using SPSS, including calculating probabilities for student demographics, evaluating a screening test for Down syndrome, and constructing various data visualizations. It also includes instructions for recoding data into different formats and categories. Each exercise is assigned a point value and requires the application of statistical methods and interpretation of results.

Uploaded by

herdsnerds
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

‭Assignment 2‬

‭SPSS-TRANSFORMING DATA‬

‭ xercise 1:‬‭(2 points)‬


E
‭The number of students in the MPH program by county is‬
‭ PH program‬
M
‭given below. If a student is selected at random from this‬
‭dataset, find the probability that:‬ ‭Number of‬
‭a.‬ ‭The student is from Escambia county‬‭0.212‬ ‭ ounty‬
C ‭students‬ ‭Percentage‬
‭b.‬ ‭The student is from Santa Rosa county‬‭0.30‬ ‭Escambia‬ ‭106‬ ‭21.2‬
‭c.‬ ‭The student is from either Escambia or Santa Rosa‬ ‭Walton‬ ‭151‬ ‭30.2‬
‭county‬‭0.512‬ ‭Santa Rosa‬ ‭150‬ ‭30‬
‭d.‬ ‭The student is not from Santa Rosa county‬‭0.70‬ ‭Okaloosa‬ ‭93‬ ‭18.6‬
‭Total‬ ‭500‬ ‭100‬

‭ xercise 2:‬‭(2 points)‬


E
‭A crosstabulation of the number of students in the MPH program by county and whether the students owned‬
‭a computer when they started the program is given below. If the student is selected at random from this data‬
‭set, determine the probability of selecting a student who is from Santa Rosa County‬‭or‬‭a student who had‬‭a‬
‭computer at the start of the program. (Show your work)‬‭0.774‬
‭ xercise 3:‬‭(2 points)‬
E
‭Investigators‬‭conducted‬‭a‬‭study‬‭to‬‭evaluate‬‭the‬‭use‬‭of‬‭a‬‭screening‬‭test‬‭of‬‭specific‬‭hormones‬‭in‬‭the‬‭blood‬‭to‬
‭assess‬ ‭whether‬ ‭or‬ ‭not‬ ‭the‬ ‭fetus‬ ‭of‬ ‭a‬ ‭pregnant‬ ‭women‬ ‭is‬ ‭likely‬ ‭to‬ ‭have‬ ‭Down‬ ‭syndrome. ‬ ‭10000‬‭pregnant‬
‭women‬ ‭underwent‬ ‭the‬ ‭screening‬ ‭test‬ ‭and‬ ‭scored‬ ‭either‬ ‭positive‬ ‭or‬ ‭negative‬ ‭depending‬ ‭on‬ ‭the‬ ‭levels‬ ‭of‬
‭hormones‬‭in‬‭the‬‭blood.‬‭There‬‭were‬‭1255‬‭women‬‭with‬‭positive‬‭test;‬‭from‬‭them,‬‭285‬‭women‬‭had‬‭an‬‭affected‬
‭fetus.‬ ‭There‬ ‭were‬ ‭9700‬ ‭unaffected‬ ‭fetus;‬ ‭8730‬‭of‬‭these‬‭had‬‭negative‬‭test‬‭results.‬‭What‬‭was‬‭the‬‭sensitivity,‬
‭specificity,‬‭positive‬‭predictive‬‭value‬‭and‬‭negative‬‭predictive‬‭value‬‭of‬‭the‬‭test?‬ ‭Interpret‬‭the‬‭results‬‭(Show‬
‭your work including developing a 2 X 2 table).‬

‭ ffected‬
a f‭ etus NOT‬ ‭total‬
‭fetus‬ ‭affected‬

‭ ositive‬
p ‭285‬ ‭970‬ ‭1,255‬
‭screening‬

‭ egative‬
n ‭15‬ ‭8,730‬ ‭8,745‬
‭screening‬

‭total‬ ‭300‬ ‭9,700‬ ‭10,000‬

‭Exercise 4:‬‭(2 points)‬

‭ ‬ I‭ QR = Q3-Q1 = 39.00-2.25 =‬‭36.75‬



‭●‬ ‭Fence‬‭Lower‬‭=‬ ‭-52.875‬
‭Q1 - (1.5)(IQR) = 2.25 - (1.5)(36,75) = 2.25 - 55.125 = -52.875‬
‭●‬ ‭are there outside values below the lower fence?‬‭yes‬
‭●‬ ‭Fence‬‭Upper‬‭=‬ ‭94.125‬
‭Q3 + (1.5)(IQR) = 39.00 + (1.5)(36.75) = 39.00 + 55.125 =‬
‭94.125‬
‭●‬ ‭are there outside values above the upper fence?‬‭yes‬
‭lower fence is the lowest number in the dataset‬
‭upper fence is the highest number in the dataset‬
‭.‬

‭Exercise 5:‬‭(2 points)‬


‭●‬ C
‭ onstruct a steam-and-leaf plot of the data‬
‭Seizures following bacterial meningitis.‬
‭●‬ ‭Construct a boxplot of the data‬‭Seizures following bacterial meningitis.‬

‭‬ A
● ‭ re there any outside values in this data set?‬‭yes‬
‭●‬ ‭Does the boxplot show evidence of asymmetry?‬‭no, the data set is skewed‬

‭ xercise 6 (optional):‬
E
‭(1 point extra for each output – 1, 2, 3)‬
‭This week we are going to learn how to recode data into the same variable and how to record data into a‬
‭different variables. This is a very helpful tool in SPSS.‬

‭1.‬ R
‭ ecoding data into single values‬
‭The first data set represents runs scored by 5 baseball players in a national tournament. We want to‬
‭recode this data so that the players are rank ordered by their number of runs, with the player with the‬
‭highest runs given a code of “1” and the player with the lowest score given a 5.‬
‭a.‬ ‭Enter the following data in SPSS: # of runs by players‬
‭b.‬ ‭Recode the data so that the players are rank ordered by their number of runs, with the player‬
‭with the highest runs given a code of "1" and the batsman with the lowest runs given a "5".‬
‭c.‬ ‭Run frequencies of the new created variable.‬
‭2.‬ R
‭ ecoding data into a given range of values‬
‭The second data set represents the scores of 10 MPH students in their final biostatistics exam. We‬
‭want to recode the data giving a code of 1 to scores between 75 - 100, code 2 to scores between 61 -‬
‭75, code 3 to scores between 41 - 60 and code 4 to scores between 0 - 40.‬
‭a.‬ ‭Enter the following data in SPSS‬
‭b.‬ ‭Recode the data giving code "1" to scores between 75 - 100, code 2 to scores between 61 -‬
‭75, code 3 to scores between 41 - 60 and code 4 to scores between 0 – 40‬
‭c.‬ ‭Run frequencies of the new created variable.‬

‭3.‬ R
‭ ecoding data into two categories‬
‭The‬ ‭third‬ ‭dataset‬ ‭represents‬ ‭12‬ ‭patient‬ ‭satisfaction‬ ‭scores‬ ‭for‬ ‭a‬ ‭dental‬ ‭provider.‬ ‭The‬ ‭satisfaction‬
‭scores‬ ‭goes‬ ‭from‬ ‭1‬ ‭to‬ ‭10‬ ‭with‬ ‭10‬ ‭being‬ ‭extremely‬ ‭satisfied‬ ‭and‬ ‭1‬ ‭extremely‬ ‭dissatisfied.‬ ‭The‬
‭provider‬‭wants‬‭to‬‭code‬‭all‬‭those‬‭who‬‭responded‬‭by‬‭giving‬‭ratings‬‭above‬‭5‬‭a‬‭"Satisfactory"‬‭code‬‭and‬
‭those below 5 a "Dissatisfactory" code.‬
‭a.‬ ‭Enter the data in SPSS‬
‭b.‬ ‭Recode‬ ‭the‬ ‭data‬ ‭so‬ ‭that‬ ‭all‬ ‭those‬ ‭who‬ ‭responded‬ ‭by‬ ‭giving‬ ‭ratings‬ ‭above‬ ‭5‬ ‭have‬ ‭a‬
‭"Satisfactory" code and those below 5 have a "Dissatisfactory" code‬
‭c.‬ ‭Run frequencies of the new created variable‬

You might also like