Statistics Workshop With Excel
Statistics Workshop With Excel
Remote (individual): Outside of class hours. Completion of class work. With the
results of the Excel document, an objective test will be carried out remotely on the
virtual campus.
The date of the online test will be Friday, November 11, 2022 from 9 a.m. to 11:59
p.m.
Instructions & Rubric:
Face-to-face Talller: Students who do not attend the workshop for unjustified
reasons, will fail the activity, regardless of the grade of the objective test remotely
and will fail in the subject in the ordinary call. They must carry out another activity
in the extraordinary call individually.
Development of the activity: The teacher will explain the software and ask
questions to all the couples, positively valuing the attention and participation.
The performance of activities that do not correspond to the practice will be
negatively assessed, each call to attention to a student will subtract 10% of the
grade of the activity from the couple who carries out the activity.
40% of the weight of the activity.
A person is considered to have clinical hypothyroidism when their TSH values are above 10
μU/ml.
The table below shows the values of the variables studied in the 40 patients: TSH (μU/ml),
age, and sex (Female=M; Male = H).
TSH
CASE AGE TSH (μU/mL) SEX CASE AGE (μU/mL) SEX
1 33 7,4 M 21 61 14,8 M
2 41 0,6 H 22 29 8,3 M
3 56 5,1 M 23 28 10,3 H
4 72 0,9 H 24 77 11,3 M
5 67 4,2 H 25 52 5,9 H
6 58 1,7 H 26 55 19,2 M
7 33 2,3 H 27 33 14,2 H
8 78 0,9 M 28 66 11,3 H
9 55 2,3 H 29 50 5,9 M
10 60 3,1 H 30 31 12,6 M
11 45 4,2 M 31 31 14,2 M
12 46 10,3 M 32 20 10,3 M
13 59 12,6 H 33 38 5,1 H
14 51 5,9 M 34 33 17,1 M
15 61 5,1 H 35 40 0,6 H
16 65 0,9 H 36 34 2,3 M
17 45 5,9 M 37 45 11,3 H
18 29 12,6 M 38 80 5,1 M
19 44 11,9 H 39 59 15,1 H
20 25 6,4 M 40 47 10,3 M
Type the names of the variables in cells A1, B1, C1, and D1.
2. Calculate, for the four variables, all the descriptive measures you know.
You can accumulate the results in the rows below the data, for example, from box A43
onwards.
When calculating these functions, you need to specify the data range, for example, from
B2:B41 to indicate the age data on which you want to calculate the statistics.
KURTOSIS to calculate the kurtosis of a quantitative variable. For example, for age, in a blank
cell type = and in the white box above the data array, type KURTOSIS (B2:B41) and when you
run it, you get kurtosis in that cell.
COEF.DE. CORREL Calculates the correlation coefficient between two variables (age and
TSH). In the boxes where Matrix is specified, we must indicate the range of values we want to
compare: B2:B41 in Matrix 1 and with the TSH values in Matrix 2.
A faster way to analyze your data is by using the Data Analytics resource. Verify that the
Data Analytics tab shows you. Otherwise, load the module by doing the following:
How to load tools for analysis in Excel: File-Options-Add-ins. In the Manage box, select Excel
Add-ins and click Go. Tools for analysis-OK.
Go to Data Analysis and generate the Descriptive Statistics of the variables, on a new sheet.
It also generates the histogram of the continuous variables. Select the last three squares in
the Histogram window.
DEGREE IN DENTISTRY – Epidemiology, Public Health and Applied Biostatistics.
FIRST ASSESSABLE ACTIVITY
Box Chart: Select Data – Insert – Insert Stat Chart – Boxes & Whiskers. Under Serial Options, you
can specify that it includes the endpoint data.
Make several box plots: one for each quantitative variable. In addition, there are two in
which the values of age and TSH differentiated by sex appear. Discuss the results.
Scatter Plots: Scatter. Relate Age to TSH in a scatter plot, with Age on the X-axis.
Its values are going to be "1 if TSH is less than 10 and 2 otherwise". Make a table of
frequencies for this variable.
Then we'll go to Formulas – Logics – IF. In Logic Proof we will say C1<10. In Value if true we will
say 1 and in the other 2. When you hit OK, 1 or 2 is loaded into cell D1. Drag from the bottom
right corner of the cell to D41 and load 1 and 2 accordingly.
To perform the frequencies, you can use the option Formulas – Autosum – count numbers.
But you must specify the range of values you want to count. Being dichotomous, ordering
from lowest to highest is enough. The counting numbers option will be used later to construct
the Contingency Tables.
It dichotomizes age, into two groups of the same size defined by the median.
Use the same resource as in the previous section. The new variable can be called Median
Age, and its values will be in column F.