0% found this document useful (0 votes)
54 views38 pages

Lab 04-Data (For Submission)

1) Attendance at performances tended to cluster between 111-119 people, with the median attendance being 118 people. 2) The smallest attendance was 88 people and the largest attendance was 156 people. 3) Half of the performances had 118 people or less in attendance.

Uploaded by

Khaled Rafei
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
54 views38 pages

Lab 04-Data (For Submission)

1) Attendance at performances tended to cluster between 111-119 people, with the median attendance being 118 people. 2) The smallest attendance was 88 people and the largest attendance was 156 people. 3) Half of the performances had 118 people or less in attendance.

Uploaded by

Khaled Rafei
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 38

Descriptive statistics

Tionesta
count 24

DotPlot10/20/2020 10:07.13 (1)

20 22 24 26 28 30 32 34 36 38 40
Tionesta

Descriptive statistics

Shefield
count 24

DotPlot
10/20/2020 10:07.55 (1)

20 25 30 35 40 45 50
Shefield

Descriptive statistics

# Attending
count 45

Stem and Leaf plot for # Attending


stem unit = 10
leaf unit = 1

Frequency Stem Leaf


2 8 89
7 9 3445667
6 10 334678
9 11 122337789
8 12 00455577
7 13 2 4 5 6 8 9 9
3 14 2 3 8
3 15 5 5 6
45

Descriptive statistics

Returns
count 21

Stem and Leaf plot for Returns


stem unit = 1
leaf unit = 0.1

Frequency Stem Leaf


1 7 7
7 8 0013488
7 9 1256689
4 10 1 2 4 8
2 11 2 6
21

Descriptive statistics

Commissions
count 15

1st quartile 1,739.50


median 2,038.00
3rd quartile 2,151.00
interquartile range 411.50
mode #N/A

low extremes 0
low outliers 0
high outliers 0
high extremes BoxPlot
0

1000 1200 1400 1600 1800 2000 2200 2400 2600


10/20/2020 14:41.28 (1)
Commissions
Descriptive statistics

Daily Charges
count 28
mean 253.00
minimum 116
maximum 353
range 237

1st quartile 224.00


median 253.00
3rd quartile 298.75
interquartile range 74.75
mode 209.00

low extremes 0
low outliers 0
high outliers BoxPlot
0
high extremes 0

100 150 200 250 300 350 400


10/20/2020 19:06.26 (1)
Daily Charges

Profit earned vs age


3,500

3,000

2,500

2,000 f(x) = 9.62444787398223 x + 1516.75642594859


R² = 0.010548043761224
Profit

1,500

1,000

500

0
40 45 50 55 60 65 70 75
Age
Tionesta Shefield
# of vehicles # of vehicles
serviced serviced
23 31
DotPlot
33 35
27 44
28 36
39 34 20 22 24 26 28 30 32 34 36 3
26 37
Tionesta
30 30
32 37
28 43
33 31
DotPlot
35 40
32 31
29 32
25 44 20 25 30 35
36 36 Shefield
31 34
32 43
27 36
35 26 Report Summary statistics
32 38
35 37 In general, Sheffield are serving more customers than Tionesta (44 customers
37 30 Tionesta serviced the fewest cars in any day (23 customers)
36 42 Tionesta serviced 32 cars on 4 different days
30 33 The number of cars serviced cluster is 32 for Tionesta and 36, 37 for sheffield
751 860

Mean (Tionesta) 31.29 Tionesta serviced a mean of 31.29 vehicles per day
Mean (Sheffield) 35.83 Sheffield serviced a mean of 35.83 vehicles per day during the same period (2
So Sheffield serviced 4.54 more vehicles per day
2 34 36 38 40

DotPlot

35 40 45 50
Shefield

n Tionesta (44 customers @ Sheff. While 39 customers @ Tionesta)

a and 36, 37 for sheffield

uring the same period (24days)


# of people
attending each
performances
96
93
88
117
127
95
113
96
108 Descriptive statistics
94
148 # Attending
156 count 45
139
142
94 Stem and Le
# Attending 1) Around what values does attendance ten
107 stem unit =10 2) What is the smallest attendance?
125 leaf unit = 1 3) The largest attendance?
155
155 Frequency Stem Leaf It is like a Histogram but in Horizontal orie
103 2 8 89 Location of the median=
112 7 9 3445667 That means that the median is 118
127 6 10 334678 50% of the performances were attended b
117 9 11 122337789 1) The Clustering is at 111 to 119 (Highest c
120 8 12 00455577 Also I can tell where the Quartiles are
112 7 13 2456899 2) The smallest attendance is 88
135 3 14 2 3 8 3) The largest attendance is 156
132 3 15 5 5 6
111 45 There were 2 performances with less tha
125 and 3 performances with 150 people & m
104
106
139
134
119
97
89
118
136
125
143
120
103
113
124
138
hat values does attendance tend to cluster?
he smallest attendance?
st attendance?

Histogram but in Horizontal orientation


of the median= (n+1)/2 23
ns that the median is 118
e performances were attended by less than 118 people
ering is at 111 to 119 (Highest cluster)
tell where the Quartiles are
est attendance is 88
st attendance is 156

re 2 performances with less than 90 people attending


formances with 150 people & more
Rate of Returns
8.3
9.6
9.5
9.1
8.8
11.2
7.7 Stem and Le Returns
10.1 stem unit =1
9.9 leaf unit = 0.1
10.8
10.2 Frequency Stem Leaf
8 1 7 7
8.4 7 8 0013488
8.1 7 9 1256689
11.6 4 10 1 2 4 8 a) 8
9.6 2 11 2 6 b) 10.1
8.8 21 10.2
8 10.4
10.4 10.8
9.8 c) Manually: First we need to
9.2 (n+1)/2
11
Then we count from stem &
1,2,3,……..11
we will get 5
So the median is:
OR BY USING THE EXCEL COMMAND F
Median= 9.5

d) Manually

OR BY USING THE EXCEL COMMAND F


min= 7.7
max= 11.6
nually: First we need to calculate the location of data
where n=21

en we count from stem & leaf table

the median is: 9.5


THE EXCEL COMMAND FOR MEDIAN

min 7.7
max 11.6

THE EXCEL COMMAND FOR MIN & MAX


Commissions
1460 Locate the median, the first quartile, and the third quartile, and the first and last
1471 deciles for the commissions earned using MEGASTAT. Comment on the difference
1637 between the EXCEL functions percentile.exc & percentile.inc. .
1721
1758
1787
1940
2038
Using descriptive Using
2047 statistics (Megastat) Percentile.EXC
2054 1st quartile 1,739.50 1721
2097 median 2,038.00 2038
2205 3rd quartile 2,151.00 2205
2287
Using
2311 Percentile.EXC
2406 First Decile 1466.6
Last Deciles is the Ninth Decile 2349.0

We can notice that the values of Q1 & Q2 are different by using Percentile.exc & Percentile.inc

BoxPlot

1000 1200 1400 1600 1800 2000 2200 2400 2600


Commissions
nd last
fference

Using Using Percentile Using hand


Percentile.INC Computation
1739.5 1739.5 1721
2038 2038 2038
2151 2151 2205

Using Using Percentile Using hand


Percentile.INC Computation
1537.4 1537.4 1466.6
2301.4 2303.8 2348.6

ntile.exc & Percentile.inc

00 2600
Daily Charges
116
121
157
192
207
209
209
229
232
BoxPlot
236
236
239
243
246 100 150 200 250 300 350 400
260
Daily Charges
264
276
281 Interpretation
283 Shape The shape is slightly skewed negatively towards low charges
289 Center The median around 250…
296 Spread Range= the charges range between 115$ & 360$ with a spread of 245$
307 Interquartile range: (Range inside the box)
309 50% of the charges between 225$ and 300$ with a total spread of 75$
312 Outliers There are no outliers - No extreme charges
317 Q1 25% of the daily charges are below 235$
324 Q3 75% of the daily charges are below 300$
341
353
1st quartile Q1 224.00
median Q2 253.00
3rd quartile Q3 298.75

400 OR BY EXCEL COMMAND


Percentile.exc Q1 214 Q2 253 Q3 304.25
Percentile.inc Q1 224 Q2 253 Q3 298.75
Player X Salary ($) x-xbar (x-xbar)^2
John Barbato 507,500 - 7,087,645 50,234,711,646,025
Dellin Betances 507,500 - 7,087,645 50,234,711,646,025
Luis Cessa 507,500 - 7,087,645 50,234,711,646,025
Ronald Torreyes 508,600 - 7,086,545 50,219,120,037,025
Mason Williams 509,700 - 7,085,445 50,203,530,848,025
Kirby Yates 511,900 - 7,083,245 50,172,359,730,025
Bryan Mitchell 516,650 - 7,078,495 50,105,091,465,025
Luis Severino 521,300 - 7,073,845 50,039,283,084,025
Greg Bird 525,300 - 7,069,845 49,982,708,324,025
Chasen Shreve 533,400 - 7,061,745 49,868,242,445,025
Austin Romine 556,000 - 7,039,145 49,549,562,331,025
Aaron Hicks 574,000 - 7,021,145 49,296,477,111,025
Didi Gregorius 2,425,000 - 5,170,145 26,730,399,321,025
Martin Prado 3,000,000 - 4,595,145 21,115,357,571,025
Dustin Ackley 3,200,000 - 4,395,145 19,317,299,571,025
Ivan Nova 4,100,000 - 3,495,145 12,216,038,571,025
Michael Pineda 4,300,000 - 3,295,145 10,857,980,571,025
Nathan Eovaldi 5,600,000 - 1,995,145 3,980,603,571,025
Starlin Castro 7,857,143 261,998 68,642,952,004
Andrew Miller 9,000,000 1,404,855 1,973,617,571,025
Aroldis Chapman 11,325,000 3,729,855 13,911,818,321,025
Chase Headley 13,000,000 5,404,855 29,212,457,571,025
Brett Gardner 13,500,000 5,904,855 34,867,312,571,025
Carlos Beltran 15,000,000 7,404,855 54,831,877,571,025
Brian McCann 17,000,000 9,404,855 88,451,297,571,025
Alex Rodriguez 21,000,000 13,404,855 179,690,137,571,025
Jacoby Ellsbury 21,142,857 13,547,712 183,540,500,434,944
Masahiro Tanaka 22,000,000 14,404,855 207,499,847,571,025
Mark Teixeira 23,125,000 15,529,855 241,176,396,321,025
CC Sabathia 25,000,000 17,404,855 302,928,977,571,025
227,854,350 2,032,511,073,086,648

Mean 7,595,145
Median 3,650,000
std 8231061.23795 8231061.23795032

Pearson's Coefficient of skewness SK 1.44

Software coefficient of skewness SK 0.92


(x-xbar/s) (x-xbar/s)^3
- 0.86 - 0.64
- 0.86 - 0.64
- 0.86 - 0.64
- 0.86 - 0.64
- 0.86 - 0.64
- 0.86 - 0.64
- 0.86 - 0.64
- 0.86 - 0.63
- 0.86 - 0.63
- 0.86 - 0.63
- 0.86 - 0.63
- 0.85 - 0.62
- 0.63 - 0.25
- 0.56 - 0.17
- 0.53 - 0.15
- 0.42 - 0.08
- 0.40 - 0.06
- 0.24 - 0.01
0.03 0.00
0.17 0.00
0.45 0.09
0.66 0.28
0.72 0.37
0.90 0.73
1.14 1.49
1.63 4.32
1.65 4.46
1.75 5.36
1.89 6.72
2.11 9.45
24.94

by using excel as calculator


Age Profit Location Vehicle-Type Previous
46 2,197 Sheffield Sedan 1 Are these data a Bi-varied data?
46 2,646 Tionesta Sedan 2 yes, for the same reason (Age vs profit),
47 1,461 Kane Sedan 0 in another word, each point have an X & Y
47 1,731 Tionesta Compact 0
47 2,230 Tionesta Sedan 1
Profit earned vs age
47 2,341 Sheffield SUV 1
47 3,292 Olean Sedan 2 3,500
48 1,108 Sheffield Sedan 1 3,000
48 1,295 Sheffield SUV 1
48 1,344 Sheffield SUV 0 2,500
48 1,906 Kane Sedan 1 f(x) = 9.62444787398223 x + 151
2,000
48 1,952 Tionesta Compact 1 R² = 0.010548043761224

Profit
48 2,070 Kane SUV 1 1,500
48 2,454 Kane Sedan 1
1,000
49 1,606 Olean Compact 0
49 1,680 Kane SUV 3 500
49 1,827 Tionesta Truck 3
0
49 1,915 Tionesta SUV 1
40 45 50 55 60 65
49 2,084 Tionesta Sedan 0
49 2,639 Sheffield SUV 0 Age
50 842 Kane SUV 0
50 1,963 Sheffield Sedan 1 No relationship whatsoever
50 2,059 Sheffield Sedan 1
50 2,338 Tionesta SUV 0 In scatter plot we have to comment on three things
50 3,043 Kane Sedan 0 Direction Slightly Direct/Positive relationship
51 1,059 Kane SUV 1 Strength the strength is very weak relationship betw
51 1,674 Sheffield Sedan 1 Type Kind of Linear
51 1,807 Tionesta Sedan 1
51 2,056 Sheffield Hybrid 0
51 2,236 Tionesta SUV 2
51 2,928 Kane SUV 0
52 1,269 Tionesta Sedan 1
52 1,717 Sheffield SUV 3
52 1,797 Kane Sedan 1
52 1,955 Olean Hybrid 2
52 2,199 Tionesta SUV 0
52 2,482 Olean Compact 0
52 2,701 Sheffield SUV 0
52 3,210 Olean Truck 4
53 377 Olean SUV 1
53 1,220 Olean Sedan 0
53 1,401 Tionesta SUV 2
53 2,175 Olean Sedan 1
54 1,118 Sheffield Compact 1
54 2,584 Olean Compact 2
54 2,666 Tionesta Truck 0
54 2,991 Tionesta SUV 0
55 934 Sheffield Truck 1
55 2,063 Kane SUV 1
55 2,083 Sheffield Sedan 1
55 2,856 Olean Hybrid 1
55 2,989 Tionesta Compact 1
56 910 Sheffield SUV 0
56 1,536 Kane SUV 0
56 1,957 Sheffield SUV 1
56 2,240 Olean Sedan 0
56 2,695 Kane Sedan 2
57 1,325 Olean Sedan 1
57 2,250 Sheffield Sedan 2
57 2,279 Sheffield Hybrid 1
57 2,626 Sheffield Sedan 2
58 1,501 Sheffield Hybrid 1
58 1,752 Kane Sedan 3
58 2,058 Kane SUV 1
58 2,370 Tionesta Compact 0
58 2,637 Sheffield SUV 1
59 1,426 Sheffield Sedan 0
59 2,944 Olean SUV 2
60 2,147 Olean Compact 2
61 1,973 Kane SUV 3
61 2,502 Olean Sedan 0
62 783 Sheffield Hybrid 1
62 1,538 Olean Truck 1
63 2,339 Olean Compact 1
64 2,700 Kane Truck 0
65 2,222 Kane Truck 1
65 2,597 Sheffield Truck 0
65 2,742 Tionesta SUV 2
68 1,837 Sheffield Sedan 1
69 2,842 Kane SUV 0
70 2,434 Olean Sedan 4
72 1,640 Olean Sedan 1
72 1,821 Tionesta SUV 1
73 2,487 Olean Compact 4
aried data?
ason (Age vs profit),
h point have an X & Y

Profit earned vs age

f(x) = 9.62444787398223 x + 1516.75642594859


R² = 0.010548043761224

50 55 60 65 70 75
Age

ve to comment on three things


rect/Positive relationship
gth is very weak relationship between the two
Location Average - Profit
Kane 2,057
Olean 2,168
Sheffield 1,847
Tionesta 2,169
Total Result 2,044 Total
2200.00

2100.00

2000.00

1900.00

1800.00

1700.00

1600.00
Kane Olean Sheffield Tion
Total

Average - Profit

Kane Olean Sheffield Tionesta Total Result


Count - Profit Category Location
Profit Category Kane Olean Sheffield Tionesta Total Result
0 11 7 16 8 42
1 8 13 10 11 42
Total Result 19 20 26 19 84
Age Profit Median Profit Profit Category Location Vehicle-Type Previous
There are four dealerships in the
46 2,197 2,067 1 Sheffield Sedan 1 profit earned on each vehicle sold
46 2,646 1 Tionesta Sedan 2 between the amount of profit ear
47 1,461 0 Kane Sedan 0
47 1,731 0 Tionesta Compact 0
47 2,230 1 Tionesta Sedan 1
47 2,341 1 Sheffield SUV 1 We cannot use contingency table
(they both have to be qualitative)
47 3,292 1 Olean Sedan 2 change it to qualitative.
48 1,108 0 Sheffield Sedan 1 I have to divide the profit into cat
48 1,295 0 Sheffield SUV 1 Or divide them low, medium, & h
We can compare the profit for th
48 1,344 0 Sheffield SUV 0 but not using the contingency tab
48 1,906 0 Kane Sedan 1
48 1,952 0 Tionesta Compact 1
48 2,070 1 Kane SUV 1
48 2,454 1 Kane Sedan 1 T
49 1,606 0 Olean Compact 0
2200.00
49 1,680 0 Kane SUV 3
49 1,827 0 Tionesta Truck 3 2100.00
49 1,915 0 Tionesta SUV 1
49 2,084 1 Tionesta Sedan 0 2000.00
49 2,639 1 Sheffield SUV 0
1900.00
50 842 0 Kane SUV 0
50 1,963 0 Sheffield Sedan 1 1800.00
50 2,059 0 Sheffield Sedan 1
50 2,338 1 Tionesta SUV 0 1700.00
50 3,043 1 Kane Sedan 0
1600.00
51 1,059 0 Kane SUV 1 Kane Olean Sheffi
51 1,674 0 Sheffield Sedan 1
51 1,807 0 Tionesta Sedan 1 Now I can make a comparaiso
51 2,056 0 Sheffield Hybrid 0
51 2,236 1 Tionesta SUV 2
51 2,928 1 Kane SUV 0
52 1,269 0 Tionesta Sedan 1 Now I have to change the pro
52 1,717 0 Sheffield SUV 3 Insert a column called "Profit
52 1,797 0 Kane Sedan 1
52 1,955 0 Olean Hybrid 2 if function:
52 2,199 1 Tionesta SUV 0 meaning:
52 2,482 1 Olean Compact 0
52 2,701 1 Sheffield SUV 0 Now Profit category & Locatio
52 3,210 1 Olean Truck 4 To do the Contingency table,
53 377 0 Olean SUV 1
53 1,220 0 Olean Sedan 0 Count - Profi
53 1,401 0 Tionesta SUV 2 Profit Catego
53 2,175 1 Olean Sedan 1 0
54 1,118 0 Sheffield Compact 1 1
54 2,584 1 Olean Compact 2 Total Result
54 2,666 1 Tionesta Truck 0
54 2,991 1 Tionesta SUV 0 This is what we called Joint fr
55 934 0 Sheffield Truck 1 What can we say here abou it
55 2,063 0 Kane SUV 1 From the table 19 cars were s
55 2,083 1 Sheffield Sedan 1 At Kane most of the cars mad
55 2,856 1 Olean Hybrid 1
55 2,989 1 Tionesta Compact 1
56 910 0 Sheffield SUV 0
56 1,536 0 Kane SUV 0
56 1,957 0 Sheffield SUV 1
56 2,240 1 Olean Sedan 0
56 2,695 1 Kane Sedan 2
57 1,325 0 Olean Sedan 1
57 2,250 1 Sheffield Sedan 2
57 2,279 1 Sheffield Hybrid 1
57 2,626 1 Sheffield Sedan 2
58 1,501 0 Sheffield Hybrid 1
58 1,752 0 Kane Sedan 3
58 2,058 0 Kane SUV 1
58 2,370 1 Tionesta Compact 0
58 2,637 1 Sheffield SUV 1
59 1,426 0 Sheffield Sedan 0
59 2,944 1 Olean SUV 2
60 2,147 1 Olean Compact 2
61 1,973 0 Kane SUV 3
61 2,502 1 Olean Sedan 0
62 783 0 Sheffield Hybrid 1
62 1,538 0 Olean Truck 1
63 2,339 1 Olean Compact 1
64 2,700 1 Kane Truck 0
65 2,222 1 Kane Truck 1
65 2,597 1 Sheffield Truck 0
65 2,742 1 Tionesta SUV 2
68 1,837 0 Sheffield Sedan 1
69 2,842 1 Kane SUV 0
70 2,434 1 Olean Sedan 4
72 1,640 0 Olean Sedan 1
72 1,821 0 Tionesta SUV 1
73 2,487 1 Olean Compact 4
e are four dealerships in the Applewood Auto Group. Suppose we want to compare the
t earned on each vehicle sold by the particular dealership. Is there a relationship
ween the amount of profit earned and the dealership?

annot use contingency table if one data is qualitative & the other is quantitative,
y both have to be qualitative), unless i make a changes in the quantitative data and
ge it to qualitative.
e to divide the profit into categories (like above the median or below the median
vide them low, medium, & high
an compare the profit for the various dealership
not using the contingency table, we use pivot table

Total
0

0
Average - Profit
0

0
Kane Olean Sheffield Tionesta Total Result

Now I can make a comparaison between various dealerships

Now I have to change the profit from Quantitative to qualitative (above median & below median)
nsert a column called "Profit category" with 0 if below median & 1 if above median

if(B2>$C$2,1,0)
if this value "B2" is greater than median "C2" then this value is 1, otherwise is 0

Now Profit category & Location are both qualitative, so we can do the contingency table
To do the Contingency table, we go to insert pivot table, highlight data

Location
Kane Olean Sheffield Tionesta Total Result
11 7 16 8 42
8 13 10 11 42
19 20 26 19 84

This is what we called Joint frequency table


What can we say here abou it?
From the table 19 cars were sold at Kane, 20 cars were sold at olean….
At Kane most of the cars made more than median profit while at orlean and at Tionesta more cars made above/higher median profit
ve/higher median profit
Frequency Distribution - Quantitative

Price cumulative
lower upper midpoint width frequency percent frequency
120 < 140 130 20 3 2.9 3
140 < 160 150 20 3 2.9 6
160 < 180 170 20 16 15.2 22
180 < 200 190 20 20 19.0 42
200 < 220 210 20 14 13.3 56
220 < 240 230 20 14 13.3 70
240 < 260 250 20 13 12.4 83
260 < 280 270 20 8 7.6 91
280 < 300 290 20 7 6.7 98
300 < 320 310 20 4 3.8 102
320 < 340 330 20 2 1.9 104
340 < 360 350 20 1 1.0 105
105 100.0

Descriptive statistics

Price
count 105
mean 221.103

1st quartile 187.000


median 213.600
3rd quartile 251.400
interquartile range 64.400
mode 188.300
low extremes 0
low outliers 0
high outliers 0
high extremes 0

Descriptive statistics

Price
count 105
sample standard deviation 47.105
sample variance 2,218.919
minimum 125
maximum 345.3
range 220.3
cumulative
percent
2.9
5.7
21.0
40.0
53.3
66.7
79.0
86.7
93.3
97.1
99.0
100.0

Histogram
20
15
Percent

10
5
0
0 0 0 0 0 0 0 0 0 0 0 0 0
12 14 16 18 20 22 24 26 28 30 32 34 36
Price
Price Bedrooms Size Pool Distance Twnship Garage Baths
263.1 4 2300 1 17 5 1 2
182.4 4 2100 0 19 4 0 2
242.1 3 2300 0 12 3 0 2
213.6 2 2200 0 16 2 0 2.5
139.9 2 2100 0 28 1 0 1.5
245.4 2 2100 1 12 1 1 2
327.2 6 2500 0 15 3 1 2
271.8 2 2100 0 9 2 1 2.5
221.1 3 2300 1 18 1 0 1.5
266.6 4 2400 0 13 4 1 2
292.4 4 2100 0 14 3 1 2
209 2 1700 0 8 4 1 1.5
270.8 6 2500 0 7 4 1 2
246.1 4 2100 0 18 3 1 2
194.4 2 2300 0 11 3 0 2
281.3 3 2100 0 16 2 1 2
172.7 4 2200 1 16 3 0 2
207.5 5 2300 1 21 4 0 2.5
198.9 3 2200 1 10 4 1 2
209.3 6 1900 1 15 4 1 2
252.3 4 2600 0 8 4 1 2
192.9 4 1900 1 14 2 1 2.5
209.3 5 2100 0 20 5 0 1.5
345.3 8 2600 0 9 4 1 2
326.3 6 2100 0 11 5 1 3 b)
173.1 2 2200 1 21 5 1 1.5
187 2 1900 0 26 4 0 2
257.2 2 2100 0 9 4 1 2
233 3 2200 0 14 3 1 1.5
180.4 2 2000 0 11 5 0 2
234 2 1700 0 19 3 1 2
247.7 5 2400 0 16 2 1 2
166.2 3 2000 1 16 2 1 2
177.1 2 1900 0 10 5 1 2
182.7 4 2000 1 14 4 0 2.5
216 4 2300 0 19 2 0 2
312.1 6 2600 0 7 5 1 2.5
199.8 3 2100 0 19 3 1 2
273.2 5 2200 0 16 2 1 3
206 3 2100 1 9 3 0 1.5
232.2 3 1900 1 16 1 1 1.5
198.3 4 2100 1 19 1 1 1.5
205.1 3 2000 1 20 4 0 2
175.6 4 2300 1 24 4 1 2
307.8 3 2400 1 21 2 1 3
269.2 5 2200 0 8 5 1 3
224.8 3 2200 0 17 1 1 2.5
171.6 3 2000 1 16 4 0 2
216.8 3 2200 0 15 1 1 2
192.6 6 2200 1 14 1 0 2
236.4 5 2200 0 20 3 1 2
172.4 3 2200 0 23 3 0 2
251.4 3 1900 0 12 2 1 2
246 6 2300 0 7 3 1 3
147.4 6 1700 1 12 1 0 2
176 4 2200 0 15 1 1 2
228.4 3 2300 0 17 5 1 1.5
166.5 3 1600 1 19 3 0 2.5
189.4 4 2200 0 24 1 1 2
312.1 7 2400 0 13 3 1 3
289.8 6 2000 0 21 3 1 3
269.9 5 2200 1 11 4 1 2.5
154.3 2 2000 0 13 2 0 2
222.1 2 2100 0 9 5 1 2
209.7 5 2200 1 13 2 1 2
190.9 3 2200 1 18 3 1 2
254.3 4 2500 1 15 3 1 2
207.5 3 2100 1 10 2 0 2
209.7 4 2200 1 19 2 1 2
294 2 2100 0 13 2 1 2.5
176.3 2 2000 1 17 3 0 2
294.3 7 2400 0 8 4 1 2
224 3 1900 1 6 1 1 2
125 2 1900 0 18 4 0 1.5
236.8 4 2600 1 17 5 1 2
164.1 4 2300 0 19 4 0 2
217.8 3 2500 0 12 3 0 2
192.2 2 2400 0 16 2 0 2.5
125.9 2 2400 0 28 1 0 1.5
220.9 2 2300 1 12 1 1 2
294.5 6 2700 0 15 3 1 2
244.6 2 2300 0 9 2 1 2.5
199 3 2500 1 18 1 0 1.5
240 4 2600 0 13 4 1 2
263.2 4 2300 0 14 3 1 2
188.1 2 1900 0 8 4 1 1.5
243.7 6 2700 0 7 4 1 2
221.5 4 2300 0 18 3 1 2
175 2 2500 0 11 3 0 2
253.2 3 2300 0 16 2 1 2
155.4 4 2400 1 16 3 0 2
186.7 5 2500 1 21 4 0 2.5
179 3 2400 1 10 4 1 2
188.3 6 2100 1 15 4 1 2
227.1 4 2900 0 8 4 1 2
173.6 4 2100 1 14 2 1 2.5
188.3 5 2300 0 20 5 0 1.5
310.8 8 2900 0 9 4 1 2
293.7 6 2400 0 11 5 1 3
179 3 2400 0 8 4 1 2
188.3 6 2100 1 14 2 1 2.5
227.1 4 2900 0 20 5 0 1.5
173.6 4 2100 0 9 4 1 2
188.3 5 2300 0 11 5 1 3
Histogram
20
15
Percent

10
5
0
0 0 0 0 0 0 0 0 0 0 0 0 0
12 14 16 18 20 22 24 26 28 30 32 34 36
Price

a) Shape: Unimodal, Skewed Positively


Center Most Prices fall between 180 and 200$, Half the prices are below 220,000$
On average prices tent to be around 230,000$
Spread: The data range between …

mean 221.103 the overall average price of……….is 221.103$


median 213.600 The average……Also, 50% of home prices
mode 188.300

Better to use median since the data are non-symetric

b) sample standard deviat 47.105 $ Price differs from one to another by 47.1 thousands $
minimum 125 $
maximum 345.3 $
range 220.3 $

Mean -2SD Mean+2SD


126.892048 315.313666 about 95% of home prices in… are between 126.892 & 315.313 thousands$
47.1 thousands $

892 & 315.313 thousands$

You might also like