Assignment DataSet2
Assignment DataSet2
1. Draw a bar chart that shows the average salary for American and national teams.
2. Draw a bar chart that shows the size of stadium of each team, which team has
the largest and smallest stadium size .
3. Draw a bar chart that shows the salary and number of wins for each team,
determine which teams have salary over $80 Mil .
4. Draw a pie chart that shows the average number of errors for teams built in or
before 1990 and team built after 1990.
5. Draw a pie chart that shows the maximum number of wins for teams that has
natural surface and those who have artificial surface.
6. For the size, the salary, attendance, wins, ERA, Batting, and HR, find the mean,
median, mode, standard deviation, range, maximum, minimum.
7. Construct the frequency table for the categorical data: the league, surface.
8. Construct the frequency table for the continuous data: size, salary, attendance,
wins, ERA, HR, stolen, and errors. (hint, use suitable number of classes)
9. Calculate the 95% and 99% confidence interval for the average salary, average
number of attendance, average size, average number of wins, average error.
10. Test the hypothesis that the average number of wins is equal 90. Write the null
and alternative hypothesis in each case.
11. Test the hypothesis that the average salary is equal 65 million dollars
12. Test the hypothesis that “there is no significance between the average salary of
American and national teams”
13. Test the hypothesis that “there is no significance difference between the size of
the stadium that have natural surface and that have artificial surface “
14. Test the hypothesis that “there is no significance difference between the average
salary of the teams built before and after 1990”
15. Test the hypothesis that “there is no significant difference between the ERA of
the teams who win more than 85 times and the teams who win less than or
equal 85 times”.
16. Test the hypothesis that”there is no significant difference between the HR of the
teams built before 1950, teams built during 1950-1970, teams built during 1971-
1990, and teams built in 1991 and more”.
17. Test the hypothesis that ”there is no significant difference between the number
of wins of the teams of salary less than 70 millions and teams of salary greater
than or equal 70 millions”.
18. Test the hypothesis that ”there is no significant difference between the number
of wins of the teams of salary less than 40 millions, teams of salary between 40
and 60 millions, teams of salary between 60 and 80 millions, and teams of salary
greater than 80 million”.
X1 = Team
X2 = League (0 = national, 1 = American)
X3 = Built (Year Stadium Was Built)
X4 = Size (Stadium Capacity)
X5 = Salary (Total 2000 Team Salary $ Mil)
X6 = Attendance (Total 2000 Team Attendance)
X7 = Wins (Number of Wins in 2000)
X8 = ERA (Earned Run Average)
X9 = Batting (Team Batting Average)
X10 = HR (Number of Home Runs for the Team)
X11 = Surface (natural = 0, artificial = 1)
X12 = Stolen (Stolen Bases)
X13 = Errors (Team Errors)