DS342 - Data Analytics Midterm 2023-2024 Model 1
DS342 - Data Analytics Midterm 2023-2024 Model 1
Midterm Exam
Model 1
تعليمات هامة
• حيازة التليفون المحمول مفتوحا داخل لجنة اإل\متحان يعتبر حالة غش تستوجب العقاب وإذا كان ضرورى الدخول بالمحمول
.فيوضع مغلق فى الحقائب
.• ال يسمح بدخول سماعة األذن أو البلوتوث
.• اليسمح بدخول أي كتب أو مالزم أو أوراق داخل اللجنة والمخالفة تعتبر حالة غش
1
8. The limitation of covariance as a descriptive measure of association is that it
a. only captures positive relationships.
b. does not capture the units of the variables.
c. is very sensitive to the units of the variables.
d. is invalid if one of the variables is categorical.
9. Gender and states of residence are examples of ____ data.
a. Discrete b. Categorical c. Continuous d. Ordinal
10. The length of the box in the box plot portrays the
a. mean. b. median. c. range. d. interquartile range.
11. In the following MS Excel spreadsheet, you are given a list of 100 customers. Column A is
their names, B is for customer category, C for payment
category (0 means discounted price, 1 means full price),
and D indicates price that customers pay.
Which of the following formula correctly counts all
customers who are adults and get discounted price?
a. = countif (B2:B101, “ =Adult”, C2:C101, “ =0”)
b. = countif (B2:B101, =Adult, C2:C101, “ =0”)
c. = countifs (B2:B101, “ =Adult”, C2:C101, “ =0”)
d. = countifs (B2:B101, “ =0”, C2:C101, “ = Adult”)
2
20. All nominal data may be treated as ordinal data.
21. Correlation and covariance can be used to examine relationships between numerical
variables and categorical variables that have been coded numerically.
22. Relationships between two variables are more evident when counts are expressed
as percentages of row totals or column totals.
23. There are four quartiles that divide the values in a data set into four equal parts.
24. Unlike histograms, box plots depict only one aspect of a variable.
25. A distribution with a flattened peak has almost all its observations within three
standard deviations of the mean.
26. If you change the data, the chart will change simultaneously.
27. Strongly related variables may have a correlation close to zero if the relationship is
nonlinear.
28. We can use side-by-side boxplots to compare at most 2 distributions of numeric
data.
29. A PivotTable is comprised of three areas: ROW, COLUMN and DATA.
30. Data analysis includes data description, data visualization, data inference, and the
search for relationships in data.
31. An example of a joint category of two variables is the count of all non-drinkers
who are also nonsmokers.
32. Correlation is not useful for describing the strength and direction of linear
relationships.
33. Counts for a categorical variable are often expressed as percentages of the total.
34. The filters field of a pivot table contains the data that you want summarized.
35. When a formula is copied into another cell, the relative references in the formula
keep their relative positions.
Based upon the boxplots, does there seem to be reason to conclude that there is a
difference between the salaries of women and men in this plant?
a. True b. False
3
38. Approximately, how large must a male’s salary be to qualify as an outlier on the high
side?
a. $60,000 b. $70,000 c. $80,000 d. None of previous
39. Which shape is the distribution of annual salary for males that work at Marko
Manufacturing, Inc.
a. Left Skewed b. Right Skewed c. Symmetric
d. Cannot tell
40. Which shape is the distribution of annual salary for females that work at Marko
Manufacturing, Inc.
a. Left Skewed b. Right Skewed c. Symmetric
d. Cannot tell
41. The histogram below represents scores achieved by 250 job applicants on a personality
profile.
44. Seventy percent of the job applicants scored above what value?
a. 45 b. 40 c. 20 d. None of previous
Good Luck
Dr. Marwa Sabry