0% found this document useful (0 votes)
42 views5 pages

Cae 2 Set 1 Original - KEY

1. The document provides information about a data science exam including course outcomes being assessed, exam parts and questions. 2. Part A contains 5 multiple choice questions assessing skills like defining statistical concepts such as percentile rank and degrees of freedom. 3. Part B has longer free response questions worth more marks, including explaining how to describe data with tables and frequency distributions for quantitative data. 4. Part C is a single longer question worth 14 marks explaining the normal curve, standard normal curve and how to calculate a z-score to standardize a variable.

Uploaded by

JANILA J.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
42 views5 pages

Cae 2 Set 1 Original - KEY

1. The document provides information about a data science exam including course outcomes being assessed, exam parts and questions. 2. Part A contains 5 multiple choice questions assessing skills like defining statistical concepts such as percentile rank and degrees of freedom. 3. Part B has longer free response questions worth more marks, including explaining how to describe data with tables and frequency distributions for quantitative data. 4. Part C is a single longer question worth 14 marks explaining the normal curve, standard normal curve and how to calculate a z-score to standardize a variable.

Uploaded by

JANILA J.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

Reg.No.

MarEphraemCollegeof EngineeringandTechnology,Elavuvilai
B.E. DEGREE CONTINUOUS ASSESSMENT EXAMINATION II– October 2022
ThirdSemester
Department of Computer Science and Engineering
CS3352 Foundations of Data Science
Time: 1.30hrs. Maximum:50marks
CourseOutcomes(COs)forAssessmentinthisExamination
CO3 Describe data with table, graphs and averages
CO4 Describe variability with standard deviation and normal standard curve
CL-CognitiveLevel;R-Remember;Un-Understand;Ap-Apply;An-Analyze;Ev-Evaluate;Cr-Create
PARTA– (5 X2=10 marks)
1. What is percentile rank? Un CO3
In statistics, the percentile rank (PR) of a given score is the percentage of scores
in its frequency distribution that are less than that score.
2. Give the steps for finding the mean. Ap CO3

1. Add up all the values in the data set.


2. Divide this number by the number of values.

3. Distinguish between histogram and bar graph. Un CO4


The bar graph is the graphical representation of
categorical data. A histogram is the graphical
representation of quantitative data. bar graph - There
is equal space between each pair of consecutive
bars. histogram - There is no space between the
consecutive bars.
4. Define degrees of freedom. R CO4
Degrees of freedom refers to the maximum number
of logically independent values, which are values that
have the freedom to vary, in the data sample.
Degrees of freedom is calculated by subtracting one
from the number of items within the data sample.
5. Give the instructions to find IQR and find the IQR for R CO4
1,3,4,1,0,2,5,8,0,2,3,4,7,11,0,2,3,4
 Step 1: Put the numbers in order.
1, 2, 5, 6, 7, 9, 12, 15, 18, 19, 27.
 Step 2: Find the median.
1, 2, 5, 6, 7, 9, 12, 15, 18, 19, 27.
 Step 3: Place parentheses around the numbers above and below the
median.
Not necessary statistically, but it makes Q1 and Q3 easier to spot.
(1, 2, 5, 6, 7), 9, (12, 15, 18, 19, 27).
 Step 4: Find Q1 and Q3
Think of Q1 as a median in the lower half of the data and think of Q3 as a
median for the upper half of data.
(1, 2, 5, 6, 7),  9, ( 12, 15, 18, 19, 27). Q1 = 5 and Q3 = 18.
 Step 5: Subtract Q1 from Q3 to find the interquartile range.
18 – 5 = 13.

PARTB– (2 X13 =26marks)


6.a Explain about describing data with tables for quantitative data. Ap CO1

Frequency Tables

All of the graphical methods shown in this section are derived


from frequency tables. Table 1 shows a frequency table for the
results of the iMac study; it shows the frequencies of the various
response categories. It also shows the relative frequencies,
which are the proportion of responses in each category. For
example, the relative frequency for “none” of 0.17 = 85/500.
Previous Ownership Frequency Relative Frequency

None 85 0.17

Windows 60 0.12

Macintosh 355 0.71

Total 500   1

Table 1. Frequency Table for the iMac Data.


Below is a table (Table 2) showing a hypothetical distribution of
scores on the Rosenberg Self-Esteem Scale for a sample of 40
college students. The Rosenburg Self-Esteem Scale is one way
to operationalize (define) self-esteem in a quantitative way. 
Participants rate each of the 10-items from strongly disagree to
strongly agree.  All items are then scored yielding an overall self-
esteem score that would be a numerical value to represent one’s
self-esteem.

Column one lists the values of the variable – the possible scores
on the Rosenberg scale
Column two lists the frequency of each score

Self-Esteem
Frequency
Scores

24 3
23 5

22 10

21 8

20 5

19 3

18 3

17 0

16 2

15 1

Table 2. Frequency Table for Rosenburg Self-Esteem Scale


Scores.

Table 2 shows that there were three students who had self-
esteem scores of 24, five who had self-esteem scores of 23, and
so on. From a frequency table like this, one can quickly see
several important aspects of a distribution, including the range of
scores (from 15 to 24), the most and least common scores (22
and 17, respectively), and any extreme scores that stand out
from the rest.
OR

6.b(i) The following frequency distribution shows the annual incomes in dollars for a group of Un CO1
college graduates.

a)Construct a histogram.
b)Construct a frequence polygon
c)Is this distribution balanced or lopsided.
a)histogram – 5marks
b)frequency polygon- 5marks
c)lopsided -3marks
6.b(ii) Construct a stem and display for the following IQ scores obtained from a group of four
year old children

Diagram 5 marks
7.a Calculate the sample standard deviation for the scores 13,10,11,7,9,11,9 using definition Ap CO1
and computation formula. Also give the computation sequence.
definition formula - 5marks
computation formula – 5 marks
computation sequence. – 3 marks
OR

7.b Explain about describing data with averages. Un CO1


Mean- 3marks
Median – 5marks
Mode – 5marks

PARTC– (1 X 14 =14marks)
8.a Explain in detail about normal curve, standard normal curve and z score Un CO2
A z-score is a measure of position that Z-score
indicates the number of standard
deviations a data value lies from the
Calculation:
mean. It is the horizontal scale of
a standard normal distribution.

The z-score is positive if the value lies


above the mean, and negative if it lies
• value being
below the mean. To calculate a z-score, examined x
the population mean and standard • population mean μ
deviation must be known. • population standard
    deviation σ

 Round z-scores to
two
decimal places.

Remember that z-scores tell us how far a value is from the


mean. When you "standardize" a variable, its mean becomes
zero and its standard deviation becomes one.

Preparedby Verifiedby

You might also like