P3 Travis Then Kai Hong FHCT1014 IDA

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 5

202201 MIDTERM TEST

ANSWER SHEETS

Part 1: To be completed by student

Name (as stated in Student Card) Student ID

Travis Then Kai Hong 2102363

Programme Faculty / Centre


Foundation In Science CFS-SL

Course Code and Course Name Submission Date

FHCT1014 Introduction to Data Analytics 30/03/2022

Part 2: For Examiner’s Use Only

MARK
QUESTION NUMBER
INTERNAL EXTERNAL

Q1

Q2

TOTAL
Midterm Test Declaration

DECLARATION STATEMENT

I Travis Then Kai Hong (Name), Student ID: 2102363


,
hereby solemnly and fully declare and confirm that during my programme of study at
Universiti Tunku Abdul Rahman, I shall abide and comply with all the rules, regulations
and lawful instructions of Universiti Tunku Abdul Rahman and endeavour at all times to
uphold the good name of the University.

I hereby declare that my submission for this Midterm Test is based on my original work,
not plagiarised from any source(s) except for citations and quotations which have been
duly acknowledged. I am fully aware that students who are suspected of violating this
pledge are liable to be referred to the Examination Disciplinary Committee of the
University.

Programme: Foundation in Science

(Digital) Signature:

Student’s I.C. / Passport No.: 020402-13-0675

Date of Submission: 30/03/2022

Page 2 of 5
Comments
and/or
No. Type / Insert pictures in this column only Mark
(Examiner
only)
Q1. (a) i) Quantitative
ii) Categorical
iii) Quantitative
iv) Quantitative
v) Categorical

Q1. (b) i) Volume: E-Fresh Market is able to handle many data


such as various groceries and services by many
customers due to the surge in online market.
Velocity: E-Fresh Market is able to handle many orders
coming at once and responds in real-time to process
purchases and to deliver them to the customers as
customers can order many times a day.
Variety: Text data such as feedbacks and ratings are
collected regarding products and services which are
then reviewed.

ii) The mode on products bought can be used by the


company to recommend products alike them. This can
help satisfy some customers in the case of the modal
product being out of stock.
Furthermore, E-Fresh Market can obtain the range on
the spending of the buyers. This helps the company to
recommend products which falls in between the range
thus increasing the likelihood of successful purchases.
Aside from that, E-Fresh Market can also collect
percentiles of product purchase. This can regulate the
stock number of said product to reduce loss of money

Q1. (c) i) The naming system has standardization errors with


surnames unnecessarily placed before the first name,
along with commas and non-alphabetical values.
The columns have no grid lines
The column named ‘Years with Company’ has error in
data with one row having a non-number value.
The column containing ‘Customer Accounts’ has
unstandardized decimal placing in the data.

ii) For ‘Customer Accounts’, standardize the decimal


placing of the data to 0.
Standardize the naming system of the salespeople and
remove non-name values.

Page 3 of 5
Replace non-number values for the Years with
Company.
Place proper gridlines for the table and columns.

Q2. (a)
i) 7.5-3.5= 4cm
The interquartile range lies between 5.5cm and 7.5cm.
The median is 6cm.
The upper end of the box plot indicates that the largest
rain is 7.5cm

ii) The 25th percentile is 7.5cm.


The 50th percentile is 6cm
The 75th percentile is 5.5cm.
75% of the amount of rain in Vietnam cities is closer to
the average amount of rain in comparison to 25%. The
data is unevenly distributed, favouring to lower amount
of rain.

Q2. (b) i) 1. Ensure the data is of quantitative data. The data must
be quantifiable and is not showcasing characteristics.
2. Ensure the data is used to show the distribution of
variables. The same variable is being distributed to
showcase the change in frequency of the variable.
3. Ensure that bins of the same width is used. This is to
ensure the range of the data can be grouped evenly.

ii) Mean = 12.19


StDev = 6.32
68% of the quality defects reported per hour have
quality defects between (12.19 – 6.32 = 5.87) and
(12.19 + 6.32 = 18.51).

Q2. (c)
i) Clustered Bar Chart. Easier and clearer to distinct the
percentages between the years.
ii) Blue Bird Shipping has the highest Current Year On-
Time Deliveries.
Urbanlink has the lowest Previous Year On-Time
Deliveries.
Urbanlink has the biggest increase in percentages from
previous to current year.
iii) Jajangmyun Logistics should use Tiger LLC. It is the
second most consistent carrier and has a higher on-time
delivery in comparison to the most consistent carrier.

Page 4 of 5
Q.2(d) i) Unstandardized data of different information for each
pie slice.
The chart is styled in a way that is hard-on-the-eyes to
read.

ii) The individual pie slice size should be based on the


percentages of its value to the total value.
The images used on the pie slices should be changed to
simple colours so that the data can be seen clearly.

Page 5 of 5

You might also like