0% found this document useful (0 votes)

45 views7 pages

Universiti Teknologi Mara Test: Confidential 1 CS/FEB 2022/UCS551

Uploaded by

Hakim Razak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views7 pages

Universiti Teknologi Mara Test: Confidential 1 CS/FEB 2022/UCS551

Uploaded by

Hakim Razak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

CONFIDENTIAL 1 CS/FEB 2022/UCS551

UNIVERSITI TEKNOLOGI MARA TEST

COURSE : INTRODUCTION TO DATA ANALYTICS AND

APPLICATION
COURSE CODE : UCS551
EXAMINATION : FEB 2022
TIME : 3 HOURS
DO NOT TURN THIS PAGE UNTIL YOU ARE TOLD TO DO SO

This examination paper consists of 4 printed pages

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL

CONFIDENTIAL 2 CS/FEB 2022/UCS551

NAME: AHMAD HAKIM BIN ABDUL RAZAK

CLASS: LG2414A
ID NO: 2020959863

QUESTION 1

1. Briefly describe the term data analytics.

Data analytics is a process of inspecting, cleansing, transforming and modelling data

with the goal of discovering useful information, suggesting conclutions and
supporting decision-making.

(4 marks)

2. Explain FOUR (4) types of data analytics

i. descriptive analytics describes what has happened over a given period of time. Have
the number of views gone up? Are sales stronger this month than last?
ii. diagnostic analytics focuses more on why something happened. This involves more
diverse data inputs and a bit of hypothesizing. Did weather affect beer sales? Did the
latest marketing campaign impact sales?
iii. predictive analytics moves to what is likely going to happen in the near term. What
happened to sales last time we had a hot summer? How many weather models predict a
hot summer this year?
iv. prescriptive analytics moves into the territory of suggesting a course of action. If the
likelihood of a hot summer as measured as an average of these five wheather models is
above 58% then we should add an evening shift to the brewery and rent an additional
tank to increase output.

(8 marks)

3. List FOUR (4) types of data and provide ONE(1) example for each.
i. structure and unstructured: email, documents, images
ii. data structure: vector, array, matrix
iii. level of measurement: nominal, ordinal, interval
iv. univariate data: height

(8 marks)

4. Explain the difference between vector and array.

Vector is a collection of values that all have the same data type, in one-dimensional
array while array is a colloction of elements of the same type placed in contiguous
memory locations that can be individually referenced by using an index to a unique
indentifier.

(4 marks)

5. Describe FOUR (4) data processing techniques that can be used in processing the raw
data

i. Data cleaning. Data cleaning is the process where data gets cleaned. Data in the real
world is normally incomplete, noisy and inconsistent. The data available in data
sources might be lacking attributes values, data of interest etc. Data cleaning involves
number of techniques including filling in the missing values manually, combined
computer and human inspection etc. The output of data cleaning process is adequately
cleaned data.
ii. Data Transformation. Data transformation is the process of transforming and
consolidating the data into different forms that suitable for mining. Data transformation
normally involves normalization, aggregation, generalization etc. After data
transformation, the available data is ready for data mining.
iii. Data Sampling. Data sampling is a statistical analysis technique used to select,
manipulate and analyze a representative subset of data points to identify patterns and
trends in the larger data set being examined. It enables data scientists, preditive
modelers and other data analyst to work with a small, manageable amount of data about
a statistical population to build and run analytical models more quickly, while still
producing accurate findings.
iv. Data Sub-setting and manipulating. Subsetting is the process of retrieving just the
parts of large files which are of interest for a specific purpose. This occurs usually in a
client – server setting, where the extraction of the parts of interest occurs on the server
before the data is sent to the client over a network. The main purpose of subsetting is to
save bandwidth on the network and storage space on the client computer.
(8 marks)

6. Explain how to get the median for odd and even dataset.

Given a set of data, arrange the numbers in ascending order from smallest to largest. If
the number of observations is odd, the number in the middle of the list is the median.
This can be found by taking the value of the (n+1)/2 -th term, where n is the number
of observations. Else, If the number of observations is even, then the median is the
simple average of the middle two numbers. In calculation, the median is the simple
average of the n/2 -th and the (n/2+1)-th terms.

(4 marks)

7. Briefly explain the importance of histogram in data visualization.

A histogram provides a visual representation of the distribution of a dataset: location,

spread and skewness of the data. It also helps to visualize whether the distribution is
symmetric or skewed left or right. In addition, if it is unimodal, bimodal or
multimodal, it can also show any outliers or gaps in the data. Histograms also can
display a large amount of data and the frequency. The function will calculate and
return a frequency distribution. We can use it to get the frequency if values in a
dataset.

(4 marks)

8. List 4 types of AI application and give one example for each type.

i. Government- Public safety and utilities have a particular need for machine learning
since they have multiple sources of data that can be mined for insights.
ii. Financial Services- Banks and other business in the financial industry use machine
learning technology to identify important insights in data, and prevent fraud.
iii. HealthCare- wearable devices and sensors that can use data to assess the patient’s
health in real time.
iv. Oil and Gas- finding new energy sources. Analyzing minerals in the ground.
Predicting refinery sensor failure. Streamlining oil distribution to make it more
efficient and cost effective.

(12 marks)

9. Explain the concept of learning in machine learning.

Learning is one of the fundamental building block of AI solutions. Learning is a
process that improves the knowledge of an AI program by making observations about
its environment. AI learning process focused on processing a collection of input-
output pairs for specific function and predicts the output for new input.

(6 marks)

10. Briefly explain two differences between supervised learning and unsupervised
learning.

The first difference is supervised learning is a process of adjusting weights in a

neural net using learning algorithm while unsupervised learning produce the output
based of input data without labelled responses.
The second difference is supervised learning is designed to perform pattern
classification while unsupervised learning type uses clutter analysis which is used for
exploratory data analysis to find hidden patterns or grouping data.

(8 marks)

11. Describe how the classification task can be performed using a significant example for
this task.

This operator should be used for performance evaluation of only classification tasks.
Many other performance evaluation operators are also available in RapidMiner or
Performance operator, Performance (Binominal Classification) operator, Performance
(Regression) operator. The Performance (Classification) operator is used with
classification tasks only. On the other hand, the Performance operator automatically
determines the learning task type and calculates the most common criteria for that
type. You can use the Performance (User-Based) operator if you want to write your
own performance measure.

Classification is a technique used to predict group membership for data

instances. For example, you may wish to use classification to predict whether the train
on a particular day will be 'on time', 'late' or 'very late'. Predicting whether a number of
people on a particular event would be 'below- average', 'average' or 'above-average' is
another example. For evaluating the statistical performance of a classification model
the data set should be labeled i.e. it should have an attribute with label role and an
attribute with prediction role. The label attribute stores the actual observed values
whereas the prediction attribute stores the values of label predicted by the
classification model under discussion.
(10 marks)
12. Differentiate classification and clustering. ( Give TWO(2) differences)

Classification
i. The number of classes is known.
ii. Popular algorithms for classification include Naïve Bayes Classifier, Decision
Trees and Random Forests.
Clustering
i. The number of classes is unknown.
ii. Popular algorithms used for clustering include K-Means, Mean-Shift
Clustering, and Density-Based Spatial Clustering of Applications with Noise.

(8 marks)

13. Discuss how data analytics can be benefited to these areas:

a. Business
Analyzing data is broadly available at lower cost points. Data analytics can be
beneficial to business areas in order to use it in new levels, using information
technology to shore accurate, stable business experimentation that direct
decision makers and to examine outputs, business models, and regeneration in
customer experience sometimes. Finance establishments are strong
experimenters as well as principal ones who keep amend its methods for
segment credit card customers. Companies in various sectors have acquired
crucial insight from the structured data collected from different enterprise
systems and anatomized by commercial database management systems.

b. Medical
Data analytics in medical organizations can be beneficial to the community.
One of the benefits is that the disease can be detected at an early stage through
the analysis of such huge information and proper care and treatment can be
provided immediately in an effective way to an individual. Data analytics can
provide various measures to be taken to save expenditure in healthcare by the
people and to lead a healthy life by taking initial care through predictable
information. Other areas in which data analytics give enhanced profit are
identifying the patients who use maximum health resources and are at the
greatest risk for adverse outcomes.
(16 marks)

END OF QUESTION PAPER

Nature of Teaching and Teacher Roles
100% (5)
Nature of Teaching and Teacher Roles
23 pages
Always More Than One by Erin Manning
100% (4)
Always More Than One by Erin Manning
41 pages
70 20 10 Framework
100% (1)
70 20 10 Framework
20 pages
Expository Writing Checklist
No ratings yet
Expository Writing Checklist
3 pages
Development of A Program To Increase Personal Happiness: Michael W. Fordyce
No ratings yet
Development of A Program To Increase Personal Happiness: Michael W. Fordyce
11 pages
SEMIOTICS or SIGN LANGUAGE OUTLINE
100% (1)
SEMIOTICS or SIGN LANGUAGE OUTLINE
12 pages
Reviewer-Child and Adult Devpt
No ratings yet
Reviewer-Child and Adult Devpt
6 pages
Expert System
100% (1)
Expert System
54 pages
ED TM1 Trainers Methodology Level I
100% (6)
ED TM1 Trainers Methodology Level I
2 pages
Philosophy of Science & Historical Enquiry (John Losee)
No ratings yet
Philosophy of Science & Historical Enquiry (John Losee)
66 pages
EFL Adult Learners' Perception of Learning English Vocabulary Through Pictures at A Private English Center
No ratings yet
EFL Adult Learners' Perception of Learning English Vocabulary Through Pictures at A Private English Center
8 pages
Memory Distortion
No ratings yet
Memory Distortion
8 pages
Learning About Cause and Effect
No ratings yet
Learning About Cause and Effect
3 pages
Osho Active Meditation Creativity Biography News Contact Ebook Home Page
No ratings yet
Osho Active Meditation Creativity Biography News Contact Ebook Home Page
4 pages
Thick Description
No ratings yet
Thick Description
13 pages
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6458)
The Effect of Pre-Reading Activities On The Reading Comprehension Performance of Ilami High School Students
No ratings yet
The Effect of Pre-Reading Activities On The Reading Comprehension Performance of Ilami High School Students
7 pages
CCM 211 Topic 4 Notes-Modes of Speech Delivery
No ratings yet
CCM 211 Topic 4 Notes-Modes of Speech Delivery
10 pages
MGT 201 Final Term Paper
No ratings yet
MGT 201 Final Term Paper
21 pages
Icelt Syllabus
100% (1)
Icelt Syllabus
41 pages
b6 Week8 Notes Term 3
No ratings yet
b6 Week8 Notes Term 3
13 pages
Evaluation Form
No ratings yet
Evaluation Form
2 pages
Introduction To Professional Development
No ratings yet
Introduction To Professional Development
2 pages
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (648)
Foreign Language Anxiety and English Medium Instruction Classrooms: An Introduction
No ratings yet
Foreign Language Anxiety and English Medium Instruction Classrooms: An Introduction
10 pages
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
78721-Article Text-184555-1-10-20230426
No ratings yet
78721-Article Text-184555-1-10-20230426
5 pages
Harley & Hart
No ratings yet
Harley & Hart
3 pages
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
612le 13 SC Identifypurpose SC
No ratings yet
612le 13 SC Identifypurpose SC
3 pages
Emma Pavydis - Gr-4-Goal Setting Menu
No ratings yet
Emma Pavydis - Gr-4-Goal Setting Menu
2 pages
DRTA
No ratings yet
DRTA
8 pages
Cycles of Decision and Learning
No ratings yet
Cycles of Decision and Learning
28 pages
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
AI Consultant Resume
No ratings yet
AI Consultant Resume
1 page
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibín
3.5/5 (2141)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (464)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2016)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2814)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)

Universiti Teknologi Mara Test: Confidential 1 CS/FEB 2022/UCS551

Uploaded by

Universiti Teknologi Mara Test: Confidential 1 CS/FEB 2022/UCS551

Uploaded by

CONFIDENTIAL 1 CS/FEB 2022/UCS551

UNIVERSITI TEKNOLOGI MARA TEST

COURSE : INTRODUCTION TO DATA ANALYTICS AND

This examination paper consists of 4 printed pages

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL

CONFIDENTIAL 2 CS/FEB 2022/UCS551

NAME: AHMAD HAKIM BIN ABDUL RAZAK

1. Briefly describe the term data analytics.

Data analytics is a process of inspecting, cleansing, transforming and modelling data

2. Explain FOUR (4) types of data analytics

4. Explain the difference between vector and array.

7. Briefly explain the importance of histogram in data visualization.

A histogram provides a visual representation of the distribution of a dataset: location,

9. Explain the concept of learning in machine learning.

The first difference is supervised learning is a process of adjusting weights in a

Classification is a technique used to predict group membership for data

13. Discuss how data analytics can be benefited to these areas:

END OF QUESTION PAPER

You might also like