0% found this document useful (0 votes)

54 views14 pages

Comp 1942 finalExamQuestion-2019

The document provides information about a COMP1942 exam, including instructions, four questions, and details about datasets and Bayesian networks. The exam tests students on topics like frequent pattern mining, decision trees, clustering, and ensemble methods.

Uploaded by

Karin Wong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views14 pages

Comp 1942 finalExamQuestion-2019

Uploaded by

Karin Wong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

COMP1942 Question Paper

COMP1942 Exploring and Visualizing Data (Spring Semester 2019)

Final Examination (Question Paper)
Date: 22 May, 2019 (Wed)
Time: 12:30-15:00
Duration: 2 hours 30 minutes

Student ID: Student Name:_______________

Seat No. :__________________

Instructions:
(1) Please answer all questions in the answer sheet.
(2) You can use a calculator.

Question Paper

1/14
COMP1942 Question Paper
Q1 (20 Marks)

(a) Given a dataset with the following transactions in binary format, and the support threshold = 2.
R A Y M N
0 1 1 0 0
0 1 1 0 1
1 0 0 0 0
1 1 1 0 1
1 0 1 0 1

After we perform the join step and the prune step in the Apriori algorithm, we obtain a set C of itemsets.
Then, we need to do the counting step for C (i.e., we need to find the frequency of each itemset in C).
Finally, we output all itemsets in C with frequency at least a given support threshold as a part of the final
output. Why do we need to do the counting step? That is, why can't we simply output C as a part of the
final output? You can use the above dataset for illustration.

(b) The following shows an FP-tree. Let the support threshold be 2.

Please list all frequent itemsets with their correspondence frequency counts.
item Head of node-link
root
d

k d:8

a
k:5
f

a:3

f:2

2/14
COMP1942 Question Paper
Q2 (20 Marks)

We have the following Bayesian Belief Network.

S = Yes
0.4

Smoke (S)

A = Yes
S = Yes 0.35
S = No 0.25 Asthma (A)

Lung Cancer (LC) Pneumonia (P)

LC = Yes P = Yes
A = Yes 0.85 A = Yes 0.75
A = No 0.2 A = No 0.3

Suppose that there is a new person. We know that

(1) he has no lung cancer
(2) he has pneumonia
We would like to know whether he is likely to smoke or not.

Please use Bayesian classifier with the use of Bayesian Belief Network to predict whether he is likely to
smoke.

3/14
COMP1942 Question Paper
Q3 (20 Marks)

(a) We know how to compute the impurity measurement of an attribute A under the ID3 decision tree,
denoted by Imp-ID3(A). We also know how to compute the impurity measurement of an attribute A
under the CART decision tree, denoted by Imp-CART(A). Consider two attributes A and B. Is it always
true that if Imp-CART(A) > Imp-CART(B), then Imp-ID3(A) > Imp-ID3(B)? If yes, please show that it
is true. Otherwise, please give a counter example showing that this is not true and then explain it.

(b) In XLMiner, suppose that we want to perform k-means clustering. We have to specify some parameters
in the following input dialog box shown here (and other input dialog boxes not shown here). Note that
there is an unknown number “A” and another unknown number “B” in the following input dialog box.
Both number “A” and number “B” are inputted by a user.

After that, we execute XLMiner at Raymond’s machine and obtain the following output O1 from
XLMiner.

4/14
COMP1942 Question Paper

Please answer the following questions.

(i) What is the value of A (in the input dialog box)?
(ii) What is the value of B (in the input dialog box)?
(iii)What is the mean of each of the clusters finally chosen by XLMiner?
(iv) What is the initial mean of each of the clusters in the k-means clustering such that we have the final
clusters with means given in (iii)?
(v) Suppose that this Excel file is sent from Raymond to you. You execute XLMiner at your machine with
the same configuration described above (i.e., the same input parameter values given above and the
same input parameter values in other input dialog boxes not shown here) and obtain the output O2. Is
it always true that O1 is equal to O2? Please give the answer and elaborate it.

5/14
COMP1942 Question Paper
Q4 (20 Marks)

(a) In class, we learnt “Sequential K-means Clustering” and “Forgetful Sequential K-means Clustering”.
What is the scenario or application that “Forgetful Sequential K-means Clustering” is better used
compared with “Sequential K-means Clustering”?
(b) Consider eight data points.
The following matrix shows the pairwise distances between any two points.
1 2 3 4 5 6 7 8
1 0 
 
2  11 0 
3  5 13 0 
 
4 12 2 14 0 

5  7 17 1 18 0 

6 13 4 15 5 20 0 
 
7  9 15 12 16 15 19 0 
8  11 20 12 21 17 22 30 0 

Please use the agglomerative approach to group these points with distance group average linkage.
Draw the corresponding dendrogram for the clustering. You are required to specify the distance metric
in the dendrogram.

6/14
COMP1942 Question Paper
Q5 (20 Marks)

(a) The insurance company is given a table with five input attributes, namely Race, Gender, Married, Income
and Child, and one target attribute, namely Insurance. Based on this table, the insurance company
constructed three classifiers based on different criteria, namely Classifier 1, Classifier 2 and Classifier 3.

Classifier 1

root
income=high income=low

Prediction:
0% Yes
100% No
child=yes child=no

Prediction: Prediction:
100% Yes 0% Yes
0% No 100% No

Classifier 2

root
gender=male gender=female

Prediction:
100% Yes
0% No
race=white race=black

Prediction: Prediction:
0% Yes 100% Yes
100% No 0% No

Classifier 3

root
income=high income=low

Prediction:
100% Yes
0% No
Married=yes Married=no

Prediction: Prediction:
100% Yes 0% Yes
0% No 100% No

7/14
COMP1942 Question Paper
Consider a group of 3 classifiers called an “ensemble” studied in class. Consider a new customer. All
input attribute values of this new customer are known to the insurance company. The company uses
this “ensemble” to do the prediction and predicts that this new customer will not buy an insurance
policy. Suppose that we are very “curious” about the input attribute values of this new customer. What
we know about the new customer is that he or she has a low income and the “predicted” result is that
this new customer will not buy an insurance policy. We also know the 3 exact classifiers used in the
insurance company. Is it possible for us to find the values of some input attribute values of this
customer? If yes, please state (1) all these input attribute values and (2) all input attribute(s) that could
not be found with their values. Otherwise, please write down the reason why we could not find those
values.

(b) Consider that we want to conduct an experiment on a particular chemical. We want to test whether this
chemical will have any reaction with another chemical of a fixed amount when the temperature is kept to
be a certain value and the weight of this chemical is adjusted to be another certain value. The following
table shows the experimental results. This table contains 2 numeric attributes, namely temperature and
weight, and one binary attribute, namely react. Each record in the following table corresponds to a
chemical test.

Record ID Temperature Weight React

1 2 18 Yes
2 4 20 Yes
3 8 16 No
4 12 14 No
5 16 12 No
6 14 10 No
7 10 12 No
8 8 6 No
9 6 4 Yes
10 8 18 No
11 18 2 Yes
12 20 8 Yes
13 12 4 Yes
14 16 6 Yes

We want to predict whether the chemical will have any reaction when the temperature is equal to 10 and
the weight of this chemical is equal to 4. Suppose that we want to use a 3-nearest neighbor classifier and
we adopt the Euclidean distance as a distance measurement between two given points/records. What is the
prediction? Please write down the prediction (i.e., Yes or No) and the record IDs of the corresponding 3
nearest neighbors.

8/14
COMP1942 Question Paper
Q6 (20 Marks)

Consider the following table with three attributes where “No. of Phones” and “No. of Laptops” are input
attributes and “Buy_NintendoSwtich” is the target attribute. Each tuple corresponds to a customer. An
attribute “Record ID” denotes the ID of each record.
Record ID No. of Phones No. of Laptops Buy_ NintendoSwtich
1 0 0 No
2 0 1 No
3 1 0 No
4 1 1 Yes
(a) Rewrite the above table such that values “Yes” and “No” in attribute “Buy_NintendoSwtich” are
mapped to values 1 and 0, respectively.
(b) Consider a neural network containing a single neuron where x1 = “No. of Phones”, x2 = “No. of
Laptops” and y = “Buy_NintendoSwtich”.

input

x1 w1 output

w2 y
x2 Neuron

Initially, we set the values of w1, w2 and b to be 0.1 where b is a bias value in the neuron.

Suppose the learning rate is denoted by . Let  = 0.5.

Suppose we adopt the sigmoid function as an activation function.
Please try to train the neural network with five instances by the following inputs in the given
sequence and the answer in Part (a).
1. (x1, x2) = (0, 0)
2. (x1, x2) = (0, 1)
3. (x1, x2) = (1, 0)
4. (x1, x2) = (1, 1)
5. (x1, x2) = (0, 0)

What are the final values of w1, w2 and b after these five instances?

9/14
COMP1942 Question Paper

Q7 (20 Marks)

Suppose that c is a positive real number where we do not know the exact value.
(a) Consider the four 2-dimensional data points:

a:(8, 8), b:(10, 10), c:(7, 11), d:(11, 7)

a:(8 + c, 8 + c), b:(10 + c, 10 + c), c:(7 + c, 11 + c), d:(11 + c, 7 + c)

We can make use of PCA for dimensionality reduction. In dimensionality reduction, given an L-
dimensional data point, we want to transform this point to a K-dimensional data point where K < L such
that the information loss during the transformation is minimized. Suppose that L = 2 and K = 1.
Can we make use of the answers in part (a) to perform the dimensionality reduction? If yes, please write
down each transformed data point. If no, please write down the reasons why we cannot make use of the
answers of part (a).
(c) Consider the four 2-dimensional data points:

a:(8c, 8c), b:(10c, 10c), c:(7c, 11c), d:(11c, 7c)

10/14
COMP1942 Question Paper
Q8 (20 Marks)

We are given the following adjacency matrix according to four sites, namely p, q, r and s.
p q r s
𝑝 1 1 1 0
𝑞 1 0 0 1
𝑟 0 1 1 0
𝑠 0 1 0 0

(a) Is it possible to find the corresponding stochastic matrix? If yes, write down the stochastic matrix.
Otherwise, please explain it.
(b) We are given the following 12 webpages, namely w1, w2, …, w12.
Raymond Linfei Hao
Chan Chen Liu

w1 w2 w3

Panagiotis Lady Weicheng

Simatis Gaga Wong

w4 w5 w6

Raymond Xiaotian James

Wong Wong Yip

w7 w8 w9

Min Dandan Tianwen

XIE LIN Wong

w10 w11 w12

The query terms typed by the user are "Raymond" and "Wong".
(i) What is the root set in this query? Please list the webpages in this set.
(ii) What is the base set in this query? Please list the webpage in this set.

11/14
COMP1942 Question Paper
Q9 (20 Marks)

In class, we learnt that we are given the following table describing the scenario that “parts are bought from
suppliers and then sold to customers at a sale price SP”.

part supplier customer SP

p1 s1 c1 4
p3 s1 c2 3
p2 s3 c1 7
… … … …

Then, we could answer a query like “for each customer, find the sum of the sale prices (SP) (i.e., find
SUM(SP))”. Suppose the total number of records in the output of this query is 0.1M. In the class, we learnt
that we represent this query and its output by “c 0.1M”. We also learnt how to derive (or obtain) the output of
query “c” from the output of query “sc”. We also learnt how to construct the following graph (or figure) due
to this derivation. Suppose that we materialize the outputs of all queries.

psc 6M

pc 4M ps 0.8M sc 2M

p 0.2M s 0.01M c 0.1M

none 1

(a) In this question, consider another scenario that we are given the following transactions. Suppose that the
support threshold is equal to 1.
A B C D
1 1 0 1
1 1 0 1
0 0 1 0
1 1 1 1
1 0 1 0

Now, we would like to answer a query like “finding all frequent itemsets such that each frequent itemset
contain at least one item from set {A, B, D}”. Two examples of this output are {A, B} and {B, C} but {C}
12/14
COMP1942 Question Paper
is not in this output. Note that the frequency of each frequent itemset is not needed in the output of this
query. Suppose that the total number of frequent itemsets in this output is a number x (to be found by you in
this question). Then, similar to what we learnt in the class, we represent this query and its output by “{A, B,
D} x”. Based on the concept we learnt in class, we could derive (or obtain) the output of query {A, D} from
the output of query {A, B, D}. We could construct the following graph due to the derivation. Note that each
variable x (with a subscript) in the following corresponds to a number to be found by you in this question.
Suppose that we materialize the outputs of all queries.

{A, B, C, D} xABCD

{A, B, C} xABC {A, B, D} xABD {A, C, D} xACD {B, C, D} xBCD

{A, B} xAB {A, C} xAC {A, D} xAD {B, C} xBC {B, D} xBD {C, D} xCD

{A} xA {B} xB {C} xC {D} xD

none xnone

(i) Please state all frequent itemsets (i.e., query “{A, B, C, D}”).
You are not required to give the frequency of each itemset.
(ii) Please find the value of each variable x (e.g., xABCD, xABC and xnone).
(iii) Assume that we do not consider “none xnone”. Now, suppose that 4 views (instead of all views) are to
be materialized (other than the top view). Apply the greedy algorithm and find the resulting views.
(Note: For each iteration/selection in the greedy algorithm, if there are ties, just pick the query in the
lexicographical order (or alphabetical order) (e.g., {A, B} is ordered before {A, C} in this ordering)).

(b) In Part (a), we did not consider any Apriori property to construct the graph. Now, we want to use the
Apriori property learnt in class to “reduce” the total size of the storage.
(i) We know that the Apriori property is in the form of “if <itemset 1> is frequent, then <itemset 2> is
frequent.” What is the relationship between <itemset 1> and <itemset 2>?
(ii) Consider this transactional dataset only. Due to this Apriori property, we do not need to store all
frequent itemsets for each query. We could store fewer frequent itemsets for each query. For example,
before we use the Apriori property, for a particular query, we store the set S of all frequent itemests
for this query, and when we answer this query, we obtain set S and return this as the output. However,
after we use this Apriori property, for this query, we store a subset S’ of S for this query, and when
we answer this query, we obtain set S’, derive S’’ based on S’ and return this derived set S’’ as the
output (where S’’ is equal to S). In the above, x (with a subscript) denotes the total number of all
frequent itemsets for each query. Let y (with a subscript) denote the total smallest possible number of
13/14
COMP1942 Question Paper
all frequent itemsets “stored” for each query. Please give the value of each variable y (e.g., yABCD,
yABC and ynone).

Q10 (20 Marks)

Consider a classification problem where the target attribute contains two possible values, “Yes” and “No”.

We are given a training dataset. We generate a classifier based on this dataset. We find that there are exactly
30 tuples which target attribute values are predicted as “Yes”, and there are exactly 20 tuples which target
attribute values are predicted as “No”. We also know that the f-measure of this classifier is 16/22 and the
accuracy of this classifier is 0.70 (out of 1.0) (i.e., 70%).

(a) Is it a must that we can find the number of false positives? If yes, please write down the number of
false positives. Otherwise, please elaborate why we cannot find it.
(b) Is it a must that we can find the precision of this classifier? If yes, please write down the precision of
this classifier. Otherwise, please elaborate why we cannot find it.
(c) Is it a must that we can find the recall of this classifier? If yes, please write down the recall of this
classifier. Otherwise, please elaborate why we cannot find it.
(d) Is it a must that we can find the specificity of this classifier? If yes, please write down the specificity
of this classifier. Otherwise, please elaborate why we cannot find it.
(e) Is it a must that we can find the decile-wise lift chart of this classifier? If yes, please write down the
decile-wise lift chart of this classifier. Otherwise, please elaborate why we cannot find it.

End of Paper

14/14

COMP1942 Question Paper
No ratings yet
COMP1942 Question Paper
7 pages
P. Monribot - There Is No Sexual Relation PDF
No ratings yet
P. Monribot - There Is No Sexual Relation PDF
18 pages
Tangedco-Abt Meter Final Spec. (Both Bulk & WFHT SC)
No ratings yet
Tangedco-Abt Meter Final Spec. (Both Bulk & WFHT SC)
34 pages
27 36 and 84 87
No ratings yet
27 36 and 84 87
34 pages
Petrochemical Processes - 2001
No ratings yet
Petrochemical Processes - 2001
174 pages
(LASER) survival8-DM An DSAD-2-print Pending
No ratings yet
(LASER) survival8-DM An DSAD-2-print Pending
29 pages
Practice Midterm
No ratings yet
Practice Midterm
4 pages
Electric Discharge Machining (Edm) BY: Dr. Manas Das Assistant Professor
No ratings yet
Electric Discharge Machining (Edm) BY: Dr. Manas Das Assistant Professor
40 pages
hw2 2011spring
0% (1)
hw2 2011spring
3 pages
Test Inks: For Testing Surface Energy
No ratings yet
Test Inks: For Testing Surface Energy
12 pages
Microsoft Office 2007 Word Assignments Computers Grade 9
No ratings yet
Microsoft Office 2007 Word Assignments Computers Grade 9
0 pages
Dow Blow Molding Guide
No ratings yet
Dow Blow Molding Guide
51 pages
Ps 3
No ratings yet
Ps 3
3 pages
DM Makeup Key
No ratings yet
DM Makeup Key
6 pages
Percentile Mendenhall Interpolation
100% (1)
Percentile Mendenhall Interpolation
15 pages
Digital Marketing Adoption and Succes For Small Business
No ratings yet
Digital Marketing Adoption and Succes For Small Business
25 pages
Total Automation Solution in Super Critical Thermal Power Plant PDF
No ratings yet
Total Automation Solution in Super Critical Thermal Power Plant PDF
28 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
HW 2
No ratings yet
HW 2
7 pages
Anova One Way & Two Way Classified Data: Dr. Mukta Datta Mazumder Associate Professor Department of Statistics
No ratings yet
Anova One Way & Two Way Classified Data: Dr. Mukta Datta Mazumder Associate Professor Department of Statistics
32 pages
Journal of Manufacturing Processes: Priya Ranjan, Somashekhar S. Hiremath T
No ratings yet
Journal of Manufacturing Processes: Priya Ranjan, Somashekhar S. Hiremath T
27 pages
Data Mining f20 Practice Final Solutions
No ratings yet
Data Mining f20 Practice Final Solutions
8 pages
COMP1942 Question Paper
No ratings yet
COMP1942 Question Paper
5 pages
Automatic Waste Segregator and Monitoring System: January 2016
No ratings yet
Automatic Waste Segregator and Monitoring System: January 2016
8 pages
Cribbage Rules1
No ratings yet
Cribbage Rules1
5 pages
Apelem Detector Interface
No ratings yet
Apelem Detector Interface
162 pages
Mid Semester Regular-DM
No ratings yet
Mid Semester Regular-DM
3 pages
Wine Quality Prediction Using Machine Learning Algorithms
100% (1)
Wine Quality Prediction Using Machine Learning Algorithms
4 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
30 pages
Team POP Quiz 1 Template: About Module 1 Prepared By: BOIL BABY BOIL
No ratings yet
Team POP Quiz 1 Template: About Module 1 Prepared By: BOIL BABY BOIL
13 pages
Inclined Bedding - Fold (Lab 2A)
No ratings yet
Inclined Bedding - Fold (Lab 2A)
14 pages
Midterm F07 Solutions
No ratings yet
Midterm F07 Solutions
4 pages
Week 7 Assignment 1
No ratings yet
Week 7 Assignment 1
6 pages
Comp 1942 finalExamSol-2016
No ratings yet
Comp 1942 finalExamSol-2016
24 pages
Comp 1942 finalExamSol-2018
No ratings yet
Comp 1942 finalExamSol-2018
24 pages
Jntuk ML RECORD Full
No ratings yet
Jntuk ML RECORD Full
46 pages
Comp 1942 finalExamQuestion-2016
No ratings yet
Comp 1942 finalExamQuestion-2016
11 pages
Data Mining Comprehensive Exam - Regular PDF
No ratings yet
Data Mining Comprehensive Exam - Regular PDF
3 pages
Midterm Solutions Machine
100% (1)
Midterm Solutions Machine
17 pages
Soal CISDM
No ratings yet
Soal CISDM
3 pages
Complete Mechanics o Level Notes 2023-24 Syllabus
No ratings yet
Complete Mechanics o Level Notes 2023-24 Syllabus
19 pages
PGDM Semester - I (2020-2022) End Term Examination: Instructions
100% (1)
PGDM Semester - I (2020-2022) End Term Examination: Instructions
2 pages
2022 CS244 End Sem Soln
No ratings yet
2022 CS244 End Sem Soln
6 pages
Assignment Data Mining
No ratings yet
Assignment Data Mining
27 pages
Mid-Semester Regular Data Mining QP v1 PDF
No ratings yet
Mid-Semester Regular Data Mining QP v1 PDF
2 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
Machine Learning
No ratings yet
Machine Learning
4 pages
Midterm Solutions PDF
No ratings yet
Midterm Solutions PDF
17 pages
APDU Basic Commands
100% (1)
APDU Basic Commands
3 pages
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
No ratings yet
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
2 pages
B.Tech May2022 Comp CSPE-64 Sem4
No ratings yet
B.Tech May2022 Comp CSPE-64 Sem4
4 pages
Excel 365 Charts
No ratings yet
Excel 365 Charts
63 pages
Exam 2017
No ratings yet
Exam 2017
8 pages
Exam Spring 10
No ratings yet
Exam Spring 10
10 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Practice Quiz
100% (1)
Practice Quiz
20 pages
Data Mining & Data Science Practical Slips
No ratings yet
Data Mining & Data Science Practical Slips
45 pages
DM 2019
No ratings yet
DM 2019
7 pages
SPAD
No ratings yet
SPAD
34 pages
Data Mining - Sem 3 - Assignment - 2
No ratings yet
Data Mining - Sem 3 - Assignment - 2
5 pages
(COMP1942) (2022) (S) Midterm Thliai 91588
No ratings yet
(COMP1942) (2022) (S) Midterm Thliai 91588
13 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
56 pages
Final Compre - Solutions - Updated FoDS
No ratings yet
Final Compre - Solutions - Updated FoDS
12 pages
C-3 Pap365er
No ratings yet
C-3 Pap365er
4 pages
23HCS4142 PDF
No ratings yet
23HCS4142 PDF
24 pages
Practice Quiz
No ratings yet
Practice Quiz
10 pages
Matrix Decompo 2024
No ratings yet
Matrix Decompo 2024
16 pages
UM0058 Ai-logger-Modbus-TCP EN V03 0424
No ratings yet
UM0058 Ai-logger-Modbus-TCP EN V03 0424
26 pages
MSBD5001 WrittenAssignment2 2024F
No ratings yet
MSBD5001 WrittenAssignment2 2024F
5 pages
Python - (Msme in India)
No ratings yet
Python - (Msme in India)
15 pages
B. Sc. H Computer S 3OWYH6v
No ratings yet
B. Sc. H Computer S 3OWYH6v
6 pages
DM Endsem 2023-1
No ratings yet
DM Endsem 2023-1
4 pages
Assignment I
No ratings yet
Assignment I
4 pages
Compre FoDS
No ratings yet
Compre FoDS
3 pages
An Intelligent Hair and Scalp Analysis System
No ratings yet
An Intelligent Hair and Scalp Analysis System
16 pages
SR Star Co-Super
No ratings yet
SR Star Co-Super
3 pages
Csci6061 - Machine Learning
No ratings yet
Csci6061 - Machine Learning
3 pages
DM Practice Problem Set-2
No ratings yet
DM Practice Problem Set-2
7 pages
Data Mining End 23 24
No ratings yet
Data Mining End 23 24
2 pages
HW 02
No ratings yet
HW 02
3 pages
ML SP24 Mid Term Exam - Solution
No ratings yet
ML SP24 Mid Term Exam - Solution
8 pages
EE4146 Test1 202324 Semb Solution
No ratings yet
EE4146 Test1 202324 Semb Solution
7 pages
2021 - Data Mining DU CBCS
No ratings yet
2021 - Data Mining DU CBCS
4 pages
IGCSE TP Hooke's Law
No ratings yet
IGCSE TP Hooke's Law
10 pages
COMP 1003&1433 Midterm (Tuesday)
No ratings yet
COMP 1003&1433 Midterm (Tuesday)
8 pages
DM-I Q Paper 2024
No ratings yet
DM-I Q Paper 2024
12 pages
Uct633 MST e Mar25
No ratings yet
Uct633 MST e Mar25
2 pages
Q1S 1
No ratings yet
Q1S 1
2 pages
IGNOU BCA Statistical Techniques Previous Year Unsolved Papers BCS 040
From Everand
IGNOU BCA Statistical Techniques Previous Year Unsolved Papers BCS 040
Manish Soni
No ratings yet
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
From Everand
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
Manish Soni
No ratings yet

Comp 1942 finalExamQuestion-2019

Uploaded by

Comp 1942 finalExamQuestion-2019

Uploaded by

COMP1942 Question Paper

COMP1942 Exploring and Visualizing Data (Spring Semester 2019)

Student ID:__________________ Student Name:_________________________________

Seat No. :__________________

(b) The following shows an FP-tree. Let the support threshold be 2.

We have the following Bayesian Belief Network.

Lung Cancer (LC) Pneumonia (P)

Suppose that there is a new person. We know that

Please answer the following questions.

Record ID Temperature Weight React

Suppose the learning rate is denoted by . Let  = 0.5.

a:(8, 8), b:(10, 10), c:(7, 11), d:(11, 7)

a:(8 + c, 8 + c), b:(10 + c, 10 + c), c:(7 + c, 11 + c), d:(11 + c, 7 + c)

a:(8c, 8c), b:(10c, 10c), c:(7c, 11c), d:(11c, 7c)

Panagiotis Lady Weicheng

Raymond Xiaotian James

Min Dandan Tianwen

w10 w11 w12

part supplier customer SP

p 0.2M s 0.01M c 0.1M

{A, B, C} xABC {A, B, D} xABD {A, C, D} xACD {B, C, D} xBCD

{A} xA {B} xB {C} xC {D} xD

Q10 (20 Marks)

You might also like

Student ID: Student Name:_______________