0% found this document useful (0 votes)

9 views5 pages

ASSIGN8

The document discusses regression techniques used in predictive data mining. It provides examples of linear regression models and discusses terms like slope, sum of squared errors, overfitting and underfitting in the context of regression analysis. It also mentions other techniques like principal component analysis and autoregression that are applicable for time series prediction problems.

Uploaded by

Toygj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views5 pages

ASSIGN8

Uploaded by

Toygj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Data Mining: Assignment Week 8: Regression

(Each question carries 1 mark)

1. Regression is used in:

A. predictive data mining

B. exploratory data mining

C. descriptive data mining

D. explanative data mining

Ans: A

Explanation: Regression is used for prediction.

2. In the regression equation Y = 21 - 3X, the slope is

A. 21
B. -21
C. 3
D. -3

Ans: D
Explanation: slope intercept form of a line is y=mx+c.

3. The output of a regression algorithm is usually a:

A. real variable

B. integer variable

C. character variable

D. string variable

Ans: A

Explanation: Regression outputs real variables.

4. Regression finds out the model parameters which produces the least square
error between -

A. input value and output value

B. input value and target value

C. output value and target value

D. model parameters and output value

Ans: C

Explanation: Regression finds out the model parameters which

minimises the error between the output value and the target value

5. The linear regression model y = a0 + a1x is applied to the data in the table
shown below. What is the value of the sum squared error function S(a0, a1),
when a0 = 1, a1 = 2?

x y
1 1
2 1
4 6
3 2

A. 0.0
B. 27
C. 13.5
D. 54

Ans: D
Explanation: y’ is the predicted output.

y’ = 1+2x

x y y’
1 1 3
2 1 5
4 6 9
3 2 7

sum of squared error = (1-3)2 +(1-5)2 +(6-9)2 +(2-7)2 = 54

6. Consider x1, x2 to be the independent variables and y the dependent
variable, which of the following represents a linear regression model?

A. y = a0 + a1/x1 + a2/x2

B. y = a0 + a1x1 + a2x2

C. y = a0 + a1x1 + a2x22

D. y = a0 + a1x12 + a2x2

Ans: B

Explanation: In option B y is linear in x.

7. Find all the eigenvalues of the following matrix A.

A. 1,3
B. 2,3
C. 1,2,3
D. Eigenvalues cannot be found.
Ans: C
Explanation: If A is an n × n triangular matrix (upper triangular, lower
triangular, or diagonal), then the eigenvalues of A are entries of the main
diagonal of A. Therefore, eigenvalues are 1,2,3.

8. In the figures below the training instances for classification problems are
described by dots. The blue dotted lines indicate the actual functions and the
red lines indicate the regression model. Which of the following statement is
correct?
A. Figure 1 represents overfitting and Figure 2 represents underfitting

B. Figure 1 represents underrfitting and Figure 2 represents overfitting

C. Both Figure 1 and Figure 2 represents underfitting

D. Both Figure 1 and Figure 2 represents overfitting

Ans: B

Explanation: Definition of overfitting and underfitting.

9. In principal component analysis, the projected lower dimensional space

corresponds to –

A. subset of the original co-ordinate axis

B. eigenvectors of the data covariance matrix

C. eigenvectors of the data distance matrix

D. orthogonal vectors to the original co-ordinate axis

Ans: B

Explanation: We must first subtract the mean of each variable from the dataset to cen-
ter the data around the origin. Then, we compute the covariance matrix of the data and
calculate the eigenvalues and corresponding eigenvectors of this covariance ma-
trix. Then we must normalize each of the orthogonal eigenvectors to become unit vectors.
Once this is done, each of the mutually orthogonal, unit eigenvectors can be interpreted as
an axis of the ellipsoid fitted to the data. This choice of basis will transform our covariance
matrix into a diagonalised form with the diagonal elements representing the variance of
each axis.
10. A time series prediction problem is often best solved using?

A. Multivariate regression

B. Autoregression

C. Logistic regression

D. Sinusoidal regression

Ans : B

Explanation: Autoregression is a time series model that uses observations

from previous time steps as input to a regression equation to predict the value
at the next time step.

MLT - Solutions (12 Weeks Merged) PDF
No ratings yet
MLT - Solutions (12 Weeks Merged) PDF
143 pages
30 Questions To Test A Data Scientist On Linear Regression
No ratings yet
30 Questions To Test A Data Scientist On Linear Regression
10 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
71 pages
Exam SRM Sample Questions 2
No ratings yet
Exam SRM Sample Questions 2
60 pages
ITAE002
0% (1)
ITAE002
10 pages
Sketching As A Tool For Numerical Linear Algebra
No ratings yet
Sketching As A Tool For Numerical Linear Algebra
139 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
77 pages
Cp4252 ML Unit-II
No ratings yet
Cp4252 ML Unit-II
44 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
QUESTION BANK, Sample Paper, and Many More
No ratings yet
QUESTION BANK, Sample Paper, and Many More
43 pages
5.2 Regression
No ratings yet
5.2 Regression
19 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
ML 2
No ratings yet
ML 2
155 pages
V20PBBA03 - Business Forecasting
No ratings yet
V20PBBA03 - Business Forecasting
41 pages
Pratice Paper
No ratings yet
Pratice Paper
12 pages
Machine Learning Test Regression
No ratings yet
Machine Learning Test Regression
6 pages
Linear Regression
No ratings yet
Linear Regression
60 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
69 pages
Linear Algebra
No ratings yet
Linear Algebra
21 pages
Department of Electrical Engineering School of Science and Engineering
No ratings yet
Department of Electrical Engineering School of Science and Engineering
10 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
ML Solved Endsem
No ratings yet
ML Solved Endsem
16 pages
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Chapter4 Regression
No ratings yet
Chapter4 Regression
15 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Assignment 1-12 ML
No ratings yet
Assignment 1-12 ML
54 pages
Uct633 Est 23
No ratings yet
Uct633 Est 23
3 pages
ML 1
No ratings yet
ML 1
51 pages
Solution 2
0% (1)
Solution 2
6 pages
Data Mining
No ratings yet
Data Mining
3 pages
Machine Learning, (CS-3035), Online Spring End Semester Examination 2021
No ratings yet
Machine Learning, (CS-3035), Online Spring End Semester Examination 2021
8 pages
Data Science Cse
No ratings yet
Data Science Cse
24 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
Graded Quiz Unit 3 PDF
No ratings yet
Graded Quiz Unit 3 PDF
10 pages
Self-Quiz Unit 3 - Attempt Review2
No ratings yet
Self-Quiz Unit 3 - Attempt Review2
12 pages
Quiz 2 2021 Sol
No ratings yet
Quiz 2 2021 Sol
8 pages
MA1513 (2223sem1) Examplify Solution
No ratings yet
MA1513 (2223sem1) Examplify Solution
16 pages
PA QB Units 1-5
No ratings yet
PA QB Units 1-5
25 pages
Sample MCQ
No ratings yet
Sample MCQ
5 pages
ML
No ratings yet
ML
2 pages
Assignment - Week 2 - Final
No ratings yet
Assignment - Week 2 - Final
3 pages
Mid Term Solutions
No ratings yet
Mid Term Solutions
5 pages
ML ProjectReport-Sonali Joshi
100% (2)
ML ProjectReport-Sonali Joshi
38 pages
Mcqs
No ratings yet
Mcqs
9 pages
d2 1 PDF
No ratings yet
d2 1 PDF
4 pages
d2 - 1 PDF
No ratings yet
d2 - 1 PDF
5 pages
Regression Analysis For Third Years
No ratings yet
Regression Analysis For Third Years
6 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
ML Unit 1 MCQ
100% (1)
ML Unit 1 MCQ
9 pages
Test Bank Statistics
No ratings yet
Test Bank Statistics
9 pages
Grade 3 Data Mining: Question Text
No ratings yet
Grade 3 Data Mining: Question Text
28 pages
Machine 2021 Jul-Dec
No ratings yet
Machine 2021 Jul-Dec
46 pages
Compre FoDS
No ratings yet
Compre FoDS
3 pages
Wa0006.
No ratings yet
Wa0006.
4 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
Instructions: Answer Each of The Following Questions and Justify Your Answer (Write It)
No ratings yet
Instructions: Answer Each of The Following Questions and Justify Your Answer (Write It)
3 pages
Select The Correct Answer
No ratings yet
Select The Correct Answer
5 pages
18CSO106T Data Analysis Using Open Source Tool: Question Bank
No ratings yet
18CSO106T Data Analysis Using Open Source Tool: Question Bank
26 pages
ML Viva Questions
No ratings yet
ML Viva Questions
25 pages
PS All MCQ
No ratings yet
PS All MCQ
72 pages
Linear Regression Interview Questions
No ratings yet
Linear Regression Interview Questions
4 pages
Panel Data Econometrics: Manuel Arellano
No ratings yet
Panel Data Econometrics: Manuel Arellano
5 pages
Q.Paper - Correlation and Regression UACA
No ratings yet
Q.Paper - Correlation and Regression UACA
3 pages
Lesson 8 - Linear Regression and Correlation PDF
No ratings yet
Lesson 8 - Linear Regression and Correlation PDF
3 pages
Exercise Spearman Rank Correlation Coefficient
100% (1)
Exercise Spearman Rank Correlation Coefficient
9 pages
Multiple Regression
100% (1)
Multiple Regression
7 pages
A Primer On Financial Time Series Analysis
No ratings yet
A Primer On Financial Time Series Analysis
93 pages
Data Analysis Using STATA Software
No ratings yet
Data Analysis Using STATA Software
16 pages
BCPUML Breast Cancer Prediction Using Machine Learning Approach-A Performance Analysis
No ratings yet
BCPUML Breast Cancer Prediction Using Machine Learning Approach-A Performance Analysis
10 pages
Chapter 9 - Correlation and Regression
No ratings yet
Chapter 9 - Correlation and Regression
112 pages
EE3211 Modelling Techniques
No ratings yet
EE3211 Modelling Techniques
47 pages
DSUR I Chapter 06 (Correlation)
No ratings yet
DSUR I Chapter 06 (Correlation)
42 pages
Scatterplots and Linear Correlation
No ratings yet
Scatterplots and Linear Correlation
9 pages
Ec 22613 Test
No ratings yet
Ec 22613 Test
18 pages
ML Question Bank Ans
No ratings yet
ML Question Bank Ans
24 pages
Excel Perhitungan Laporan Keuangan
No ratings yet
Excel Perhitungan Laporan Keuangan
5 pages
A Brief Guide To Decisions at Each Step of The Propensity Score M
No ratings yet
A Brief Guide To Decisions at Each Step of The Propensity Score M
12 pages
Multiple Regression Analysis in SPSS Statistics - Laerd Statistics
No ratings yet
Multiple Regression Analysis in SPSS Statistics - Laerd Statistics
7 pages
A Medical Researcher Is Studying The Relationship Between Age (X Years) and Volume
No ratings yet
A Medical Researcher Is Studying The Relationship Between Age (X Years) and Volume
17 pages
Chapter 08
No ratings yet
Chapter 08
3 pages
Unipma: Simba
No ratings yet
Unipma: Simba
13 pages
DIMTOT
No ratings yet
DIMTOT
3 pages
Psychological Statistics Using T-Test, Correlation, and ANOVA
No ratings yet
Psychological Statistics Using T-Test, Correlation, and ANOVA
4 pages
JAMBURA: Vol 4. No 1. Mei 2021: Jurnal Ilmiah Manajemen Dan Bisnis P-Issn 2620-9551 E-ISSN 2622-1616
No ratings yet
JAMBURA: Vol 4. No 1. Mei 2021: Jurnal Ilmiah Manajemen Dan Bisnis P-Issn 2620-9551 E-ISSN 2622-1616
12 pages
2 Pearson Correlation
No ratings yet
2 Pearson Correlation
7 pages
SCI 1020 - wk2
No ratings yet
SCI 1020 - wk2
4 pages
1
No ratings yet
1
6 pages
Homework 1 - Simple Linear Regression - Neal Pania
No ratings yet
Homework 1 - Simple Linear Regression - Neal Pania
4 pages
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet

ASSIGN8

Uploaded by

ASSIGN8

Uploaded by

Data Mining: Assignment Week 8: Regression

(Each question carries 1 mark)

1. Regression is used in:

A. predictive data mining

B. exploratory data mining

C. descriptive data mining

D. explanative data mining

Explanation: Regression is used for prediction.

2. In the regression equation Y = 21 - 3X, the slope is

3. The output of a regression algorithm is usually a:

Explanation: Regression outputs real variables.

A. input value and output value

B. input value and target value

C. output value and target value

D. model parameters and output value

Explanation: Regression finds out the model parameters which

sum of squared error = (1-3)2 +(1-5)2 +(6-9)2 +(2-7)2 = 54

Explanation: In option B y is linear in x.

7. Find all the eigenvalues of the following matrix A.

B. Figure 1 represents underrfitting and Figure 2 represents overfitting

C. Both Figure 1 and Figure 2 represents underfitting

D. Both Figure 1 and Figure 2 represents overfitting

Explanation: Definition of overfitting and underfitting.

9. In principal component analysis, the projected lower dimensional space

A. subset of the original co-ordinate axis

B. eigenvectors of the data covariance matrix

C. eigenvectors of the data distance matrix

D. orthogonal vectors to the original co-ordinate axis

Explanation: Autoregression is a time series model that uses observations

You might also like