0% found this document useful (0 votes)

4 views

Data Mining

The document consists of a series of true/false questions related to concepts in supervised learning, regression analysis, data visualization, and data mining. It covers topics such as the importance of target variables, the curse of dimensionality, principal component analysis, and the characteristics of big data. Additionally, it addresses misconceptions in statistical modeling and provides insights into the properties of various analytical techniques.

Uploaded by

Rani Raut

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Data Mining

Uploaded by

Rani Raut

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Question 1- Supervised Learning MUST have a target /Out Variable –

True
Question -2-The Curse of dimension is the Affiliction caused by adding variables to multivariate
data models – True
Question -3-A matrix plot is an Example of a Multi -Dimension Plot True
Question 4- In logistic Regression models errors in functional form will create bias True
Question 5- Principal Component analysis is a dimensionally reduction Technique that retains
much of the variation present in data set True
Question 6-Data Visualization is used for Prediction and not Exploration False
Question -7 A time Series plot can be visually inspected to determine Seasonality in the data
True
Question 8 -A negative Covariance between Variables X and Y Move in Opposite Direction True
Question 9 – In a linear regression model of the form Y=B0+B1X, the parameter B1 is biased if it
is different from the true parameter B1 False
Question 10 -Data Mining is the Confluence of the Field of Statistics and machine learning TRUE
Question 11- A histogram is an Example of a basic plot False
Question 12-in Predictive modeling the P value of coefficient is the most important measure
True
Question 13 – The odds and log odds in the context of logistic regression means the same -
False
Question 14- A linear probability model is nothing but a linear regression with the fitted value
restricted between o and 1 – False
Question 15- Linear Regression Cannot be used when the outcome variable is categorical- True

16. If the data is homoscedastic the parameters in the linear regression cannot be trusted?
A.False
17.in any given dataset the covariance of any two variables can never be zero?
A.False
18. An odd of -0.5 means that the probability of winning and losing are equal’s a goodness
A.False
19. In a standard linear regression equation the co-efficient b1 represents the intercept of the
regression line on the axis of the outcome variable Y?
A.False
20. R – squared is a goodness of fit measure that adjusts the results based on the number of
predictors?
A.False
21. Let X and Y be two variables in a dataset. You have calculated two corresponding principal
components, Z1 and Z2, respectively, which of the following is ALWAYS true?
A. COV(Z1,Z2)=0
22.In a logistic regression equation, in p/1-p =B0 + B1 X consider B1 to be equal to1. Then one
unit change in X results in:?
A.Increasing the odds by the factor of 2.71
23. The error term in a linear regression equation?
A.Contains all factor affecting the outcome variable Y……
24. which of the following is a characteristic of BIG DATA:
A.All of the above
25. The first principal component in the PCA algorithm has:
A.Highest Variance
26. Which of the following is not a step in Explanatory modelling?
A.Focus in on Yhat
27. Which of the following is not a step in the principal component analysis methodology?
A.Square the covariance matrixx
28. Which of the following is not a technique to deal with missing values?
A.Use linear Regression to predict values
29. In the OLS solution in class, we rely on the following assumption
A.Two of the other Choices
30. Which of the following is not an iterative search algorithm for determining predictors?
A. Maximium likey hood estimation

31. In the algebraic solution to OLS. The following answers are true?
A.Two of the other choices
32. In a linear probability model the predicted probability is?
A. can be below 0 & above 1
33. The mean of a data sample is a measure of?
A.Central Tendency
34. Heat maps are used to visualize ______________ and _____________?
A.Correleation and Missing Data

35. Consider a regression which gives coefficient with a p-value of 0.555, Which of the following
statements is true (Use Statistical Significance of < 0.001)
Ans : There is not enough evidence to reject the null hypothesis

36. Which of the following is not a property of eigen vector

Ans : MxN there are M eigenvectors

37: An Odds of 0.5 means the probability of winning is

Ans : . Lower than losing

38. Data Visualization Supports?

Ans : All of the Above

39: Which of the following is reason NOT to select predictors from the full data set.
Ans : parsimony is important

40.Jittering in the Scatter Plots is the

Ans : adding of noise to unstack markets that hide data points underneath

41. Which of the following is not part of the 4 steps data mining cycle
Ans: None of the Above

42. Which of the following is not a property of predictive modeling?

Ans: performance is measure by how well the model approximates the training data set

43. Which of the following is not a step in ordinary least square algorithm
Ans: Find the eigenvectors

44. Which of the following is not a data reduction technique

Ans: Classification Trees

45. Which of the following is not a part of data mining step

Ans: Overfit data

46. The Diagonal of 2 dimensional covariance matric represents the following

Ans: the variance of each variable

47: Cov (X,Y) is always equal to

Ans : Cov(Y,X)

48. Multidimensional Visualition is the

Ans : addng of colour, size and multiple panels to convery richer information

49. In a linear regression of the form Y=B0+B1*X+U, homoskedasticity means the following
Ans. E(xu] = 0 but not Covxu) = 0

50. In a logistic regression equation of a form in [P/1-P] = B0+B1X,

Ans. p are unrelated

ITAE002
0% (1)
ITAE002
10 pages
Test Bank Questions Chapters 1 and 2
50% (2)
Test Bank Questions Chapters 1 and 2
3 pages
MCQs (Machine Learning)
50% (22)
MCQs (Machine Learning)
7 pages
Multiple Choice Test Bank Questions No Feedback - Chapter 3
100% (1)
Multiple Choice Test Bank Questions No Feedback - Chapter 3
5 pages
linear regression
No ratings yet
linear regression
37 pages
Dsce PP
No ratings yet
Dsce PP
3 pages
Instructions: Answer Each of The Following Questions and Justify Your Answer (Write It)
No ratings yet
Instructions: Answer Each of The Following Questions and Justify Your Answer (Write It)
3 pages
ML Unit 03 MCQ
No ratings yet
ML Unit 03 MCQ
20 pages
Axioms:: Simultaneously Meannormalization
No ratings yet
Axioms:: Simultaneously Meannormalization
2 pages
ML 1
No ratings yet
ML 1
51 pages
ML Unit 1 MCQ
100% (1)
ML Unit 1 MCQ
9 pages
NSE BA Sample Paper With Solution
100% (1)
NSE BA Sample Paper With Solution
18 pages
Revision Exercise SDSC5001 Midterm
No ratings yet
Revision Exercise SDSC5001 Midterm
4 pages
Test 1 With Key 10-3
No ratings yet
Test 1 With Key 10-3
16 pages
ML U3 MCQ
No ratings yet
ML U3 MCQ
20 pages
Quiz Final Ae
No ratings yet
Quiz Final Ae
23 pages
Grade 3 Data Mining: Question Text
No ratings yet
Grade 3 Data Mining: Question Text
28 pages
Graded Quiz Unit 3 PDF
No ratings yet
Graded Quiz Unit 3 PDF
10 pages
DS100-2-Grp#4 Chapter 6 Advanced Analytical Theory and Methods Regression (CADAY, CASTOR, CRUZ, SANORIA, TAN)
No ratings yet
DS100-2-Grp#4 Chapter 6 Advanced Analytical Theory and Methods Regression (CADAY, CASTOR, CRUZ, SANORIA, TAN)
4 pages
V20PBBA03 - Business Forecasting
No ratings yet
V20PBBA03 - Business Forecasting
41 pages
MCQ On Regression
100% (2)
MCQ On Regression
3 pages
12
No ratings yet
12
16 pages
Big Data Analytics (BDAG 19-5) : Quiz: GMP - 2019 Term V
No ratings yet
Big Data Analytics (BDAG 19-5) : Quiz: GMP - 2019 Term V
2 pages
Question 1 (1 Point) : Saved
No ratings yet
Question 1 (1 Point) : Saved
6 pages
Assignment - Week 2 - Final
No ratings yet
Assignment - Week 2 - Final
3 pages
Test Bank Questions Chapters 1 and 2
No ratings yet
Test Bank Questions Chapters 1 and 2
3 pages
Midterm
No ratings yet
Midterm
9 pages
ASSIGN8
No ratings yet
ASSIGN8
5 pages
DS&BDA Techneo Unit 1&2 MCQs
No ratings yet
DS&BDA Techneo Unit 1&2 MCQs
16 pages
Pratice Paper[1]
No ratings yet
Pratice Paper[1]
12 pages
Data Final
No ratings yet
Data Final
17 pages
Itae002 Test 2
No ratings yet
Itae002 Test 2
150 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
RGRSSN Assgnmnt
No ratings yet
RGRSSN Assgnmnt
11 pages
18CSO106T Data Analysis Using Open Source Tool: Question Bank
No ratings yet
18CSO106T Data Analysis Using Open Source Tool: Question Bank
26 pages
Int 354 ML-1
No ratings yet
Int 354 ML-1
4 pages
30-questions-to-test-a-data-scientist-on-linear-regression
No ratings yet
30-questions-to-test-a-data-scientist-on-linear-regression
10 pages
MODULE 2 Coursera
No ratings yet
MODULE 2 Coursera
9 pages
Regression Analysis For Third Years
No ratings yet
Regression Analysis For Third Years
6 pages
Sample Questions
No ratings yet
Sample Questions
8 pages
MLP Question Bank of AI and ML and NLP
No ratings yet
MLP Question Bank of AI and ML and NLP
7 pages
ML Question bank
No ratings yet
ML Question bank
13 pages
Linear Regression Basics QUIZS
No ratings yet
Linear Regression Basics QUIZS
13 pages
PA obj
No ratings yet
PA obj
3 pages
PSQ Q3
No ratings yet
PSQ Q3
3 pages
Machine Learning Test Regression
No ratings yet
Machine Learning Test Regression
6 pages
Econometric Mod L
No ratings yet
Econometric Mod L
8 pages
30 Questions To Test A Data Scientist On Linear Regression PDF
No ratings yet
30 Questions To Test A Data Scientist On Linear Regression PDF
13 pages
30 Questions To Test Your Understanding of Logistic Regression
No ratings yet
30 Questions To Test Your Understanding of Logistic Regression
13 pages
2022 Final exam_all
No ratings yet
2022 Final exam_all
9 pages
Repaso Econometria Final BUENO
No ratings yet
Repaso Econometria Final BUENO
88 pages
KTEE218 - Bài tập trắc nghiệm
No ratings yet
KTEE218 - Bài tập trắc nghiệm
15 pages
Machine Learning,( CS-3035), Online Spring End Semester Examination 2021
No ratings yet
Machine Learning,( CS-3035), Online Spring End Semester Examination 2021
8 pages
Econometrics Questions
80% (5)
Econometrics Questions
7 pages
TL On-Tap
No ratings yet
TL On-Tap
158 pages
Assigniment Econometrics RVU 2024 Summer
No ratings yet
Assigniment Econometrics RVU 2024 Summer
5 pages
PSQ Q2
No ratings yet
PSQ Q2
2 pages
Multiple Choice Test Bank Questions No Feedback - Chapter 5
No ratings yet
Multiple Choice Test Bank Questions No Feedback - Chapter 5
7 pages
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Hypothesis Testing 1
No ratings yet
Hypothesis Testing 1
61 pages
Savitribai Phule Pune University: Syllabus For M.Phil./Ph.D. (PET) Entrance Exam: Commerce Research Methodology
No ratings yet
Savitribai Phule Pune University: Syllabus For M.Phil./Ph.D. (PET) Entrance Exam: Commerce Research Methodology
4 pages
Chapter 9 Audit Sampling
No ratings yet
Chapter 9 Audit Sampling
17 pages
Sta211 2016 2017
No ratings yet
Sta211 2016 2017
5 pages
Basic Bootstrap in Stata
No ratings yet
Basic Bootstrap in Stata
2 pages
Correlation and Regression
No ratings yet
Correlation and Regression
45 pages
Chapter 5: Sampling & Estimation Example 1:: Solution
No ratings yet
Chapter 5: Sampling & Estimation Example 1:: Solution
3 pages
Sessions 21-24 Factor Analysis - Ppt-Rev
No ratings yet
Sessions 21-24 Factor Analysis - Ppt-Rev
61 pages
Exercise Sheet 2
No ratings yet
Exercise Sheet 2
2 pages
Chyt TEST
No ratings yet
Chyt TEST
2 pages
Practice Problems 3 (Data Description For Online) PDF
No ratings yet
Practice Problems 3 (Data Description For Online) PDF
2 pages
Calculation of Staheli's Planter Arch Index, Chippaux-Smirak Index, Clarke's Angle Prevalence and Predictors of Flat Foot: A Cross Sectional Study
No ratings yet
Calculation of Staheli's Planter Arch Index, Chippaux-Smirak Index, Clarke's Angle Prevalence and Predictors of Flat Foot: A Cross Sectional Study
6 pages
Econometrics: Specification Errors
100% (2)
Econometrics: Specification Errors
13 pages
2023 End of Semester - Business Statistics Questions and Marking Scheme
No ratings yet
2023 End of Semester - Business Statistics Questions and Marking Scheme
9 pages
Wooldridge 6e Ch09 SSM
No ratings yet
Wooldridge 6e Ch09 SSM
8 pages
Sheet5 Sol
No ratings yet
Sheet5 Sol
13 pages
Class5 Lecture
No ratings yet
Class5 Lecture
53 pages
Unit 4 Data Management JGDomingo
No ratings yet
Unit 4 Data Management JGDomingo
17 pages
Deepka Bhardwaj Sir
No ratings yet
Deepka Bhardwaj Sir
4 pages
T Test Practice Problems
0% (1)
T Test Practice Problems
2 pages
160 MCQS for comparative education
No ratings yet
160 MCQS for comparative education
30 pages
Assignments Walkthroughs and R Demo: W4290 Statistical Methods in Finance - Spring 2010 - Columbia University
No ratings yet
Assignments Walkthroughs and R Demo: W4290 Statistical Methods in Finance - Spring 2010 - Columbia University
38 pages
SPMetaAnalysis
No ratings yet
SPMetaAnalysis
11 pages
G Power 3.1 Manual: October 15, 2020
No ratings yet
G Power 3.1 Manual: October 15, 2020
85 pages
Sae: An R Package For Small Area Estimation
No ratings yet
Sae: An R Package For Small Area Estimation
18 pages
Module 4 in Assessment 2 Upload
No ratings yet
Module 4 in Assessment 2 Upload
8 pages
Basic Biostatistics For Post-Graduate Students: Educational Forum
No ratings yet
Basic Biostatistics For Post-Graduate Students: Educational Forum
9 pages
Survival Part 9
No ratings yet
Survival Part 9
57 pages
ChoiceModelR Manual
No ratings yet
ChoiceModelR Manual
17 pages
Regression
No ratings yet
Regression
3 pages

Data Mining

Uploaded by

Data Mining

Uploaded by

Question 1- Supervised Learning MUST have a target /Out Variable –

36. Which of the following is not a property of eigen vector

37: An Odds of 0.5 means the probability of winning is

38. Data Visualization Supports?

40.Jittering in the Scatter Plots is the

42. Which of the following is not a property of predictive modeling?

44. Which of the following is not a data reduction technique

45. Which of the following is not a part of data mining step

46. The Diagonal of 2 dimensional covariance matric represents the following

47: Cov (X,Y) is always equal to

48. Multidimensional Visualition is the

50. In a logistic regression equation of a form in [P/1-P] = B0+B1X,

You might also like