0% found this document useful (0 votes)
23 views1 page

Data Mining

The document discusses different statistical techniques, data mining techniques and machine learning concepts including regression, supervised learning, decision trees, clustering, data pre-processing, predictive modelling and association rule mining.

Uploaded by

Lynch George
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
23 views1 page

Data Mining

The document discusses different statistical techniques, data mining techniques and machine learning concepts including regression, supervised learning, decision trees, clustering, data pre-processing, predictive modelling and association rule mining.

Uploaded by

Lynch George
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 1

Statistical technique used for investigating and modelling the relationship between

two or more variables is: ~ Regression


What is the type of learning where a function is inferred to describe hidden
structure from unlabeled data ~ Supervised
Probability of theft in an area is 0.03 with expected loss of 20% or 30% of things
with probabilities 0.55 and 0.45. Insurance policy from A costs $150 pa with 100%
repayment. Policy with B, costs $100 pa and first $500 of any loss has to be paid
by the owner. Which data mining technique can be used to choose the policy? ~
Decision Tree
If time is used as an independent variable in a simple linear regression analysis,
which of the following assumptions could be violated? ~ Successive
Which statistical technique deals with finding a structure in a collection of
unlabeled data? ~ Clusturing
Which of the following activities is performed as part of data pre processing?
~ All Detect Missing Values
Noisy values are the values that are valid for the dataset, but are incorrectly
recorded ~ TRUE
Which of the following modelling type should be used for Labelled data? ~
Predictive
What is the other name for Data Preparation stage of Knowledge Discovery Process
~ ETL
Which of the following role is responsible for performing validation on analysis
datasets ~ Statistacian
The process of extracting valid, useful, unknown info from data and using it to
make proactive knowledge driven business is called ~ Data Mining
Which of the following is not applicable to Data Mining ~ Ivolves
working with known information
~
Associate rule is known as� ~ Affinity
Which data mining method groups together objects that are similar to each other and
dissimilar to the other objects? ~ Clustering
Which of the following are Multi-class Classification problem ~ Movie
_________ are the values that mark the boundaries of the confidence interval.
~ Confidence Limits
Regression is typically carried out to develop a mathematical model of the process
~ TRUE
Machine learning task of inferring a function from labelled training data is known
as ~ Supervised
Simulations are carried out to develop a mathematical model of the process ~
FALSE

You might also like