0% found this document useful (1 vote)
1K views2 pages

Data Mining-All Correct

Data integration aids in unifying data from different sources. The dependent variable in the apartment example is price, as the number of bedrooms, bathrooms, and floor determine price. The learning step of classification contributes to constructing the learning model.

Uploaded by

Amit Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (1 vote)
1K views2 pages

Data Mining-All Correct

Data integration aids in unifying data from different sources. The dependent variable in the apartment example is price, as the number of bedrooms, bathrooms, and floor determine price. The learning step of classification contributes to constructing the learning model.

Uploaded by

Amit Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 2

Which process of KDD aids in unifying data from different sources?

-> Data
Integration
Consider an example of an apartment: The number of bedrooms, bathrooms, and the
floor of an apartment determines its price. Which is the dependent variable in this
example?-> Price
__________ step of classification contributes to the construction of learning
model.->Learning Step

Consider an example of an apartment: The number of bedrooms, bathrooms, and the


floor of an apartment determines its price.

Which is/are the predictor variable(s) in this example?-. All the options(Number of
bedrooms,Floor in which the apartment is located & Number of bathrooms)
Response variable is a __________.->Dependent Variable
Explanatory variable is a __________.-> Predictor Variable
__________ is the problem of identifying a category to which a new observation
belongs to, based on a training set of data containing observations whose
categories are already known.-> Classification
__________ term portrays the process of discovering small pieces from a large
volume of raw material.-> Mining

Collective outlier significantly deviates from the entire dataset.-> False


Derived relationships in Association Rule Mining are represented in the form of
__________.-> Rules
Jacard Index distance measure is used on __________.-> Non-numeric dataset
Clustering process works on _________ measure.-> Distance
__________ step of KDD process helps in identifying valuable patterns.-> Pattern
Evaluation
Identify the Unsupervised Learning method.-> Clustering

Which of the following association measure helps in identifying how frequently the
item appears in a dataset?-> Support
__________association measure compares the confidence with the expected
confidence.-> Lift
Which of the following helps in measuring the central tendency of the dataset?->
All the options(Mode,Mean&Median)
Distance measure(s) used in clustering process of Numeric Dataset is/are
__________.-> All the options(Hamming,Minkowski&Manhattan Distance)
__________ aids in identifying associations, correlations, and frequent patterns in
data.-> Association Rule Mining
Which among the following is/are (an) Ensemble Classifier?-> All the options(Random
Forest,Boosting&Bagging)
__________ parameter of regression helps in identifying the direction of
relationship between variables.-> Regression Coefficient

Classification is a __________ task.-> Data Analysis


__________ stage of data science process helps in converting raw data into a
machine-readable format.-> Data Cleaning
__________ stage of data science process helps in exploring and determining the
patterns from the data.-> Exploratory Data Analysis
Identify the algorithm that works based on the concept of clustering.-> K-Means
Identify the algorithm that works based on the concept of classification.-> None of
the options(Decision Tree,Random Forest & SVM)

Descriptive statistics is used in __________ datasets.->Sample


__________statistics provides inferences on population.-> Inferential
__________statistics provides the summary statistics of the data.-> Descriptive
Inferential statistics is used in __________ datasets.->Population

Identify the application of the Outlier Detection Method.-> Intrusion Detection


Regression can be used in predicting/forecasting Applications.-> True
Classification predicts the value of __________ variable.-> Categorical
__________ outlier significantly deviates based on the context selected.->
Contextual Outlier
Which of the following helps in measuring the dispersion range of the data?-> All
the options(Standard Deviation,Range,Variance&interquartile range)
Which among the following is/are (an) outlier detection method(s)?-> All the
options(Proximity-based approach,Clustering-based approach,Statistical
approach&Classification approach)
The science of collecting, interpreting, and analyzing data is known as
__________.-> Statistics
In Association Rules, the Antecedent and Consequent form a disjoint set.-> True

You might also like