0% found this document useful (0 votes)
14 views5 pages

Practice

The document is a question bank for a data mining and warehousing course, covering various topics such as data mining tasks, the relationship between data warehousing and data mining, and the differences between exploratory and predictive data mining. It also discusses the application areas of data mining, key issues, benefits, limitations, and the importance of human intervention in the data mining process. Additionally, it includes specific questions related to practical applications in banking and healthcare.

Uploaded by

822c2z899d
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views5 pages

Practice

The document is a question bank for a data mining and warehousing course, covering various topics such as data mining tasks, the relationship between data warehousing and data mining, and the differences between exploratory and predictive data mining. It also discusses the application areas of data mining, key issues, benefits, limitations, and the importance of human intervention in the data mining process. Additionally, it includes specific questions related to practical applications in banking and healthcare.

Uploaded by

822c2z899d
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

Practice

DATA MINING & WAREHOUSING QUESTION BANK


Semester :8th
Q.1 Explain different data mining tasks.
Predictive Tasks
• Predictive modelling is Similar to the human learning
experience.
• Model is developed using a Supervised learning approach,
which has two phases:
1. Training builds a model using a large sample of historical
data called a training
set where the class labels are known.
2. Testing determine its accuracy and physical performance
characteristics.
Descriptive Tasks
Find human-interpretable patterns that describe the data.
• Use Unsupervised learning approach :
- The class labels of training data is unknown.
- establishing the existence of relationships, classes or clusters
in the data
.
Q.2 What is the relation between data warehousing and data
mining?
Data warehousing involves storing and organizing large volumes of
structured data for analysis, while data mining extracts patterns,
insights, and useful information from that stored data. Data
warehousing provides the foundation for data mining.
from that stored data. Data warehousing provides the foundation for
data mining.
Q.3 Explain the differences between “Explorative Data Mining”
and “Predictive Data Mining” and give one example of each.
- Explorative Data Mining focuses on discovering patterns,
relationships, or insights in data without predefined hypotheses.
Example: Identifying customer segments based on purchasing
behavior.

Predictive Data Mining aims to forecast future outcomes using


historical data and models.
Example: Predicting customer churn based on past usage patterns.

Q.4 What are the application areas of data Mining?


- Healthcare
 Finance
 Retail
 Marketing
 Education
 Manufacturing
 Telecommunications
 E-commerce
 Social Media
 Energy

Q.5 Explain the differences between Knowledge discovery and


data mining.
- -Knowledge Discovery is the overall process of extracting useful
information from data, including steps like data preprocessing, data
mining, and interpretation.

-Data Mining is a step within knowledge discovery that focuses


specifically on applying algorithms to identify patterns or insights in
data

Q.6 How is data warehouse different from a database? How are


they similar?
-Differences: A database is designed for day-to-day operations,
storing current transactional data. A data warehouse is designed for
analysis, storing large volumes of historical data optimized for
querying and reporting.

-Similarities: Both store and manage data. Both use similar


technologies like SQL for querying. Both ensure data integrity and
security.

Q.7 What type of benefit you might hope to get from data
mining?
Discover patterns, improve decision-making, enhance customer
insights, predict trends, detect fraud, and optimize processes.

Q.8 What are the key issues in data Mining?


-Data quality, privacy concerns, scalability, algorithm selection,
interpretability, and handling complex data.

Q.9 How can Data Mining help business analyst?


-It helps identify trends, predict customer behavior, improve
marketing strategies, optimize operations, and support data-driven
decisions.

Q.10 What are the limitations of data Mining?


-Requires high-quality data, can be computationally expensive, may
lead to privacy issues, and results need expert interpretation.

Q.11 Discuss the need of human intervention in data mining


process.
-Humans are needed to interpret results, select relevant data,
choose appropriate algorithms, and ensure ethical use of insights.

Q.12As a bank manager, how would you decide whether to give


loan to an applicant or not?
. -Assess applicant’s credit history, income, debt-to-income ratio,
and past loan performance using data analytics to determine risk
Q.13 What steps you would follow to identify a fraud for a credit
card company.
-Analyze transaction patterns, detect unusual spending behaviors,
and use anomaly detection algorithms to flag potential fraud.

Q.14 What is data Mining?


-Data mining refers to extracting or mining knowledge from
large amounts of data. The term is
actually a misnomer. Thus, data mining should have been more
appropriately named as
knowledge mining which emphasis on mining from large
amounts of data
Q.15 State three different application for which data mining
techniques seem appropriate. Informally explain each
application.
-Healthcare: Predict patient diseases based on historical data.
Retail: Recommend products based on customer preferences.
Banking: Detect fraudulent transactions using transaction analysis.

Q.16 Explain briefly the differences between “classification”


and ‘’clustering” and give an informal example of an
application that would benefit from each technique
-Classification: Categorizes data into predefined groups (e.g., spam
vs. non-spam emails).
Clustering: Groups similar data points tog by ether without
predefined categories

Q.17 What do you mean by Data Processing?


-Data processing is the transformation of raw data into meaningful
information through various methods like cleaning, analyzing, and
summarizing

Q.18 Explain data cleaning


.- Data cleaning involves removing inaccuracies, correcting errors,
and handling missing or inconsistent data to improve dataset
quality.

Q.19 Describe different data cleaning approaches.


 Removing duplicates
 Handling missing values
 Correcting errors
 Standardizing data formats

Q.20 How can we handle missing values?


By Replace missing values with mean/median, use interpolation, or
exclude incomplete data, depending on the context.

You might also like