The document is a question bank for a data mining and warehousing course, covering various topics such as data mining tasks, the relationship between data warehousing and data mining, and the differences between exploratory and predictive data mining. It also discusses the application areas of data mining, key issues, benefits, limitations, and the importance of human intervention in the data mining process. Additionally, it includes specific questions related to practical applications in banking and healthcare.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
14 views5 pages
Practice
The document is a question bank for a data mining and warehousing course, covering various topics such as data mining tasks, the relationship between data warehousing and data mining, and the differences between exploratory and predictive data mining. It also discusses the application areas of data mining, key issues, benefits, limitations, and the importance of human intervention in the data mining process. Additionally, it includes specific questions related to practical applications in banking and healthcare.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5
Practice
DATA MINING & WAREHOUSING QUESTION BANK
Semester :8th Q.1 Explain different data mining tasks. Predictive Tasks • Predictive modelling is Similar to the human learning experience. • Model is developed using a Supervised learning approach, which has two phases: 1. Training builds a model using a large sample of historical data called a training set where the class labels are known. 2. Testing determine its accuracy and physical performance characteristics. Descriptive Tasks Find human-interpretable patterns that describe the data. • Use Unsupervised learning approach : - The class labels of training data is unknown. - establishing the existence of relationships, classes or clusters in the data . Q.2 What is the relation between data warehousing and data mining? Data warehousing involves storing and organizing large volumes of structured data for analysis, while data mining extracts patterns, insights, and useful information from that stored data. Data warehousing provides the foundation for data mining. from that stored data. Data warehousing provides the foundation for data mining. Q.3 Explain the differences between “Explorative Data Mining” and “Predictive Data Mining” and give one example of each. - Explorative Data Mining focuses on discovering patterns, relationships, or insights in data without predefined hypotheses. Example: Identifying customer segments based on purchasing behavior.
Predictive Data Mining aims to forecast future outcomes using
historical data and models. Example: Predicting customer churn based on past usage patterns.
Q.4 What are the application areas of data Mining?
- Healthcare Finance Retail Marketing Education Manufacturing Telecommunications E-commerce Social Media Energy
Q.5 Explain the differences between Knowledge discovery and
data mining. - -Knowledge Discovery is the overall process of extracting useful information from data, including steps like data preprocessing, data mining, and interpretation.
-Data Mining is a step within knowledge discovery that focuses
specifically on applying algorithms to identify patterns or insights in data
Q.6 How is data warehouse different from a database? How are
they similar? -Differences: A database is designed for day-to-day operations, storing current transactional data. A data warehouse is designed for analysis, storing large volumes of historical data optimized for querying and reporting.
-Similarities: Both store and manage data. Both use similar
technologies like SQL for querying. Both ensure data integrity and security.
Q.7 What type of benefit you might hope to get from data mining? Discover patterns, improve decision-making, enhance customer insights, predict trends, detect fraud, and optimize processes.
-It helps identify trends, predict customer behavior, improve marketing strategies, optimize operations, and support data-driven decisions.
Q.10 What are the limitations of data Mining?
-Requires high-quality data, can be computationally expensive, may lead to privacy issues, and results need expert interpretation.
Q.11 Discuss the need of human intervention in data mining
process. -Humans are needed to interpret results, select relevant data, choose appropriate algorithms, and ensure ethical use of insights.
Q.12As a bank manager, how would you decide whether to give
loan to an applicant or not? . -Assess applicant’s credit history, income, debt-to-income ratio, and past loan performance using data analytics to determine risk Q.13 What steps you would follow to identify a fraud for a credit card company. -Analyze transaction patterns, detect unusual spending behaviors, and use anomaly detection algorithms to flag potential fraud.
Q.14 What is data Mining?
-Data mining refers to extracting or mining knowledge from large amounts of data. The term is actually a misnomer. Thus, data mining should have been more appropriately named as knowledge mining which emphasis on mining from large amounts of data Q.15 State three different application for which data mining techniques seem appropriate. Informally explain each application. -Healthcare: Predict patient diseases based on historical data. Retail: Recommend products based on customer preferences. Banking: Detect fraudulent transactions using transaction analysis.
Q.16 Explain briefly the differences between “classification”
and ‘’clustering” and give an informal example of an application that would benefit from each technique -Classification: Categorizes data into predefined groups (e.g., spam vs. non-spam emails). Clustering: Groups similar data points tog by ether without predefined categories
Q.17 What do you mean by Data Processing?
-Data processing is the transformation of raw data into meaningful information through various methods like cleaning, analyzing, and summarizing
Q.18 Explain data cleaning
.- Data cleaning involves removing inaccuracies, correcting errors, and handling missing or inconsistent data to improve dataset quality.
A Review A Review of Financial Accounting Fraud Detection Based On Data Mining Techniquesof Financial Accounting Fraud Detection Based On Data Mining Techniques