0% found this document useful (0 votes)

32 views8 pages

DMBI Questions

The document discusses data mining and business intelligence concepts across 6 modules. It covers topics such as data warehousing, OLAP, data preprocessing, classification algorithms like decision trees and naive bayes, clustering techniques including k-means and DBSCAN, association rule mining with the apriori algorithm, and applications of data mining in business intelligence.

Uploaded by

Manthan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views8 pages

DMBI Questions

Uploaded by

Manthan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

DMBI

Module 1
• Draw Data warehousing Architecture?
• Compare and contrast between OLTP and OLAP.
• What is Data mining ? Explain KDD process with diagram.
• Compare star schema , Snow flakes schema and star
constellation.
• Short note on Dimensional Modeling.
• Define data warehouse. Describe different OLAP operations in
detail.
• Compare star schema , Snow flakes schema and fact
constellation.
• Explain OLAP operations with the examples.
• Explain the knowledge discovery process with diagram.
• What are the major issues in data mining?
• What is data mining? Explain KDD process with diagram.
• Demonstrate with a diagram the process of KDD.
Module 2
• What is noisy data ? how to handle noisy data ? (2)
• Consider we have age of 29 participants in a survey given to us
in sorted order. 5, 10, 13, 15, 16, 16, 20, 20, 21, 22, 22, 25, 25,
25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 45, 46, 52, 70, 85
Explain how to calculate mean, median, standard deviation, 1st
and 3rd Quartile for given data and also compute the same.
Show the Box and Whisker plot for this data.
• (2) Suppose the data for analysis includes the attribute age. The
age values for data tuples are (in increasing order):
13,15,16,16,19,20,20,21,22,22,25,25,25,25,30,33,33,35,35,35,3
5,36,40,45,46,52,70
i) What is mean of data? What is median of data?
ii) What is mode of data? Comment on data's modality.
iii) What is mid range of data?
iv) Give the five point summary of the data.
v) Show box plot of the data.

• Describe any two methods of data reductions.

• Use the normalization methods to normaliz the following group
of data : 200, 300, 400, 600, 1000
Use min-max normalization by setting min=0 and max=1 and z-
score normalization.
• Give any two techniques of data preprocessing.
• Suppose a group of 12 sales price records has been stored as
follows: 30, 36, 47, 50, 52, 52, 56, 60, 63, 70, 70, 110.
Find mean, median, mode , inter quartile range (IQR).
• What is an attribute ? Explain its types.
• Describe the different types of attributes one may come across
in data mining with two examples of each.
• Find Mean, median, mode for a given data. Show box plot.
11,13,13,15,15,16,19,20,20,21,21,22,23,24,30,40,45,45,45.
• What is the need of pre-processing. Explain the different steps
involved in data pre-processing.

Module 3
• Explain concept of information gain and gini value used in
decision tree algorithm.
• Consider Training dataset as given below. Use Naive Bayes
Algorithm to determine whether it is advisable to play tennis on
a day with hot temperature, rainy outlook, high humidity and
no wind?
• Short note on Random Forest technique.
• Short note on Decision tree induction.
• Short note on cross validation.
• Apply Naive Bayes classifier algorithm to the dataset given
below, and classify the unknown data sample?
Given all the previous patients I've seen(below are their
symptoms and their diagnosis)

Do I believe that patient with following Symptoms has the flu ?

• Briefly explain Bagging and Boosting of Classifiers.
• Write and explain Bayes Classification algorithm.
• Write the steps of Ada-boost algorithm.
• Describe the classification performance evaluation measures
that are obtained from confusion matrix?
• Explain Confusion matrix with one example
• Write a short note on Naïve Bayesian classification.
• Explain bagging technique.
• Explain Confusion Matrix. Calculate Accuracy, Precision and
Recall for the following Confusion Matrix.

• Explain regression. Explain linear regression with example.

• Using the given training dataset classify the following tuple
using Naïve Bayes Algorithm:
<Homeowner: No, Marital Status: Married, Job experience:3>
• Illustrate any one classification technique for the following
dataset. Show how we can classify new
tuple(HOMEOWNER=Yes, Status= Employed, Income=Average)

• Explain different methods that can be used to evaluate and

compare the accuracy of different classification algorithms.
• Explain simple linear regression with example.

Module 4
• What is an outlier ? Explain various methods for performing
outlier analysis.
• Cluster the following eight points (with (x, y) representing
locations) into three clusters: A1(2, 10) , A2(2, 5) , A3(8, 4) ,
A4(5, 8) , A5(7, 5) , A6(6, 4) , A7(1, 2) , A8(4,9) Assume Initial
cluster centers are at; A1(2, 10) , A4(5, 8) and A7(1, 2) .The
distance function between two points a =(x1,y1) and b =(x2,y2)
is defined as- P(a,b) =|x2-x1|+|y2-y1|
Use K-Means Algorithm to find the three cluster centres after
the second iteration.
• Short note on DBSCAN Algorithm.
• Suppose we have six objects with name A, B, C, D, E, F. Apply
single linkage clustering and dendrogram for the given data.

• What is an outlier ? describe methods used for outlier analysis.

• Give the overview of partition clustering methods.
• Give the steps of K means clustering algorithm.
• Explain concept hierarchy with example.
• Explain density based clustering.
• What do you mean by outlier? Give the types of it.
• Apply K-means Algorithm to divide the given set of values
{2,3,6,8,9,12,15,18,22} into 3 clusters .
• Suppose we have five objects with name A, B, C, D & E. Apply
single linkage clustering and draw dendrogram for the given
data.

• What is an outlier ? describe methods that are used for outlier

analysis.
• Use k means clustering to cluster the following data into 2
clusters. 2,3,4,10,11,12,20,25,30.
•

• Explain DBSCAN algorithm with example.

Module 5
• Explain market Basket Analysis with example. (3)
• Use the Apriori algorithm to identify the frequent item-sets in
the following database. Then extract the strong association
rules from these sets. Assume Min. Support = 50% Min.
Confidence = 75%

• Explain multi-level and multi-dimensional association rules with

example. (3)
• For the table given , apply Apriori algorithm and show frequent
item set and strong association rules. Assume Minimum
support of 30% and Minimum confidence of 70%.

• How can we further improve the efficiency of Apriori-based

mining?
• Explain how the efficiency of Apriori algorithm is improved.
• Consider the transaction database given in table below. Apply
Apriori Algorithm with minimum support of 50% and
confidence of 50%. Find all frequent itemsets and all the
association rules.

• What is market basket analysis ? Give apriori algorithm.

Module 6
• What is Business Intelligence (BI) ? Explain architecture in
detail.
• How is Data Mining used in Business Intelligence (BI) ?
• What is BI ? define decision support system.
• Explain Business Intelligence issues.
• Define BI and give its architecture. Explain any business
application where data mining can be used.

DWM NOTES
No ratings yet
DWM NOTES
118 pages
DWDM Unitwise Questions
No ratings yet
DWDM Unitwise Questions
3 pages
Data Mining & Business Intelligence
No ratings yet
Data Mining & Business Intelligence
322 pages
DMBI All Pyqs
No ratings yet
DMBI All Pyqs
4 pages
DWDM Unit Wise Question Bank
No ratings yet
DWDM Unit Wise Question Bank
8 pages
DM Question Bank
No ratings yet
DM Question Bank
50 pages
DMBI
No ratings yet
DMBI
3 pages
Information Technology Fundamentals: CCIT4085
No ratings yet
Information Technology Fundamentals: CCIT4085
43 pages
R23!3!1 DWDM Final Syllabus On 21-06-2025
No ratings yet
R23!3!1 DWDM Final Syllabus On 21-06-2025
5 pages
Data Mining (Gtu Sem-6) 002
No ratings yet
Data Mining (Gtu Sem-6) 002
5 pages
Adobe Scan Jul 22, 2022
No ratings yet
Adobe Scan Jul 22, 2022
2 pages
DMBI Sample Questions
No ratings yet
DMBI Sample Questions
7 pages
Study Material I
No ratings yet
Study Material I
140 pages
DMBI MKP Test
No ratings yet
DMBI MKP Test
7 pages
DWDM QB
No ratings yet
DWDM QB
6 pages
DM Vsaq
No ratings yet
DM Vsaq
8 pages
DMA QB Solved
No ratings yet
DMA QB Solved
42 pages
Model Question Paper 2
No ratings yet
Model Question Paper 2
7 pages
DM 100
No ratings yet
DM 100
17 pages
SemSuggestions DM
No ratings yet
SemSuggestions DM
6 pages
Data Mining1
No ratings yet
Data Mining1
13 pages
Data Mining - 2
No ratings yet
Data Mining - 2
16 pages
Seperated
No ratings yet
Seperated
11 pages
Que Es Datamin
No ratings yet
Que Es Datamin
52 pages
Lecture 1-Introduction To Data Mining - M
No ratings yet
Lecture 1-Introduction To Data Mining - M
38 pages
CSE2021 - MODULE 1ppt
No ratings yet
CSE2021 - MODULE 1ppt
62 pages
ML Lect1
100% (1)
ML Lect1
51 pages
New Syllabus - COMP 482 Data Mining1674216496
No ratings yet
New Syllabus - COMP 482 Data Mining1674216496
3 pages
Data Mining Suggestions
No ratings yet
Data Mining Suggestions
5 pages
DM UNIT-1 Question and Answer
No ratings yet
DM UNIT-1 Question and Answer
25 pages
Assignment Solution 074
No ratings yet
Assignment Solution 074
8 pages
new-Guidelines-Datamining-I-UGCF-DSE-CS Hons-Sem 4-Jan 25
No ratings yet
new-Guidelines-Datamining-I-UGCF-DSE-CS Hons-Sem 4-Jan 25
3 pages
Guidelines-Datamining-I-UGCF-DSE-CS Hons-Sem 4-Jan2024
No ratings yet
Guidelines-Datamining-I-UGCF-DSE-CS Hons-Sem 4-Jan2024
3 pages
DMDW Imp Ques
No ratings yet
DMDW Imp Ques
17 pages
DMBI-Viva Sample Questions
No ratings yet
DMBI-Viva Sample Questions
2 pages
CEUC502 - DMBI - Question - Bank
No ratings yet
CEUC502 - DMBI - Question - Bank
12 pages
QB Data Mining
No ratings yet
QB Data Mining
5 pages
DMBI Simplified
No ratings yet
DMBI Simplified
28 pages
Data Mining List of Important Question
No ratings yet
Data Mining List of Important Question
4 pages
What Is Data Mining: Effective Data Collection Warehousing
No ratings yet
What Is Data Mining: Effective Data Collection Warehousing
21 pages
DM Guidelines 14jan2022
No ratings yet
DM Guidelines 14jan2022
5 pages
Methodology Chapter 2 Quarter 2
No ratings yet
Methodology Chapter 2 Quarter 2
25 pages
IV-cse DM Viva Questions
No ratings yet
IV-cse DM Viva Questions
10 pages
Knowledge Discovery Data Mining - Syllabus
No ratings yet
Knowledge Discovery Data Mining - Syllabus
6 pages
CS-DM Module - 1
No ratings yet
CS-DM Module - 1
27 pages
DWDM-JNTUK SyllabousPre
No ratings yet
DWDM-JNTUK SyllabousPre
2 pages
Data Mining Lesson Plan-Revised Syllabus
No ratings yet
Data Mining Lesson Plan-Revised Syllabus
4 pages
Sample Question DMW
No ratings yet
Sample Question DMW
4 pages
9709 51 June23.dvi
No ratings yet
9709 51 June23.dvi
13 pages
Data Warehousing and Mining April 2019
No ratings yet
Data Warehousing and Mining April 2019
4 pages
Question Bank 2
No ratings yet
Question Bank 2
4 pages
Data Mining
No ratings yet
Data Mining
15 pages
DMDW Lab Oral Question Bank
No ratings yet
DMDW Lab Oral Question Bank
4 pages
1569928600-7cs It3a dmwh-3555
No ratings yet
1569928600-7cs It3a dmwh-3555
2 pages
Data Mining
No ratings yet
Data Mining
3 pages
Data Mining Syllabus and Question
No ratings yet
Data Mining Syllabus and Question
6 pages
The Reliable Change Index - 1 Slide Per Page
No ratings yet
The Reliable Change Index - 1 Slide Per Page
8 pages
Data Mining and Business Intelligence
No ratings yet
Data Mining and Business Intelligence
4 pages
Gujarat Technological University: Subject Name: Elective I - Data Warehousing & Data Mining (DWDM) Subject Code: 640005
No ratings yet
Gujarat Technological University: Subject Name: Elective I - Data Warehousing & Data Mining (DWDM) Subject Code: 640005
5 pages
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
No ratings yet
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
3 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
2 pages
Final Practice
No ratings yet
Final Practice
30 pages
Stats Chap07 Bluman
No ratings yet
Stats Chap07 Bluman
71 pages
Department of Education: Practical Research 2 Second Periodical Test
No ratings yet
Department of Education: Practical Research 2 Second Periodical Test
2 pages
P&S-Unit II, III PDF
No ratings yet
P&S-Unit II, III PDF
3 pages
LESSON 5 Paired and Unpaired T Test Calculations
No ratings yet
LESSON 5 Paired and Unpaired T Test Calculations
32 pages
Correlation
No ratings yet
Correlation
84 pages
4.7.1 - Data Warehousing Mining & Business Intelligence
No ratings yet
4.7.1 - Data Warehousing Mining & Business Intelligence
3 pages
Basic Mathematics - I BCA Syllabus 2024-25
No ratings yet
Basic Mathematics - I BCA Syllabus 2024-25
2 pages
M-Iii Unit-2 LN
No ratings yet
M-Iii Unit-2 LN
83 pages
Full Practice Questions Sample
No ratings yet
Full Practice Questions Sample
13 pages
Probability QB
No ratings yet
Probability QB
10 pages
Baudm - Logistic Regression
No ratings yet
Baudm - Logistic Regression
18 pages
3
No ratings yet
3
4 pages
09 Handout 1
No ratings yet
09 Handout 1
8 pages
Tutorial 6-t 0
No ratings yet
Tutorial 6-t 0
6 pages
BIVARIAT
No ratings yet
BIVARIAT
4 pages
ANOVA
No ratings yet
ANOVA
7 pages
DLP Stat
No ratings yet
DLP Stat
6 pages
Assignment-Regression Analysis
No ratings yet
Assignment-Regression Analysis
6 pages
Code Visualisation
No ratings yet
Code Visualisation
3 pages
Group Assignment PBA - Latest
No ratings yet
Group Assignment PBA - Latest
8 pages
Checklist For Final Exam
No ratings yet
Checklist For Final Exam
7 pages
Faktor-Faktor Yang Mempengaruhi Kunjungan Lansia Ke Posyandu Lansia Di RW Vii Kelurahan Wonokusumo Kecamatan Semampir Surabaya
No ratings yet
Faktor-Faktor Yang Mempengaruhi Kunjungan Lansia Ke Posyandu Lansia Di RW Vii Kelurahan Wonokusumo Kecamatan Semampir Surabaya
11 pages
Ell784 Aq
No ratings yet
Ell784 Aq
2 pages
Module 7 Learning Activity
No ratings yet
Module 7 Learning Activity
3 pages
Econometrics I Quiz 1 Spring 2022
No ratings yet
Econometrics I Quiz 1 Spring 2022
4 pages
Uji Linearitas: ANOVA Table
No ratings yet
Uji Linearitas: ANOVA Table
5 pages
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
IGNOU PGDCA MCS 208 Data Structure and Algorithm Previous Years Unsolved Papers
From Everand
IGNOU PGDCA MCS 208 Data Structure and Algorithm Previous Years Unsolved Papers
Manish Soni
No ratings yet

DMBI Questions

Uploaded by

DMBI Questions

Uploaded by

DMBI

• Describe any two methods of data reductions.

Do I believe that patient with following Symptoms has the flu ?

• Explain regression. Explain linear regression with example.

• Explain different methods that can be used to evaluate and

• What is an outlier ? describe methods used for outlier analysis.

• What is an outlier ? describe methods that are used for outlier

• Explain DBSCAN algorithm with example.

• Explain multi-level and multi-dimensional association rules with

• How can we further improve the efficiency of Apriori-based

• What is market basket analysis ? Give apriori algorithm.

You might also like