0% found this document useful (0 votes)

53 views

Explain Multirelational Data Mining Concept in Detail

Multirelational data mining aims to discover patterns involving multiple tables from a relational database. It includes tasks like multirelational classification, clustering, and frequent pattern mining. Spatial data mining applies data mining techniques to spatial data models and uses geographical information to produce business intelligence. It involves techniques to transform geographic data into useful formats and extract non-trivial patterns rather than just visualizing data. An example is using density-based clustering on fatal car accident data to detect systemic safety issues near road segments with dense accident clusters.

Uploaded by

anirudh devaraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Explain Multirelational Data Mining Concept in Detail

Uploaded by

anirudh devaraj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

JSS SCIENCE AND TECHNOLOGY UNIVERSITY

MYSURU-570006
Department of Information Science and Engineering

Advanced Data Mining Techniques Assignment

Submitted by:
Anirudh D (01JST19PSE001)
1. Explain Multirelational Data mining concept in detail.

Multirelational data mining search for patterns that involve multiple tables (relations) from a
relational database. Multirelational data mining aims to discover knowledge directly from
relational data. There are different multirelational data mining tasks, including multirelational
classification, clustering, and frequent pattern mining. Multirelational classification aims to
build a classification model that utilizes information in different relations. Multirelational
clustering aims to group tuples into clusters using their own attributes as well as tuples
related to them in different relations. Multirelational frequent pattern mining aims at finding
patterns involving interconnected items in different relations.

Relational databases are the most popular repository for structured data. In a relational
database, multiple relations are linked together via entity-relationship links. Many
classification approaches (such as neural networks and support vector machines) can only be
applied to data represented in single table. While most existing data mining approaches look
for patterns in a single data table, multi-relational data mining approaches look for patterns
that involve multiple tables (relations) from a relational database. A relational database
consists of a collection of named tables, often referred to as relations that individually behave
as the single table that is the subject of Propositional Data Mining.

Fig 1: Multirelational framework

Approaches that are supported by the Multi Relational Data Mining:

 Inductive Logic Programming (ILP)

 Multi-relational Clustering
 Probabilistic Relational Model

MRDM is a multi-disciplinary field which dealing with the Knowledge discovery from
relational database which consisting of number of relations. It is frameworks which deals
with gathering the data about the data (metadata) from a database and choose the best
approach to get the optimal results. MRDM aims to integrate the results from existing fields
like ILP, KDD, Statistics, Machine learning. Data understanding means gathering the
metadata from the database and which describe the best approach of the analysis.

Ex: Consider the relational database. Arrows go from primary keys to corresponding foreign
keys. Suppose the target relation is Loan. Each target tuple is either positive or negative,
indicating whether the loan is paid on time. The task of multirelational classification is to
build a hypothesis to distinguish positive and negative target tuples, using information in
different relations. For classification, in general, we search for hypotheses that help
distinguish positive and negative target tuples. The most popular form of hypotheses for
multirelational classification is sets of rules and data can be mined.
In a database for multirelational data mining, there is one target relation, whose tuples are
called target tuples and are associated with class labels. The other relations are nontarget
relations. Each relation may have one primary key (which uniquely identifies tuples in the
relation) and several foreign keys (where a primary key in one relation can be linked to the
foreign key in another). If we assume a two-class problem, then we pick one class as the
positive class and the other as the negative class. MRD approaches within actual database
management systems and using the query opti-mization techniques of the DBMSs to improve
the eﬃciency.

2. Illustrate Multidimensional Analysis and Descriptive Mining of

Complex Data Objects

A major limitation of many commercial data warehouse and OLAP tools for
multidimensional database analysis is their restriction on the allowable data types for
dimensions and measures. Most data cube implementations confine dimensions to
nonnumeric data and measures to simple aggregated values. To introduce data mining and
multidimensional data analysis for complex objects, we examine how to perform
generalization on complex structured objects and construct object cubes for OLAP and
mining in object databases. The storage and access of complex structured data have been
studied in object-relational and object-oriented database systems. These systems organize a
large set of complex data objects into classes, which are in turn organized into class/subclass
hierarchies. Each object in a class is associated with

1) An object-identifier

2) A set of attributes that may contain sophisticated data structures, set- or list-valued data,
class composition and hierarchies, multimedia data

3) A set of methods that specify the computational routines or rules associated with the object
class.

To facilitate generalization and induction in object-relational and object-oriented databases,

it is important to know how the generalized data can be used for multidimensional data and
analysis and data mining. The application of descriptive analysis is to discover the captivating
subgroups in the major part of the data.

Thus, the extraction of meaningful feature representations yields a variety of different views
on the same set of data objects. Each of these views or representations might focus on a
different aspect and may offer another notion of similarity. However, in almost any
application there is no universal feature representation that can be used to express similarity
between all possible objects in a meaningful way. Thus, recent data mining approaches
employ multiple representations to achieve more general results that are based on a variety of
aspects.

The descriptive mining is used to mine data and provide the latest information on past or
recent events. It identifies, what happened in the past by analysing stored data and provides
accurate data. Some Practical analysis methods here are Standard reporting, query/drill down
and ad-hoc reporting. Descriptive mining focuses on the summarization and conversion of the
data into meaningful information for reporting and monitoring.

An example application for multi-represented objects is data mining in protein data. A protein
can be described by multiple feature transformations based upon its amino acid sequence, its
secondary or its three-dimensional structure. Another example is data mining in image data
which might be represented by texture features, color histograms or text annotations. Mining
multi-represented objects yields advantages because more information can be incorporated
into the mining process. On the other hand, the additional information has to be used
carefully since too much information might distort the derived patterns. Basically, we can
distinguish two problems when clustering multirepresented objects, comparability and
semantics. The comparability problem subsumes several effects when comparing features,
distances or statements from different representations.

3. Paraphrase on spatial data mining using an example.

Spatial data mining is the application of data mining to spatial models. In spatial data mining,
analysts use geographical or spatial information to produce business intelligence or other
results. This requires specific techniques and resources to get the geographical data into
relevant and useful formats. It is generally used to talk about finding useful and non-trivial
patterns in data. In other words, just setting up a visual map of geographic data may not be
considered spatial data mining by experts. The core goal of a spatial data mining project is to
distinguish the information in order to build real, actionable patterns to present, excluding
things like statistical coincidence, randomized spatial modelling or irrelevant results. SDM
aims to improve human ability to extract knowledge and insights from large and complex
collections of digital data. It efficiently extracts previously unknown, potentially useful, and
ultimately understandable knowledge from these huge datasets for a given task. the SDM
method not only relies on the traditional theories of mathematical statistics, machine learning,
pattern recognition, neural networks, and artificial intelligence, but it also engages new
methods, such as data fields, cloud models, and decision trees.

Fig 2. Spatial data mining process

Ex: Consider an example of Exploring Fatal Car Accident Data by using Spatial data mining
process. Here Finding and prioritizing locations where systemic issues result in multiple fatal
car accidents is a crucial need for transportation agencies that run operations and guide safety
policy. A spatial approach can help us expand beyond our basic understanding of where fatal
car accidents occur and start detecting patterns.

Many fatal car accidents result from seemingly random events, but a dense cluster of fatal car
accidents near a specific road segment can suggest the presence of a systemic problem or
human-driven process that can greatly benefit from targeted safety measures. This
methodology uses Spatial Statistics tool named Density-Based Clustering to find dense
clusters of fatal car accidents.

Prioritize Clusters
Find Cluster (Density
Categorize Clusters (Normalizing and
Based Clustering)
Indexing)

To organize and prioritize each cluster as a candidate safety measure project, we then
consider the characteristics of each detected cluster and the transportation network:

 The elements of the transportation network at the cluster’s location, such as the
presence of an intersection or the posted speed limit, become basis for classifying the
clusters into groups that can be addressed by common safety measures. An example
group would be “Intersection and Traffic Light Clusters”.
 The number of fatal accidents at the cluster and the traffic counts at the location then
become the basis of prioritization rankings.
 These clusters became our candidate priority locations, and we then supplement
additional characteristics for each cluster to categorize and prioritize.

Attunity Acx Reference
No ratings yet
Attunity Acx Reference
830 pages
What Motivated Data Mining? Why Is It Important?: The Evolution of Database Technology
100% (1)
What Motivated Data Mining? Why Is It Important?: The Evolution of Database Technology
18 pages
DM - UNIT I
No ratings yet
DM - UNIT I
58 pages
Data Mining MCA 3 Sem
No ratings yet
Data Mining MCA 3 Sem
51 pages
DM-unit 1
No ratings yet
DM-unit 1
22 pages
A Novel Methodology For Discrimination Prevention in Data Mining
No ratings yet
A Novel Methodology For Discrimination Prevention in Data Mining
21 pages
Unit 2
No ratings yet
Unit 2
37 pages
Data Warehousing & Data Mining Syllabus Subject Code:56055 L:4 T/P/D:0 Credits:4 Int. Marks:25 Ext. Marks:75 Total Marks:100
No ratings yet
Data Warehousing & Data Mining Syllabus Subject Code:56055 L:4 T/P/D:0 Credits:4 Int. Marks:25 Ext. Marks:75 Total Marks:100
52 pages
DATA MINING-Knowledge Discovery in Databases
No ratings yet
DATA MINING-Knowledge Discovery in Databases
6 pages
Database Ass
No ratings yet
Database Ass
25 pages
An Introduction To Data Mining
No ratings yet
An Introduction To Data Mining
3 pages
Mining Databases: Towards Algorithms For Knowledge Discovery
No ratings yet
Mining Databases: Towards Algorithms For Knowledge Discovery
10 pages
Kinds of data
No ratings yet
Kinds of data
8 pages
Data Moning Seminar Report
No ratings yet
Data Moning Seminar Report
12 pages
CO5 notes
No ratings yet
CO5 notes
11 pages
What Motivated Data Mining? Why Is It Important?
No ratings yet
What Motivated Data Mining? Why Is It Important?
12 pages
1st Slides
No ratings yet
1st Slides
60 pages
18mca52c U1
No ratings yet
18mca52c U1
17 pages
Data Mining Issues and Tasks
No ratings yet
Data Mining Issues and Tasks
5 pages
Data Mining 1 2 and 3
No ratings yet
Data Mining 1 2 and 3
20 pages
Major components of data mining system
No ratings yet
Major components of data mining system
9 pages
Mathematical Programming For Data Mining: Formulations and Challenges
No ratings yet
Mathematical Programming For Data Mining: Formulations and Challenges
35 pages
Assignment ON Data Mining
No ratings yet
Assignment ON Data Mining
24 pages
A Brief Survey On Data Mining For Biological and Environmental Problems.
No ratings yet
A Brief Survey On Data Mining For Biological and Environmental Problems.
46 pages
InTech-Mining Enrollment Data Using Descriptive and Predictive Approaches
No ratings yet
InTech-Mining Enrollment Data Using Descriptive and Predictive Approaches
21 pages
1.1 - Data Mining
No ratings yet
1.1 - Data Mining
18 pages
Unit-1 Introduction To Data Mining
No ratings yet
Unit-1 Introduction To Data Mining
33 pages
Data Mining Notes
No ratings yet
Data Mining Notes
9 pages
unit2
No ratings yet
unit2
20 pages
Hybrid Fuzzy Approches For Networks
No ratings yet
Hybrid Fuzzy Approches For Networks
5 pages
Annotating Full Document
No ratings yet
Annotating Full Document
48 pages
Unit 1 (DMW)
No ratings yet
Unit 1 (DMW)
53 pages
Data Mining-Introduction
No ratings yet
Data Mining-Introduction
8 pages
Data Mining
No ratings yet
Data Mining
26 pages
Unit 2 (DWDM)
No ratings yet
Unit 2 (DWDM)
40 pages
Unit 1
No ratings yet
Unit 1
21 pages
UNIT-1 Introduction To Data Mining
No ratings yet
UNIT-1 Introduction To Data Mining
29 pages
UNIT-2 BI
No ratings yet
UNIT-2 BI
26 pages
Data Mining and Data Warehousing Notes ct1
No ratings yet
Data Mining and Data Warehousing Notes ct1
12 pages
An Enhanced Technique For Database Clustering Using An Extensive Data Set For Multi-Valued Attributes
No ratings yet
An Enhanced Technique For Database Clustering Using An Extensive Data Set For Multi-Valued Attributes
6 pages
Data Mining Notes
100% (1)
Data Mining Notes
75 pages
CSC 425 Data Mining and Warehousing 2024
No ratings yet
CSC 425 Data Mining and Warehousing 2024
54 pages
Data Mining Models and Tasks
No ratings yet
Data Mining Models and Tasks
6 pages
Mining
No ratings yet
Mining
7 pages
DMW Lab File Work
No ratings yet
DMW Lab File Work
18 pages
DM Notes-1
No ratings yet
DM Notes-1
71 pages
Data Mining Unit 1
No ratings yet
Data Mining Unit 1
24 pages
1,2 UNITS NOTES
No ratings yet
1,2 UNITS NOTES
53 pages
Kinds of Data: 1. Data Bases Data 2.data Warehouses Data 3. Transactional Data
No ratings yet
Kinds of Data: 1. Data Bases Data 2.data Warehouses Data 3. Transactional Data
24 pages
Data Mining
No ratings yet
Data Mining
87 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
11 pages
Unit II Data Mining
No ratings yet
Unit II Data Mining
8 pages
CRP I TV 53 Galloway
No ratings yet
CRP I TV 53 Galloway
12 pages
OLAP and Metadata
No ratings yet
OLAP and Metadata
6 pages
Week1-2
No ratings yet
Week1-2
24 pages
Data Mining-CH5
No ratings yet
Data Mining-CH5
49 pages
3types of Data Mining
No ratings yet
3types of Data Mining
4 pages
Ramy mahmoud 52117
No ratings yet
Ramy mahmoud 52117
3 pages
Data Mining Ch1
No ratings yet
Data Mining Ch1
38 pages
Automatic Image Annotation: Fundamentals and Applications
From Everand
Automatic Image Annotation: Fundamentals and Applications
Fouad Sabry
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
SQA R Paper 1
No ratings yet
SQA R Paper 1
20 pages
Mining Infrequent Itemset Using Association Rule: P.Kavya A.Kalaiselvi
No ratings yet
Mining Infrequent Itemset Using Association Rule: P.Kavya A.Kalaiselvi
4 pages
Jss Science and Technology University MYSURU-570006 Department of Information Science and Engineering
No ratings yet
Jss Science and Technology University MYSURU-570006 Department of Information Science and Engineering
4 pages
Li2015 PDF
No ratings yet
Li2015 PDF
329 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Chapter1 PDF
100% (1)
Chapter1 PDF
80 pages
Chapter 4: Findings, Conclusions & Recommendations
No ratings yet
Chapter 4: Findings, Conclusions & Recommendations
3 pages
Oracle Redolog
No ratings yet
Oracle Redolog
26 pages
B Compare
No ratings yet
B Compare
162 pages
U2 AddSam 2019 PartA MarkingGuide
No ratings yet
U2 AddSam 2019 PartA MarkingGuide
9 pages
BSBPMG535 - Assessment Task 3 V1.2
No ratings yet
BSBPMG535 - Assessment Task 3 V1.2
43 pages
Bda (Chapter 1)
No ratings yet
Bda (Chapter 1)
8 pages
Moral Value in Charles Dickens' Novel: David Copperfield
No ratings yet
Moral Value in Charles Dickens' Novel: David Copperfield
16 pages
Study of Students' Personal Life During Lockdown Period
No ratings yet
Study of Students' Personal Life During Lockdown Period
5 pages
ER Diagram Presentation
No ratings yet
ER Diagram Presentation
21 pages
Extensible Storage Engine (ESE) Database File (EDB) Format
No ratings yet
Extensible Storage Engine (ESE) Database File (EDB) Format
53 pages
Data Management Concepts: 2013 Pearson Education, Inc. Publishing As Prentice Hall, AIS, 11/e, by Bodnar/Hopwood
No ratings yet
Data Management Concepts: 2013 Pearson Education, Inc. Publishing As Prentice Hall, AIS, 11/e, by Bodnar/Hopwood
57 pages
Datainbrief Template
No ratings yet
Datainbrief Template
3 pages
Demystifying The Big Data Ecosystem... - Param Natarajan
100% (1)
Demystifying The Big Data Ecosystem... - Param Natarajan
8 pages
Sri Ram Dayal Khemka Vivekananda Vidyalaya Junior College: Half Yearly Examination 2024 - 2025
No ratings yet
Sri Ram Dayal Khemka Vivekananda Vidyalaya Junior College: Half Yearly Examination 2024 - 2025
3 pages
Seminar Topic: MIS Practice, Challenge and Opportunity (You Have To Consider Different Organization
No ratings yet
Seminar Topic: MIS Practice, Challenge and Opportunity (You Have To Consider Different Organization
28 pages
Firebird 4.0.0 RC1 ReleaseNotes
No ratings yet
Firebird 4.0.0 RC1 ReleaseNotes
168 pages
Article in Press: Data-Driven Smart Manufacturing
No ratings yet
Article in Press: Data-Driven Smart Manufacturing
13 pages
SAP Business Data Cloud
No ratings yet
SAP Business Data Cloud
30 pages
B207A Powerpoint - Week 2
No ratings yet
B207A Powerpoint - Week 2
39 pages
Configuration Librarian
No ratings yet
Configuration Librarian
3 pages
Health Checks - Ceph Documentation
No ratings yet
Health Checks - Ceph Documentation
22 pages
65f8313795640 Sandook Money - Case Study
No ratings yet
65f8313795640 Sandook Money - Case Study
2 pages
Assignment - Research Methods For Management
No ratings yet
Assignment - Research Methods For Management
19 pages
Program Studi Sarjana Keperawatan Universitas Bhakti Kencana Psdku Tasikmalaya Skripsi, Agustus 2019
No ratings yet
Program Studi Sarjana Keperawatan Universitas Bhakti Kencana Psdku Tasikmalaya Skripsi, Agustus 2019
2 pages
Adding New Tables To An Existing Oracle Goldengate Replication
No ratings yet
Adding New Tables To An Existing Oracle Goldengate Replication
7 pages
Block-04 Introduction To Advanced Database Models
No ratings yet
Block-04 Introduction To Advanced Database Models
83 pages
Program:-3: A Program in SQL Using Logical Operators
No ratings yet
Program:-3: A Program in SQL Using Logical Operators
7 pages
MS-100 Project Proposal: Submitted by
No ratings yet
MS-100 Project Proposal: Submitted by
10 pages

Explain Multirelational Data Mining Concept in Detail

Uploaded by

Explain Multirelational Data Mining Concept in Detail

Uploaded by

JSS SCIENCE AND TECHNOLOGY UNIVERSITY

Advanced Data Mining Techniques Assignment

Fig 1: Multirelational framework

 Inductive Logic Programming (ILP)

2. Illustrate Multidimensional Analysis and Descriptive Mining of

To facilitate generalization and induction in object-relational and object-oriented databases,

3. Paraphrase on spatial data mining using an example.

Fig 2. Spatial data mining process

You might also like