Reading Assignment 1

Uploaded by

rishitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views3 pages

Reading Assignment 1

Uploaded by

rishitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

SECTION 1:

Summary of the paper

Singh, B., & Dubey, V. (2018, April 21). A review on knowledge discovery and data mining techniques.
International Journal of Advanced Technology and Engineering Exploration, 5(41), 70–77.
https://doi.org/10.19101/ijatee.2018.541006

Purpose of study:
The primary goal of this study is to explore technological discoveries in many domains such as
engineering, business analytics, medical, etc. This can also analyze the approaches based on the
capability of knowledge discovery. There are several approaches for the analysis that have previously
been published and have been considered in this respect. We now have a good knowledge of the
advantages and potential applications thanks to this study.

There are a few problems identified by the authors during the research work. They are as follows:
According to Vanahalli et al., the datasets which are dimensionally high have benefited from
bioinformatics. It is the outcome of a multitude of traits combining with a small number of samples. Most
of the operational time is spent by traditional algorithms mining a vast number of sparsely distributed
small, and moderately sized objects that do not contain important or notable data. The current work
concentrated on huge cardinality mining itemsets, commonly referred to itemsets as gigantic, which are
crucial in many domains, including bioinformatics. The present ceaseless gigantic itemset mining
computations have failed to identify a comprehensive arrangement of notable incessant monster itemsets.

Related Literature:
In 2018, High utility itemset mining (HUIM), a fresh area of data mining, is what Bai et al. claim it to be. In
the study's investigation and analysis, the following issue statements were discovered:
1. Additionally, periodization of frequent item based on rating and negative-positive mining rules are
absent.
2. Less research has been done on some particular projections of database using upper and lower
bound trimming techniques and mean value abundance.
3. Prior research lacked data on the quantity of candidate sets, creating recurrence sets, and
accuracy taken together.
4. In the mining of negative rule and threshold level task, better approaches are needed for
knowledge discovery.
5. Mining weighted frequent itemsets in data streams requires research.

In 2016, Shrivastava et al. proposed that the persistent itemsets of mining are significant in association
rule mining (ARM). This may be used in a variety of industries, including remedial education, market
basket analysis, banking, retail, and more. The unique way the itemsets have solidified might be both
intriguing and useful. Using the utility pattern rare itemset (UPRI) approach, they looked at huge benefit
uncommon itemsets. The great usefulness was found by them.

Li et al. suggested useful details of mining may be carried out utilizing rare itemsets in 2016. The
2L-XMMMS model is distinctive in that it gives each item one of two minimal supports. It could mine both
common and uncommon itemsets.

In 2017, Ghorbani proposed that standard methodologies for locating systematic itemsets presuppose
that there are steady datasets and that the limitations imposed across the whole dataset significantly. In
any event, this is not the case when details are global. The main goal of their study is to increase the
productivity of mining recurring itemsets on secular data.

He et al. made the argument that data mining is crucial in big data in 2017. To increase mining
productivity, they put forth the FP-tree and MapReduce based MAFIM approach. Data distribution is
carried by using MapReduce. For repeated itemset computing, the FP-tree has been utilized. Once the
mining results had been incorporated, the center node used MapReduce to create the global recurrent
item sets. Their findings show how rapid and structured the MAFIM algorithm is.

The average utility item sets (EHAUI-Tree) approach, which may be used to add latest database
arrangement without restarting the system, was introduced by Phuong and Duy in 2017. The estimate of
updated data is first computed. Then, based on the updated information esteem and the previous High
Average-utility Upper-bound, item sets that roll out improvements will be calculated and renewed
(HAUUB).

In 2017, Zulkurnain and Shah proposed that overflow of data occurs among various industries, including
banking, telecommunications, scientific operations, and so on. Data mining may be used to extract usable
information from this swamped data. By obtaining useful information from large datasets, it supports
directing processes.

It was suggested by Hong et al. in 2017 to employ erasable-itemset (EI) mining to find itemsets without
harming the profitability of factory. For collections of erasable items, they offered an gradual mining
method. Updates frequently form the foundation of this (FUP). In the intermittent data environment, their
results reveal that the suggested method runs quicker than the batch technique.

Ismail et al. suggested in 2017 that mining high-benefit instances is a method for finding groups of
beneficial goods that can deliver an excellent advantage for customer database.

In 2017, Jiang and He unveiled a data structure and non-recursive FPNR-growth technique that was more
functional. The results of their experiment indicate that the FPNR-growth algorithm works better than the
FP-growth approach. It is productive in both mining and storage time.

Mohammed et al. suggested in 2017 that the most popular itemset in a database should be the one that
isn't shielded by other itemsets. The Honey Bee Algorithm is a simple, robust, population-based
stochastic theoretical algorithm based on the often seeking tendencies of honey bees.

Wang et al. suggested extracting common patterns in data streams is a crucial component of data mining
in 2017. Their resolutions are discussed in their article, along with how they relate to common item sets
and sliding windows.

Results & Conclusions:

To create a useful information discovery system and mine a common itemset, this study looks at several
data mining techniques. It talks about the prospects, intuitions, and influence of mining for and
negative-positive associations. It also contains the results that are now being released, as well as future
recommendations. The study and discussion presented above make it clear that a powerful data mining
expertise realization model that can handle creation of candidate and mining for both positive and
negative are necessary corporation rules.
Contributions:
A safe systematics model for accident instances has been established in this study. The two crucial
elements of this strategy are comparable units analysis and similarity degree assessment. The benefits of
this approach include the ability to collect both subjective usage of information regarding various
incidents. They investigated the descriptive capability of the data mining technique based on
multi-relational inductive logic programming (ILP). They investigated ILP's ability to hold and merge
(implicit) multiple relationships in E-COMPARED data.

SECTION 2:
Overall Assesment:
This paper is simple to learn and understand. The paper has given clear cut explanations with examples
for every topic which made me easy to understand. Yes the author did the great job by elaborating the
pros and cons of the problems given by other authors. He also explained about the future aspects. The
study, including its scientific and practical contributions, is distributed among others in a specific field
through publishing. This raises the awareness of new comprehension in their field among scientific
researchers and practitioners with comparable regards, hence advancing comprehension and its
execution.

Future Reasearch:
In the future, correlations between comparable data can be made, and cutoff conditions can be evaluated.
More research is required in candidate development and improvement. A variety of DM method
hybridization combinations have been proposed.

New Knowledge Learned:

Learned about the approaches based on knowledge discovery, benefit itemset mining , association rule
mining, convenience pattern rare itemset approach, FP-tree and MapReduce based MAFIM approach.

SECTION 3:
Questions:
1. I would like to know solutions and some practical approach for the given problems faced by the
authors?
2. What are the other approaches other than ILP?

Link to the video:

https://drive.google.com/drive/folders/1HGE_6H0eXFOz5bGwKtTNeh6H4lShWwi2?usp=sharing

Best Practices For Data Quality
No ratings yet
Best Practices For Data Quality
37 pages
Mjoiuytrsfedsqwe 4 e 56 R 7 I 8 Ouikjghfvdcsretjyukilopl, KMJHNGB
No ratings yet
Mjoiuytrsfedsqwe 4 e 56 R 7 I 8 Ouikjghfvdcsretjyukilopl, KMJHNGB
9 pages
High Utility Mining
No ratings yet
High Utility Mining
6 pages
2018 Local and Peak Utility Patterns FINAL
No ratings yet
2018 Local and Peak Utility Patterns FINAL
27 pages
High-Utility Itemset Mining With Effective Pruning Strategies
No ratings yet
High-Utility Itemset Mining With Effective Pruning Strategies
22 pages
1 s2.0 S0952197623003664 Main
No ratings yet
1 s2.0 S0952197623003664 Main
13 pages
Survey High Utility Itemset2019 Draft PDF
No ratings yet
Survey High Utility Itemset2019 Draft PDF
44 pages
Improving Upgrowth Algorithm Using Top-K Itemset Mining High Utility
No ratings yet
Improving Upgrowth Algorithm Using Top-K Itemset Mining High Utility
12 pages
Apriori Based Novel Frequent Itemset Mining Mechanism: Issn No
No ratings yet
Apriori Based Novel Frequent Itemset Mining Mechanism: Issn No
8 pages
426-Article Text-1037-1-10-20210421
No ratings yet
426-Article Text-1037-1-10-20210421
9 pages
Survey - Itemset - Mining
No ratings yet
Survey - Itemset - Mining
41 pages
Hot Keys
No ratings yet
Hot Keys
4 pages
Utility-Driven Data Analytics On Uncertain Data
No ratings yet
Utility-Driven Data Analytics On Uncertain Data
11 pages
2022 - PBL 1 Article
No ratings yet
2022 - PBL 1 Article
24 pages
Data 07 00011
No ratings yet
Data 07 00011
22 pages
An Efficient Algorithm (Fufm) For Mining Frequent Item Sets
No ratings yet
An Efficient Algorithm (Fufm) For Mining Frequent Item Sets
5 pages
TKN: An Efficient Approach For Discovering Top-K High Utility 1 Itemsets With Positive or Negative Profits
No ratings yet
TKN: An Efficient Approach For Discovering Top-K High Utility 1 Itemsets With Positive or Negative Profits
28 pages
Mining Infrequent Itemset Using Association Rule: P.Kavya A.Kalaiselvi
No ratings yet
Mining Infrequent Itemset Using Association Rule: P.Kavya A.Kalaiselvi
4 pages
Efficient Utility Based Infrequent Weighted Item-Set Mining
No ratings yet
Efficient Utility Based Infrequent Weighted Item-Set Mining
5 pages
Utility Mining
No ratings yet
Utility Mining
5 pages
A Survey of Key Technologies For High Utility Patterns Mining
No ratings yet
A Survey of Key Technologies For High Utility Patterns Mining
17 pages
10.1007@s12652 020 01706 8
No ratings yet
10.1007@s12652 020 01706 8
10 pages
Lecture 2.3.5 2.3.6
No ratings yet
Lecture 2.3.5 2.3.6
19 pages
p139 Data Mining Mafia
No ratings yet
p139 Data Mining Mafia
13 pages
Algorithms For Frequent Itemset Mining: A Literature Review
No ratings yet
Algorithms For Frequent Itemset Mining: A Literature Review
19 pages
A Comparative Analysis of NFA and Tree-Based Approach For Infrequent Itemset Mining
No ratings yet
A Comparative Analysis of NFA and Tree-Based Approach For Infrequent Itemset Mining
5 pages
Data Analytics Unit 4
No ratings yet
Data Analytics Unit 4
56 pages
Literature Review On Interestingness Based Data Mining For Business Development
No ratings yet
Literature Review On Interestingness Based Data Mining For Business Development
6 pages
High Average-Utility itemset-KBS-2019
No ratings yet
High Average-Utility itemset-KBS-2019
19 pages
Insert Your Titles and Guide Name: International Research Journal of Engineering and Technology (IRJET)
No ratings yet
Insert Your Titles and Guide Name: International Research Journal of Engineering and Technology (IRJET)
8 pages
Association Rule Mining For Modelling Academic Resources Using FP Growth Algorithm PDF
No ratings yet
Association Rule Mining For Modelling Academic Resources Using FP Growth Algorithm PDF
6 pages
Improved Data Mining Approach To Find Frequent Itemset Using Support Count Table
No ratings yet
Improved Data Mining Approach To Find Frequent Itemset Using Support Count Table
7 pages
Big Dta Project
No ratings yet
Big Dta Project
8 pages
Applsci 11 08971 v2
No ratings yet
Applsci 11 08971 v2
15 pages
A Brief Survey: Data Mining Techniques and Application On Selected Sectors
No ratings yet
A Brief Survey: Data Mining Techniques and Application On Selected Sectors
5 pages
AS C I T T D M: Tudy ON Omputational Ntelligence Echniques O ATA Ining
No ratings yet
AS C I T T D M: Tudy ON Omputational Ntelligence Echniques O ATA Ining
13 pages
Min - Util, Ce Is Not An HUI. The TU of T T TWU (Ce) TU (T: Tid T T T T T T T T T T
No ratings yet
Min - Util, Ce Is Not An HUI. The TU of T T TWU (Ce) TU (T: Tid T T T T T T T T T T
1 page
DMDW Qa-3.2
No ratings yet
DMDW Qa-3.2
11 pages
PHM: Mining Periodic High-Utility Itemsets
No ratings yet
PHM: Mining Periodic High-Utility Itemsets
15 pages
Ijcs 2016 0303009 PDF
No ratings yet
Ijcs 2016 0303009 PDF
10 pages
August 2016 1474359690 08
No ratings yet
August 2016 1474359690 08
6 pages
Mining Frequent Itemsets Using Vertical Data Format
No ratings yet
Mining Frequent Itemsets Using Vertical Data Format
14 pages
Data Mining Task Primitives and Major Issues
No ratings yet
Data Mining Task Primitives and Major Issues
18 pages
14 - Novel High Average Utility Pattern Mining With Tighter UpperBounds
No ratings yet
14 - Novel High Average Utility Pattern Mining With Tighter UpperBounds
78 pages
ISMIS2014 FHM Faster High Utility Itemset Mining PAPER
No ratings yet
ISMIS2014 FHM Faster High Utility Itemset Mining PAPER
10 pages
Customer Relation Management in Retail Business Using Utility Mining
No ratings yet
Customer Relation Management in Retail Business Using Utility Mining
9 pages
DFP-Growth: An Efficient Algorithm For Frequent Patterns in Dynamic Data Mining
No ratings yet
DFP-Growth: An Efficient Algorithm For Frequent Patterns in Dynamic Data Mining
5 pages
2007 Jiawei Han FP Mining
No ratings yet
2007 Jiawei Han FP Mining
32 pages
03 Literature Review
No ratings yet
03 Literature Review
8 pages
Data Warehousing & Data Mining Syllabus Subject Code:56055 L:4 T/P/D:0 Credits:4 Int. Marks:25 Ext. Marks:75 Total Marks:100
No ratings yet
Data Warehousing & Data Mining Syllabus Subject Code:56055 L:4 T/P/D:0 Credits:4 Int. Marks:25 Ext. Marks:75 Total Marks:100
52 pages
OSUMI - On-Shelf - Utility - Mining - From - Itemset-Based - Data
No ratings yet
OSUMI - On-Shelf - Utility - Mining - From - Itemset-Based - Data
10 pages
Realizing Behavioral Patterns Using Fuzzy Logic in Market Basket Analysis IJERTV8IS110276
No ratings yet
Realizing Behavioral Patterns Using Fuzzy Logic in Market Basket Analysis IJERTV8IS110276
4 pages
Data Mining in Different Fields: A Study
No ratings yet
Data Mining in Different Fields: A Study
11 pages
(IJCST-V5I2P89) :Riswana.P.P, Divya.M
No ratings yet
(IJCST-V5I2P89) :Riswana.P.P, Divya.M
4 pages
Unit 4 Data Analytics
No ratings yet
Unit 4 Data Analytics
11 pages
Data Mining Applications and Feature Scope Survey
No ratings yet
Data Mining Applications and Feature Scope Survey
5 pages
CSE 385 - Data Mining and Business Intelligence - Lecture 02
No ratings yet
CSE 385 - Data Mining and Business Intelligence - Lecture 02
67 pages
Ch-4 Data Mining Knowledge Representation Premitives
No ratings yet
Ch-4 Data Mining Knowledge Representation Premitives
16 pages
Mining Items From Large Database Using Coherent Rules
No ratings yet
Mining Items From Large Database Using Coherent Rules
10 pages
A Survey of Correlated High Utility Pattern Mining
No ratings yet
A Survey of Correlated High Utility Pattern Mining
15 pages
Data Mining: Concepts, Fundamentals And Applications
From Everand
Data Mining: Concepts, Fundamentals And Applications
Enrico Guardelli
No ratings yet
Azure Synapse Analytics
No ratings yet
Azure Synapse Analytics
4 pages
Unit 2 Notes Data Analytics
No ratings yet
Unit 2 Notes Data Analytics
11 pages
RDBMS Concepts & Database Designing: Dr. R.C. Goyal
No ratings yet
RDBMS Concepts & Database Designing: Dr. R.C. Goyal
0 pages
File Handling Cs Class 12
100% (1)
File Handling Cs Class 12
19 pages
As
No ratings yet
As
10 pages
T Codes
No ratings yet
T Codes
14 pages
Notes - 4 Unit-Big Data
No ratings yet
Notes - 4 Unit-Big Data
38 pages
Brand Management of Tanishq
67% (3)
Brand Management of Tanishq
54 pages
What Is Meant by Utility Program
No ratings yet
What Is Meant by Utility Program
6 pages
The Alignment Between International and National Academic Accreditations - An Application in Information Systems Bachelor Program at Kingdom of Saudi Arabia
No ratings yet
The Alignment Between International and National Academic Accreditations - An Application in Information Systems Bachelor Program at Kingdom of Saudi Arabia
25 pages
Research Proposal FINAL For PRINT
100% (1)
Research Proposal FINAL For PRINT
14 pages
This Example Shows Default Values of 8 Primitive Types in Java
No ratings yet
This Example Shows Default Values of 8 Primitive Types in Java
5 pages
Dbms - Assignment 1 Sol
No ratings yet
Dbms - Assignment 1 Sol
2 pages
Online Banking System
No ratings yet
Online Banking System
15 pages
Logbooks
No ratings yet
Logbooks
6 pages
Assignment Normalization
No ratings yet
Assignment Normalization
2 pages
AI The Global Landscape of Ethics Guidelines Anna Jobin
No ratings yet
AI The Global Landscape of Ethics Guidelines Anna Jobin
42 pages
CV Gizem Çataldal - Eng
No ratings yet
CV Gizem Çataldal - Eng
2 pages
4th Quarter Exam Practical Research 1
No ratings yet
4th Quarter Exam Practical Research 1
3 pages
CSC 222 Modules 1-3
No ratings yet
CSC 222 Modules 1-3
50 pages
Assignment 05
No ratings yet
Assignment 05
6 pages
Delphi™ 5, Developer's Guide For Windows 98, Windows 95, & Windows NT
No ratings yet
Delphi™ 5, Developer's Guide For Windows 98, Windows 95, & Windows NT
1,020 pages
Get-Pip Py
No ratings yet
Get-Pip Py
413 pages
Or Err
No ratings yet
Or Err
6 pages
Iip Final Report Format
No ratings yet
Iip Final Report Format
2 pages
Major Stages in Legal Research
100% (1)
Major Stages in Legal Research
36 pages
SAP UI5 Netweaver Gateway Fiori Syllabus Sheet
No ratings yet
SAP UI5 Netweaver Gateway Fiori Syllabus Sheet
8 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
37 pages
Excel Dashboard Tutorials Point PDF Download
No ratings yet
Excel Dashboard Tutorials Point PDF Download
45 pages

Reading Assignment 1

Uploaded by

Reading Assignment 1

Uploaded by

SECTION 1:

Summary of the paper

Results & Conclusions:

New Knowledge Learned:

Link to the video:

You might also like