0% found this document useful (0 votes)

33 views10 pages

Indira 2011

The document discusses using genetic algorithms to mine association rules from datasets. Genetic algorithm parameters like population size, crossover rate, and fitness function affect the accuracy of association rule mining. The study compares how changing these parameters impacts performance on three datasets.

Uploaded by

saubackchodi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views10 pages

Indira 2011

Uploaded by

saubackchodi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Association Rule Mining Using Genetic Algorithm:

The Role of Estimation Parameters

K. Indira1 and S. Kanmani2

1
Research Scholar, Department of Computer Science,
Pondicherry Engineering College, Puducherry, India
[email protected]
2
Professor, Department of Information Technology,
Pondicherry Engineering College, Puducherry, India
[email protected]

Abstract. Genetic Algorithms (GA) have emerged as practical, robust optimi-

zation and search methods to generate accurate and reliable Association Rules.
The performance of GA for mining association rules greatly depends on the GA
parameters namely population size, crossover rate, mutation rate, fitness func-
tion adopted and selection method. The objective of this paper is to compare the
performance of the Genetic algorithm for association rule mining by varying
these parameters. The algorithm when tested on three datasets namely Lenses,
Iris and Haberman indicates that the accuracy depends mainly on the fitness
function which is the key parameter of GA. The population size is affected by
the size of the dataset under study. The crossover probability brings changes in
convergence rate with minimal changes in accuracy. The size of the dataset and
relationship between its attributes also plays a role in achieving the optimum
accuracy.

Keywords: Association rules, Genetic Algorithm, Population size, Crossover

rate, Fitness function.

1 Introduction
Data mining, also referred as knowledge discovery in database, means a process of
nontrivial extraction of implicit, previously unknown and potentially useful informa-
tion (such as knowledge rules, constraints, regularities) from data in database. Data
mining combines theory and technology of several domains which include artificial
intelligence, machine learning, statistics, neural network and so on. Association rule
mining is a major area in data mining that discovers the relations between different
attributes by analyzing and disposing data in the database.
Many algorithms for generating association rules were developed over time. Some
of the well known algorithms are Apriori, Eclat and FP-Growth tree. Many existing
algorithms traverse the database many times so the I/O overhead and computational
complexity becomes very high and cannot meet the requirements of large-scale data-
base mining. Genetic algorithm is an algorithm which based on the biological theory
of evolution and molecular genetics of the global random search, the algorithm has a

A. Abraham et al. (Eds.): ACC 2011, Part I, CCIS 190, pp. 639–648, 2011.
© Springer-Verlag Berlin Heidelberg 2011
640 K. Indira and S. Kanmani

strong randomness, robust and implicit parallelism and can quickly and effectively
search for global optimization, in an effective way to deal with large-scale data sets.
At present, genetic algorithm-based data mining methods have yielded some progress,
and based on genetic algorithms classification system has also yielded some results.
This paper analyses the mining of Association Rules by applying Genetic Algo-
rithms. There have been several attempts for mining association rules using Genetic
Algorithm. Robert Cattral et al. [1] describe the evolution of hierarchy of rule using
genetic algorithm with chromosomes of varying length and macro mutations. The
initial population is seeded rather than random selection. Manish Saggar et al. [2]
proposes an algorithm with binary encoding and the fitness function was generated
based on confusion matrix. The individuals are represented using the Michigan’s Ap-
proach. Roulette Wheel selection is done by first normalizing the values of all candi-
dates.
Genetic algorithm based on the concept of strength of implication of rules was pre-
sented by Zhou et al. [3]. The properties of independence and correlation of descrip-
tions in rules are taken up for fitness calculation. Genxiang et al. [4] introduced
dynamic immune evolution, and biometric mechanism in Engineering immune com-
puting namely immune recognition, immune memory and immune regulation to GA
for mining association rules.
Gonzales. E et al. [5] introduced the Genetic Relation Algorithm (GRA) based on
evaluating the distances between rules. The distance is calculated using both matching
criteria namely complete match and partial match. Genetic algorithm easily leads to
premature convergence or takes too much time to converge during evolution process.
Hong Lei et al. [6] propose GA where the fitness function is based on predictive ac-
curacy, comprehensibility and interestingness factor. The selection method is based
on elitist recombination.
In Haiying Ma et al. [7] the encoding of data is done with gene string structure
where the complexity concepts are mapped to form linear symbols. The fitness func-
tion is the measure of the overall performance of the process rather than that of
individual rules when the bit strings were interpreted as a complex process. Adaptive
exchange probability (Pc) and mutation probability (Pm) are adopted in this paper.
Hong Guo et al. [8] adopt the method of adaptive mutation rate to avoid excessive
variation causing non-convergence, or into a local optimal solution. A sort of indi-
vidual-based selection method is applied to the evolution in genetic algorithm, in or-
der to prevent the high-fitness individuals converging early by the rapid growth of the
number of individual.
As the parameters of the genetic algorithm and the fitness function are found to be
the major area of interest in the above studies, this paper tries to explore on the effects
of the genetic parameters and the controlling variables of fitness function on three
different datasets.
A brief introduction about Association Rule Mining and GA is given in Section 2,
followed by methodology in section 3, which describes the basic implementation de-
tails of Association Rule Mining with GA. In section 4 the parameters that decides on
efficiency of the algorithm is presented. Section 5 presents the experimental results
followed by conclusion in the last section.
Association Rule Mining Using Genetic Algorithm: The role of Estimation Parameters 641

2 Association Rules and Genetic Algorithms

2.1 Association Rules

Association rule is a popular and well researched method for discovering interesting
relations between variables in large databases. It studies the frequency of items occur-
ring together in transactional databases, and based on a threshold called support, iden-
tifies the frequent item sets. Another threshold, confidence, which is the conditional
probability that an item appears in a transaction when another item appears, is used to
pinpoint association rules.
The discovered association rules are of the form: P Q [s, c], where P and Q are
conjunctions of attribute value-pairs, and s (for support) is the probability that P and
Q appear together in a transaction and c (for confidence) is the conditional probability
that Q appears in a transaction when P is present.

2.2 Genetic Algorithm

A Genetic Algorithm (GA) is a procedure used to find approximate solutions to

search problems through the application of the principles of evolutionary biology.
Genetic algorithms use biologically inspired techniques such as genetic inheritance,
natural selection, mutation, and sexual reproduction (recombination, or crossover).
Genetic algorithms are typically implemented using computer simulations in
which an optimization problem is specified. For this problem, members of a space of
candidate solutions, called individuals, are represented using abstract representations
called chromosomes. The GA consists of an iterative process that evolves a working
set of individuals called a population towards an objective function, or fitness func-
tion. Traditionally, solutions are represented using fixed length strings especially bi-
nary strings, but alternative encodings have also been developed.

3 Methodology
The evolutionary process of GA is a highly simplified and stylized simulation of the
biological version. It starts from a population of individuals randomly generated ac-
cording to some probability distribution, usually uniform and updates this population
in steps called generations. In each generation, multiple individuals are randomly se-
lected from the current population based on application of fitness, crossover, and
modified through mutation to form a new population.

A. [Start] Generate random population of n chromosomes.

B. [Fitness] Evaluate the fitness f(x) of each chromosome x in the population.
C. [New population] Create a new population by repeating the following steps until
the new population is complete.
i. [Selection] Select two parent chromosomes from a population according
to their fitness.
ii. [Crossover] With a crossover probability alter the parents to form a new
offspring.
642 K. Indira and S. Kanmani

iii. [Mutation] With a mutation probability mutate new offspring at each lo-
cus.
iv. [Accepting] Place new offspring in a new population
D. [Replace] Use newly generated population for a further run of the algorithm
E. [Test] If the end condition is satisfied, stop, and return the best solution in cur-
rent population
F. [Loop] Go to step B

4 Parameters in Genetic Algorithm

The GA parameters are the key components enabling the system to achieve good
enough solution for possible terminating conditions.

4.1 Encoding

Encoding is the process of representing individual solutions. The most common way
of encoding is binary encoding. Here each chromosome encodes a binary string where
each bit in the string represents some characteristics of the solution. Other encoding
schemes are octal, hexadecimal, permutation value and tree encoding.

4.2 Population

Population refers to the number of chromosomes taken up for optimization. A chro-

mosome is the raw genetic information that the GA deals with. If there are too few
chromosomes, GA has few possibilities to perform crossover and only a small part of
search space is explored. On the other hand, if there are too many chromosomes, GA
slows down. The initial population generation and population size are the two aspects
of population. The initial population is either selected randomly from the data or se-
lected with prior knowledge on the data.
The population size is calculated by

(1)

Where = number of chromosomes in data and k is the average size of the schema of
interest. If uniform crossover is adopted we can most likely get with population size
at least twice as small as the number of instances in the dataset.

4.3 Selection

During each successive generation, a proportion of the existing population is selected

to breed a new generation. Individuals are selected through a fitness-based process,
where fitter solutions as measured by a fitness function are typically more likely to be
selected. The Tournament, Roulette Wheel, Random, Rank and Boltzmann selection
are the commonly used selection methods. Elitism and stochastic universal sampling
significantly improves the GA’s performance.
Association Rule Mining Using Genetic Algorithm: The role of Estimation Parameters 643

4.4 Fitness Function

A fitness function is a particular type of objective function that prescribes the optimal-
ity of a chromosome in a genetic algorithm, so that the particular chromosome may be
ranked against all the other chromosomes [9, 10]. An ideal fitness function correlates
closely with the algorithm's goal, and yet may be computed quickly. Speed of execu-
tion is very important, as a typical genetic algorithm must be iterated many times in
order to produce an usable result for a non-trivial problem.
This paper adopts minimum support and minimum confidence for filtering rules.
Then correlative degree is confirmed in rules which satisfy minimum support-degree
and minimum confidence-degree. After support-degree and confidence-degree are
synthetically taken into account, fit degree function is defined as follows.

. . (2)

，
In the above formula, Rs + Rc =1 (Rs ≥0 Rc ≥ 0) and Suppmin, Confmin are respec-
tive values of minimum support and minimum confidence. By all appearances if the
Suppmin and Confmin are set to higher values, then the value of fitness function is also
found to be high.

4.5 Crossover Operator

Crossover entails choosing two individuals to swap segments of their code, producing
artificial "offspring" that are combinations of their parents. This process is intended to
simulate the analogous process of recombination that occurs to chromosomes during
sexual reproduction. Common forms of crossover include single-point crossover, in
which a point of exchange is set at a random location in the two individual genomes,
where one individual contributes all its code till the point of crossover, the second
individual contributes all its code after the point of crossover to produce an offspring,
and uniform crossover, in which the value at any given location in the offspring's ge-
nome is either the value of one parent's genome at that location or the value of the
other parent's genome at that location, chosen with 50/50 probability[8].

4.6 Mutation Operator

Partial gene values of individuals are adjusted by using mutation operation [5]. This
part of the genetic algorithm, require great care, here there are two probabilities, one
usually called as Pm, this probability will be used to judge whether mutation has to be
done or not, when the candidate fulfills this criterion it will be fed to another probabil-
ity, the locus probability that is on which point of the candidate the mutation has to be
done.

4.7 Number of Generations

The generational process of mining association rules by Genetic algorithm is repeated

until a termination condition has been reached. Common terminating conditions are:
644 K. Indira and S. Kanmani

A solution is found that satisfies minimum criteria.

• Fixed number of generations reached.
• Allocated budget (computation time/money) reached.
• The highest ranking solution's fitness is reaching or has reached a plateau
such that successive iterations no longer produce better results.
• Manual inspection.
• Combinations of the above.

5 Experimental Studies
The objective of this study is to compare the accuracy achieved in datasets by varying
the GA Parameters. The encoding of chromosome is binary encoding with fixed
length. As the crossover is performed on attribute level the mutation rate is set to zero
so as to retain the original attribute values. The selection method used is tournament
selection. The fitness function adopted is as given in equation (1).
Three datasets namely Lenses, Haberman survival and Iris Data Set from UCI Ma-
chine Learning Repository have been taken up for experimentation. Lenses dataset
has 4 attributes with 24 instances. Haberman's Survival data Set has 3 attributes and
306 instances and Iris dataset has 5 attributes and 150 instances. The Algorithm is
implemented using MATLAB R2008a simulation package. The flow of the system is
as shown in flowchart below.

/ŶŝƚŝĂůŝǌĞWŽƉƵůĂƚŝŽŶ

ǀĂůƵĂƚĞĨŝƚŶĞƐƐ

zĞƐ

^ĂƚŝƐĨǇŽŶƐƚƌĂŝŶƚƐ

EŽ

^ĞůĞĐƚ^ƵƌǀŝǀŽƌƐ ƌŽƐƐŽǀĞƌ

KƵƚƉƵƚZĞƐƵůƚƐ

Fig. 1. Flow chart of the GA

The default values set for the GA parameters are given in Table 1.
The accuracy and the convergence rate by controlling the GA parameters are rec-
orded in the table 2. Accuracy is the count of dataset matching between the original
dataset and resulting population divided by the number of instances in dataset. The
convergence rate is the generation at which the fitness value becomes fixed. The pop-
ulation size is varied for the three dataset, from the size of the dataset to one and half
times the dataset size while keeping the other parameters fixed.
Association Rule Mining Using Genetic Algorithm: The role of Estimation Parameters 645

Table 1. Default GA Parameters

Parameter Value
Population Size Instances * 1.5
Crossover Rate 0.5
Mutation Rate 0.0
Selection Method Tournament Selection
Minimum Support 0.2
Minimum Confidence 0.8

Table 2. Comparison based on variation in population Size

No. of Instances No. of Instances * 1.25 No. of Instances *1.5

Accuracy No. of Accuracy No. of Accuracy No. of
% Generations % Generations % Generations
Lenses 75 7 82 12 95 17
Haberman 71 114 68 88 64 70
Iris 77 88 87 53 82 45

It could be seen from Table 2 that for the Lenses dataset whose size is small, an op-
timal accuracy is achieved, when the population size is one and half times the size of
the dataset whereas for the larger dataset, Haberman the accuracy is maximum when
the population size is equivalent to dataset size. For the Iris dataset of moderate size
the population has to be set to 1.25 times the size of the dataset to achieve optimum
result.
As the fitness function is considered to be the crucial factor for the GA, variations
are introduced in the fitness function while other parameters remain unchanged. In
Table 3 the minimum confidence and support values are altered when others are at
default values and the results are recorded.
From the Table 3 it is clear that the variation in minimum support and confidence
brings greater changes in accuracy. When the values of minimum support and confi-
dence are set to minimum, the accuracy if found to be low regardless of the size of the
dataset. The same is noted when both the values are set to maximum. Optimum accu-
racy is achieved when a tradeoff value between minimum confidence and minimum
support is set.

Table 3. Comparison based on variation in Minimum Support and Confidence

Minimum Support & Minimum Confidence

Sup = 0.4 & Sup =0.9 & Sup = 0.9 & Sup = 0.2 &
con =0.4 con =0.9 con = 0.2 con = 0.9
Accuracy No. Accuracy No. Accuracy No. Accuracy No.
% of % of % of % of
Gen. Gen. Gen. Gen.
Lenses 22 20 49 11 70 21 95 18
Haberman 45 68 58 83 71 90 62 75
Iris 40 28 59 37 78 48 87 55
646 K. Indira and S. Kanm
mani

When the parameters Rs and Rc are altered in the fitness function, minimum alteera-
tions in accuracy are noted and hence their impact is not taken up for analysis.
In Table 4 the crossover probability is altered when other GA parameters are seet to
default values and the results observed are recorded.

Table 4. Com
mparison based on variation in Crossover Probability

Cross Over
Pc = .2
25 Pc = .5 Pc = .75
Accuracy No. of Accuracy No. of Accuracy % No. of
% Generations % Generations Generationns
Lenses 95 8 95 16 95 13
Haberman 69 77 71 83 70 80
Iris 84 45 86 51 87 55

From the Table 4 it is evvident that the accuracy achieved is almost same for all the
three datasets whatever the crossover probability adopted. The effect of the crossoover
probability on convergencee rate is noticeable, the data size and population size beeing
set also alters the convergen
nce rate.
The results observed arre compared for the three datasets as shown in figurees 2
and 3.

Fig. 2. Population Size Vs

V Accuracy Fig. 3. Minimum Support and Confidennce
Vs Accuracy

The values of the GA parameters

p set for the three datasets when maximum eeffi-
ciency is achieved is shown
n in Table 5.

Table 5. Comparison of the optimum value of Parameters for maximum Accuracy achieveed

Dataset No. of No. of Minimum Minimum Crossover Accuraccy

Instances attrributes Support confidence rate in %
Lenses 24 4 0.2 0.9 0.25 95
Haberman 306 3 0.9 0.2 0.5 71
Iris 150 5 0.2 0.9 0.75 87
Association Rule Mining Using Genetic Algorithm: The role of Estimation Parameters 647

It is observed from the experimental analysis that the choice of optimum popula-
tion size for better accuracy depends upon the number of instances in dataset. If data-
set size is larger, then the population size same as the number of instances in dataset is
found to produce better accuracy.
Setting up values for minimum support and confidence depends on the dataset and
their relationship between attributes. Tradeoff between minimum confidence and min-
imum support has to be scored to attain optimum results. Cross over rate affects the
convergence rate of the system mainly and has minimum effect on the accuracy of the
system.

6 Conclusion
Genetic Algorithms have been used to solve difficult optimization problems in a
number of fields and have proved to produce optimum results in mining Association
rules. When Genetic algorithm is used for mining association rules the GA parameters
decides the efficiency of the system. Minimum support, minimum confidence and
population size are the key parameters deciding the accuracy of the system. The set-
ting of the population size is based on the size of the problem under study, whereas
the minimum confidence and minimum support to be set depends upon the problem
under study. The optimum value of crossover rate leads to earlier convergence while
playing minimum role in achieving better accuracy. The setting of optimum value of
the GA parameters varies from data to data and the fitness function plays a major role
in optimizing the results. The size of the dataset and relationship between attributes in
data contributes to the setting up of the parameters. The efficiency of the methodolo-
gy could be further explored on more datasets with varying attribute sizes.

References
1. Cattral, R., Oppacher, F., Deugo, D.: Rule Acquisition with a Genetic Algorithm. In: Pro-
ceedings of the 1999 Congress on Evolutionary Computation, CEC 1999 (1999)
2. Saggar, M., Agrawal, A.K., Lad, A.: Optimization of Association Rule Mining. In: IEEE
International Conference on Systems, Man and Cybernetics, vol. 4, pp. 3725–3729 (2004)
3. Zhou, J., Li, S.-y., Mei, H.-y., Liu, H.-x.: A Method for Finding Implicating Rules Based
on the Genetic Algorithm. In: Third International Conference on Natural Computation,,
vol. 3, pp. 400–405 (2007)
4. Zhang, H. Chen. : Immune Optimization Based Genetic Algorithm for Incremental Asso-
ciation Rules Mining. In : International Conference on Artificial Intelligence and Compu-
tational Intelligence, AICI ’09, Volume: 4, Page(s): 341 – 345, 2009.
5. Gonzales, E., Mabu, S., Taboada, K., Shimada, K., Hirasawa, K.: Mining Multi-class Da-
tasets using Genetic Relation Algorithm for Rule Reduction. In: IEEE Congress on Evolu-
tionary Computation, CEC 2009, pp. 3249–3255 (2009)
6. Shi, X.-J., Lei, H.: Genetic Algorithm-Based Approach for Classification Rule Discovery.
In: International Conference on Information Management, Innovation Management and
Industrial Engineering, ICIII 2008, vol. 1, pp. 175–178 (2008)
7. Ma, H., Li, X.: Application of Data Mining in Preventing Credit Card Fraud. In: Interna-
tional Conference on Management and Service Science, MASS 2009, pp. 1–6 (2009)
648 K. Indira and S. Kanmani

8. Guo, H., Zhou, Y.: An Algorithm for Mining Association Rules Based on Improved Ge-
netic Algorithm and its Application. In: 3rd International Conference on Genetic and Evo-
lutionary Computing, WGEC 2009, pp. 117–120 (2009)
9. Tang, H., Lu, J.: Hybrid Algorithm Combined Genetic Algorithm with Information Entro-
py for Data Mining. In: 2nd IEEE Conference on Industrial Electronics and Applications,
pp. 753–757 (2007)
10. Dou, W., Hu, J., Hirasawa, K., Wu, G.: Quick Response Data Mining Model using Genet-
ic Algorithm. In: SICE Annual Conference, pp. 1214–1219 (2008)

Genetic Algorithm Seminar Report
No ratings yet
Genetic Algorithm Seminar Report
17 pages
ML MCQ 250
100% (1)
ML MCQ 250
44 pages
Association Rule Mining Using Genetic Algorithm: The Role of Estimation Parameters
No ratings yet
Association Rule Mining Using Genetic Algorithm: The Role of Estimation Parameters
10 pages
Ga Parameters
No ratings yet
Ga Parameters
10 pages
Abstract - Genetic Algorithms (GA) Have Emerged As Practical, Robust Optimization and Search
No ratings yet
Abstract - Genetic Algorithms (GA) Have Emerged As Practical, Robust Optimization and Search
10 pages
Mining Association Rules Using Genetic Algorithm: The Role of Estimation Parameters
No ratings yet
Mining Association Rules Using Genetic Algorithm: The Role of Estimation Parameters
9 pages
Rule Acquisition in Data Mining Using Genetic Algorithm
No ratings yet
Rule Acquisition in Data Mining Using Genetic Algorithm
9 pages
Ga Perf Analysis
No ratings yet
Ga Perf Analysis
19 pages
Rule Acquisition in Data Mining Using A Self Adaptive Genetic Algorithm
No ratings yet
Rule Acquisition in Data Mining Using A Self Adaptive Genetic Algorithm
8 pages
DC Meet Second
No ratings yet
DC Meet Second
21 pages
Rule Acquisition in Data Mining Using A Self Adaptive Genetic Algorithm
No ratings yet
Rule Acquisition in Data Mining Using A Self Adaptive Genetic Algorithm
8 pages
Framework For Comparison of Association Rule Mining Using Genetic Algorithm
No ratings yet
Framework For Comparison of Association Rule Mining Using Genetic Algorithm
8 pages
Survey On GA and Rules
No ratings yet
Survey On GA and Rules
15 pages
Name: Survey1 Title: Genetic Algorithm Based On Evolution Strategy and The Alication in Data Mining 2.issue
No ratings yet
Name: Survey1 Title: Genetic Algorithm Based On Evolution Strategy and The Alication in Data Mining 2.issue
24 pages
P3Permutation Encoding TSP
No ratings yet
P3Permutation Encoding TSP
6 pages
Genetic Algorithms For Association Rule Mining: A Comparative Study
No ratings yet
Genetic Algorithms For Association Rule Mining: A Comparative Study
7 pages
A Genetic Algorithm For Discovering Classification Rules in Data Mining
No ratings yet
A Genetic Algorithm For Discovering Classification Rules in Data Mining
13 pages
Mining Fuzzy Association Rules From The Asp Database: Wang Bin Xie Qing-Sheng
No ratings yet
Mining Fuzzy Association Rules From The Asp Database: Wang Bin Xie Qing-Sheng
6 pages
Rule Acquisition in Data Mining Using Self Adaptive Genetic Algorithm
No ratings yet
Rule Acquisition in Data Mining Using Self Adaptive Genetic Algorithm
4 pages
Study of Genetic Algorithm An Evolutionary Approach
No ratings yet
Study of Genetic Algorithm An Evolutionary Approach
4 pages
Genetic Algorithm
No ratings yet
Genetic Algorithm
4 pages
Genetic Algorithms For Association Rule Mining: A Comparative Study
No ratings yet
Genetic Algorithms For Association Rule Mining: A Comparative Study
4 pages
Genetic Algorithm Group 5 Official - 101533 - Quincy Scott
No ratings yet
Genetic Algorithm Group 5 Official - 101533 - Quincy Scott
25 pages
Dcmeet Third
No ratings yet
Dcmeet Third
72 pages
Survey On Mining Ars Using Genetic Algorithm
No ratings yet
Survey On Mining Ars Using Genetic Algorithm
4 pages
Limitation of Apriori Algo
No ratings yet
Limitation of Apriori Algo
7 pages
A Study On Genetic Algorithm and Its Applications: Related Papers
No ratings yet
A Study On Genetic Algorithm and Its Applications: Related Papers
6 pages
GADataMining CNA
No ratings yet
GADataMining CNA
73 pages
Machine Learning Programming
No ratings yet
Machine Learning Programming
4 pages
The Genetic Algorithm (GA) in Relation To Natural Evolution
No ratings yet
The Genetic Algorithm (GA) in Relation To Natural Evolution
8 pages
A Quantum Swarm Evolutionary Algorithm For Mining Association Rules in Large Databases
No ratings yet
A Quantum Swarm Evolutionary Algorithm For Mining Association Rules in Large Databases
6 pages
Genetic Algorithm
No ratings yet
Genetic Algorithm
40 pages
Genetic Algorithms ML
No ratings yet
Genetic Algorithms ML
40 pages
Unit 5,6 Ot Notes
No ratings yet
Unit 5,6 Ot Notes
14 pages
Unit 5 Machine Learning Aktu
No ratings yet
Unit 5 Machine Learning Aktu
7 pages
Unit 2, 4,5,3 SC
No ratings yet
Unit 2, 4,5,3 SC
23 pages
SC Unit 4
No ratings yet
SC Unit 4
23 pages
Decision Tree Classifiers With Ga Based Feature Selection
No ratings yet
Decision Tree Classifiers With Ga Based Feature Selection
10 pages
Expert Systems With Applications: Hamid Reza Qodmanan, Mahdi Nasiri, Behrouz Minaei-Bidgoli
No ratings yet
Expert Systems With Applications: Hamid Reza Qodmanan, Mahdi Nasiri, Behrouz Minaei-Bidgoli
11 pages
Genetic Algorithms
No ratings yet
Genetic Algorithms
35 pages
Role and Working of Genetic Algorithm in Computer Science: Manik Sharma
No ratings yet
Role and Working of Genetic Algorithm in Computer Science: Manik Sharma
6 pages
Non Traditional Optimization Methods: Chapter-4
No ratings yet
Non Traditional Optimization Methods: Chapter-4
24 pages
Data Mining 2
No ratings yet
Data Mining 2
96 pages
Unit-3 Evolutionary Computing & GA
No ratings yet
Unit-3 Evolutionary Computing & GA
39 pages
Genetic Algothim AI
No ratings yet
Genetic Algothim AI
16 pages
Financial Forecasting Using Genetic Algorithms
No ratings yet
Financial Forecasting Using Genetic Algorithms
13 pages
Genetic Algorithm
No ratings yet
Genetic Algorithm
22 pages
09 Chapter2 PDF
No ratings yet
09 Chapter2 PDF
23 pages
Selection Methods For Genetic Algorithms: December 2013
No ratings yet
Selection Methods For Genetic Algorithms: December 2013
13 pages
Fuzzy Genetic Algorithm
No ratings yet
Fuzzy Genetic Algorithm
20 pages
Genetic Algorithm and Data Mining
No ratings yet
Genetic Algorithm and Data Mining
2 pages
Identifying Interesting Association Rules
No ratings yet
Identifying Interesting Association Rules
20 pages
Genetic Algorithm
No ratings yet
Genetic Algorithm
8 pages
Soumen Paul Department of Computer Science and Informatics Haldia Institute of Technology ICARE Complex, HIT Campus, P.O - HIT, PIN 721657 Haldia, West Bengal
No ratings yet
Soumen Paul Department of Computer Science and Informatics Haldia Institute of Technology ICARE Complex, HIT Campus, P.O - HIT, PIN 721657 Haldia, West Bengal
56 pages
4.1 Genetic Algorithms
No ratings yet
4.1 Genetic Algorithms
25 pages
Matecconf Bisstech2016 03002
No ratings yet
Matecconf Bisstech2016 03002
5 pages
1GAWO
No ratings yet
1GAWO
11 pages
Genetic Algorithm Report
No ratings yet
Genetic Algorithm Report
26 pages
Performance Evaluation of A Genetic Algorithm Based Approach To Network Intrusion Detection System
No ratings yet
Performance Evaluation of A Genetic Algorithm Based Approach To Network Intrusion Detection System
17 pages
ML Unit Iv Part Ii
No ratings yet
ML Unit Iv Part Ii
9 pages
Genetic Algorithm: Fundamentals and Applications
From Everand
Genetic Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
2.1.2. SC-Lecture-Unit-II-Ch1
No ratings yet
2.1.2. SC-Lecture-Unit-II-Ch1
17 pages
Digital Design Engine: Based On Vedic Sciences
No ratings yet
Digital Design Engine: Based On Vedic Sciences
8 pages
Automatic Cell Image Segmentation Using Genetic Algorithms
No ratings yet
Automatic Cell Image Segmentation Using Genetic Algorithms
5 pages
Genetic Algorithm For Variable Selection: Jennifer Pittman
No ratings yet
Genetic Algorithm For Variable Selection: Jennifer Pittman
27 pages
A More Realistic Example
No ratings yet
A More Realistic Example
12 pages
Optimization & Relaibility - Machine Design
No ratings yet
Optimization & Relaibility - Machine Design
2 pages
LP IV Lab Zdvzmanual Sem II fbsccAY 2019-20z 20-Convxvzzerted
No ratings yet
LP IV Lab Zdvzmanual Sem II fbsccAY 2019-20z 20-Convxvzzerted
96 pages
CSC 325 AI Lecture05 Local Search Fall2024 29092024 101846am
No ratings yet
CSC 325 AI Lecture05 Local Search Fall2024 29092024 101846am
61 pages
Genetic Algorithms For Object Recognition IN A: Complex Scene
No ratings yet
Genetic Algorithms For Object Recognition IN A: Complex Scene
4 pages
Permanent Magnet Synchronous Motor Dynamic Modeling With Genetic Algorithm Performance Improvement
No ratings yet
Permanent Magnet Synchronous Motor Dynamic Modeling With Genetic Algorithm Performance Improvement
14 pages
BCS 3701 - Artificial Intelligence
No ratings yet
BCS 3701 - Artificial Intelligence
10 pages
10 Ga
No ratings yet
10 Ga
20 pages
CS361 Soft Computing CSE Syllabus-Semesters - 5 PDF
No ratings yet
CS361 Soft Computing CSE Syllabus-Semesters - 5 PDF
3 pages
Structural Optimization of Jacket Platform Based o PDF
No ratings yet
Structural Optimization of Jacket Platform Based o PDF
8 pages
Data Discretization vs. OLAP
No ratings yet
Data Discretization vs. OLAP
33 pages
Application of Genetic Algorithm in Intrusion Detection System
No ratings yet
Application of Genetic Algorithm in Intrusion Detection System
9 pages
Genetic: Introduction To Genetic Algorithms
No ratings yet
Genetic: Introduction To Genetic Algorithms
44 pages
Local Search and Optimization
No ratings yet
Local Search and Optimization
28 pages
Artificial Intelligence-Enabled Deep Learning Model For Multimodal Biometric Fusion
No ratings yet
Artificial Intelligence-Enabled Deep Learning Model For Multimodal Biometric Fusion
24 pages
ICPCES2012 1STLFReview
No ratings yet
ICPCES2012 1STLFReview
11 pages
The Universal Generating Function in Reliability Analysis and Optimization (Springer Series in Reliability Engineering) PDF
No ratings yet
The Universal Generating Function in Reliability Analysis and Optimization (Springer Series in Reliability Engineering) PDF
458 pages
Computers Are From Mars, Organisms Are From Venus: Interrelationship Guide To Biology and Computer Science
No ratings yet
Computers Are From Mars, Organisms Are From Venus: Interrelationship Guide To Biology and Computer Science
8 pages
Engineering Applications of Arti Ficial Intelligence: Kwong C.K., Jiang Huimin, Luo X.G
No ratings yet
Engineering Applications of Arti Ficial Intelligence: Kwong C.K., Jiang Huimin, Luo X.G
12 pages
Algoritma Genetika Untuk Penjadwalan Matakuliah
No ratings yet
Algoritma Genetika Untuk Penjadwalan Matakuliah
14 pages
AI Lecture 11
No ratings yet
AI Lecture 11
9 pages
An Efficient Algorithm For Solving Nonograms: Hui-Lung Lee Ling-Hwei Chen
No ratings yet
An Efficient Algorithm For Solving Nonograms: Hui-Lung Lee Ling-Hwei Chen
14 pages
Fms With Arena
No ratings yet
Fms With Arena
6 pages
Exam Scheduling: A Case Study: February 2017
No ratings yet
Exam Scheduling: A Case Study: February 2017
7 pages
IJE - Volume 33 - Issue 7 - Pages 1257-1265
No ratings yet
IJE - Volume 33 - Issue 7 - Pages 1257-1265
9 pages

Indira 2011

Uploaded by

Indira 2011

Uploaded by

Association Rule Mining Using Genetic Algorithm:

The Role of Estimation Parameters

K. Indira1 and S. Kanmani2

Abstract. Genetic Algorithms (GA) have emerged as practical, robust optimi-

Keywords: Association rules, Genetic Algorithm, Population size, Crossover

2 Association Rules and Genetic Algorithms

2.1 Association Rules

2.2 Genetic Algorithm

A Genetic Algorithm (GA) is a procedure used to find approximate solutions to

A. [Start] Generate random population of n chromosomes.

4 Parameters in Genetic Algorithm

Population refers to the number of chromosomes taken up for optimization. A chro-

During each successive generation, a proportion of the existing population is selected

4.4 Fitness Function

4.5 Crossover Operator

4.6 Mutation Operator

4.7 Number of Generations

The generational process of mining association rules by Genetic algorithm is repeated

A solution is found that satisfies minimum criteria.

Fig. 1. Flow chart of the GA

Table 1. Default GA Parameters

Table 2. Comparison based on variation in population Size

No. of Instances No. of Instances * 1.25 No. of Instances *1.5

Table 3. Comparison based on variation in Minimum Support and Confidence

Minimum Support & Minimum Confidence

Fig. 2. Population Size Vs

The values of the GA parameters

Dataset No. of No. of Minimum Minimum Crossover Accuraccy

You might also like