Genetic Algorithms (GA) : Group 4
Genetic Algorithms (GA) : Group 4
Group 4
What is a Genetic Algorithm?
• It is a randomized search
and optimization technique
How are Genetic Algorithms
used?
How are Genetic Algorithms
used?
• Find genes by looking for consensus sequences
• Estimate which genes or segments are used
given a DNA fragment or RNA primer
• Find similar proteins between species to find
evolutionary links
• Find binding sites between protein sequences
Purpose of Genetic Algorithms
• Simplify sequences
into statistical
measures
• Identify trends
...more specifically
• Experimental data
• Microarrays
• Databases
Microarrays
• To classify test data
▫ knowing which genes you want
• Quickly provide a clinical diagnosis for cancer
• The algorithm finds the minimal set of genes to
test
• Too few genes limits the accuracy,
• Too many genes creates too much noise
Microarrays
• Discriminant analysis:
▫ method of exploring gene-expression microarray
data.
▫ requires gene selection
▫ easier to handle the large data volume,
▫ aids understanding diseases and functions of
certain genes.
Algorithms
• Pattern classification
▫ practical in medical diagnosis
• Neural networks
▫ fine tuning solutions
• Provide a compact gene set
• Understand the functions of particular genes and plan the
diagnosis process.
• Genetic Algorithms provide
▫ more flexibility
▫ but more time consuming
Gene selection methods:
• A good feature selection method:
– low generalization error
– low computational cost
– small number of evaluations
• Feature selection method’s components
• search procedure
• evaluation function
• stopping criterion
• validation procedure
Genetic algorithm (GA)
• Pros:
▫ commonly used
▫ high-quality solution
▫ good time frame
• Cons:
▫ long time to locate the local optimum
▫ may not find the optimum with precision
Feature selection methods
• Marginal filters
– individual feature ranking methods
– the most efficient
– The selected feature subsets are sub-optimal
• Wrapper method
– a classifier is the evaluation criterion
– wrapper and embedded methods are often closely
related to each other.
– guaranteed to discover the optimal feature subset
with respect to the evaluation criterion
Issues
•Accuracy
•Data Management
•Computing Power
Accuracy
• How is a genetic algorithm assessed for accuracy?
• An algorithm's generalization
(ability to serve multiple functions)is one measure
Data Management?