Optimization of A Classical Algorithm For The Alignment of Genomic Sequences With Artificial Bee Colony
Optimization of A Classical Algorithm For The Alignment of Genomic Sequences With Artificial Bee Colony
doi: 10.17265/2159-5275/2022.02.004
D DAVID PUBLISHING
Raul Magdaleno Peñaloza, Andrea Magadan Salazar and Gerardo Reyes Salgado
Department of Computer Science, Centro de Investigacion y Desarrollo Tecnologico, Cuernavaca Morelos 62490, Mexico
Abstract: This article shows genomic alignment methods using the classic “Needleman” and “Smith-Waterman” algorithms, the latter
they were optimized by the ABC (artificial bee colony) algorithm. In the genomic alignment, a goal state is not presented, the experiments
that are carried out show alternative alignments by ABC were proposed. Different types of alignments could exist within the classical
algorithm, based on a horizontal, vertical, diagonal and inverse search mechanism on a match value table. Our ABC-Smith Waterman
algorithm was generated from the genomic sequences written in rows and columns for the search for similarities that will provide
values that ABC uses to process and provide more results of alignments that can be used by scientists for their experiments and research.
DNA undergoes mutations over the years and in one of the sequences as the disappearance of a
through their descendants, more time that passes since residue in another. In an alignment, when there is no
the last common ancestor, the more different the coincidence and in order not to move the whole
sequences will be. sequence, it is left in the closest place as shown in Fig.
2.
3. Sequences Alignment
Sales et al. [6] show that in each position of the
Sequence alignment is a basic tool that allows the alignment there will be two identical characters
extraction of functional, structural and evolutionary (MATCH), different (No MATCH) or one character
information contained in biological sequences. aligned with a gap. What cannot be are two gaps
The main goals of comparing two or more genomic aligned.
sequences are: Two sequences can be aligned in many ways. To
Determine and quantify the degree of similarity determine which is the best alignment, a scoring
between them. system is used that gives each pair of characters a
Determine if there is some kind of relationship different value depending on whether they are the
between them or if the resemblance is simply the same, different or whether there is a gap. The score
result of chance. of an alignment is calculated by adding the score of
Detect the presence of conserved structural and each of the positions helping to determine if the
functional motifs. sequences are really related or if their similarity is
Build phylogenetic trees that reflect their due to chance.
evolutionary relationships. The alignment that gets the highest score is called
It is proposed that in order to find the degree of the optimal alignment, shown in Fig. 3.
similarity in two sequences, the first thing to do is to
4. Needleman-Wunch Algorithm
look for similar characters, which consists of writing
one sequence, on top of the other so that the number Needleman and Wunch [2] introduced an approach
of symbols that coincide in the same position be in 1970 to calculate the optimal global alignment of
maximum [5]. These matches are displayed in Fig. 1. two sequences.
If necessary, gaps can be introduced in the The algorithm [2] is a way to massively reduce the
sequences and considered as the insertion of a residue number of possibilities to consider finding new alignment.
Fig. 12 Fourth result with ABC optimization. Fig. 13 Fifth result with ABC optimization.
The yellow color shows the first alignment made a different path by the alignment of the classical
with the classic Smith-Waterman algorithm. The method in reverse.
green color shows an alignment made in the reverse Figs. 14 and 15 show an example of the different
way. In the section where the colors are born, those alignment results generated with the aforementioned
are the possibilities found by the bees in case of taking algorithms.
Optimization of a Classical Algorithm for the Alignment of Genomic Sequences 63
with Artificial Bee Colony