Comparing DNA Sequences To Understand Evolutionary Relationships With Blast
Comparing DNA Sequences To Understand Evolutionary Relationships With Blast
How can bioinformatics be used as a tool to determine evolutionary relationships and to better understand
genetic diseases?
■■BACKGROUND
Between 1990–2003, scientists working on an international research project known as the Human Genome Project
were able to identify and map the 20,000–25,000 genes that define a human being. The project also successfully
mapped the genomes of other species, including the fruit fly, mouse, and Escherichia coli. The location and
complete sequence of the genes in each of these species are available for anyone in the world to access via the
Internet.
Why is this information important? Being able to identify the precise location and sequence of human genes will
allow us to better understand genetic diseases. In addition, learning about the sequence of genes in other species
helps us understand evolutionary relationships among organisms. Many of our genes are identical or similar to
those found in other species.
Suppose you identify a single gene that is responsible for a particular disease in fruit flies. Is that same gene found in
humans? Does it cause a similar disease? It would take you nearly 10 years to read through the entire human
genome to try to locate the same sequence of bases as that in fruit flies. This definitely isn’t practical, so a
sophisticated technological method is needed.
Note that the cladogram is treelike, with the endpoints of each branch representing a specific species. The closer
two species are located to each other, the more recently they share a common ancestor. For example,
Selaginella(spikemoss) and Isoetes(quillwort) share a more recent common ancestor than the common ancestor
that is shared by all three organisms.
■■Learning Objectives
To create cladograms that depict evolutionary relationships
To analyze biological data with a sophisticated bioinformatics online tool
To use cladograms and bioinformatics tools to ask other questions of your own and to test your ability to
apply concepts you know relating to genetics and evolution
Table 2. Percentage Similarity Between the GAPDH Gene and Protein in Humans and
Other Species
Gene Percentage Protein Percentage
Species
Similarity Similarity
Chimpanzee (Pan troglodytes) 99.6% 100%
Dog (Canis lupus familiaris) 91.3% 95.2%
Fruit fly (Drosophila melanogaster) 72.4% 76.7%
Roundworm (Caenorhabditiselegans) 68.2% 74.3%
a. Why is the percentage similarity in the gene always lower than the percentage similarity in the protein for each of
the species? (Hint: Recall how a gene is expressed to produce a protein.)
b. Draw a cladogram depicting the evolutionary relationships among all five species (including humans) according
to their percentage similarity in the GAPDH gene.