0% found this document useful (0 votes)

103 views40 pages

Phylogenetic Tree Reconstruction: I519 Introduction To Bioinformatics, 2012

This document discusses phylogenetic tree reconstruction and evolution. It covers key topics such as speciation, the molecular clock hypothesis, phylogenetic trees, and morphological versus molecular analysis. Methods for phylogenetic tree reconstruction include distance-based approaches like UPGMA and NJ that take pairwise distances as input, and character-based approaches that take a multiple sequence alignment as input. Factors that can complicate phylogenetic analysis are also reviewed, such as horizontal gene transfer, gene duplication and loss, long branch attraction, and lack of resolution.

Uploaded by

Junegreg Cual

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

103 views40 pages

Phylogenetic Tree Reconstruction: I519 Introduction To Bioinformatics, 2012

Uploaded by

Junegreg Cual

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 40

I519 Introduction to Bioinformatics, 2012

Phylogenetic Tree Reconstruction

Evolution theory
 Speciation
– Evolution of new organisms is driven by
• Mutations
– The DNA sequence can be changed due to single base changes, deletion/insertion
of DNA segments, etc.
• Selection bias
– Speciation events lead to creation of different species.
– Speciation caused by physical separation into groups where
different genetic variants become dominant
 Any two species share a (possibly distant) common
ancestor
 The molecular clock hypothesis
 Indiana University (Michael Lynch, Jeff Palmer, Matt
Hann et al)
Lynch: The Origins of Genome Complexity
Phylogeny (phylogenetic tree)
 A phylogenetic tree is a graph reflecting the approximate
distances between a set of objects (species, genes,
proteins, families) in a hierarchical fashion

taxon

Aardvark Bison Chimp Dog Elephant

 Leaves – current species; sequences in current species
 Internal nodes - hypothetical common ancestors
 Branches (Edges) length - “time” from one speciation to the next
(branching represents speciation into two new species)
Phylogenetic trees (binary trees)
Rooted tree Rooted tree satisfying “molecular clock”
hypothesis: all leaves at same distance from the root.
root
6 root
8 7 time
7
6
8
3
1 2 5
4 1 2 3 4 5 UMPGA

Unrooted tree: Note: 1-5 are called leaves, or

3
leaf nodes. 6-8 are inferred nodes
4
corresponding to ancestral species or
8
6
molecules. Branches are also called
2 7 5 edges. The edge lengths reflect
evolutionary distances.
1
Morphological vs. molecular
 Classical phylogenetic analysis: morphological
features: presence or absence of fins, number of
legs, lengths of legs, etc.

 Modern biological methods allow to use

molecular features
– Gene sequences
– Protein sequences

 Species tree and gene tree

From sequences to phylogenetic tree
Rat QEPGGLVVPPTDA..
Rabbit QEPGGMVVPPTDA..
Gorilla QEPGGLVVPPTDA..
Cat REPGGLVVPPTEG..

There are many possible types of

sequences to use (e.g. Mitochondrial vs
Nuclear proteins).
How to choose the best tree?
 To decide which tree is best we can use an optimality
criterion.
 Parsimony is one such criterion (the other criteria:
Maximum likelihood, minimum evolution, bayesian)
 It chooses the tree which requires the fewest mutations
to explain the data.
 The Principle of Parsimony is the general scientific
principle that accepts the simplest of two explanations
as preferable.
Principle of Parsimony
Parsimony based methods look for a tree with the
minimum total number of substitutions of symbols
between species and their ancestor in the phylogenetic
tree.
AAA 1 AAA
AAA AGA AAA AAA
1 1 1 1 2
AAA AGA AGA GGA
AAG GGA AAG AAA
Total #substitutions = 3 Total #substitutions = 4

The left tree is preferred over the right tree.

Maximum parsimony
S1 CACCCCTT
S2 AACCCCAT
S3 CACTGCTT
S4 AACTGCTA
(S1,S2),(S3,S4) 20011011 6 
(S1,S3),(S2,S4) 10022011 7
S1 C C S3 S1 C A S2

C A

S2 A mutation=2 mutation=1
A S4 S3 C A S4
(S1,S2), (S3, S4) (S1,S3), (S2, S4)
We can EASILY get different trees
(the “reality check” paper)

 Input sequences
 Multiple alignment programs
 Substitution models
 Phylogenetic tree reconstruction methods
Trees – what might they mean?
Species A
Species B
Species tree
Species C

Species D

Seq A
Gene tree
Seq D
Seq C
Seq B
Lack of resolution

Seq A
Seq D
Seq C

Seq B

Weak support, 40% bootstrap for bipartition (AD)(CB)

(typical >80%)
Long branch attraction (LBA)
the two longest branches join together
Seq A
Seq D
Seq C

Seq B

Strong support, e.g., 100% bootstrap for (AD)(CB)

Horizontal gene transfer
species A
species B
Species tree Gene Transfer
species C
species D

Seq A
Gene tree
Seq D
Seq C

speciation Seq B
gene transfer
Gene duplication & loss
species A
Species tree
species B
species C
Duplication species D
Seq A
Loss
 Seq B
Gene tree  Seq C A
D
Seq D C
Seq B’
B
Seq C’
 Seq D’
Orthologs and paralogs
(important for function annotation)
Seq A
Orthologs:
Seq B sequences diverged after
a speciation event
Seq C
Paralogs:
Seq D sequences diverged after
gene Seq B a duplication event
duplication Xenologs:
Seq C
sequences diverged after
Seq D a horizontal transfer

Duplicated genes may have different functions!!

Phylogeny (phylogenetic tree)
reconstruction: overview
 Tree topology & branch lengths
 Computational challenge
– Huge number of tree topology
3 sequences: 1 (unrooted)
4 sequences: 3
5 sequences: 15
10 sequences: 2,027,025
20 sequences: 221,643,095,476,699,771,875
n sequences (unrooted & rooted) ??
 Most methods are heuristic
 Two types of methods
– Distance based (input: distance matrix; UPGMA & NJ)
– Character based (input: multiple alignment)
Models of evolutionary distance
1. Simplest case: Jukes-Cantor model
-- equal probability of change to any nucleotide

2. Other models take into account transitions vs. transversion frequencies

-- Kimura: different probabilities for transitions, transversions
-- HKY: different probabilities for transitions, transversions &
takes into account genomic nucleotide biases
Transition: R to R
b Y to Y
A G

  Transversion: R to Y
Y to R
C T
b where R = A,G
Y = C,T
Distance based phylogeny
reconstruction
 Phylogeny reconstruction for 3 sequences is
EASY
– There is a single tree topology
– The branch lengths can be calculated as follows:
To compute: branch lengths a, b and c, such that

A
a
c C
Input: DAB, DBC and DAC (pairwise distances) b
B
Output:
Fitch-Margoliash (FM) algorithm
 For phylogeny reconstruction with more than 3 sequences
 For example, given 5 sequences, A, B, C, D and E. The
tree can be reconstructed as follows
– First choose the closest sequence pair, suppose it is D and E
(based on the input pairwise distances; e.g., DDE=10)
– To calculate the branch lengths from D and E to their common
ancestor (denoted as d and e), we combine the remaining
three sequences (A, B and C) and treat them as a single
composite sequence (and define and so on)
-- so again we are dealing with 3 sequences, and we can
easily calculate the branch lengths
– Then merge D and E into a cluster and treat it as a composite
sequence, and update the distance table so that
and so on.
– Repeat the above steps until no more clusters to merge
Distance based method: UPGMA
 UPGMA: Unweighted Pair Group Method with Arithmetic
Mean
 Assume same rate evolution (molecular clock hypothesis)
 The length from root to each leaf is the same (ultra
metric).
 It is similar to Fitch-Margoliash algorithm (merge two most
similar sequences or clusters first); but the calculation of
branch lengths is even simpler.
 For example, for the same example shown above with
five sequences, d=e=5 (d and e are the branch lengths
from sequence D and E to their common ancestor)
Neighbor-joining (NJ)
(Seitou & Nei algorithm)
 Minimum evolution -- the least total branch
length (distance-based)
 Bottom-up clustering method
 Does not assume same rate evolution
 Fast & produce reasonable trees
NJ method
3 3

2 2
X 4 Y X 4

1 1
5 5

NJ looks for two sequences (clusters) to merge that minimizes:

Where

C1 and C2 are most similar to each other, while they are most
dissimilar to the other clusters (far from others)
Comparison of FM, UPGMA and
NJ methods
 All are hierarchical clustering methods
 All define the distance between two clusters as the
average pairwise distance

 FM and NJ do not assume “molecular clock”; UPGMA

does and it uses a simpler way to calculate branch
lengths.
 UPGMA and FM choose and merge the closest sequence
(cluster) pair first, but NJ looks for two sequences
(clusters) that are not only close to each other (as in
UPGMA and FM) but also far apart from the rest
Parsimony based reconstruction

1. A procedure to find the minimum number of

changes needed to explain the data for a given
tree topology, where species are assigned to
leaves. (Small Parsimony Problem)

2. A search through the space of trees. (hard

problem!) (Large Parsimony Problem)
Small Parsimony Problem
 Compute the minimum number of mutations on a
GIVEN tree
 Fitch algorithm
 Sankoff algorithm (subtree; DP)
The Fitch Algorithm
 Pick an arbitrary root to work towards (for unrooted tree)
 Work from the tips of the tree towards the root. Label each node with the
intersection of the states of its child nodes.
 If the intersection is empty label the node with the union and add one to the
cost
TC +1 T
+1
C C

AGC +1 Cost=4 C
CT +1 GC +1 C C
CT= +1 +1 +1
C T G C A T C T G C A T
Calculate Fitch score Internal node labeling
Sankoff Algorithm
 More general than the Fitch algorithm.
 Assumes we have a table of costs cab for all possible
changes between states a and b (A, T, C or G for DNA)
 For each node i in the tree we compute S(i,a) the minimal
cost given that node i is assigned state a.
 In particular we can compute the minimum value over a
for S(root,a) which will be the cost of the tree.
Sankoff algorithm: DP
G C T
inf inf 0 inf inf inf inf 0
A G inf 0 inf inf
T
0 inf inf inf inf inf 0 inf inf inf inf 0
S(i,A) = 
S(i,G)=0 S(i,T) = 0

1 4 1 4 4 1 4 1
Initialization:
S(i,a) = 0 i labeled by a, or
S(i,a) = 
2 5 1 5
5 2 5 1
a{A,T,C,G}
cab A C G T Iteration:
A 0 2 1 2 S(i,a) = minx(S(j,x)+c(a,x))
C 2 0 2 1 + miny(S(k,y)+c(a,y))
G 1 2 0 2 Termination:
5 5 4 4
T 2 1 2 0 minaS(root,a)
Large Parsimony Problem
 The small parsimony problem – to find the score
of a given tree - can be solved in linear time in
the size of the tree.
 The large parsimony problem is to find the tree
with minimum score.
 It is known to be NP-Hard.
Tree search strategies
 Exact search
– possible for small n only
 Branch and Bound
– Use “cleaver” rules to avoid some branches of trees
– up to ~20 (25) taxa

 Local search - Heuristics

– Pick a good starting tree and use moves within a
“neighbourhood” to find a better tree; e.g., nearest-neighbor
interchanges (NNIs)
 Meta-heuristics
– Genetic algorithms
– Simulated annealing
Branch and bound

http://evolution.gs.washington.edu/gs541/2005/lecture25.pdf
Branch and bound

This branch
can be safely
“neglected”!
Nearest neighbor interchange
Probabilistic approaches to
phylogeny
 Rank trees according to their likelihood
P(data|tree), or, posterior probability P(tree|data)
(Bayesian)
 Maximum likelihood methods
 Sampling methods
Calculate likelihood
Felsenstein’s algorithm for likelihood
 Initialization: i=2n-1
 Recursion: Compute P(Li|a) for all a as follows
if i is leaf
P(Li|a)=1 if a is the label of the leaf, otherwise 0
else
Compute P(Lj|a), P(Lk|a) for all a i
P(Li|a)=b,cP(b|a,tj)P(Lj|b)P(c|a,tk)P(Lk|c) a
j k
b c
How can one tell if a tree is
significant
Biological knowledge

Bootstrapping: how dependent is the tree on the dataset

1. Randomly choose n objects from your dataset of n, with replacement
(picking columns from the alignment at random with replacement)
2. Rebuild the tree based on the subset of the data
3. Repeat many (1,000 – 10,000) times
4. How often are the same children joined?
Jackknifing: how dependent is the tree on the dataset
1. Randomly choose k objects from your dataset of n, without replacement
2. Rebuild the tree based on the subset of the data
3. Repeat many (1,000 – 10,000) times
4. How often are the same children joined?
Commonly used phylogeny
packages
 369 phylogeny packages
(http://evolution.gs.washington.edu/phylip/software.html) and 54
free servers (as of Sep 30, 2011)
– Phylip (general package, protdist, NJ, parsimony, maximum
likelihood, etc)
– PAUP (parsimony)
– PAML (maximum likelihood)
– TreePuzzle (quartet based)
– PhyML (maximum likelihood)
– MyBayes
– MEGA (biologist-centric)
What a surprise
 “..because of their intrinsic reliance on summary
statistics, distance-matrix methods are assumed to
be less accurate than likelihood-based
approaches. In this paper [Science, 327:1376–
1379, 2010], pairwise sequence comparisons are
shown to be more powerful than previously
hypothesized. A statistical analysis of certain
distance-based techniques indicates that their data
requirement for large evolutionary trees essentially
matches the conjectured performance of maximum
likelihood methods—challenging the idea that
summary statistics lead to suboptimal analyses. ”
Readings
 Chapter 7 (Revealing Evolutionary History)
 Chapter 8 (Building Phylogenetic Trees)

Maximum Parsimony and Likelihood
No ratings yet
Maximum Parsimony and Likelihood
34 pages
Phylogenetic Analysis1
No ratings yet
Phylogenetic Analysis1
62 pages
Phylogenetic Tree
No ratings yet
Phylogenetic Tree
25 pages
Phylogenetics
No ratings yet
Phylogenetics
49 pages
Slides 9
No ratings yet
Slides 9
62 pages
Computational Methods in Phylogenetic Analysis: Tutorial at CSB 2004 Tandy Warnow
No ratings yet
Computational Methods in Phylogenetic Analysis: Tutorial at CSB 2004 Tandy Warnow
89 pages
Lec8 - Phylogenetic Trees - Till NJ
No ratings yet
Lec8 - Phylogenetic Trees - Till NJ
36 pages
Ultrametricity
No ratings yet
Ultrametricity
35 pages
Intro Phylo Notes
No ratings yet
Intro Phylo Notes
36 pages
Phylogeny
No ratings yet
Phylogeny
22 pages
Recpad2024 - Poster
No ratings yet
Recpad2024 - Poster
1 page
4 Phylogenetics
No ratings yet
4 Phylogenetics
43 pages
Molecular Phylogeny
No ratings yet
Molecular Phylogeny
78 pages
Slides Week03
No ratings yet
Slides Week03
49 pages
Molecular Evolution and Phylogenetics Session.3
No ratings yet
Molecular Evolution and Phylogenetics Session.3
37 pages
Lec 10 Phylogenetics
No ratings yet
Lec 10 Phylogenetics
51 pages
Peter Duerre - Handbook On Clostridia-CRC Press (2004) PDF
100% (2)
Peter Duerre - Handbook On Clostridia-CRC Press (2004) PDF
1,156 pages
Ceng465 Week8
No ratings yet
Ceng465 Week8
40 pages
Phylogenetics
No ratings yet
Phylogenetics
108 pages
Lecture 9 - Phylogenetic Tree
No ratings yet
Lecture 9 - Phylogenetic Tree
16 pages
Computational Biology B. Tech - Bio-Tech (VI Semester)
No ratings yet
Computational Biology B. Tech - Bio-Tech (VI Semester)
40 pages
BTC 506 Phylogenetic Analysis
No ratings yet
BTC 506 Phylogenetic Analysis
58 pages
Phylogenic Tree
No ratings yet
Phylogenic Tree
42 pages
Phylogenetic Trees
No ratings yet
Phylogenetic Trees
11 pages
Intro To Phyl o Genetics
No ratings yet
Intro To Phyl o Genetics
44 pages
Disclaimer
No ratings yet
Disclaimer
36 pages
Phylogenetic Tree
No ratings yet
Phylogenetic Tree
31 pages
Phyl o Genetics
No ratings yet
Phyl o Genetics
58 pages
Phylogenetic Tree Constructions Methods and Programmes - L 11 - 12
No ratings yet
Phylogenetic Tree Constructions Methods and Programmes - L 11 - 12
27 pages
Phylogenetic Tree
No ratings yet
Phylogenetic Tree
25 pages
Final 2
No ratings yet
Final 2
85 pages
College of Agriculture, Rajendranagar, Hyderabad-500030: Professor Jayashankar Telangana State Agricultural University
No ratings yet
College of Agriculture, Rajendranagar, Hyderabad-500030: Professor Jayashankar Telangana State Agricultural University
34 pages
College of Agriculture, Rajendranagar, Hyderabad-500030: Professor Jayashankar Telangana State Agricultural University
No ratings yet
College of Agriculture, Rajendranagar, Hyderabad-500030: Professor Jayashankar Telangana State Agricultural University
34 pages
4 - Phylogenetics
No ratings yet
4 - Phylogenetics
30 pages
Molecular Phylogeny - Introduction
No ratings yet
Molecular Phylogeny - Introduction
12 pages
L13 PhylogenyTrees
No ratings yet
L13 PhylogenyTrees
19 pages
Phylogenetics Basics
No ratings yet
Phylogenetics Basics
28 pages
PHYLOGENY
No ratings yet
PHYLOGENY
17 pages
Phylogenetic Analysis Extra
No ratings yet
Phylogenetic Analysis Extra
13 pages
BDMH Phylogenetic
No ratings yet
BDMH Phylogenetic
32 pages
Phylogenetic Tree Construction
No ratings yet
Phylogenetic Tree Construction
6 pages
Module 2 Unit - 2 EVOLUTIONARY TREES AND PHYLOGENY
No ratings yet
Module 2 Unit - 2 EVOLUTIONARY TREES AND PHYLOGENY
39 pages
Assignment5 BI12-223
No ratings yet
Assignment5 BI12-223
9 pages
Phylogenetic Tree
No ratings yet
Phylogenetic Tree
9 pages
Phylogenetic Analysis
No ratings yet
Phylogenetic Analysis
11 pages
Phylogenetic Tree Construction - Methods
No ratings yet
Phylogenetic Tree Construction - Methods
7 pages
Week 3c - Phylogenetic - Tree - ConstructionMai PDF
No ratings yet
Week 3c - Phylogenetic - Tree - ConstructionMai PDF
19 pages
Bscol 7
No ratings yet
Bscol 7
29 pages
Introduction To Molecular Evolution: Mike Thomas October 3, 2002
No ratings yet
Introduction To Molecular Evolution: Mike Thomas October 3, 2002
32 pages
A Review: Phylogeny Construction Methods: Priyanka Shaktawat, Parvati Bhurani
No ratings yet
A Review: Phylogeny Construction Methods: Priyanka Shaktawat, Parvati Bhurani
4 pages
Phylogenetic Analyses2
No ratings yet
Phylogenetic Analyses2
16 pages
Phylogenetic Tree Bioinformatics
No ratings yet
Phylogenetic Tree Bioinformatics
4 pages
Bioinformatics Session16!17!25102021
No ratings yet
Bioinformatics Session16!17!25102021
39 pages
Class16-Introduction To Molecular Phylogenetics
No ratings yet
Class16-Introduction To Molecular Phylogenetics
14 pages
Phylogeny Review Worksheet 2017
100% (1)
Phylogeny Review Worksheet 2017
6 pages
Phylogenetics PDF by Matti Ullah KHan NIazi
No ratings yet
Phylogenetics PDF by Matti Ullah KHan NIazi
4 pages
Lab 3
No ratings yet
Lab 3
6 pages
Molecular Phylogenetic Analysis: - Humans-flies-Mollusks - Common Phenotype?
No ratings yet
Molecular Phylogenetic Analysis: - Humans-flies-Mollusks - Common Phenotype?
35 pages
Phylogenetic Trees (BIOINFORMATICS)
No ratings yet
Phylogenetic Trees (BIOINFORMATICS)
7 pages
Phylogeny and Systematics: Powerpoint Lectures For
No ratings yet
Phylogeny and Systematics: Powerpoint Lectures For
35 pages
Phylogenetics
100% (1)
Phylogenetics
51 pages
Phylogeny Analysis
No ratings yet
Phylogeny Analysis
49 pages
Cladistics - The Theory and Practice of Parsimony Analysis)
No ratings yet
Cladistics - The Theory and Practice of Parsimony Analysis)
240 pages
Lehtonen (2017) - Splitting Caldesia in Favour of Albidella (Alismataceae)
No ratings yet
Lehtonen (2017) - Splitting Caldesia in Favour of Albidella (Alismataceae)
7 pages
Phylogeny of Clematis
No ratings yet
Phylogeny of Clematis
43 pages
Instant Ebooks Textbook Molecular Evolution A Statistical Approach 1st Edition Ziheng Yang Download All Chapters
No ratings yet
Instant Ebooks Textbook Molecular Evolution A Statistical Approach 1st Edition Ziheng Yang Download All Chapters
71 pages
Pag-Ibig Sa Tinubuang Lupa
No ratings yet
Pag-Ibig Sa Tinubuang Lupa
10 pages
Cosacov Et. Al. 2009 Phylogenetic Relationships Calceolaria
No ratings yet
Cosacov Et. Al. 2009 Phylogenetic Relationships Calceolaria
16 pages
Computational Phylogenetics
No ratings yet
Computational Phylogenetics
18 pages
Ardiyani 2003
No ratings yet
Ardiyani 2003
318 pages
2 Aspergillus Section Circumdati
No ratings yet
2 Aspergillus Section Circumdati
22 pages
Mycoscience - Geastrum Melanocephalum Triplex
No ratings yet
Mycoscience - Geastrum Melanocephalum Triplex
17 pages
de Pinna, M. A New Family of Neotropical Freshwater Fishes From Deep Fossorial Amazonian Habitat, With A Reappraisal of Morphological Characiform Phylogeny (Teleostei Ostariophysi)
No ratings yet
de Pinna, M. A New Family of Neotropical Freshwater Fishes From Deep Fossorial Amazonian Habitat, With A Reappraisal of Morphological Characiform Phylogeny (Teleostei Ostariophysi)
31 pages
Protein Sequence Analysis
No ratings yet
Protein Sequence Analysis
44 pages
Board Games Are Like Plant and Animal Species
No ratings yet
Board Games Are Like Plant and Animal Species
12 pages
Aporosa Biogeography
No ratings yet
Aporosa Biogeography
12 pages
Cbe 513
No ratings yet
Cbe 513
11 pages
Acid Base Chemistry
No ratings yet
Acid Base Chemistry
23 pages
Systematics - BIO 615: History
No ratings yet
Systematics - BIO 615: History
9 pages
Walter Zimmermann and The Growth of Phylogene
No ratings yet
Walter Zimmermann and The Growth of Phylogene
13 pages
Phylogeny of The Bangiophycidae
No ratings yet
Phylogeny of The Bangiophycidae
11 pages
MC Article 56168 en 1
No ratings yet
MC Article 56168 en 1
13 pages
Phylogenetic Position of Neotropical Bursera-Specialist Mistletoes The Evolution of Deciduousness and Succulent Leaves in Psittacanthus (Loranthaceae)
No ratings yet
Phylogenetic Position of Neotropical Bursera-Specialist Mistletoes The Evolution of Deciduousness and Succulent Leaves in Psittacanthus (Loranthaceae)
19 pages
Agnarsson and Miller 2008 Acctran Vs Deltran
No ratings yet
Agnarsson and Miller 2008 Acctran Vs Deltran
7 pages
Narcissus Phylogeny Marques
No ratings yet
Narcissus Phylogeny Marques
23 pages
Aromatic Chemistry Reviewer
No ratings yet
Aromatic Chemistry Reviewer
24 pages
Chap 4
No ratings yet
Chap 4
18 pages
Reviewer
No ratings yet
Reviewer
14 pages
Activation of Bacterial Spores. A Review': I G Rmina
No ratings yet
Activation of Bacterial Spores. A Review': I G Rmina
7 pages
Rafflesia Aurantia Rafflesiaceae A New Species From PDF
No ratings yet
Rafflesia Aurantia Rafflesiaceae A New Species From PDF
12 pages
Spanish Pigeon Breeds
No ratings yet
Spanish Pigeon Breeds
15 pages
Aphyosemion and Fundulopanchax 1999
No ratings yet
Aphyosemion and Fundulopanchax 1999
10 pages
Gene
No ratings yet
Gene
5 pages
Cual, Junegreg A. Systematics Lecture Journal Review Assignment
No ratings yet
Cual, Junegreg A. Systematics Lecture Journal Review Assignment
2 pages
Animal2 PDF
No ratings yet
Animal2 PDF
1 page
Beef Kaldereta Recipe: Ingredients
No ratings yet
Beef Kaldereta Recipe: Ingredients
1 page
Pork Mechado Recipe: Ingredients
No ratings yet
Pork Mechado Recipe: Ingredients
1 page
24th April 2013 Humba Bisaya (Original Pinoy Recipe)
No ratings yet
24th April 2013 Humba Bisaya (Original Pinoy Recipe)
1 page
Beef Kaldereta Recipe: Ingredients
No ratings yet
Beef Kaldereta Recipe: Ingredients
1 page
Research Method: Applying Parsimony To A Problem in Molecular Systematics
No ratings yet
Research Method: Applying Parsimony To A Problem in Molecular Systematics
1 page
Pol Ypedat Es L Eucomust At: Taxa A Di Agnosi S
No ratings yet
Pol Ypedat Es L Eucomust At: Taxa A Di Agnosi S
1 page
Plants 2 PDF
No ratings yet
Plants 2 PDF
1 page
Topical Guidebook For GCE O Level Biology 3 Part 2
From Everand
Topical Guidebook For GCE O Level Biology 3 Part 2
Esther Chen
5/5 (1)

Phylogenetic Tree Reconstruction: I519 Introduction To Bioinformatics, 2012

Uploaded by

Phylogenetic Tree Reconstruction: I519 Introduction To Bioinformatics, 2012

Uploaded by

I519 Introduction to Bioinformatics, 2012

Phylogenetic Tree Reconstruction

Aardvark Bison Chimp Dog Elephant

Unrooted tree: Note: 1-5 are called leaves, or

 Modern biological methods allow to use

 Species tree and gene tree

There are many possible types of

The left tree is preferred over the right tree.

Weak support, 40% bootstrap for bipartition (AD)(CB)

Strong support, e.g., 100% bootstrap for (AD)(CB)

Duplicated genes may have different functions!!

2. Other models take into account transitions vs. transversion frequencies

NJ looks for two sequences (clusters) to merge that minimizes:

 FM and NJ do not assume “molecular clock”; UPGMA

1. A procedure to find the minimum number of

2. A search through the space of trees. (hard

 Local search - Heuristics

Bootstrapping: how dependent is the tree on the dataset

You might also like