phylogenetic trees

54
Phylogenetic trees

Upload: rodney

Post on 18-Jan-2016

21 views

Category:

Documents


0 download

DESCRIPTION

Phylogenetic trees. Trees. Gorilla. Human. Chimp. Human. Chimp. Gorilla. =. Gorilla. Chimp. Human. =. =. Chimp. Human. Gorilla. Terminology. A branch = An edge. The root. Internal nodes. Chicken. Gorilla. Human. Chimp. External node - leaf. Ingroup / Outgroup:. Chicken. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Phylogenetic trees

Phylogenetic trees

Page 2: Phylogenetic trees

Chimp HumanGorillaHuman ChimpGorilla

=

Chimp GorillaHuman

= =

Human GorillaChimp

Trees

Page 3: Phylogenetic trees

A branch =An edge

External node - leaf

Human ChimpChicken Gorilla

The root

Internal nodes

Terminology

Page 4: Phylogenetic trees

Human ChimpChicken Gorilla

INGROUPOUTGROUP

Ingroup / Outgroup:

Page 5: Phylogenetic trees

The maximum parsimony principle.

(The shortest path)

Modified from Inferring Phylogenies (Book),Author: Prof. Joe Felsenstein

Page 6: Phylogenetic trees

Genes: 0 = absence, 1 = presence

speciesg1g2g3g4g5g6

s1100110

s2001000

s3110000

s4110111

s5001110

Page 7: Phylogenetic trees

s1 s4 s3 s2 s5

Evaluate this tree…

Page 8: Phylogenetic trees

s1 s4 s3 s2 s5

1

Page 9: Phylogenetic trees

s1 s4 s3 s2 s5

01

Page 10: Phylogenetic trees

s1 s4 s3 s2 s5

11 0

Page 11: Phylogenetic trees

s1 s4 s3 s2 s5

1 1 1 0 0

Gene number 1

Page 12: Phylogenetic trees

s1 s4 s3 s2 s5

Gene number 1.

The most parsimonious ancestral character states

1 1 1 0 0

10

1

Page 13: Phylogenetic trees

s1 s4 s3 s2 s5

Gene number 1, Option number 1.

1 1 1 0 0

1

0

1

1

Page 14: Phylogenetic trees

s1 s4 s3 s2 s5

Gene number 1, Option number 2.

Minimal number of changes for gene 1 (character 1) = 1

1 1 1 0 0

1

0

0

1

Page 15: Phylogenetic trees

s1 s4 s3 s2 s5

0 0

Gene number 2,

Page 16: Phylogenetic trees

s1 s4 s3 s2 s5

Gene number 2, Option number 1.

0 1 1 0 0

1

0

0

1

Page 17: Phylogenetic trees

s1 s4 s3 s2 s5

Gene number 2, Option number 2.

0 1 1 0 0

1

0

1

1

Gene number 2, Option number 2.

Page 18: Phylogenetic trees

s1 s4 s3 s2 s5

0 1 1 0 0

0

0

0

0Number of changes for gene 2 (character 2) = 2

Gene number 2, Option number 3.

Page 19: Phylogenetic trees

Sum of changes = 9

Genes: 0 = absence, 1 = presence

speciesg1g2g3g4g5g6

s1100110

s2001000

s3110000

s4110111

s5001110

Total number of changes

given the tree

121221

Page 20: Phylogenetic trees

Can we do better?

Sum of changes = 9

Page 21: Phylogenetic trees

YES WE CAN!

Sum of changes = 8

Sum of changes = 9

The MP (most parsimonious) tree:

Page 22: Phylogenetic trees

s1 s4 s3 s2 s5

The MP (most parsimonious) tree:

Sum of changes for this tree topology = 8

Page 23: Phylogenetic trees

Intermediate Summary

MP tree = one for which minimal number of changes are needed to explain the data

We can now search for the best tree under the MP criterion

Page 24: Phylogenetic trees

Challenges

Evaluating big tree “by hand” can be problematic. We want the computer to do it.

Going over all the trees? How many trees are there?

Can we generalize to nucleotides? To amino acids?

Is the parsimony criterion ideal?

Page 26: Phylogenetic trees

Positions :

speciesp1p2p3p4p5p6

s1AAGTAA

s2CAAAAC

s3CAGGAA

s4AAATAC

s5GCGCCA

s1 AAGTAA

s2 CAAAAC

s3 CAGGAA

s4 AAATAC

s5 GCGCCA

Page 27: Phylogenetic trees

s1 s4 s3 s2 s5

G

Position number 1

A A C C

Page 28: Phylogenetic trees

s1 s4 s3 s2 s5

G

Position number 1

A

A

C CA

C

C

C Number of changes for position 1 = 2

Page 29: Phylogenetic trees

GACA GGGACAAG GCGAGAAA

Human ChimpChicken GorillaDuck

Find the MP score of the tree for these sequences

Exercise

Page 30: Phylogenetic trees

How to efficiently compute the MP score of a tree

Page 31: Phylogenetic trees

A GC CA

Human ChimpChicken GorillaDuck

{A,G}

{A,C,G}

{A,C}

{A,C}

Postorder tree scan. In each node, if the intersection between the leaves is empty: we apply a union operator. Otherwise, an intersection.

The Fitch algorithm (1971):

Page 32: Phylogenetic trees

A GC CA

Human ChimpChicken GorillaDuck

{A,G}

{A,C,G}

{A,C}

{A,C}

Total number of changes = number of union operators.

Page 35: Phylogenetic trees

Positions :

speciesp1p2p3p4p5p6

HumanAAGTAA

ChimpAATTAC

GorillaACATAA

A A A A A AA A A

C H G G C HH C G

Total number of changes = 0

For all 3 possible tree topologies

Page 36: Phylogenetic trees

Positions :

speciesp1p2p3p4p5p6

HumanAAGTAA

ChimpAATTAC

GorillaACATAA

A A C C A AA A C

C H G G C HH C G

Total number of changes = 1

For all 3 possible tree topologies

Page 37: Phylogenetic trees

Positions :

speciesp1p2p3p4p5p6

HumanAAGTAA

ChimpAATTAC

GorillaACATAA

T G A A T GG T A

C H G G C HH C G

Total number of changes = 2

For all 3 possible tree topologies

Page 38: Phylogenetic trees

Positions :

speciesp1p2p3p4p5p6

HumanAAGTAA

ChimpAATTAC

GorillaACATAA

C H G G C HH C G

Total number of changes is always the same

for all 3 possible tree topologies

Page 40: Phylogenetic trees

G O HC H C GOO C HG

G H CO H O CGO H GC

G C OH H O GCO C GH O C GH

O H GC

O C HG

C H GO

C O HG

C O GH

Page 41: Phylogenetic trees

G O HC H C GO

O C GH O C GH

C O HG

Page 42: Phylogenetic trees

G O HCH C GO

O C GH

C

C GH

C O HG

1

5

4 3

2

O

OG

H

Page 43: Phylogenetic trees

The position of the root does not affect the MP score.

Conclusion

Page 44: Phylogenetic trees

Chimp

Orangutan

Gorilla

Human

C

GC A

G

G

G

G

G

G

A

G

After “bending” the trees, the association of changes and branches does not change!

Rooting does not change MP score

G

Page 45: Phylogenetic trees

Chimp

Orangutan

Gorilla

Human

C

GC C

G

G

G

C

C

G

C

G

C

After “bending” the trees, the association of changes and branches does not change!

Rooting does not change MP score

Page 49: Phylogenetic trees

Human

Chimp

Chicken

Gorilla

Human

Gorilla

Chimp

Chicken

Human

Chicken

Chimp

Gorilla

With 4 taxa, there are 3 difference unrooted trees.

Page 50: Phylogenetic trees

Human

Chimp

Chicken

Gorilla

Human

Gorilla

Chimp

Chicken

Human

Chicken

Chimp

Gorilla

One tree gets a better score (less changes) than the other trees.

Page 51: Phylogenetic trees

Human

Chimp

Chicken

Gorilla

We then use an external knowledge, that chicken is the outgroup and get a rooted tree

Page 52: Phylogenetic trees

C

X

Y

H

X

O

CHY O

Can you root the unrooted tree to obtain the tree below?

Exercise

Page 53: Phylogenetic trees

How many rooted trees result from an unrooted tree with n taxa?

Exercise

Page 54: Phylogenetic trees

Assume you have three sequences and the MP score of the unrooted tree is X. You now add another sequence. Can the score of the 4-taxa tree be lower than that of the 3 taxa tree?

Exercise