a statistic summarizing: relative timing in trees

10
A statistic summarizing: Relative timing in trees Tanja Gernhard joint work with Erick Matsen, Daniel Ford

Upload: lieu

Post on 01-Feb-2016

29 views

Category:

Documents


0 download

DESCRIPTION

A statistic summarizing: Relative timing in trees. Tanja Gernhard joint work with Erick Matsen, Daniel Ford. Tree statistics. Investigate the properties of reconstructed trees Colless [Colless, 1982] summarizes the information of the shape of the tree (i.e. balanced vs. unbalanced) - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: A statistic summarizing: Relative timing in trees

A statistic summarizing:

Relative timing in trees

             

Tanja Gernhard

joint work with Erick Matsen, Daniel Ford

Page 2: A statistic summarizing: Relative timing in trees

Tree statistics

• Investigate the properties of reconstructed trees

• Colless [Colless, 1982] summarizes the information of the shape of the tree (i.e. balanced vs. unbalanced)

• Gamma [Pybus, Harvey 2000] summarizes the time between speciation events (graphical representation in LTT plots)

Page 3: A statistic summarizing: Relative timing in trees

Relative Timing

Neutrality: Each ranking equally likely

Page 4: A statistic summarizing: Relative timing in trees

How to summarize rank information?

Each ranking equally likely equivalent to

For each vertex: Each shuffle equally likely

shuffles on a vertex

RRRRLL RRLRRL

Page 5: A statistic summarizing: Relative timing in trees

Runs summarize shuffle

• Runs: Number of sequences in shuffle with same letter

• Consider (2,4)-shuffle RRLRRL: 4 runs• Run distribuition of (m,n)-shuffle

well-known from statistics:

P 5i=0 xi

PfXm;n = 2k+1g=

¡m¡ 1

k

¢¡n¡ 1k¡ 1

¢+

¡m¡ 1k¡ 1

¢¡n¡ 1

k

¢

¡m+n

m

¢

PfXm;n = 2kg=2¡m¡ 1k¡ 1

¢¡n¡ 1k¡ 1

¢

¡m+n

m

¢

Page 6: A statistic summarizing: Relative timing in trees

Combining all shuffles in tree

• Runs of a tree, R(T): Sum of runs of all shuffles in tree

• Distribution for runs in tree T:

PfR(T) = kg=kX

i=1

PfXm;n = igk¡ iX

j =0

PfR(L) = jgPfR(R) = k¡ i ¡ jg

Page 7: A statistic summarizing: Relative timing in trees

Application to Ant lineages

• We investigated the tree of Ants [Moreau et al, 2007]

• LTT plot show substantial increase during Late Cretacceous to Early Eocene

Page 8: A statistic summarizing: Relative timing in trees

10

0500406aRA0385RA0278RA02940501448aRA03761 2 run0.5 shuffle0.52 3 run0.5 shuffle0.53 4 run0.5 shuffle0.5CS0458RA02294 2 run0.5 shuffle0.5

5 6 run0.75 shuffle0.75

RA0283CS03306 2 run0.5 shuffle0.57 8 run0.8333 shuffle0.6666

0500172aCS0278RA02808 2 run0.5 shuffle0.59 3 run0.5 shuffle0.5RA0237CS0243RA039010 2 run0.5 shuffle0.511 3 run0.5 shuffle0.5

12 6 run0.5 shuffle0.5

RA0395RA0340RA040213 2 run0.5 shuffle0.514 3 run0.5 shuffle0.515 9 run0.5317 shuffle0.5238

RA0240RA0230RA036616 2 run0.5 shuffle0.517 3 run0.5 shuffle0.5RA02650500523a18 2 run0.5 shuffle0.5

19 5 run0.3333 shuffle0.3333RA0330RA024220 2 run0.5 shuffle0.5

21 7 run0.5333 shuffle0.7

RA0256RA0277RA0296RA038222 2 run0.5 shuffle0.523 3 run0.5 shuffle0.524 4 run0.5 shuffle0.525 11 run0.9206 shuffle0.9404

CS0249RA0392RA0325RA021126 2 run0.5 shuffle0.527 3 run0.5 shuffle0.528 4 run0.5 shuffle0.529 15 run0.9736 shuffle0.8531

30 24 run0.9248 shuffle0.6417

RA0219RA0341RA034631 2 run0.5 shuffle0.532 3 run0.5 shuffle0.5CS0294RA034833 2 run0.5 shuffle0.5

34 5 run0.3333 shuffle0.3333RA0254CS0258RA040535 2 run0.5 shuffle0.536 3 run0.5 shuffle0.5

37 8 run0.4888 shuffle0.6

38 32 run0.9816 shuffle0.9581

39 40 run0.9867 shuffle0.5977

RA0266CS0501RA0351RA027140 2 run0.5 shuffle0.541 3 run0.5 shuffle0.542 4 run0.5 shuffle0.5CS0384CS0308CS0319RA024743 2 run0.5 shuffle0.544 3 run0.5 shuffle0.545 4 run0.5 shuffle0.5

46 8 run0.2 shuffle0.2

RA0248CS038747 2 run0.5 shuffle0.5CS0161CS041648 2 run0.5 shuffle0.549 4 run0.5 shuffle0.5

50 12 run0.2416 shuffle0.4333

51 52 run0.9841 shuffle0.8770

RA0264RA0387RA038852 2 run0.5 shuffle0.5RA0239CS0271a53 2 run0.5 shuffle0.554 4 run0.5 shuffle0.5

55 5 run0.5 shuffle0.5

RA0371RA0276RA0322RA0290CS026056 2 run0.5 shuffle0.557 3 run0.5 shuffle0.558 4 run0.5 shuffle0.559 5 run0.5 shuffle0.5

RA0292RA0270RA021760 2 run0.5 shuffle0.561 3 run0.5 shuffle0.562 8 run0.6 shuffle0.6

63 13 run0.8954 shuffle0.8939

RA0374RA036964 2 run0.5 shuffle0.5RA0372RA030365 2 run0.5 shuffle0.566 4 run0.5 shuffle0.5

67 17 run0.7443 shuffle0.2626

CS0297RA0306RA027268 2 run0.5 shuffle0.569 3 run0.5 shuffle0.5RA0223RA0233CS026870 2 run0.5 shuffle0.571 3 run0.5 shuffle0.5

72 6 run0.8333 shuffle0.8333

73 23 run0.8445 shuffle0.5729

CS0424CS010874 2 run0.5 shuffle0.5

75 25 run0.8509 shuffle0.5434

76 77 run0.9863 shuffle0.6365

RA0310CS047677 2 run0.5 shuffle0.5CS04480501441a78 2 run0.5 shuffle0.579 4 run0.5 shuffle0.5RA0384RA036280 2 run0.5 shuffle0.5

81 6 run0.75 shuffle0.75

82 83 run0.9797 shuffle0.1353

RA0258RA0342CS039083 2 run0.5 shuffle0.5RA0308RA0245RA0327RA030584 2 run0.5 shuffle0.5RA0312RA0311RA0307RA030985 2 run0.5 shuffle0.586 3 run0.5 shuffle0.587 4 run0.5 shuffle0.588 6 run0.75 shuffle0.75

89 7 run0.75 shuffle0.5

CS0355RA0226CS026290 2 run0.5 shuffle0.591 3 run0.5 shuffle0.5

92 10 run0.6428 shuffle0.464293 11 run0.6428 shuffle0.5

94 13 run0.6915 shuffle0.5909

RA0286RA0249CS021495 2 run0.5 shuffle0.596 3 run0.5 shuffle0.5

97 16 run0.7959 shuffle0.6978

98 17 run0.7959 shuffle0.5

99 100 run0.9868 shuffle0.6548

RA0301CS0280CS0241100 2 run0.5 shuffle0.5101 3 run0.5 shuffle0.5

102 103 run0.9871 shuffle0.5294

RA0267RA0328103 2 run0.5 shuffle0.5RA0363RA0364104 2 run0.5 shuffle0.5105 4 run0.5 shuffle0.5CS0532aCS0320106 2 run0.5 shuffle0.5CS0468CS0272107 2 run0.5 shuffle0.5108 4 run0.5 shuffle0.5

109 8 run0.8 shuffle0.8

RA0403RA0359110 2 run0.5 shuffle0.5RA0241RA0397RA0398111 2 run0.5 shuffle0.5112 3 run0.5 shuffle0.5113 5 run0.8333 shuffle0.8333

114 13 run0.6139 shuffle0.2378

115 116 run0.9574 shuffle0.0607

RA0250RA0263CS0274RA0297RA0352RA0243116 2 run0.5 shuffle0.5117 3 run0.5 shuffle0.5118 4 run0.5 shuffle0.5RA0350RA0261CS0421119 2 run0.5 shuffle0.5120 3 run0.5 shuffle0.5RA0108RA0246CS0233121 2 run0.5 shuffle0.5122 3 run0.5 shuffle0.5

123 6 run0.5 shuffle0.5124 10 run0.7857 shuffle0.8214

125 11 run0.7857 shuffle0.5126 12 run0.7857 shuffle0.5

RA0399CS0328127 2 run0.5 shuffle0.5

128 14 run0.8095 shuffle0.5833

RA01200501443a0500456a0500145a129 2 run0.5 shuffle0.5130 3 run0.5 shuffle0.5131 4 run0.5 shuffle0.5RA02100500058a132 2 run0.5 shuffle0.50500384a0500097a133 2 run0.5 shuffle0.5134 4 run0.5 shuffle0.5

135 8 run0.8 shuffle0.8

0500378a0500366a0500379a136 2 run0.5 shuffle0.5137 3 run0.5 shuffle0.5138 11 run0.4583 shuffle0.1527

139 25 run0.7366 shuffle0.6150

140 141 run0.9215 shuffle0.2059

RA0401RA0400141 2 run0.5 shuffle0.5

142 143 run0.9217 shuffle0.5070

0501449aRA03860500407a143 2 run0.5 shuffle0.5144 3 run0.5 shuffle0.5

145 146 run0.9052 shuffle0.0276

146 147 run0.9052 shuffle0.5

147 148 run0.9052 shuffle0.5

Page 9: A statistic summarizing: Relative timing in trees

Some comments

• Testing the posterior of Bayesian methods for neutrality (phylogenies or populations)

• Population size might change arbitrary

• All programs available from my homepage in the Python package Casspython Stats.dat YourTrees.tre

Page 10: A statistic summarizing: Relative timing in trees

Dankeschön