american statistical association october 23 minneapolis presentation part 2

18
Size distribution of m icrotum ors 0 10 20 30 40 50 60 70 80 90 20000 60000 100000 140000 180000 220000 260000 300000 340000 380000 420000 460000 500000 540000 580000 620000 660000 700000 740000 780000 820000 860000 900000 940000 Size lim it(μm 2 ) Num berofm icrotum ors A range of sizes and morphologies observed Microtumors Num berand Average Size ofAggregates and Microtum ors 0 20000 40000 60000 80000 100000 120000 140000 160000 ALL STRUCTURES AGGREGATES NO T MELANIZED MICROTUMORS NO T MELANIZED MICROTUMORS PARTIALLY MELANIZED MICRO TUMO RS STRONGLY MELANIZED P rojection A rea (µm ²) (n=932) (n=513) (n=268) (n=55) (n=96) Ubc9 - dif - dl - - F-actin Microtumo r Small Microtum or 419 Projection >10,000 m 2 Estimated volume: 0.5 mm 3 -1 mm 3 932 513 Fruitfly Tumors:

Upload: double-check-consulting

Post on 05-Dec-2014

313 views

Category:

Documents


0 download

DESCRIPTION

Validation and Actionable Analytical solutions from a small cancer data set.

TRANSCRIPT

Page 1: American Statistical Association October 23 Minneapolis Presentation Part 2

Size distribution of microtumors

0

10

20

30

40

50

60

70

80

90

2000

0

6000

0

1000

00

1400

00

1800

00

2200

00

2600

00

3000

00

3400

00

3800

00

4200

00

4600

00

5000

00

5400

00

5800

00

6200

00

6600

00

7000

00

7400

00

7800

00

8200

00

8600

00

9000

00

9400

00

Size limit (μm2)

Num

ber

of m

icro

tum

ors

A range of sizes and morphologies observed: MicrotumorsNumber and Average Size of Aggregates and Microtumors

0

20000

40000

60000

80000

100000

120000

140000

160000

ALLSTRUCTURES

AGGREGATESNOT MELANIZED

MICROTUMORSNOT MELANIZED

MICROTUMORSPARTIALLYMELANIZED

MICROTUMORSSTRONGLYMELANIZED

Pro

jection A

rea (µm

²) .

(n=932)

(n=513)

(n=268)

(n=55)

(n=96)

Ubc9- dif-dl-

-

F-actin

Microtumor

SmallMicrotumor

419Projection >10,000 m2

Estimated volume: 0.5 mm3 -1 mm3 932

513

1. Fruitfly Tumors:

Page 2: American Statistical Association October 23 Minneapolis Presentation Part 2

Compensatory Response hypotheses: Case 1 Simulation-Excess Frequency of 25K is reduced by 88 in Bc and 176 in lwr(-).

Page 3: American Statistical Association October 23 Minneapolis Presentation Part 2

Compensatory Behavior: Case 2 Simulation

11 25000 -88 10 25000 -176 11 50000 11 10 50000 22 11 75000 11 10 75000 22 11 100000 11 10 100000 22 11 250000 -8 10 250000 -8 11 275000 -8 10 275000 -8 11 300000 -8 10 300000 -8

Page 4: American Statistical Association October 23 Minneapolis Presentation Part 2

Compensatory Behavior: Case 3 Simulation…

11 25000 -88 10 25000 -176 11 50000 11 10 50000 22 11 75000 11 10 75000 22 11 100000 11 10 100000 22 11 225000 -8 10 225000 -16 11 250000 -8 10 250000 -8 11 275000 -8 10 275000 -8 11 300000 -8 10 300000 -8

Page 5: American Statistical Association October 23 Minneapolis Presentation Part 2

Compensatory Behavior: Case4 Simulation…

11 25000 -88 10 25000 -176 11 50000 11 10 50000 22 11 75000 11 10 75000 22 11 100000 11 10 100000 22 11 125000 11 10 125000 22 11 150000 11 10 150000 22 11 175000 11 10 175000 22 11 200000 11 10 200000 22 11 225000 -8 10 225000 -16 11 250000 -8 10 250000 -16 11 275000 -8 10 275000 -16 11 300000 -8 10 300000 -16

Page 6: American Statistical Association October 23 Minneapolis Presentation Part 2

Egger’s Paradigmsmall studies tend to have large effect sizes than

would be expected!Size effect greater than what a sample size of N=24

can sustain…

Egger M, Davey Smith G, Schneider M, Minder C. Bias inmeta-analysis detected by a simple, graphical test. BMJ 1997;

315: 629–634.

Peters JL, Sutton AJ, Jones DR, Abrams KR, Rushton L.Comparison of two methods to detect publication bias in

meta-analysis. JAMA 2006; 295: 676–680.

Page 7: American Statistical Association October 23 Minneapolis Presentation Part 2

Compensatory Response Behavior: Case 4- Uncorrected VS Corrected Reverse Engineering Simulation

Page 8: American Statistical Association October 23 Minneapolis Presentation Part 2

PRESENCE of Compensatory Behavior: Simulations by Numerical Data and Visual Analytics

Data Type Linear R-sq. Quad. R-sq. Cubic R-sq. Original Data 0.3816 0.6781 0.8468 Case 1 0.4389 0.7431 0.8941 Case 2 0.506 0.7885 0.9158 Case 3 0.5103 0.7952 0.914 Case 4 0.5385 0.7724 0.8948

• Gain in Linear R-sq.= 16% =Between Tumor area size competition reduction. • Size asymmetry where largest cell size have increased genetic fitness even at highest

compensatory adjustments (0.5385 VS 0.8948; case 4). • Case 1 is at equilibrium; about 5% competition in both, Inter and Intra-size area limit (Cubic

R-sq.) is lessened from Original data. • Case 2 VS Observed data shows alleviation of 6.9% intra-size competition results into 12.4%

reduction in inter-size competition. • Decrease in competition between inter tumor sizes are not as pronounced later by

compensatory adjustments because intra-tumor area size competition has increased (Case 4) by 2% .

Page 9: American Statistical Association October 23 Minneapolis Presentation Part 2

25K frequency is above threshold as well as increased fitness of by asymmetrical competition by both genotypes?? WRONG!

Ho: Increased size asymmetry frequency allows Bc to maintain increased genetic fitness in absence of fitness challenges or in situ fitness is high (explanation of PLOS publication).

VISUAL ANALYTICS: Meta Modeling: Above threshold frequency of 25K lwr(-) at the expense of decreased fitness of largest tumor cell size (lack of asymmetrical

competition resulting into increased aggregates/ small tumor frequencies competition).

Page 10: American Statistical Association October 23 Minneapolis Presentation Part 2

• Why learn compensatory behavior-no applied value.•The residual plot shows 75K and 100K in Bc and 75K and 100K in lwr (-) is most suppressed. •The same residual plot shows 25K frequency having greatest deviations from expected frequencies. •Just reduce 25 K frequency in Bc and lwr(-) by 90% or greater.•Just increase the frequency of 75K and / or 100K in lwr(-) and Bc for drug discovery. •Make sure to use highest sample size available (N=48). •Factors mentioned here are dynamical properties of Brain Tumor gene expression, not the compensatory response behavior hypotheses.

VALUE of Compensatory Behavior Hypotheses for Drug Discovery

Page 11: American Statistical Association October 23 Minneapolis Presentation Part 2

Candida Ferreira. 2003. www.gene-expression-programming.com/author.asp- equation 3.5: (N = 48);

≈90% decrease in small tumors frequencies.

Page 12: American Statistical Association October 23 Minneapolis Presentation Part 2

Simulated Frequencies: biased R-sq. and too much scatter- no interpretation…

Page 13: American Statistical Association October 23 Minneapolis Presentation Part 2

Corrected (CF) Candida Ferreira. 2003. www.gene-expression-programming.com/author.asp- equation 3.5: Good

news R-sq. shrinkage but bad news R-sq. problem…

Page 14: American Statistical Association October 23 Minneapolis Presentation Part 2

Corrected (CF) Reverse Engineering Algorithm

Page 15: American Statistical Association October 23 Minneapolis Presentation Part 2

LOW BRAIN TUMOR FREQUENCIES DOES NOT MEAN R-SQ. NEEDS TO BE LOW; SIMULATED DATA POINTS NEED TO FIT BETTER TO HAVE AN INTERPRETATION!

VALUE ($): IT IS MUCH EASIER TO SIMULATE FREQUENCIES OF TUMOR CELL SIZES THAT ARE RESPONSIBLE FOR COMPENSATORY BEHAVIOR, BUT THE BOTTLENECK IN DRUG DISCOVERY WILL BE CREATED BY OBSERVED FREQUENCIES OF ALL SUPPRESSED CELL SIZES- A CHALLENGING SIMULATION!

VALIDATION

Page 16: American Statistical Association October 23 Minneapolis Presentation Part 2

Summary Text: 1APPLICATION OF MAXIMUM LIKELIHOOD IS VALID ON LOGNORMAL DATA, FOR EXAMPLE, PARKIN, T.B. 1993: EVALUATION OF STATISTICAL METHODS FOR DETERMINING DIFFERENCES BETWEEN SAMPLES FROM LOGNORMAL POPULATIONS. AGRONOMY JOURNAL 85:747-753 .

CONVERT THE NON-NORMAL DISTRIBUTION DATA INTO NORMAL DISTRIBUTION BY TAKING THE LOG OF IT. THEN YOU CALCULATE THE MEAN, VARIANCE ON LOG-TRANSFORMED DATA . MAXIMUM LIKELIHOOD ESTIMATORS MEAN = EXP(µ + (SIGMA)2/2) VARIANCE = µ2 (EXP (SIGMA)2 -1) CI= EXP (µ ± SIGMA * T (1-Α/2)/SQRT N) WHERE µ AND SIGMA-SQUARE ARE MEAN AND VARIANCE OF LOG-TRANSFORMED SAMPLE DATA. SIGMA = STANDARD DEVIATION OF LOG TRANSFORMED DATA AND T (1-Α/2)= 1.96= P<.05 PROBABILITY LEVEL.

THE CI METHOD HERE BY SOKAL AND RHOLF (1969:PHYLOGENETICS) IS CONSERVATIVE METHOD. IT PRODUCE CI OF SMALL WIDTH. IF THE CI WIDTH IS LONG BY THIS METHOD, LIKE YOU HAVE FOR 25K- 300K RECESSIVE MUTANT, THEN THERE IS REALLY AN EFFECT. THIS WAY YOU KNOW THAT HIGH NUMBERS OF MICRO-TUMORS WOULD BE OBSERVED IN ANY REPLICATED STUDY.

Page 17: American Statistical Association October 23 Minneapolis Presentation Part 2

Summary Text: 2 1.We analyzed a random sample of ≈900 structures from 58 animals in BC and lwr(-) genotypes. We call this population of measure structures microtumors, because the size of the largest structure is calculated to be less than 1 mm3. We analyzed their distribution as follows:

2. We plotted the frequency of occurrence (Y axis) as a function of tumor size (X axis).3. There are two distinct populations 25k-300k and 300k-600k4. In the first population (25-300 k), we observe an exponential reduction in the frequency of occurrence as the tumor size increases. (b) This reduction can be understood using a dynamic simulation model described by the equation y = x1 + x2+x3 + x4 (where = = and x = ). This distribution predicts mitosis is driving the growth of tumors; and once tumors achieve a certain size, they stop growing mitotically because in situ genetic fitness threshold was reached in both Bc and lwr (-).5. Therefore, second population of larger microtumors shows lower frequency (<2 tumors) of occurrence of larger.

Page 18: American Statistical Association October 23 Minneapolis Presentation Part 2

THE END- THANK YOU!!

NAVIN K. SINHAAPPLIED/ ACTIONABLE SOLUTIONS ($):1. BRAIN TUMOR REMISSION FROM CHEMOTHERAPY IS USUALLY SHORT LIVED AND PATIENT DIES IN ACUTE PAIN. THE RESEARCH PRESENTED HERE SHOWS THE MATHEMATICAL ASSOCIATIONS BETWEEN VARIOUS TUMORS; INDICATING GIVEN A TUMOR BEFORE CHEMOTHERAPY, WHAT BRAIN TUMOR TYPE CELLS CAN BE BACK. IF THIS KNOWLEDGE IS USED, INNOVATIVE DRUGS WILL BE PATENTED ($), A BETTER PATIENT MANAGEMENT WILL BE ACHIEVED THROUGH THOSE DRUGS, RESULTING IN COST SAVINGS ($) TO INSURERS AND HUMANE QUALITY OF LIFE FOR PATIENTS.

2. MATHEMATICAL SYSTEMS BIOLOGY APPROACH WILL GENERATE MORE EFFECTIVE MEDICINE AND DATA DRIVEN (INCLUDING A FEW DOMINANT AND SUPPRESSED CANCER CELL TYPE).