computational approaches to analyze big root data grown in ......computational approaches to analyze...

32
Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome 2020 February 24, 2020

Upload: others

Post on 17-Aug-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Computational Approaches to Analyze Big Root Data Grown in the Field

Alexander Bucksch

Genomes2Fields Workshop @ Phenome 2020

February 24, 2020

Page 2: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

1. Food source

2. Sequester atmospheric carbon and provide oxygen

3. Construction material for shelter

4. Energy

Human life relies on plants

PHYSIOLOGICAL

NEEDS

SAFETY &

SECURITY

LOVE &

BELONGING

SELF-ESTEEM

SELF-

ACTUALIZATION

Page 3: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

= x

= x

Assumption: If an observed root trait variation is linked

to genes, than the trait is possible to breed

Sa

me

co

mm

on

be

an

ge

no

typ

e

dro

ug

ht

we

ll w

ate

red

Basic idea: Roots change their shape

Phenotype = G + GxE +

E

Page 4: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Compute

on large

data sets

How to link computing, math &

plants?

Unknow

n

phenoty

pic

pattern

Highly

detailed

phenoty

pe

Page 5: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Suxing Liu

Measuring every root in the maize root

system

Page 6: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

3D root phenotyping pipeline for field grown maize

Page 7: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Real maize root vs. 3D root model3D root scanner

Recording high-resolution 3D point clouds

Page 8: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

8 replicates per genotype

12 genotypes to validate measurements

Page 9: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Traits are measured as averages per genotype

Validation of four traits

Page 10: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Broad sense heritability of all traits

Page 11: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Whole root descriptor distinguishes genotypes in 3D

Page 12: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

We can measure a lot – but challenges remain

1. Number of crown root is currently not reliably counted2. Crown root angles are too noisy to be useful3. Number of whorls and distance between whorls sometimes unresolvable

Increase point cloud density throughoptimal positioning of cameras.This means to find an approximationfor the art gallery problem (NP-hard)

Page 13: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Lots of new technology, but what does it help in future?

Limeng Xie

Page 14: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Can we discover unknown phenotypic pattern in big root data?

Limeng Xie

Page 15: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Tagging every

plant

with location

500-1000 times one genotype in a grid

3 genotypes (DOR 364 / L88 57 / SEQ 7)

2 environments water stress / non-limiting

How many root architectures?

Page 16: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

What is this mess?

Analyzing D-curves as DS-curves

Fraction of excavation depth

An

gle

to x

-ax

in r

adia

ns

Page 17: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Shape curves describe the variation

Fraction of excavation depth

An

gle

to x

-ax

in r

adia

ns

Fraction of excavation depth

An

gle

to x

-ax

in r

adia

ns

Any visible differences between

environments?

Page 18: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Shape curves describe the variation

Fraction of excavation depth

An

gle

to x

-ax

in r

adia

ns

Fraction of excavation depth

An

gle

to x

-ax

in r

adia

ns

1 genotype + 1 environment = many architectures

Page 19: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

An

gle

to x

-ax

in r

adia

ns

Let’s do some math !

How can we group similar curves?

0

1

2

3

4

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

Depth

DS_M

ean

Cluster

0

1

2

3

4

5

6

7

8

9

10

11

Fraction of excavation depth

An

gle

to x

-ax

in r

adia

ns

Fraction of excavation depth

Page 20: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Result: Number of architectures per data

set

How to group similar curves?

K-needlealgorithm

K-means++clustering

DS-curves From DIRT

Normalized knee plot Optimal k

1 10 20

number of clusters

perc

en

tag

e o

f exp

lain

ed

vari

ation

perc

en

tag

e o

f exp

lain

ed

vari

ation

fraction of clusters

1000 times

Page 21: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

What is an outliers for curves?

Modified Epigraph Index(~ variation within the curve)

Mo

dif

ied

Ban

d D

ep

th

(~d

ista

nce

fro

m m

ean

cu

rve)

Method modified from Arribas-Gil, Ana, and Juan Romo.

"Shape outlier detection and visualization for functional data: the

outliergram.” Biostatistics 15.4 (2014): 603-619.

Shape outliers: Curves not following the “obvious” trend of the clusterMagnitude outliers: Curves that peak out of the “typical” bandwidth

DS-

valu

e

Fraction of excavation depth

An

gle

to x

-ax

in r

adia

ns

Fraction of excavation depth

Page 22: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Stability 0.66 0.69 0.70 0.66 0.67

Stability 0.55 0.58 0.50 0.66 0.62

Phenotypic Spectrum of L88 57 (2015)

Water stress

Non-limiting

(n=828)

(n=439)

Page 23: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Stability 0.59 0.59 0.59 0.58 0.60

Stability 0.71 0.70 0.60 0.63 0.74

Phenotypic Spectrum of L88 57 (2016)

Water stress

Non-limiting

(n=304)

(n=328)

Page 24: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Consensus types across years &

environment

Architecture Type 1 Architecture Type 2

Architecture Type 5 (2016)

Architecture Type 4Architecture Type 3

Architecture Type 5 (2015)

Year difference

Page 25: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

No difference in shoot biomass (2016/non-limiting)

No significant biomass difference

between architecture types (ANOVA),

Except 2016ww_5-2016ww_3 p<0.05

Page 26: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

No difference in shoot biomass (2016/water stress)

No significant biomass difference

between architecture types (ANOVA)

Page 27: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Real time soil water content data in 3D

Volumetric water content regulated with 128 sensors to control 128 sprayers

6 inch deep

15 inch deep

Frac

tio

n o

f vo

lum

etri

c w

ater

co

nte

nt

Frac

tio

n o

f vo

lum

etri

c w

ater

co

nte

nt

Page 28: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Compute

on large

data sets

How to link computing, math &

plants?

Unknow

n

phenoty

pic

pattern

Highly

detailed

phenoty

pe

Page 29: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Collaborators on the presented projects:

Funding for the presented projects:

Acknowledgements

Jonathan Lynch Kathleen Brown Paul Heinemann Dana ChoiJames Burridge

Malcom BennettTony PridmoreSasha Moony

Shawn KaepplerNatalia De Leon

Patompong Saengwilai

Andries TemmeJohn BurkeLisa DonovanJohn Miller

Page 30: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Jonathan Lynch Kathleen Brown Paul Heinemann Dana ChoiJames Burridge

Malcom BennettTony PridmoreSasha Moony

Shawn KaepplerNatalia De Leon

Andries TemmeThursday, 2/275:00 - 5:15 PMPoster 430

Patompong Saengwilai

Collaborators on the presented projects:

Funding for the presented projects:

Acknowledgements

Page 31: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome
Page 32: Computational Approaches to Analyze Big Root Data Grown in ......Computational Approaches to Analyze Big Root Data Grown in the Field Alexander Bucksch Genomes2Fields Workshop @ Phenome

Questions?

Natural History Museum, Mae Rim, TH