[2013.09.27] extracting genomes from metagenomes

Post on 10-May-2015

744 Views

Category:

Technology

3 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Extracting genomes from metagenomes

Mads AlbertsenAdvanced Bacteriology @ KU

27-09-2013

CENTER FOR MICROBIAL COMMUNITIES

Agenda

Why do we need genomes?

How can we get them?

… and then what?

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYSlides:

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Who - when, where and why?

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Water and wastewater treatment

Diseases and infectionsEnergy

Local and global challenges

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Genome = Parts list with 3000-5000 items

What is a genome?

Culturing

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

How do we get the genomes?

Few microorganisms can be easily cultured (<<5%)Microorganisms needs to be studied in their environment

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

How do we get the genomes?

What you think you study What you actually study

Single cell genomics

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

How do we get the genomes?

CulturingFew microorganisms can be easily cultured (<<5%)Microorganisms needs to be studied in their environment

Only routinely performed in specialized labsVery incomplete genomes (mean 40%, range 10-90%)

https://www.bigelow.org/

Single cell genomics

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

How do we get the genomes?

CulturingFew microorganisms can be easily cultured (<<5%)Microorganisms needs to be studied in their environment

Only routinely performed in specialized labsVery incomplete genomes (mean 40%, range 10-90%)

Metagenomics

https://www.bigelow.org/

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Genome = Parts list of a single species

What is a genome?

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Metagenome = Parts list of the community

Photo: D. Kunkel; color, E. Latypova

What is a metagenome?

”...functional analysis of the collective genomes of soil microflora, which we term the metagenome of the soil.”

- J. Handelsman et al., 1998

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

What is a metagenome?

PubMed: metagenom*[Title/Abstract]

”...functional analysis of the collective genomes of soil microflora, which we term the metagenome of the soil.”

- J. Handelsman et al., 1998

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Metagenomics is sexy!

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

”...functional analysis of the collective genomes of soil microflora, which we term the metagenome of the soil.”

- J. Handelsman et al., 1998

PubMed: metagenom*[Title/Abstract]

Sequencing costs

http://www.genome.gov/sequencingcosts/

Sequencing is cheap

DNA extraction

Sequencing

Assembly Contigs Search against

database

1000+ bp

100-150 bp

Reads

Metagenomics

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

100++ Abundant species (≈3 Mbp each)

DNA extraction

Sequencing

Assembly Contigs Search against

database

Phylogenetic classificationWho is there?

Functional classificationWhat can they do?

Bacterium ABacterium B...Bacterium X

Gene AGene B...Gene X

100++ Abundant species (≈3 Mbp each)

1000+ bp

100-150 bp

Reads

Metagenomics

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Metagenomics

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

”If you want to understand the ecosystem

you need to understand the individual species

in the ecosystem”

Metagenomics

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Lion + Eagle ≠ Flying Lion

DNA extraction

Sequencing

Assembly Contigs

1000+ bp

100-150 bp

Reads

Metagenomics

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Why not full genomes?

100++ Abundant species (≈3 Mbp each)

DNA extraction

Sequencing

Assembly Contigs

1000+ bp

100-150 bp

Reads

Metagenomics

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Why not full genomes?

1. Micro-diversity

2. Separation of genomes (Binning)

100++ Abundant species (≈3 Mbp each)

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Not 1 strain

Many closely related strains

AAAAAAAAAAAAAA

AAAAAAAAATAAAA

AAAAAAAAACAAAA

AAAAAAAAA

TAAAA

CAAAA

What you get

AAAAA

Assembly

Extracting genomes

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Low micro-diversityHigh micro-diversity

Short term enrichment

Extracting genomes

DNA extraction

Sequencing

Assembly Contigs

1000+ bp

100-150 bp

Reads

Metagenomics

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Why not full genomes?

1. Micro-diversity

2. Separation of genomes (Binning)

100++ Abundant species (≈3 Mbp each)

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Genomic signatures:- GC / Codon usage- Tetranucleotide frequency + statistical method

Complex sample

PhD student

”Binning”

Binning

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Genomic signatures:- GC / Codon usage- Tetranucleotide frequency + statistical method

Complex sample

PhD student

”Binning”

Problems:- Short pieces of sequence (1-10kbp)- Local sequence divergence

Binning

Sequence composition-independent binning

Sample 1

Abun

danc

e

Sample 2

Abun

danc

e

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Binning

Sequence composition-independent binning

Sample 1 Sample 2

Abundance Sample 1

Abun

danc

e Sa

mpl

e 2

Abun

danc

e

Abun

danc

e

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Binning

1. Reduce micro-diversity

2. Use multiple related samples

Abundance Sample 1

Abun

danc

e Sa

mpl

e 2

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Binning

1. Reduce micro-diversity

2. Use multiple related samples

Abundance Sample 1

Abun

danc

e Sa

mpl

e 2

Abundance Sample 1

Abun

danc

e Sa

mpl

e 2

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Binning

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYH. Daims & C. Dorninger, DOME, University of Vienna

• Nitrospira enrichment running for years

• 3 dominant species

• No micro-diversity

Binning

Short term enrichment

Full-scale EBPR plantSBR reactor

Days 1. Reduction of (micro)-diversityCENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYAlbertsen et al., 2013 Nat. Biotech.

Short term enrichment

Full-scale EBPR plantSBR reactor

2. Two different

DNA extraction methods

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYAlbertsen et al., 2013 Nat. Biotech.

Colored using a set of 100 phylogenetic marker genes

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYAlbertsen et al., 2013 Nat. Biotech.

Colored using a set of 100 phylogenetic marker genes

TM7-1 (1.6%)

TM7-2 (0.7%)

TM7-3 (0.2%)

TM7-4 (0.06%)

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYAlbertsen et al., 2013 Nat. Biotech.

Zoom on target

TM7-2 (0.7%)

Colored using a set of 100 phylogenetic marker genes

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYAlbertsen et al., 2013 Nat. Biotech.

Zoom on target

PC2

PC1

TM7-2

PCA on genomic signatures

TM7-2 (0.7%)

Colored using a set of 100 phylogenetic marker genes

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYAlbertsen et al., 2013 Nat. Biotech.

Colored using a set of 100 phylogenetic marker genes

TM7-1 (1.6%)

Candidate phylum TM7

Saccharibacteria

Candidatus Saccharimonas aalborgensis

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYAlbertsen et al., 2013 Nat. Biotech.

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYAlbertsen et al., 2013 Nat. Biotech.

Phyla

Genes (HMM models)

Essential single copy genesAssembly inspection

Genome validation

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITYAlbertsen et al., 2013 Nat. Biotech.

http://madsalbertsen.github.io/multi-metagenome/Short: goo.gl/0ctA3

• Guides• Workflow scripts• Example data• All the code• Reccomendations

Multi-metagenome

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

...add more samples!

Complex samples

S. M. Karst, AAU

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

It’s just a potential!

..and a poorly translated description of it.

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Metabolites

Proteins

mRNA

DNA

Meta-bolomics

Meta-proteomics

Meta-transcriptomics

Meta-genomics

Data integration

In Situ methods

Community structure Microbial functions

Extraction

P-Removal:

N-Removal:

-Removal:

Foaming:

Ethanol production:

Microbial needsEcology

Understanding ecosystems

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Understanding ecosystems

McIlroy and Albertsen et al., 2013 Candidatus Competibacter’-lineage genomesretrieved from metagenomes reveal functional metabolic diversity. ISME J (in press).

• Competibacter has a potential to negatively effect phosphorus removal in wastewater treatment

• 2 Genomes obtained from enrichment metagenomes

• Compared to full-scale metagenomes• Only 1 abundant

• Genomic reconstruction reveals potential for fermentation of glucose

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Understanding ecosystems

McIlroy and Albertsen et al., 2013 Candidatus Competibacter’-lineage genomesretrieved from metagenomes reveal functional metabolic diversity. ISME J (in press).

FISH with Competibacter specific probe

MAR with H3-labeled glucose

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

Understanding ecosystems

Genomes enable comprehensive transcriptomics of individual species in complex communities.

(stranded mRNAseq data)

CENTER FOR MICROBIAL COMMUNITIES | AALBORG UNIVERSITY

G.W. Tyson

Per H. NielsenSimon J. McIllroySøren M. KarstEB group

C. Dorringer H. Daims M. WagnerP. Hugenholtz

University of Vienna

University of Queensland

Questions? @MadsAlbertsen85

MadsAlbertsenma@bio.aau.dk

top related