t-coffee multiple alignments of orthologous sequences horizontal gene transfer (phylogenetic trees)...

53
T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Upload: madlyn-annabel-mcgee

Post on 28-Dec-2015

218 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

T-COFFEE Multiple Alignmentsof Orthologous Sequences

Horizontal Gene Transfer(Phylogenetic Trees)

WebLogo

Page 2: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Overview

• T-COFFEE– Tree-based Consistency Objective Function for

alignment Evaluation• Focuses on orthologous gene sequences• Used to generate multiple sequence alignments

• WebLogo• Constructed from multiple sequence alignment

• Phylogenetic Trees• Used to determine if your gene is derived from horizontal

gene transfer

Page 3: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

“Click”

Enter ortholog sequencesinto query box –

Where do I get these?

Page 4: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

RECALL: What are orthologs?

• Homologs– Orthologs

• Genes duplicated via appearance of new species

– Identical function in different organisms

– Paralogs• Genes duplicated within a species

– Perform slightly different tasks in cell» Can develop new capabilities» Can become pseudogene if

functionality lost but sequencesimilarity retained

Insert Figure 8-41 fromMicrobiology – An Evolving Science

© 2009 W.W. Norton & Company, Inc.

Page 5: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Under Homolog Selection, choose “Paralogs/Orthologs”from drop-down menu

Where do I find orthologs?

Scrolldown

Page 6: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Scroll down to table containing list of orthologs

Page 7: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Select the genes by clicking these boxes

The genes are rankedby ascending E-values

Add the top 5 orthologs to Gene Cart

Notice orthologous genes are from

different organisms

Page 8: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

“Click”

Scroll down to bottom of page

Page 9: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Only 5 genes were selected, why are6 genes shown in the Gene Cart?

One of the genes shown is your ASSIGNED gene

(the one you are annotating)

Page 10: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Generate amino acid sequences for orthologs in FASTA format

Select “FASTA Amino Acid

format”

Scroll down to “Export Genes”

“Click”

Page 11: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Amino Acid sequences in FASTA format for all6 genes will appear

Scroll down

Page 12: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Your assigned gene is located at the bottom

of this list(inspect Gene OID number)

Copy / paste all 5 ortholog sequencesinto your notebook for this module

EXCLUDE your gene, whichshould already be in your notebook

Scroll down

Page 13: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Recording results in your notebook

The amino acid sequences in FASTA format for the top 5 orthologs

Add heading and box

Page 14: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

STEP 1: Copy / paste the amino acid sequence in FASTA format for your assigned gene into the query box for T-COFFEE

Return to T-COFFEE database

Page 15: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

STEP 2: Copy / paste the amino acid sequences in FASTA format for the top 5 orthologs into the same query box as your gene

T-COFFEE database entries

Separate individual sequences by a

hard return

“Click”

Page 16: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Wait a few moments . . .

Page 17: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Select “Start JalView” to examine the multiple

sequence alignment of the ortholog sequences

T-COFFEE Results

Page 18: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Reminder:Light Blue = Low FrequencyDark Blue = High Frequency

Alignment inspection using JalView

Select “Percentage Identity”under “Colour” menu

Compare to consensus sequence

Page 19: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Copy / paste this alignment into your lab

notebook

Return to T-COFFEE Results

Page 20: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Identify organism in alignment by Gene OID

Recording results in your notebook

Page 23: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Copy/paste multiple sequence alignment

Scroll down

Page 24: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

1- Select “amino acid” as sequence type

2- Select box for multiline logo

“Click”

Page 25: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

WebLogo Results

Zoom in

In IE, save picture as .png file for upload to notebook

Page 26: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Recording results in your notebook

Page 27: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

WHAT ARE OUR GOALS?

1. Build a phylogenetic tree

2. Determine if assigned genes are derived from horizontal gene transfer

Page 28: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Phylogenetic Phylogenetic tree of tree of BacteriaBacteria

showing showing established & established &

candidate phylacandidate phyla(organismal (organismal

phylogenyphylogeny))

Domain Phylum Class Order Family Genus Species

Insert Figure 1 from Handelsman (2004) Microbiol. Mol. Biol. Rev. 68: 669-685.

Page 29: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Three bacterial phyla closely related to Three bacterial phyla closely related to Planctomycetes by 23S rRNA analysisPlanctomycetes by 23S rRNA analysis

(organismal phylogeny)(organismal phylogeny)

Insert Figure 4A from Pilhofer et al. (2008)Characterization and Evolution of Cell Division and Cell Wall SynthesisGenes in the Bacterial Phyla Verrucomicrobia, Lentisphaerae, Chlamydiae,and Planctomycetes and Phylogenetic Comparison with rRNA Genes.J Bacteriology 190: 3192-3202.

Page 30: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

16S rRNA gene supports the16S rRNA gene supports themonophyletic grouping Planctomycetalesmonophyletic grouping Planctomycetales

(organismal phylogeny by rDNA analysis)(organismal phylogeny by rDNA analysis)

http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=126

Closest phylogenetic relativesof P. limnophilus (same family)

Page 31: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

How do we build a phylogenetic tree?

include P. limnophilus gene (first module)

include the top 5 orthologs (second module)

include genes from organisms closely related to P. limnophilus (i.e., same family)

include genes from organisms less closely related to P. limnophilus (i.e., from phyla Verrucomicrobia, Chlamydiae, and Lentisphaerae)

include genes from organisms that are distantly related to P. limnophilus

Page 32: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Recall: We want to include genes from organisms more closely related to P. limnophilus AND genes from organisms that are less closely related to P. limnophilus.

So…depending on the organisms in your top 5 orthologs,there are 2 paths you can take:

Select top 5 orthologs

If top 5 are closely relatedto P. limnophilus. . .

If top 5 are less closely related to P. limnophilus. . .

Choose 5-10less closely related

organisms

Choose 5-10more closely related

organisms

PATH 1 PATH 2

Page 33: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=126

Building a phylogenetic tree

EXAMPLE: Organisms closely related to P. limnophilus (i.e., same family)

Page 34: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Building a phylogenetic tree EXAMPLE: Organisms less closely related to P. limnophilus (i.e., from phyla Verrucomicrobia, Chlamydiae, and Lentisphaerae)

Insert Figure 4A from Pilhofer et al. (2008)Characterization and Evolution of Cell Division and Cell Wall SynthesisGenes in the Bacterial Phyla Verrucomicrobia, Lentisphaerae, Chlamydiae,and Planctomycetes and Phylogenetic Comparison with rRNA Genes.J Bacteriology 190: 3192-3202.

Page 35: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Inspect your top 5 orthologs:Which path?

Example: PATH #1 – Most are in same family as P. limnophilus, so choose 5-10 sequences from less closely related organisms

Page 36: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Under Homolog Selection, choose “Paralogs/Orthologs”from drop-down menu

Where do I find sequences?

Scrolldown

Page 37: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Scroll through the ortholog list and select some genes fromless closely related as well as some distantly related organisms

Once 5-10 orthologs are selected, add them

to your gene cart

Page 38: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Generate amino acid sequences for orthologs in FASTA format

Select “FASTA Amino Acid

format”

Scroll down to “Export Genes”

“Click”

Page 39: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Amino Acid sequences in FASTA format

Scroll down

Remember: Your assigned gene is at the

bottom of the list

Page 40: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Recording results in your notebook

Create another box in your lab notebook, and copy/paste

ONLY the 5-10 NEW ortholog FASTA sequences

(i.e., exclude those alreadyin first & second module)

Page 41: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Recording results in your notebook

Page 42: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

What if your top 5 orthologs are distantly related to P. limnophilus?

Example: PATH #2 – Most arenot in the same phylum as

P. limnophilus, so choose 5-10 sequences from more closely

related organisms

Page 43: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Scroll through the ortholog list and select some genes fromclosely related as well as other distantly related organisms

Once 5-10 orthologs are selected, add them

to your gene cart

Copy / pasteFASTA format

protein sequencesinto notebook

Page 44: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Use Phylogeny.fr site to createa phylogenetic tree

“Click”

Page 45: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Select“A la Carte” from menu

Creating a phylogenetic tree

Page 46: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

1- Select “T-Coffee” for multiple alignment

2- Leave other settings as default

Scroll down

Page 47: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

“Click”

Scroll down

Page 48: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Copy/paste sequences in query box.

Scroll down & select “submit”

Your P. limnophilus gene

Your top 5 orthologs

5-10 new orthologs

Page 49: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Results of phylogenetic analysis

Download and save as .png for upload to notebook

Page 50: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Recording results in your notebook

How do I interpret the tree results?

Page 51: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Possible scenarios resulting fromPossible scenarios resulting fromconstruction of a phylogenetic treeconstruction of a phylogenetic tree

P. limnophilus

Carboxydothermus

Carboxydothermus

Bacillus

P. limnophilus

Blastopirellula

P. maris

Pirellula

No HGT since P. limnophilus and Blastopirellula are in the same family and are clustered together (i.e., gene phylogeny matches organismal phylogeny).

Possible HGT since P. limnophilus and Carboxydothermus are very distantly related yet clustered together (i.e., gene phylogeny does NOT match organismal phylogeny).

Bacillus

Clostridium

Carboxydothermus

P. limnophilus

P. maris

Blastopirellula

Clustered

Not clustered

Maybe HGT, but unsure because there is also an unresolved or multifurcating branch

Page 52: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

Interpreting your phylogenetic tree If your Planctomyces limnophilus gene is clustered with that from an organism in the P. limnophilus family probably not horizontal gene transfer

Planctomyces limnophilusBlastopirellula marinaPlanctomyces marisPedospheara parvula (Ellin 514)

Lentisphaera araneosaVerrucomicrobium spinosumSorangium cellulosumEscherichia coli K12Rhodopirellula balticaThiobacillus denitrificansMoorella thermoaceticaBrucella canisHydrogenivirga sp.Clostridium perfringensGemmata obscuriglobus

If your Planctomyces limnophilus gene is clustered with that from an organism that is NOT in the P. limnophilus family may be horizontal gene transfer

If your Planctomyces limnophilus gene is clustered with more than one organism in the tree (multifurcating branch) unresolved phylogeny

In the example below, is the gene derived by HGT? Why or why not?

Page 53: T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo

In the “Interpretation” box to your lab notebook

Is there evidence of horizontal gene transfer?

What organisms does your gene cluster with? The same family? Or the three more closely related phyla (Verrucomicrobia, Chlamydiae, Lentisphaerae)?

Do the gene and genome GC content differ by more than 5%?

Do the neighborhoods for the top 5 orthologs look similar or different to that of your gene in P. limnophilus?

Recording results in your notebook