anis karimpour-fard 1, corrella detweiler 2, ryan t. gill 3, and lawrence hunter 1 1 university of...

6
Anis Karimpour-Fard 1 , Corrella Detweiler 2 , Ryan T. Gill 3 , and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University of Colorado, Boulder 3 Department of Chemical and Biological Engineering, University of Colorado, Boulder [email protected] http://compbio.uchsc.edu/Hunte A new method for using co- conservation to predict protein function

Upload: blanche-flynn

Post on 20-Jan-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University

Anis Karimpour-Fard1, Corrella Detweiler2, Ryan T. Gill3, and Lawrence Hunter1

1University of Colorado School of Medicine2MCD-Biology, University of Colorado, Boulder3Department of Chemical and Biological Engineering, University of Colorado, Boulder [email protected]

http://compbio.uchsc.edu/Hunter

A new method for using co-conservation to predict protein

function

Page 2: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University

The problem ……Close to 400 Microbial genomes are fully sequence and there is high percent of genes with unknown function

The meaning of protein function

Eisenberg, D. et. al. Nature 2000

S PA

Biochemical view

The function of protein A is its action on Substrate to form a Product

The function of A is the context of its interactions with other proteins in the cell

Post genomic view

A

B

YZ

MDN

32% 30%

45% 30%

32%

40%

0%

10%

20%

30%

40%

50%

organism

unknown function

Bacillus subtilisE.Coli

k12

P. aeruginosa

Burkholderia pseudomallei K96243

Bacillus clausii KSM-K16

X C

Chlamydia trachomatis D/UW-3/CX

Page 3: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University

Phylogenetic ProfilePellegrini et al. PNAS 96, 4285 (1999)

Pairs of genes that are present or absent together in genomes.

Protein i: 110001111001001110001111Protein j: 11100011110000011000111119 matching bits out of 24

Page 4: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University

Network of Bacillus color coded based on gene function

Cell envelope and cellular process (red), Intermediary metabolism (green), Information pathway or central dogma (yellow), Uncategorized (gray) ,and others (blue),

Page 5: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University

The unique differences between an organism of choice and each organism are shown in red and conserved genes across species are shown in gray. Org (organism); org0 (organism of choice); P (protein).

Method

Page 6: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University

Multiple Species Cluster Co-conservation Network

BacillusSalmonella E. coli K12