anis karimpour-fard 1, corrella detweiler 2, ryan t. gill 3, and lawrence hunter 1 1 university of...
TRANSCRIPT
![Page 1: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University](https://reader035.vdocuments.us/reader035/viewer/2022070415/5697bf791a28abf838c821f3/html5/thumbnails/1.jpg)
Anis Karimpour-Fard1, Corrella Detweiler2, Ryan T. Gill3, and Lawrence Hunter1
1University of Colorado School of Medicine2MCD-Biology, University of Colorado, Boulder3Department of Chemical and Biological Engineering, University of Colorado, Boulder [email protected]
http://compbio.uchsc.edu/Hunter
A new method for using co-conservation to predict protein
function
![Page 2: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University](https://reader035.vdocuments.us/reader035/viewer/2022070415/5697bf791a28abf838c821f3/html5/thumbnails/2.jpg)
The problem ……Close to 400 Microbial genomes are fully sequence and there is high percent of genes with unknown function
The meaning of protein function
Eisenberg, D. et. al. Nature 2000
S PA
Biochemical view
The function of protein A is its action on Substrate to form a Product
The function of A is the context of its interactions with other proteins in the cell
Post genomic view
A
B
YZ
MDN
32% 30%
45% 30%
32%
40%
0%
10%
20%
30%
40%
50%
organism
unknown function
Bacillus subtilisE.Coli
k12
P. aeruginosa
Burkholderia pseudomallei K96243
Bacillus clausii KSM-K16
X C
Chlamydia trachomatis D/UW-3/CX
![Page 3: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University](https://reader035.vdocuments.us/reader035/viewer/2022070415/5697bf791a28abf838c821f3/html5/thumbnails/3.jpg)
Phylogenetic ProfilePellegrini et al. PNAS 96, 4285 (1999)
Pairs of genes that are present or absent together in genomes.
Protein i: 110001111001001110001111Protein j: 11100011110000011000111119 matching bits out of 24
![Page 4: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University](https://reader035.vdocuments.us/reader035/viewer/2022070415/5697bf791a28abf838c821f3/html5/thumbnails/4.jpg)
Network of Bacillus color coded based on gene function
Cell envelope and cellular process (red), Intermediary metabolism (green), Information pathway or central dogma (yellow), Uncategorized (gray) ,and others (blue),
![Page 5: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University](https://reader035.vdocuments.us/reader035/viewer/2022070415/5697bf791a28abf838c821f3/html5/thumbnails/5.jpg)
The unique differences between an organism of choice and each organism are shown in red and conserved genes across species are shown in gray. Org (organism); org0 (organism of choice); P (protein).
Method
![Page 6: Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University](https://reader035.vdocuments.us/reader035/viewer/2022070415/5697bf791a28abf838c821f3/html5/thumbnails/6.jpg)
Multiple Species Cluster Co-conservation Network
BacillusSalmonella E. coli K12