pdb-protein data bank scop –protein structure classification cath –protein structure...

34
PDB - Protein Data Bank SCOP Protein structure classification CATH Protein structure classification genTHREADER 3D structure prediction Swiss-Model 3D structure prediction ModBase - A database of 3D struc. Predict. Protein Structure Prediction II

Post on 21-Dec-2015

231 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

• PDB - Protein Data Bank• SCOP – Protein structure classification • CATH – Protein structure classification • genTHREADER – 3D structure prediction• Swiss-Model – 3D structure prediction• ModBase - A database of 3D struc.

Predict.

Protein Structure Prediction II

Page 2: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D
Page 3: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D
Page 4: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

PDB fileAccession number

Java based visualization tools

Structural Classification

Page 5: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

PDB provides the atomic coordinates of the structure :

Which can be viewed by different visualization tools

Page 6: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D
Page 7: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

SCOP: Structural Classification of Proteins

http://scop.mrc-lmb.cam.ac.uk/scop/

•Based on known protein structures

•Manually created by visual inspection

•Hierarchical database structure:–Class, Fold, Superfamily, Family, Protein

and Species

Page 8: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D
Page 9: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

Parents of node

Childrenof node

Node

Page 10: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

Parents of node

Childrenof node

Node

Page 11: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D
Page 12: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D
Page 13: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D
Page 14: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

CATH: Protein Structure Classificationby Class, Architecture, Topology and Homology

http://www.cathdb.info/

•Class: The secondary structure composition: mainly-alpha, mainly-beta and alpha-beta.

• Architecture: The overall shape of the domain structure. Orientations of the secondary structures : e.g. barrel or 3-layer sandwich.

• Topology: Structures are grouped into fold groups at this level depending on both the overall shape and connectivity of the secondary structures.

•Homologous Superfamily: Evolutionary conserved structures

Page 15: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D
Page 16: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

CATH: Protein Structure Classificationby Class, Architecture, Topology and Homology

Page 17: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

genTHREADER

Input sequence

Type of Analysis

(PSIPRED,MEMSAT,

genTHREAD)

http://bioinf.cs.ucl.ac.uk/psipred/psiform.html

Page 18: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

GenTHREADEROutput

Page 19: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

GenTHREADEROutput

The output sequences show some extent of sequence homologyBut high level of secondary structure conservation

Page 20: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

SWISS-MODEL

An automated protein modeling server.

http://swissmodel.expasy.org/

Page 21: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

SWISS-MODEL• The SWISS-MODEL algorithm can be divided into

three steps:

1.Search for suitable templates: the server finds

all similarities of a query sequence to sequences

of known structure. It uses the BLASTP2 program

with the ExNRL-3D database (a derivative of PDB

database, specified for SWISS-MODEL). You get

these partial results as a SwissModel TraceLog

file.

2.Check sequence identity with target: All templates

with sequence identities above 25% are selected

3.Create the model using the ProModII program. You

get this as a SwissModel-Model file.

Page 22: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

SWISS-MODEL

Get PDB file by E-mail

Load to J-Mol

Page 23: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

Single StructureHomology Modeling

Page 24: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

Swiss-Model file

Structures used for the homology

model

query

Page 25: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

Comparative Modeling• Accuracy of the comparative model is related to the sequence identity on which it is based

>50% sequence identity = high accuracy 30%-50% sequence identity= 90% modeled

<30% sequence identity =low accuracy (many errors)

Page 26: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

ModBaseA Homology Model Database

http://modbase.compbio.ucsf.edu/modbase-cgi/index.cgi

Page 27: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D
Page 28: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D
Page 29: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

Ligand Binding Site

Page 30: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

Excersize The sequence below belongs to the Prion that causes the “mad cow” disease. This protein becomes toxic

when it gets into the brain and misfolds causing native cellular prions to deform and aggregate. In structural terms, the prion toxicity in leaded by a folding change into an instable structure.

 

>PRION_1ag2

GLGGYMLGSAMSRPMIHFGNDWEDRYYRENMYRYPNQVYYRPVDQYSNQNNFVHDCVNITIKQHTVTTTTKGENFTETDVKMMERVVEQMCVTQYQKESQAYY

 

Use PSIpred, geneTHREADER and PROFsec in order to predict its secondary and tertiary structures. Based on the secondary and tertiary structure predictions

1. Can you suggest the region which could be responsible for the structural instability?

2. What is the secondary structure in the real solved structure?

3. What is the expected structural change in this region?

 

Page 31: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

PSIPRED

Page 32: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

geneTHREADER

Page 33: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

PROFsec

Page 34: PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D

PROFsec

Answer : alpha helix geneTHREADER turn into B-sheet PSIPRED anf PROFsec, prediction in this area are not consistent in the different tools.