protein data bank
TRANSCRIPT
![Page 1: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/1.jpg)
WWW.RCSB.ORG/PDB
Protein Data Bank(PDB)
![Page 2: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/2.jpg)
CONTENTS
• Introduction• Supported and funded by• History• PDB Holdings list• Member organizations• Task forces• PDB ID• PDB File format• Browse to WWW.RCSB.ORG/PDB/
![Page 3: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/3.jpg)
“The repository reservoir data bank to store the authenticated structures of Protein and Nucleic acid”
Single worldwide database and hundreds of secondary databases categorize the data differently.
Key resource in the area of structural biology, stores 3D structural data of large biological molecules such as Proteins and Nucleic acids.
Data is submitted by Biologists and Biochemists from all around the world to be freely accessible on internet via its member organizations’ websites and is updated weekly.
The mission is to maintain a single Protein Data Bank Archive of Macromolecular Structural data.
PROTEIN DATA BANKPDB
![Page 4: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/4.jpg)
The Protein Data Bank (PDB) is operated by:
Rutgers, The State University of New Jersey.
The San Diego Supercomputer Center at the University of California, San Diego.
RCSB-the Research Collaborator for Structural Bioinformatics
The PDB is supported by funds from the National Science Foundation, the Department of Energy, and the National Institutes of Health.
SUPPORTED AND FUNDED
![Page 5: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/5.jpg)
Two forces to initiate PDB:
Growing collection of sets of protein structural data by X-Ray diffraction.
NMR-nuclear Magnetic Resonance method to visualize protein structures in 3D, emerged in 1968.
In 1969, Dr Edger Meyer began to write software to store atomic coordinates files in a common format to make them available for geometric and graphical evaluation.
In 1971, one of Dr Meyer’s programs- SEARCH- enabled networking i.e enabled the researchers to access information from database to study protein structures offline.
PDB HISTORY
![Page 6: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/6.jpg)
In 1973, upon Hamilton’s death, Dr Tom Koetzle took over direction of PDB for 20 years.
mmCIF project completed and Structural genomics began in 1970s.
In 1980s, IUCr guidelines established, number of structures deposited increases and independent biological databases established – e.g., the NDB.
In Oct, 1998; PDB was transferred to Research Collaboratory for Structural Bioinformatics (RCSB), complete transfer since 1999. Dr Helen M Berman of Rutgers University was the new director.
In 2003, with the formation of wwPDB, the PDB became an international organization having three member organizations.
In 2006, the BMRB joined PDB.
![Page 7: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/7.jpg)
ExperimentalMethod
Proteins Nucleic AcidsProtein/Nucleic Acid
complexesOther Total
X-ray diffraction 62750 1323 3050 2 67125
NMR 7962 960 179 7 9108
Electron microscopy 262 22 96 0 380
Hybrid 41 3 1 1 46
Other 133 4 5 13 155
Total: 71148 2312 3331 23 76814
PDB HOLDINGS LIST ( as of 25 Oct, 2011)
![Page 8: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/8.jpg)
Act as Data deposition, Data processing and Distribution centers for PDB data.
Three are founding member organizations:
PDBe…Protein Data Bank in Europe. PDBj…Protein Data Bank in Japan. RCSB…Research Collaboratory for Structural Bioinformatics.
The Biological Magnetic Resonance Data Bank (BMRB) joined later in 2006.
Another organization Worldwide Protein Data Bank (wwPDB) oversees PDB. wwPDB reviews and annotates each submitted entry and then it is automatically checked for plausibility( the source code) for validation software is available.
MEMBER ORGANIZATIONS
![Page 9: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/9.jpg)
WW
PDB
![Page 10: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/10.jpg)
![Page 11: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/11.jpg)
![Page 12: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/12.jpg)
![Page 13: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/13.jpg)
![Page 14: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/14.jpg)
X-Ray diffraction :-Spacing of atoms determined by location intensities spot on photographic plate by X-Ray e.g lyzozyme.
Limited to just crystal structures only
NMR (about 15% e.g., hemoglobin)…estimations of distances between pairs of atoms of proteins. Final conformation is obtained after solving distance geometry problem.
Illuminate dynamic side,conformatonal changes, protein folding as well
TASK FORCES
![Page 15: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/15.jpg)
![Page 16: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/16.jpg)
![Page 17: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/17.jpg)
Each structure published in PDB receives a four character alphanumeric identifier or accession number. Like, 1ANG or 4hhb.
However, this cant be used as an identifier for biomolecules. Because several structures for the same molecule in different environments or conformations-are contained in PDB with different PDB IDs.
PDB IDENTIFIER (PDB ID)
HAEMOGLOBIN(2DN2)
![Page 18: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/18.jpg)
Standard data representation…encoded in data dictionary. The metadata model supporting this representation is used by all PDB data processing and database software tools.
1. PDB file format was restricted to 80 characters per line initially.
2. In 1996, macromolecular Crystallographic Information File (mmCIF) format started.
3. In 2005, XML version called as PDBML, was described.
PDB FILE FORMAT
![Page 19: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/19.jpg)
The Protein Data Bank (pdb) file format is a textual file format
describing the three dimensional structures of molecules held in the Protein Data Bank.
provides description and annotation structure atomic coordinates, side chains, secondary structure, as well as atomic connectivity
Water , ions, nucleic acids, ligands…
PDB file format
![Page 20: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/20.jpg)
![Page 21: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/21.jpg)
mmCIF is the acronym for the macromolecular Crystallographic Information File.
mmCIF is based on a subset of the syntax rules for the Self Defining Text Archive (STAR) file.
A Dictionary Description Language (DDL) defines the structure of mmCIF dictionaries. Dictionaries provide the metadata which define the content of mmCIF data files.
mmCIF data files, dictionaries and DDLs are all expressed in a common syntax.
mmCIF
![Page 22: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/22.jpg)
![Page 23: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/23.jpg)
basic information, more detailed
description of PDB, PDBML and mmCIF file formats
can be found at Protein Data Bank web sites.highly recommended to get familiar with all
rules of PDB format (such as gaps between columns)
BEACAUSE…
PDBML
![Page 24: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/24.jpg)
![Page 25: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/25.jpg)
put either a search term (for example, a protein name) or a PDB number
Point your browser to: www.rcsb.org/pdb/
![Page 26: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/26.jpg)
RCSB PDB.htm
![Page 27: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/27.jpg)
2DN2
![Page 28: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/28.jpg)
![Page 29: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/29.jpg)
RCSB PDB - 2DN2 Structure Summary.htm
![Page 30: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/30.jpg)
UMINHAEMOGLOBIN
![Page 31: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/31.jpg)
![Page 32: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/32.jpg)
RCSB PDB - Query Results_HAEMOGLOBIN.htm
![Page 33: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/33.jpg)
![Page 34: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/34.jpg)
![Page 35: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/35.jpg)
![Page 36: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/36.jpg)
RCSB PDB - Query Results_homosepiense.htm
![Page 37: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/37.jpg)
RCSB PDB - 2DN2 Structure Summary.htm
CLICK
![Page 38: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/38.jpg)
If the contents of the PDB are thought of as primary data,
THENhundreds of derived (i.e., secondary)
databases categorize the data differently. For exampleSCOP & CATH :
categorize structures according to type of structure and assumed evolutionary relations;
GO categorize structures based on genes.
Protein Annotation
![Page 39: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/39.jpg)
CLICK
![Page 40: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/40.jpg)
The Structural Classification of Proteins (SCOP) database is a largely manual classification of protein structural domains based on similarities of their structures and amino acid sequences
![Page 41: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/41.jpg)
Class:the overall secondary-structure content of the domainArchitecture:high structural similarity but no evidence of homology.Topology:a large-scale grouping of topologies which share particular structural featuresHomologous superfamily:indicative of a demonstrable evolutionary relationship.
![Page 42: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/42.jpg)
Pfam is a database of protein families that includes their annotations and multiple sequence alignment generated using hidden Markov models
![Page 43: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/43.jpg)
![Page 44: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/44.jpg)
CLICK
![Page 45: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/45.jpg)
Select your desire method
![Page 46: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/46.jpg)
CLICK
RCSB PDB - 2DN2 Literature Report.htm
![Page 47: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/47.jpg)
CLICK
RCSB PDB - 2DN2 Biology and Chemistry Report.htm
![Page 48: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/48.jpg)
CLICK
![Page 49: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/49.jpg)
CLICK
RCSB PDB - 2DN2 Geometry Report.htm
![Page 50: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/50.jpg)
& FINALLY
![Page 51: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/51.jpg)
Structural View of BiologyShow the gradual updating released entries
![Page 52: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/52.jpg)
![Page 53: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/53.jpg)
![Page 54: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/54.jpg)
![Page 55: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/55.jpg)
![Page 56: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/56.jpg)
![Page 57: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/57.jpg)
You can also select different display view
![Page 58: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/58.jpg)
can also download in different view
![Page 59: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/59.jpg)
Text file can be viewed or modified in editor. Structure files may be viewed using various free and commercial
visualizations programs and Web browsers plug-ins like OPEN SOURCE PDB SOFTWERES Jmol Molekel MeshLab (able to import PDB data set and buildup surfaces
from them) QuteMol Avogadro OPEN BUT NOT FREE PYMOL , RASMOL, VIST PROT 3DS & STAR BIOCHEM
Viewing the data
![Page 60: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/60.jpg)
http://www.rcsb.org/pdb/static.do?p=software/software_links/molecular_graphics.html
The RCSB PDB website contains an extensive list of both free and commercial molecule visualization programs and web browser plug-in.
RCSB PDB_softwere.htm
![Page 61: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/61.jpg)
![Page 62: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/62.jpg)
haemoglobin.pdb
![Page 63: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/63.jpg)
central archive of experimentally solved bimolecular structures. But
only allows data retrieval does not provide collaboration or user feedback.
In contrast, PDBWiki allows for sharing expert knowledge about structures deposited in the PDB.
provides tools for discussing and annotating proteins in a collaborative way.
Limitation
![Page 64: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/64.jpg)
The Protein Data Bank (PDB) is the central archive of experimentally solved bimolecular structures. However, the PDB only allows data retrieval and does not provide functionality for collaboration or user feedback.
In contrast, PDBWiki allows for sharing expert knowledge about structures deposited in the PDB. It provides tools for discussing and annotating proteins in a collaborative way. The goal is to create a central and freely-accessible repository of user-contributed information that will be useful for anyone working with PDB structures. As such PDBWiki can be considered a part of a wider effort in community-based biological databases curation.
Limitation
![Page 65: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/65.jpg)
Lo0k there is something more for you…..
![Page 66: Protein Data Bank](https://reader036.vdocuments.us/reader036/viewer/2022081507/55501aa7b4c90535638b50d2/html5/thumbnails/66.jpg)