phyre2 dr. lawrence kelley structural bioinformatics group imperial college london

Post on 30-Mar-2015

220 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Phyre2

Dr. Lawrence KelleyStructural Bioinformatics Group

Imperial College London

SVYDAAAQLTADVKKDLRDSWKVIGSDKKGNGVALMTTLFADNQETIGYFKRLGNVSQGMANDKLRGHSITLMYALQNFIDQLDNPDSLDLVCS…….

Predict the 3D structure adopted by a user-supplied protein sequence

Phyre2

How does Phyre2 work?

ARDLVIPMIYCGHGY

Search the 10 million known sequences for homologues using PSI-Blast.

Phyre2

Homologous sequences

User sequence

ARDLVIPMIYCGHGY HMM

PSI-Blast

Phyre2

Hidden Markov model

Capture the mutational propensities at each position in the protein

An evolutionary fingerprint

User sequence

~ 65,000 known 3D structures

Phyre2

~ 65,000 known 3D structures

Phyre2

~ 65,000 known 3D structures

Phyre2

HAPTLVRDC…….

Extract sequence

~ 65,000 known 3D structures

Phyre2

HAPTLVRDC…….

PSI-Blast

Extract sequence

~ 65,000 known 3D structures

Phyre2

HAPTLVRDC…….

HMM

PSI-Blast

Hidden Markov modelfor sequence of KNOWN structure

Extract sequence

~ 65,000 known 3D structures

Phyre2

HMM HMM HMM

~ 65,000 hidden Markov models

~ 65,000 known 3D structures

Phyre2

Hidden Markov Model Database of

KNOWNSTRUCTURES

ARDLVIPMIYCGHGY HMM

PSI-Blast

Phyre2

Hidden Markov model

Capture the mutational propensities at each position in the protein

An evolutionary fingerprint

ARDLVIPMIYCGHGY HMM

PSI-Blast

Hidden Markov Model DB of

KNOWNSTRUCTURES

HMM-HMM matching

Phyre2

Alignments of user sequence to known structuresranked by confidence.

ARDL--VIPMIYCGHGYAFDLCDLIPV--CGMAY

Sequence of known structure

ARDLVIPMIYCGHGY HMM

PSI-Blast

Hidden Markov Model DB of

KNOWNSTRUCTURES

HMM-HMM matching

Phyre2

ARDL--VIPMIYCGHGYAFDLCDLIPV--CGMAY

Sequence of known structure

3D-Model

ARDLVIPMIYCGHGY HMM

PSI-Blast

Hidden Markov Model DB of

KNOWNSTRUCTURES

HMM-HMM matching

Phyre2

ARDL--VIPMIYCGHGYAFDLCDLIPV--CGMAY

Sequence of known structure

Very powerful – able to reliably detect extremely remote homology

Routinely creates accurate models even when sequence identity is <15%

3D-Model

top related