genome revolution: compsci 004g 8.1 blast l what is blast? what is it good for? basic
DESCRIPTION
Genome Revolution: COMPSCI 004G 8.3 Global Alignment COELACANTH P E L I C A NTRANSCRIPT
![Page 1: Genome Revolution: COMPSCI 004G 8.1 BLAST l What is BLAST? What is it good for? Basic](https://reader037.vdocuments.us/reader037/viewer/2022090107/5a4d1bf07f8b9ab0599e5910/html5/thumbnails/1.jpg)
Genome Revolution: COMPSCI 004G 8.1
BLASThttp://www.ncbi.nlm.nih.gov/Education/BLASTinfo/information3.html What is BLAST? What is it good for?
Basic Local Alignment Search Tools Given query (DNA or Protein) find
“matches” What is a match? How do judge a good
one?
Two kinds of alignment or matches Global alignment (sequence to sequence) Local alignment (subseq to subseq)
![Page 2: Genome Revolution: COMPSCI 004G 8.1 BLAST l What is BLAST? What is it good for? Basic](https://reader037.vdocuments.us/reader037/viewer/2022090107/5a4d1bf07f8b9ab0599e5910/html5/thumbnails/2.jpg)
Genome Revolution: COMPSCI 004G 8.2
Global Alignment Words explain (see O’Reilly BLAST)
Align ‘coelacanth’ and ‘pelican’ Score +1 for match, -1 for mismatch, -1
gap
coelacanth coelacanthp-elican-- -pelican--
What are scores of these matches? What’s the best score? Needleman-Wunsch algorithm
![Page 3: Genome Revolution: COMPSCI 004G 8.1 BLAST l What is BLAST? What is it good for? Basic](https://reader037.vdocuments.us/reader037/viewer/2022090107/5a4d1bf07f8b9ab0599e5910/html5/thumbnails/3.jpg)
Genome Revolution: COMPSCI 004G 8.3
Global Alignment
0 -1 -2 -3 -4 -5 -6 -7 -8 -9 -10
-1 -1 -2
-2
-3 -2 -2 -1
-4
-5
-6
-7 0
C O E L A C A N T H
PELICAN
![Page 4: Genome Revolution: COMPSCI 004G 8.1 BLAST l What is BLAST? What is it good for? Basic](https://reader037.vdocuments.us/reader037/viewer/2022090107/5a4d1bf07f8b9ab0599e5910/html5/thumbnails/4.jpg)
Genome Revolution: COMPSCI 004G 8.4
Local Alignment Subsequence alignment rather than global
Advantages? Tradeoffs? Score +1 for match, -1 for mismatch, -1
gap
(co)ELECAN(th) (p)ELICAN
Smith-Waterman: initialize to zero, only score positive, trace-back from highest score
![Page 5: Genome Revolution: COMPSCI 004G 8.1 BLAST l What is BLAST? What is it good for? Basic](https://reader037.vdocuments.us/reader037/viewer/2022090107/5a4d1bf07f8b9ab0599e5910/html5/thumbnails/5.jpg)
Genome Revolution: COMPSCI 004G 8.5
Local Alignment
0 0 0 0 0 0 0 0 0 000 10 2 10 1000 4
C O E L A C A N T H
PELICAN
![Page 6: Genome Revolution: COMPSCI 004G 8.1 BLAST l What is BLAST? What is it good for? Basic](https://reader037.vdocuments.us/reader037/viewer/2022090107/5a4d1bf07f8b9ab0599e5910/html5/thumbnails/6.jpg)
Genome Revolution: COMPSCI 004G 8.6
Analysis How long does this algorithm take to
execute? How do we measure the complexity/size? Time v. Memory
We need a different measure of “gap match” and mismatch? Just using +1 or -1 doesn’t provide domain
specific analysis In practice use scoring matrix, see ncbi site
![Page 7: Genome Revolution: COMPSCI 004G 8.1 BLAST l What is BLAST? What is it good for? Basic](https://reader037.vdocuments.us/reader037/viewer/2022090107/5a4d1bf07f8b9ab0599e5910/html5/thumbnails/7.jpg)
Genome Revolution: COMPSCI 004G 8.7
BLOSUM 62 scoring matrix http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=sef.figgrp.194