![Page 1: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/1.jpg)
Bioinformatics Nahla Bakhamis, MSc
![Page 2: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/2.jpg)
OUTLINE
• What is bioinformatics.
• Why bioinformatics
• Types of Data
• Applications
• OMIM workshop
• Primer design workshop
11/1
9/2
015
NA
HL
A B
AK
HA
MIS
, M
Sc
2
![Page 3: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/3.jpg)
What is bioinformatics ?
• Computational management & analysisof biological data
• Coined by Paulien Hogeweg 1979
• 1980s in genomics and genetics
• Also called; Biocomputing, Systems biology
Computational biology
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
3
![Page 4: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/4.jpg)
Aims
• To store maximum amount of data in the internet
• Efficient access/management of data
• Increase understanding of biological process
• Increase research efforts in the field
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
4
![Page 5: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/5.jpg)
Why bioinformatics ?
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
5
![Page 6: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/6.jpg)
We have the sequence what does it mean ?
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
6
![Page 7: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/7.jpg)
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
7
![Page 8: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/8.jpg)
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
8
![Page 9: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/9.jpg)
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
9
![Page 10: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/10.jpg)
Not all genes are active
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
10
![Page 11: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/11.jpg)
Genes interact with each other
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
11
![Page 12: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/12.jpg)
Common activities in bioinformatics
1. Mapping & analysing DNA & protein sequences
2. Aligning and compare different DNA & protein sequence
3. Creating & viewing 3D modules of protein structure
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
12
![Page 13: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/13.jpg)
Data classification
• Primary data:
Row/basic data eg. DNA or aa seq (building blocks)
• Secondary data:
arrangement of aa in a protein
• Tertiary data:
more complicated, related to 3D structure of proteins
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
13
![Page 14: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/14.jpg)
Unit of information
• DNA (Genome)
• RNA (transcriptome)
• Proteins (Proteome)
• Or genetic, genomic and metabolic
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
14
![Page 15: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/15.jpg)
DNA
• Simple seq analysis (database searching)
• Regulatory regions
• Gene finding
• Whole genome annotations
• Comparative genomics between species and strains
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
15
![Page 16: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/16.jpg)
DNA
• Row DNA sequence;
coding or non-coding?
pares into genes?
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
16
![Page 17: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/17.jpg)
Whole genomes
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
17
![Page 18: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/18.jpg)
RNA
• Tissue specific expression
• Structure
• Single gene analysis
• Experimental data
• Micro-array and expression array analyses
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
18
![Page 19: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/19.jpg)
Protein
• Proteome of an organism
• Mass specific
• 2D,3D 4D structures (interactions)
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
19
![Page 20: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/20.jpg)
Other integrative data
• Metabolic pathways
• Regulatory networks
• Whole organisms phylogeny
• Environments, habitats, ecology
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
20
![Page 21: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/21.jpg)
Applications• Medical
understanding life process in healthy & disease states
SNPs
• Pharmaceutical and biotech industry
develop new drug or gene/structural base drug design
• Agricultural applications
higher yields crops
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
21
![Page 22: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/22.jpg)
Biological problems computers can help with
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
22
![Page 23: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/23.jpg)
Biological problems computers can help with
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
23
![Page 24: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/24.jpg)
Software and tools
• Range from simple command-line to more complex programs
• Web-services available
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
24
![Page 25: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/25.jpg)
Data bases • 4 majors
1. Nucleotide data bases
2. Protein data bases
3. Whole genome data bases ENSEMBL
4. Specialized data bases
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
25
![Page 26: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/26.jpg)
Nucleotide data bases INSDCInternational Nucleotide Data Bank Collaborative
• EMBL
European molecular biology library (Germany)
• Gene bank. US
• DDBJ
DNA Data Bank in Japan
Collaborate by international Advisory Meeting
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
26
![Page 27: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/27.jpg)
Protein data bases
• 3 majors:
1. Sequence (primary)
UniProt, SwisProt and PIR
2. Structure
PDB, SCOP
3. Interactions
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
27
![Page 28: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/28.jpg)
Specialized data bases
1. Inherited diseases data bases
• OMIM (Online Mendelian Inheritance in Man)
• funded by NHGRI, supported by JHM (copy right)
• Originally developed by Dr. Victor A. McKusick 1960s
2. Microarray data bases
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
28
![Page 29: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/29.jpg)
OMIM• Focuses on single-gene mendelian disease/disorders/phenotypes
eg. CF, Sickle cell anemia
• Complex diseases with significant single gene contribution
eg. Complement factor H and age related molecular degeneration
• Descriptions of recurrent deletion and duplication syndromes
eg. Potocki-Shaffer syndrome, chromosome 10q26deletion syndrome
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
29
![Page 30: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/30.jpg)
OMIM (workshop)• http://omim.org
• http://www.openhelix.com/OMIM
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
30
![Page 31: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/31.jpg)
OMIM (workshop)• You will learn about:
Basic search
Phenotype result
Genotype result
Gene map information
Advanced search
Additional features
Exercises
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
31
![Page 32: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/32.jpg)
Primer design
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
32
![Page 33: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/33.jpg)
Primer
• A strand of nucleic acid serves as starting point for DNA/RNA synthesis
• Why it is required ??
• Polymerase start replication at 3’ end of the primer
• PCR and DNA sequencing
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
33
![Page 34: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/34.jpg)
Primer design
• NCBI National Centre for Biotechnology Information
• Primer3
• Database of single nucleotide polymorphism dbSNP
• UCSC Genome Browser
• Ensemble Genome Browser
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
34
![Page 35: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/35.jpg)
Primer design NCBI • Tutorial
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
35
![Page 36: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/36.jpg)
Primer design • Points should be taken in consideration:
1. Mononucleotide repeats should be avoided (loop formation)
2. Avoid Primer dimer
3. reverse primer should be the reverse complement of the given seq
4. In TA cloning efficiency can be increased by adding AG tail to 3’ & 5’ ends
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
36
![Page 37: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/37.jpg)
References • Hunt M. (2006) Real time PCR tutorial - Copyright 2006, The Board of Trustees of the
University of South Carolina
• Achuthsankar S Nair Computational Biology & Bioinformatics - A gentle Overview,
Communications of Computer Society of India, January 2007
• Aluru, Srinivas, ed. Handbook of Computational Molecular Biology. Chapman & Hall Crc,
2006. ISBN 1584884061.
• Baldi, P and Brunak, S, Bioinformatics: The Machine Learning Approach, 2nd edition. MIT
Press, 2001. ISBN 0-262-02506-X
• Barnes, M.R. and Gray, I.C., eds., Bioinformatics for Geneticists, first edition. Wiley, 2003.
ISBN 0-470-84394-2
• Baxevanis, A.D. and Ouellette, B.F.F., eds., Bioinformatics: A Practical Guide to the Analysis
of
Genes and Proteins, third edition. Wiley, 2005. ISBN 0-471-47878-4
Patricia, Stock; John, Vanderberg; Itamar, Glazer; Noel, Boemare (2009). "1.6.2. Primers development and
virus identification strategies". p. 22. ISBN 978 1 84593 478 1.
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
37
![Page 38: Nahla Bakhamis, MSc - fac.ksu.edu.safac.ksu.edu.sa/sites/default/files/bioinformatics.pdf · What is bioinformatics ? •Computational management & analysis of biological data •Coined](https://reader036.vdocuments.us/reader036/viewer/2022062604/5fbbe181c5dfa9655e2c29e6/html5/thumbnails/38.jpg)
11
/19
/20
15
NA
HL
A B
AK
HA
MIS
, M
Sc
38