bioinformatics “half a year in the lab can easily save you an afternoon in front of the...
Post on 19-Dec-2015
217 views
TRANSCRIPT
![Page 1: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/1.jpg)
Bioinformatics
• “half a year in the lab can easily save you an afternoon in front of the computer….”
(unknown)
![Page 2: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/2.jpg)
Genome analysis - Fall 2002
• Home page: www.birc.dk/studies
• Book: “Bioinformatics: Sequence and Genome Analysis” by D. W. Mount
• Teachers: Leif Schauser, Christian Storm and others
• Instructors: Niels Høgslund Jørgensen
![Page 3: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/3.jpg)
![Page 4: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/4.jpg)
Genome projectsOrganism Year Size (Mbp) # of Genes
Saccharomyces cerevisiae
1996 12 5.200
Caenorhabditis elegans
1998 100 19.000
Drosophila melanogaster
2000 115 13.000
Arabidopsis thaliana
2000 115 25.000
Human 2001 2700 39.000
![Page 5: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/5.jpg)
Genomes
• First genomes were selected in order to reflect biological diversity.
• Database contains 20 10^9 bp
• Doubling time: 15 month– CPU doubling time 18 month
• Effective tools for sequence analysis needed
![Page 6: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/6.jpg)
Bioinformatics Master
Masters Thesis
Algorithms in Bioinformatics
Complex systems Protein structure
Algorithms and Datastructure
Molecular Population Genetics and Evolution
Biostatistics
Basics in Programming
Mathematics basic
Molecular biology basics
Intro: Bioinformatics Genome analysis
![Page 7: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/7.jpg)
Discuss with your neighbour
• What do you expect to learn during this course?
![Page 8: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/8.jpg)
Topics
• Substitution matrices
• Pairwise alignment
• Multiple alignment
• RNA secondary structure prediction
• Phylogenetic analysis
• Database searching
![Page 9: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/9.jpg)
Objectives
• Overview: understanding of topics and techniques– Motivation / principles– Mathematical and statistical models– Algorithms
• User-focus– When and how to use applications
![Page 10: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/10.jpg)
Non-objectives
• To learn how to write programs
• To construct mathematical and statistical models
• To improve algorithms
![Page 11: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/11.jpg)
Alignment: a central problem
• Alignments are basis of many analysis– Predicting RNA secondary structure– Phylogeny reconstruction– Database searches– Genome analysis mm.
![Page 12: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/12.jpg)
Why alignment?
• Discuss with your neighbour:– Which principle does an alignment represent
![Page 13: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/13.jpg)
Why alignment?
• Biological sequences are related– Common ancestors– Duplication, mutation, speciation, variation– Principle of evolution
![Page 14: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/14.jpg)
Why alignment?
![Page 15: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/15.jpg)
What is a Sequence What is a Sequence Alignment? (cont.)Alignment? (cont.)
HIGHLY RELATED:HBA_HUMAN GSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKL
G+ +VK+HGKKV A+++++AH+D++ +++++LS+LH KLHBB_HUMAN GNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKL
RELATED:HBA_HUMAN GSAQVKGHGKKVADALTNAVAHV---D--DMPNALSALSDLHAHKL
++ ++++H+ KV + +A ++ +L L+++H+ KLGB2_LUPLU NNPELQAHAGKVFKLVYEAAIQLQVTGVVVTDATLKNLGSVHVSKG
SPURIOUS ALIGNMENT:HBA_HUMAN GSAQVKGHGKKVADALTNAVAHVDDMPNALSALSD----LHAHKL
GS+ + G + +D L ++ H+ D+ A +AL D ++AH+F11G11.2 GSGYLVGDSLTFVDLL--VAQHTADLLAANAALLDEFPQFKAHQE
How to filter out the last one & pick up the second?
![Page 16: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/16.jpg)
Which species are the closest living relatives of modern humans?
Mitochondrial DNA, most nuclear DNA-encoded genes, and DNA/DNA hybridization all show that bonobos and chimpanzees are related more closely to humans than either are to gorillas.
The pre-molecular view was that the great apes (chimpanzees, gorillas and orangutans) formed a clade separate from humans, and that humans diverged from the apes at least 15-30 MYA.
MYA
Chimpanzees
Orangutans Humans
Bonobos
GorillasHumans
Bonobos
Gorillas Orangutans
Chimpanzees
MYA015-30014
![Page 17: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/17.jpg)
Ribosome structure
![Page 18: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/18.jpg)
Rimosome rRNA
![Page 19: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/19.jpg)
Conclusions
• Bioinformatic methods are motivated by the explosion of sequence data
• This course gives a broad introduction to a number of analysis tools
• Most of these tools rely on the principle of evolution
![Page 20: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/20.jpg)
Schedule
• Lectures: Tuesdays 9-11, auditorium D1, Department of Mathematical Sciences and on Fridays 9-10, also auditorium D1
• Computer / theoretical exercises take place on Wednesdays 14-17 (HOLD 1), and Thursday 11-14 (HOLD 2) at the Department of Computer Sciences, building 540, room K.16.
![Page 21: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/21.jpg)
![Page 22: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/22.jpg)
Daimi: some practical things
• Unix introduction
• Computer exercises
• Username / password
• Sign web & user ‘brugererklaeringer’
• Keys / Copy accounts
![Page 23: Bioinformatics “half a year in the lab can easily save you an afternoon in front of the computer….” (unknown)](https://reader036.vdocuments.us/reader036/viewer/2022062320/56649d375503460f94a10698/html5/thumbnails/23.jpg)
What is a Sequence Alignment?
• HIGHLY RELATED:HBA_HUMAN GSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKL
G+ +VK+HGKKV A+++++AH+D++ +++++LS+LH KLHBB_HUMAN GNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKL
• RELATED:HBA_HUMAN GSAQVKGHGKKVADALTNAVAHV---D--DMPNALSALSDLHAHKL
++ ++++H+ KV + +A ++ +L L+++H+ K
LGB2_LUPLU NNPELQAHAGKVFKLVYEAAIQLQVTGVVVTDATLKNLGSVHVSKG
• SPURIOUS ALIGNMENT:HBA_HUMAN GSAQVKGHGKKVADALTNAVAHVDDMPNALSALSD----LHAHKL
GS+ + G + +D L ++ H+ D+ A +AL D ++AH+F11G11.2 GSGYLVGDSLTFVDLL--VAQHTADLLAANAALLDEFPQFKAHQE
• How to filter out the last one & pick up the second?