cnit final presentation
TRANSCRIPT
![Page 1: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/1.jpg)
CNIT Final PresentationChris ThompsonApril 18th, 2013
CNIT 227
![Page 2: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/2.jpg)
Introduction
Materials
Methods
Results and Conclusion
Table of Contents
![Page 3: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/3.jpg)
INTRODUCTION
![Page 4: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/4.jpg)
Bioinformatics
Bioinformatics – an interdisciplinary field that develops and improves upon methods for storing, retrieving, organizing, and analyzing biological data.
Bioinformatics is important because without the technologies produced and developed through it, many of the experiments and assays we do today would not be possible.
![Page 5: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/5.jpg)
CNIT
CNIT is the bioinformatics course at Purdue, focused on annotating the genome of mycobacteriophages.
Overall goal is to annotate the genome of the RiverMonster phage, so other researchers can use it in the future.
![Page 6: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/6.jpg)
Bacteriophages• A virus that infects and replicates in bacteria• One of the most common and populous
organism in existence • Many have a mosaic genome• Unlimited potential usage• Mycobacteriophages infect M.smegmatis
![Page 7: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/7.jpg)
Clusters
• System to organize bacteriophages• Phages sorted by factors such as genome
length, presence of certain genes, organization of genome, GC content, and plaque size and characteristics
• A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S, Singleton, and T
![Page 8: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/8.jpg)
RiverMonster
• Discovered in 2010 in West Lafayette• Mycobacteriophage• Cluster E• 144 genes in total• Many protein products are unknown• Overall geographical presence is unknown
Through CNIT and bioinformatics we are trying to answer some of the unknowns about RiverMonster
![Page 9: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/9.jpg)
MATERIALS
![Page 10: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/10.jpg)
Bioinformatics Tools
• DNA Master• Phamerator• Glimmer• GeneMark• NCBI and BLAST• EverNote
![Page 11: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/11.jpg)
DNA Master• Designed and written by Dr. Jeffrey Lawrence• Annotation program• Can auto-annotate entire genomes• Uses information from Glimmer and GeneMark• Can locally BLAST genes
![Page 12: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/12.jpg)
Phamerator
• Developed in 2011• Linux-based bioinformatic program• Used for comparative phage genomics• Can visualize entire phage genomes• Separates phages into “phams”
![Page 13: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/13.jpg)
Glimmer
• Stands for Gene Locator and Interpolated Markov ModelER
• Used for finding genes in microbial DNA• Uses models and algorithms to distinguish between
coding and non-coding DNA
![Page 14: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/14.jpg)
GeneMark
• A family of gene prediction programs developed at the Georgia Institute of Technology
• Determines the protein-coding potential of a DNA sequence
• Uses many of the same algorithms and models as GIimmer
![Page 15: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/15.jpg)
NCBI and BLAST• National Center for Biotechnology Information• Basic Local Alignment Search Tool• Program that compares DNA sequences with a large
database of known sequences• Used to find similar gene sequences
![Page 16: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/16.jpg)
EverNote
• Started in 2008• Designed for note-taking and archiving• Used as an online lab notebook for CNIT
![Page 17: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/17.jpg)
METHODS
![Page 18: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/18.jpg)
Organization
• Genome split into two sections• Genes 0 to 65 by Jon and Bill• Genes 66 to 144 by Chris and Nyema• Split again into four sections• 0 to 23 by Jon• 24 to 65 by Bill• 100 to 123 by Chris• 124 to 144 by Nyema
![Page 19: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/19.jpg)
Process
• Document the auto-annotated gene call• Ran the Shine-Delgarno Test• BLASTed gene and compared scores• Compared homologous genes in Phamerator• Made final call
![Page 20: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/20.jpg)
First Section
• Genes 66 to 144• Split up evens and odds• I had even numbered genes• No outstandingly tricky gene calls• Gene 88 seems to be a family of Kinases, many of
them hypothetical• Gene 92 is a family of RNA ligases• Gene 94 is Transcription factor WhiB
![Page 21: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/21.jpg)
Second Section
• Genes 101 to 123• Every gene• Gene 101 is a protease family• Gene 112 contains genes for polymerases• Genes 116 and 117 were reverse genes• 117 had many inconsistencies and was difficult to call
![Page 22: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/22.jpg)
RESULTS AND CONCLUSION
![Page 23: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/23.jpg)
Accomplishments
• Personally called 39 genes• Called 144 genes as a class• Analyzed protein products• Completed a final draft of the RiverMonster genome
![Page 24: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/24.jpg)
Significance
• Genome can be used by future scientists• Proves validity of undergraduate research• Learned about bioinformatics, bacteriophages,
genomes, annotation, and biotechnology
![Page 25: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/25.jpg)
Future Work
• Check and finalize all gene calls• Compilation of DNA Master file• Send to HHMI and SEA Phages to be put in
Phamerator
![Page 26: Cnit final presentation](https://reader036.vdocuments.us/reader036/viewer/2022062522/586e0e741a28ab8a588b466f/html5/thumbnails/26.jpg)
The End