big data in biology
TRANSCRIPT
![Page 1: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/1.jpg)
Big Data in Biology
Future Scope and Study
![Page 2: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/2.jpg)
WHAT IS BIG DATA??
![Page 3: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/3.jpg)
So, how is this data produced??● The data produced by the social
media in a single minute is
astounding!
● All this data is stored and
analyzed for many obvious
reasons.
![Page 4: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/4.jpg)
how is it related to biology??
● DNA-DeoxyriboNucleicAcid
● DNA carries all the genetic information
in our body.
● It drives the human body.
● Genome is an organism’s complete set
of DNA.
![Page 5: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/5.jpg)
human genome project● It is an international scientific research project.
● The goal of the project is to determine the sequence of chemical base
pairs that make up human DNA.
● The project was successfully completed in 2003 and 90% of the human
genome was sequenced.
● This was just a start of a new era of sequencing.
![Page 6: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/6.jpg)
![Page 7: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/7.jpg)
Why do we need sequencing?
Bacterial Lights in Paris Genetically modified mosquitoes
![Page 8: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/8.jpg)
Why do we need sequencing?
CRISPR/Cas9
![Page 9: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/9.jpg)
Big data parkingClouds are a solution, but they also throw up fresh challenges. Ironically, their proliferation
can cause a bottleneck if data end up parked on several clouds and thus still need to
be moved to be shared.
And clouds means entrusting valuable data to a distant service provider who may be
subject to power changes or other disruptions.
Scientists experiment with different constellations to suit their needs and trust levels.
Clouds can be used for both data storage and computing.
This reduces the overhead of transferring the data into a local machine and computing it on
the local machine.
![Page 10: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/10.jpg)
![Page 11: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/11.jpg)
Databases storing these genomics
![Page 12: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/12.jpg)
● The information necessary to build and control any living organism is written in its
genome and it took 13 years to decipher.
● A single decade later sequencing a genome takes a few hours on a machine that fits on
a tabletop.
● The tsunami of biological data generates new problems, it needs to be analysed
properly to unearth and retrieve the exciting knowledge it contains.
● Getting the most from the data requires interpreting them in light of all the relevant prior
knowledge.
● That means scientists have to store a large data sets, and analyse, compare and share
them - not simple tasks.
What are we concerned about??
![Page 13: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/13.jpg)
It is estimated that by 2025 , exabytes(1018) of genomics data will be produced globally
and will far exceed from twitter and facebook.
Moreover, the genomics data being produced roughly doubles every year and will
require new solutions in precision and accuracy for storage, analysis and sharing.
The European Bioinformatics Institute(EBI), UK, part of the European Molecular Biology
Laboratory and one of the world’s largest biology-data repositories, currently stores
20 petabytes(20*1015) of data and back-ups about genes, proteins and small
molecules.
Genomic data accounts for 2 petabytes of that, a number that more than doubles every
year.
Data Explosion
![Page 14: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/14.jpg)
![Page 16: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/16.jpg)
Storing data using dna molecules● This could be the future of the data
storage.
● DNA molecules can store huge amounts
of data.
● DNA storage is robust.
![Page 17: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/17.jpg)
How do we do it??
![Page 18: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/18.jpg)
Microsoft has already started storing some of its data using DNA.
The first phase of demonstration was successfully completed.
Microsoft partnered with the startup “Twist Bioscience” which produced oligonucleotides for
them and arranged them in the sequence specified.
One of the drawbacks of this storage is it cannot be commercialised.
![Page 19: Big data in biology](https://reader035.vdocuments.us/reader035/viewer/2022070521/58efbdbe1a28abd92b8b4583/html5/thumbnails/19.jpg)
Thank You. Have a good day.