bioinformatics analysis of nucleotide sequences

105
BIOINFORMATICS ANALYSIS OF NUCLEOTIDE SEQUENCES Members: Avellaneda Vergara Adrian Gustavo Yarasca Cerna Withney Aracely Zegarra Aguinaga Janeth Alexandra Farfan Hernandez Kevin Jhonny Graham Angeles Laura Andrea

Upload: adrian-gustavo-avellaneda-vergara

Post on 05-Jul-2015

320 views

Category:

Science


0 download

DESCRIPTION

DNA sequencing is a technique that provides a detailed analysis of the structure of DNA and consists of a set of techniques and biochemical methods that allow us to determine the sequence of nucleotides (A, C, G, and T) analysis is DNA. In the mid-1970s happened a revolution in technology for identifying DNA sequence. In 1977 was published the complete nucleotide sequence of a viral genome (φ X174, 5375 nucleotides long). This milestone in molecular biology occurred in the laboratory of Frederick Sanger, who identified the amino acid sequence of the polypeptide (insulin) 25 years earlier. Bioinformatics is the application of computer technology to information in molecular biology, encompassing aspects of the acquisition, processing, distribution, analysis, interpretation and integration of biological information. There are several databases that organize information and they are often used, which are presented in the following bioinformatics centers: GenBank (NCBI) and BOLD Systems The NCBI database (established in 1988) has a public database, with three components. Creating databases (store biological data), development of algorithms and statistics to determine relationships between databases, and use these tools to analyze and interpret various types of biological data (sequences of DNA, RNA, protein, protein structure, gene expression, biochemical pathways) The Barcode of Life Data Systems (BOLD) is an informatics workbench aiding the acquisition, storage, analysis, and publication of DNA barcode records. By assembling molecular, morphological, and distributional data, it bridges a traditional bioinformatics chasm. BOLD is freely available to any researcher with interests in DNA barcoding. By providing specialized services, it aids the assembly of records that meet the standards needed to gain BARCODE designation in the global sequence databases. Because of its web-based delivery and flexible data security model, it is also well positioned to support projects that involve broad research alliances.

TRANSCRIPT

Page 1: Bioinformatics Analysis of Nucleotide Sequences

BIOINFORMATICS ANALYSIS OF NUCLEOTIDE SEQUENCES

Members:

• Avellaneda Vergara Adrian Gustavo• Yarasca Cerna Withney Aracely • Zegarra Aguinaga Janeth Alexandra• Farfan Hernandez Kevin Jhonny• Graham Angeles Laura Andrea

Page 2: Bioinformatics Analysis of Nucleotide Sequences

INTRODUCTION

• DNA sequencing is a technique that provides a detailed analysis of the structure of DNA and

consists of a set of techniques and biochemical methods that allow us to determine the

sequence of nucleotides (A, C, G, and T) analysis is DNA.

• In the mid-1970s happened a revolution in technology for identifying DNA sequence. In

1977 was published the complete nucleotide sequence of a viral genome (φ X174, 5375

nucleotides long). This milestone in molecular biology occurred in the laboratory of

Frederick Sanger, who identified the amino acid sequence of the polypeptide (insulin) 25

years earlier.

• Bioinformatics is the application of computer technology to information in molecular

biology, encompassing aspects of the acquisition, processing, distribution, analysis,

interpretation and integration of biological information. There are several databases that

organize information and they are often used, which are presented in the following

bioinformatics centers: GenBank (NCBI) and BOLD Systems

Page 3: Bioinformatics Analysis of Nucleotide Sequences

• The NCBI database (established in 1988) has a public database, with three components.

Creating databases (store biological data), development of algorithms and statistics to

determine relationships between databases, and use these tools to analyze and interpret

various types of biological data (sequences of DNA, RNA, protein, protein structure, gene

expression, biochemical pathways)

• The Barcode of Life Data Systems (BOLD) is an informatics workbench aiding the

acquisition, storage, analysis, and publication of DNA barcode records. By assembling

molecular, morphological, and distributional data, it bridges a traditional bioinformatics

chasm. BOLD is freely available to any researcher with interests in DNA barcoding. By

providing specialized services, it aids the assembly of records that meet the standards

needed to gain BARCODE designation in the global sequence databases. Because of its web-

based delivery and flexible data security model, it is also well positioned to support projects

that involve broad research alliances.

Page 4: Bioinformatics Analysis of Nucleotide Sequences

PROCEDURE(SCREENSHOTS)

Page 5: Bioinformatics Analysis of Nucleotide Sequences

To do the analysis of nucleotides we must follow many steps.

STEP 1

Page 6: Bioinformatics Analysis of Nucleotide Sequences
Page 7: Bioinformatics Analysis of Nucleotide Sequences

STEP 2

Then we have to go to Edit > reverse + complement

Page 8: Bioinformatics Analysis of Nucleotide Sequences
Page 9: Bioinformatics Analysis of Nucleotide Sequences

STEP 3

Page 10: Bioinformatics Analysis of Nucleotide Sequences
Page 11: Bioinformatics Analysis of Nucleotide Sequences

STEP 4

Page 12: Bioinformatics Analysis of Nucleotide Sequences
Page 13: Bioinformatics Analysis of Nucleotide Sequences

• We still in the same sequence, but we are modifying other parts.

STEP 5

Page 14: Bioinformatics Analysis of Nucleotide Sequences
Page 15: Bioinformatics Analysis of Nucleotide Sequences

STEP 6

Page 16: Bioinformatics Analysis of Nucleotide Sequences
Page 17: Bioinformatics Analysis of Nucleotide Sequences

STEP 7

Page 18: Bioinformatics Analysis of Nucleotide Sequences
Page 19: Bioinformatics Analysis of Nucleotide Sequences

STEP 8

Page 20: Bioinformatics Analysis of Nucleotide Sequences
Page 21: Bioinformatics Analysis of Nucleotide Sequences

STEP 9

Page 22: Bioinformatics Analysis of Nucleotide Sequences
Page 23: Bioinformatics Analysis of Nucleotide Sequences

• After saved all our sequences in the notepad we have to go to BioEdit

program.

STEP 10

Page 24: Bioinformatics Analysis of Nucleotide Sequences
Page 25: Bioinformatics Analysis of Nucleotide Sequences

STEP 11

Page 26: Bioinformatics Analysis of Nucleotide Sequences
Page 27: Bioinformatics Analysis of Nucleotide Sequences

STEP 12

Page 28: Bioinformatics Analysis of Nucleotide Sequences
Page 29: Bioinformatics Analysis of Nucleotide Sequences

• Then we must create the consensus sequence, so we go to the

option aligment

STEP 13

Page 30: Bioinformatics Analysis of Nucleotide Sequences
Page 31: Bioinformatics Analysis of Nucleotide Sequences

• We get the consensus sequence and the differtens between some

nucleotides

STEP 14

Page 32: Bioinformatics Analysis of Nucleotide Sequences
Page 33: Bioinformatics Analysis of Nucleotide Sequences

STEP 15

Page 34: Bioinformatics Analysis of Nucleotide Sequences
Page 35: Bioinformatics Analysis of Nucleotide Sequences

• Finally we have all the step to do the research of our specie in the

different data bases that exist (NCBI & BOLDsystem)

STEP 16

Page 36: Bioinformatics Analysis of Nucleotide Sequences
Page 37: Bioinformatics Analysis of Nucleotide Sequences

STEP 17

Page 38: Bioinformatics Analysis of Nucleotide Sequences
Page 39: Bioinformatics Analysis of Nucleotide Sequences

STEP 18

Page 40: Bioinformatics Analysis of Nucleotide Sequences
Page 41: Bioinformatics Analysis of Nucleotide Sequences

• Then we have our result…

STEP 19

Page 42: Bioinformatics Analysis of Nucleotide Sequences
Page 43: Bioinformatics Analysis of Nucleotide Sequences

• Whit this information we find out which kind of living beings is.

STEP 20

Page 44: Bioinformatics Analysis of Nucleotide Sequences
Page 45: Bioinformatics Analysis of Nucleotide Sequences

• Here we have all the possibilities for our specie.

STEP 21

Page 46: Bioinformatics Analysis of Nucleotide Sequences
Page 47: Bioinformatics Analysis of Nucleotide Sequences

STEP 22

Page 48: Bioinformatics Analysis of Nucleotide Sequences
Page 49: Bioinformatics Analysis of Nucleotide Sequences

• After foud all the information about our specie we must go to

BOLDsystem to get detailed information

STEP 23

Page 50: Bioinformatics Analysis of Nucleotide Sequences
Page 51: Bioinformatics Analysis of Nucleotide Sequences

STEP 24

Page 52: Bioinformatics Analysis of Nucleotide Sequences
Page 53: Bioinformatics Analysis of Nucleotide Sequences

STEP 25

Page 54: Bioinformatics Analysis of Nucleotide Sequences
Page 55: Bioinformatics Analysis of Nucleotide Sequences

STEP 26

Page 56: Bioinformatics Analysis of Nucleotide Sequences
Page 57: Bioinformatics Analysis of Nucleotide Sequences

Then we select, in this case 20, differet sequences that are similar

with our consensu sequence. We can see the similarity in the ítem

IDENT that is in red.

STEP 27

Page 58: Bioinformatics Analysis of Nucleotide Sequences
Page 59: Bioinformatics Analysis of Nucleotide Sequences

STEP 28

Page 60: Bioinformatics Analysis of Nucleotide Sequences
Page 61: Bioinformatics Analysis of Nucleotide Sequences

• Then we go to NotePad

STEP 29

Page 62: Bioinformatics Analysis of Nucleotide Sequences
Page 63: Bioinformatics Analysis of Nucleotide Sequences

STEP 30

Page 64: Bioinformatics Analysis of Nucleotide Sequences
Page 65: Bioinformatics Analysis of Nucleotide Sequences

Then open the BioEdit program with the other nucleotides

sequences

STEP 31

Page 66: Bioinformatics Analysis of Nucleotide Sequences
Page 67: Bioinformatics Analysis of Nucleotide Sequences

STEP 32

Page 68: Bioinformatics Analysis of Nucleotide Sequences
Page 69: Bioinformatics Analysis of Nucleotide Sequences

STEP 33

Page 70: Bioinformatics Analysis of Nucleotide Sequences
Page 71: Bioinformatics Analysis of Nucleotide Sequences

STEP 34

Page 72: Bioinformatics Analysis of Nucleotide Sequences
Page 73: Bioinformatics Analysis of Nucleotide Sequences

STEP 35

Page 74: Bioinformatics Analysis of Nucleotide Sequences
Page 75: Bioinformatics Analysis of Nucleotide Sequences

STEP 36

Page 76: Bioinformatics Analysis of Nucleotide Sequences
Page 77: Bioinformatics Analysis of Nucleotide Sequences

STEP 37

Page 78: Bioinformatics Analysis of Nucleotide Sequences
Page 79: Bioinformatics Analysis of Nucleotide Sequences

STEP 38

Page 80: Bioinformatics Analysis of Nucleotide Sequences
Page 81: Bioinformatics Analysis of Nucleotide Sequences

STEP 39

Page 82: Bioinformatics Analysis of Nucleotide Sequences
Page 83: Bioinformatics Analysis of Nucleotide Sequences

STEP 40

Page 84: Bioinformatics Analysis of Nucleotide Sequences
Page 85: Bioinformatics Analysis of Nucleotide Sequences

STEP 41

Page 86: Bioinformatics Analysis of Nucleotide Sequences
Page 87: Bioinformatics Analysis of Nucleotide Sequences

STEP 42

Page 88: Bioinformatics Analysis of Nucleotide Sequences
Page 89: Bioinformatics Analysis of Nucleotide Sequences

STEP 43

Page 90: Bioinformatics Analysis of Nucleotide Sequences
Page 91: Bioinformatics Analysis of Nucleotide Sequences

STEP 44

Page 92: Bioinformatics Analysis of Nucleotide Sequences
Page 93: Bioinformatics Analysis of Nucleotide Sequences

STEP 45

Page 94: Bioinformatics Analysis of Nucleotide Sequences
Page 95: Bioinformatics Analysis of Nucleotide Sequences

STEP 46

Page 96: Bioinformatics Analysis of Nucleotide Sequences
Page 97: Bioinformatics Analysis of Nucleotide Sequences

STEP 47

Page 98: Bioinformatics Analysis of Nucleotide Sequences
Page 99: Bioinformatics Analysis of Nucleotide Sequences

STEP 48

Page 100: Bioinformatics Analysis of Nucleotide Sequences
Page 101: Bioinformatics Analysis of Nucleotide Sequences

STEP 49

Page 102: Bioinformatics Analysis of Nucleotide Sequences
Page 103: Bioinformatics Analysis of Nucleotide Sequences

STEP 50

Page 104: Bioinformatics Analysis of Nucleotide Sequences
Page 105: Bioinformatics Analysis of Nucleotide Sequences

• The Analysis of result and the conclusions are in the other file because is too long.