bioinformatics analysis of nucleotide sequences

Post on 05-Jul-2015

320 Views

Category:

Science

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

DNA sequencing is a technique that provides a detailed analysis of the structure of DNA and consists of a set of techniques and biochemical methods that allow us to determine the sequence of nucleotides (A, C, G, and T) analysis is DNA. In the mid-1970s happened a revolution in technology for identifying DNA sequence. In 1977 was published the complete nucleotide sequence of a viral genome (φ X174, 5375 nucleotides long). This milestone in molecular biology occurred in the laboratory of Frederick Sanger, who identified the amino acid sequence of the polypeptide (insulin) 25 years earlier. Bioinformatics is the application of computer technology to information in molecular biology, encompassing aspects of the acquisition, processing, distribution, analysis, interpretation and integration of biological information. There are several databases that organize information and they are often used, which are presented in the following bioinformatics centers: GenBank (NCBI) and BOLD Systems The NCBI database (established in 1988) has a public database, with three components. Creating databases (store biological data), development of algorithms and statistics to determine relationships between databases, and use these tools to analyze and interpret various types of biological data (sequences of DNA, RNA, protein, protein structure, gene expression, biochemical pathways) The Barcode of Life Data Systems (BOLD) is an informatics workbench aiding the acquisition, storage, analysis, and publication of DNA barcode records. By assembling molecular, morphological, and distributional data, it bridges a traditional bioinformatics chasm. BOLD is freely available to any researcher with interests in DNA barcoding. By providing specialized services, it aids the assembly of records that meet the standards needed to gain BARCODE designation in the global sequence databases. Because of its web-based delivery and flexible data security model, it is also well positioned to support projects that involve broad research alliances.

TRANSCRIPT

BIOINFORMATICS ANALYSIS OF NUCLEOTIDE SEQUENCES

Members:

• Avellaneda Vergara Adrian Gustavo• Yarasca Cerna Withney Aracely • Zegarra Aguinaga Janeth Alexandra• Farfan Hernandez Kevin Jhonny• Graham Angeles Laura Andrea

INTRODUCTION

• DNA sequencing is a technique that provides a detailed analysis of the structure of DNA and

consists of a set of techniques and biochemical methods that allow us to determine the

sequence of nucleotides (A, C, G, and T) analysis is DNA.

• In the mid-1970s happened a revolution in technology for identifying DNA sequence. In

1977 was published the complete nucleotide sequence of a viral genome (φ X174, 5375

nucleotides long). This milestone in molecular biology occurred in the laboratory of

Frederick Sanger, who identified the amino acid sequence of the polypeptide (insulin) 25

years earlier.

• Bioinformatics is the application of computer technology to information in molecular

biology, encompassing aspects of the acquisition, processing, distribution, analysis,

interpretation and integration of biological information. There are several databases that

organize information and they are often used, which are presented in the following

bioinformatics centers: GenBank (NCBI) and BOLD Systems

• The NCBI database (established in 1988) has a public database, with three components.

Creating databases (store biological data), development of algorithms and statistics to

determine relationships between databases, and use these tools to analyze and interpret

various types of biological data (sequences of DNA, RNA, protein, protein structure, gene

expression, biochemical pathways)

• The Barcode of Life Data Systems (BOLD) is an informatics workbench aiding the

acquisition, storage, analysis, and publication of DNA barcode records. By assembling

molecular, morphological, and distributional data, it bridges a traditional bioinformatics

chasm. BOLD is freely available to any researcher with interests in DNA barcoding. By

providing specialized services, it aids the assembly of records that meet the standards

needed to gain BARCODE designation in the global sequence databases. Because of its web-

based delivery and flexible data security model, it is also well positioned to support projects

that involve broad research alliances.

PROCEDURE(SCREENSHOTS)

To do the analysis of nucleotides we must follow many steps.

STEP 1

STEP 2

Then we have to go to Edit > reverse + complement

STEP 3

STEP 4

• We still in the same sequence, but we are modifying other parts.

STEP 5

STEP 6

STEP 7

STEP 8

STEP 9

• After saved all our sequences in the notepad we have to go to BioEdit

program.

STEP 10

STEP 11

STEP 12

• Then we must create the consensus sequence, so we go to the

option aligment

STEP 13

• We get the consensus sequence and the differtens between some

nucleotides

STEP 14

STEP 15

• Finally we have all the step to do the research of our specie in the

different data bases that exist (NCBI & BOLDsystem)

STEP 16

STEP 17

STEP 18

• Then we have our result…

STEP 19

• Whit this information we find out which kind of living beings is.

STEP 20

• Here we have all the possibilities for our specie.

STEP 21

STEP 22

• After foud all the information about our specie we must go to

BOLDsystem to get detailed information

STEP 23

STEP 24

STEP 25

STEP 26

Then we select, in this case 20, differet sequences that are similar

with our consensu sequence. We can see the similarity in the ítem

IDENT that is in red.

STEP 27

STEP 28

• Then we go to NotePad

STEP 29

STEP 30

Then open the BioEdit program with the other nucleotides

sequences

STEP 31

STEP 32

STEP 33

STEP 34

STEP 35

STEP 36

STEP 37

STEP 38

STEP 39

STEP 40

STEP 41

STEP 42

STEP 43

STEP 44

STEP 45

STEP 46

STEP 47

STEP 48

STEP 49

STEP 50

• The Analysis of result and the conclusions are in the other file because is too long.

top related