applied genomics: toward personalized medicinegenomics nutrigenomics bioinformatics proteomics...

27
Dr Rana Bazzi PhD in Genetics University of Nottingham, UK Applied genomics: toward personalized medicine

Upload: others

Post on 01-Sep-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Dr Rana Bazzi

PhD in Genetics

University of Nottingham, UK

Applied genomics: toward

personalized medicine

Page 2: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Contents

• Introduction to the HG

• DNA polymorphisms

• SNP microarrays

• Genetic polymorphism and pharmacogenetics

Page 3: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

The sequence of the human genome

• Draft annouced in 2000 by IHGSC and celera

• The euchromatin part was sequenced (92%)

• Difficulties to sequence heterochromatin parts

Page 4: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

The Human Genome

• human genome = 3.3 billion bases

• For most organisms, less than 5% of the total genome has a

genetic function.

• Only 1.2% DNA encodes protein!

• Many genes produce more than one transcript

• Junk DNA-what a rumour?!

Page 5: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Genomic revolution

• Discovery of the double-helical structure of DNA 1953

• The Human Genome Project (HGP) 2003:the “project Apollo of biology".

• Advances in genome-based analysis methods increases our ability to explore genome function:

Genomic sequence database, cDNA sequencing, proteomics, databases and bioinformatics.

• Genomics has become a central discipline of biomedical research

• Applying the genomic information to improve human health is a big challenge!!!

Page 6: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Genomic disciplines

Toxicogenomics

Structural Genomics

Chemical Genomics

Gene therapy

Functional Genomics

Pharmacogenomics

Comparative

Genomics

Nutrigenomics

Bioinformatics

Proteomics

Page 7: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Genomic variations

Relationship between genomic variation, variation in RNA

and protein expression and the biological effects and clinical

effects. Some – but not all – mechanisms of regulation are

also indicated

Page 8: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Online Mendelian Inheritance in Man (OMIM)

OMIM reported:

• > 387 human genes of known sequence with a known

phenotype, and

• > 2,310 human phenotypes with a known molecular basis.

• >1,621 confirmed Mendelian phenotypes for which the

molecular basis is not known.

• > 2,084 phenotypes for which a Mendelian basis is suspected

but has not been fully established, or that may exhibit overlap

with other characterized phenotypes

Page 9: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

• Ensembl is one of the world's primary resources for

genomic research

• Completely sequenced genomes as of January,

2009) http://www.genomesonline.org.

• http://www.ensembl.org/info/website/tutorials/cour

sebook.pdf

Page 10: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Structure of a human gene

Page 11: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Human DNA polymorphisms

• Usually involve population “polymorphisms”

• – occurrence at a locus of identifiable alternative

alleles each at an appreciable frequency in the

population

• Polymorphic markers

• – Two or more alternative alleles each at frequency >

1%

• – marker alleles can be used to follow the alternative

forms of a “trait / phenotype / disease” gene in family

and population studies

• Alleles <0.01 are “rare variants“

• Potential genetic markers

Page 12: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Human DNA polymorphisms

•Genetic polymorphisms have been studied at various levels:

•Phenotypic assessment (such as tests of colour vision)

•Cellular and serological tests (such as the determination of blood groups and HLA types ),

•Metabolic analysis (such as the acetylator status ),

•Direct examination of the nuclear and mitochondrial DNA itself.

•Drug response

•Disease susceptibility

Page 13: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

DNA Microarray

• Microarray analysis involves the use of what are commonly

called "gene chips" to determine the expression of a large set of

genes at the same time in a single experiment.

• Gene chips can be purchased from several different companies,

eg Affymetrix, or they can be custom prepared in laboratories

with the proper equipment.

• Affymetrix gene chips are created through the covalent

attachment of synthetic oligonucleotides (oligos) to a small

surface. In general, there are 20 or more different oligos on the

chip corresponding to different regions of each gene to be

analyzed. In addition, a set of oligos that each contain a

nucleotide mismatch are included as negative controls for each

gene. The technology of creating gene chips is such that there can

be 10's of thousands of different genes represented on a single

chip approximately 2cm square.

Page 14: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Gene Chip

Page 15: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Genetic polymorphism

• By definition, a polymorphism is the coexistence of more than one genetic variant in a population and is typically described in terms of wild-type versus mutant alleles.

• Polymorphisms result from heritable mutations that become stable genotypes in a population and appear at an appreciable frequency (>1%).

• Pharmacogenetic polymorphism occurs when the 1% frequency of the mutant allele causes a different drug response or phenotype.

• Functional polymorphism: the genetic variants have functional importance in that they are linked with a variant phenotype.

Page 16: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Microsatellites

• Also known as simple tanem repeats (STRs)

• Consist of multiple copies of tandemly repeated sequences

of two (TGTG … TG), three (CAACAA … CAA), or four

nucleotides (AAATAAAT … AAAT)

• They tend to occur in non-coding regions of the DNA and to

a less extent in coding regions

• >10 000 microsatellite polymorphic loci are known

randomly distributed throughout the human genome

• Being small in size they can be readily genotyped (PCR)

Page 17: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

The use of microsatellites in Genetics

• Being highly polymorphic and highly specific,

they are useful genetic markers that allows:

- Mapping genomes (genetic linkage)

- Biomedical diagnosis (markers for certain disease

conditions).

- DNA testing in forensics

- Parentage analysis/ relatedness of individuals,

etc..

Page 18: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

STR

Page 19: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Microsatellites and

fingerprinting

Page 20: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Single Nucleotide Polymorphisms (SNPs)

• Single nucleotide polymorphisms

• More frequent than microsatellites

• Frequencies>1%

• Random distribution / codominant

• Less variable than microsatellites

• Generally only two alleles per site

• Can be identified by various methods (comparisons of

sequences/heteroduplex hybridizations/PCR/DNA chips

• Go to dbSNP (http://www.ncbi.nlm.nih.gov/projects/SNP/)

and find the "dbSNP

Page 21: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

And their number is still increasing!

• 2001…… 1.42 million SNPs

• 2004……. 4

• 2009…….10 million

• 2011…….13 million!

Page 23: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

SNP characteristics

• It has the following three characteristics

1) very common in the human genome (a SNP

occurs every 100 to 300 bases along the 3-

billion-base human genome)

2) among the SNPs, two of every three SNPs are the

variations from cytosine (C) to thymine (T)

3) very stable from generation to generation.

Page 24: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

Applications

• This technique is used:

- In forensics medicine,

- Paternity testing

- Fingerprinting

- Gene mapping (genetic linkage)

- Biomedical diagnostics (markers for certain genetic

disorders)

- Pharmacogenomics

Page 25: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein
Page 26: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

SNPs

• Partial DNA sequence of exon 7 of African–American patient (top) and wild type

CYP2C9 (lower). Direct sequencing of the sense strand of exon 7 shows the patient

is heterozygous for a G1078A mutation (numbering based on the cDNA).

Page 27: Applied genomics: toward personalized medicineGenomics Nutrigenomics Bioinformatics Proteomics Genomic variations Relationship between genomic variation, variation in RNA and protein

HapMap Project

• The international HapMap Project provides many

kinds of data for researchers, such as the HapMap

genotype data and the phased haplotype data.

• The phased haplotype data describes SNP alleles

on a chromosome inherited from one of father and

mother, while

• The genotype data describes SNP alleles on both

chromosomes inherited from parents