some views on microarray experimental design rainer breitling molecular plant science group &...

23
Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland, UK

Upload: clementine-pierce

Post on 30-Dec-2015

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

Some views on microarray experimental design

Rainer BreitlingMolecular Plant Science Group & Bioinformatics Research Centre

University of Glasgow, Scotland, UK

Page 2: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

Personal Background

• University of Glasgow, Scotland, UK

• Molecular Plant Sciences Group

• Bioinformatics Research Centre

• Functional Genomics Facility

Page 3: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

Some common questions in microarray experimental design

• How many arrays will I need?

• Should I pool my samples?

• Which arrays should I choose?

• Which samples should I put together on one array?

Page 4: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

Why are microarrays special?

• produce large amounts of data instantaneously

• can look for unexpected effects

• are still quite expensive

almost never repeated

careful design necessary before you start

Page 5: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

How many replicates?

• as many as possible

Statistics says: The more replicates, the better your estimate of expression (that’s an asymptotic process, so if you add at least a few replicates, the effect will be really strong)

Page 6: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

How many replicates?

2

212/1

)/(

)(4

zz

n

•α significance level (probability of detecting FP)

•1-β power to detect differences (probability of detecting TP)

•σ standard deviation of the log-ratios

•δ detectable difference between class mean log-ratios

•z percentile of standard normal distribution

n required number of arrays (reference design)

Page 7: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

How many replicates?

• Five

Experience shows: For most common experiments you get a reasonable list of differentially expressed genes with 5 replicates

Page 8: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

How many replicates?

• Three

One to convince yourself, one to convince your boss, one just in case...

Page 9: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

How many replicates?

• It depends on– the quality of the sample– the magnitude of the expected effect– the experimental design– the method of analysis

Page 10: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

The quality of the sample

• smaller samples (single cells) are more noisy than large samples (tissue homogenates)

• cell cultures are less noisy than patient biopsies

• sample pooling can decrease noise – if individual variation is not of interest

Page 11: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

The magnitude of the effect

• Microarrays are very sensitive

• To keep effects small:– use early time points, gentle stimuli– never compare dogs and donuts

• if you get a list of 2000 genes that are significantly changed, your experiment failed!

Page 12: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

The magnitude of the effect

• some problematic cases– stably transfected cell lines (are they still the

same cells?)– knock-out organisms (even the same tissue

can be a different)– local changes may be diluted cell

isolation will increase noise

Page 13: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

The experimental design

• Three major options:– reference design (flexible)

– balanced block design (efficient)

– loop design (elegant)

Page 14: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

The experimental design

• loop designs can save samples...

• ...but they can cause interpretation nightmares in less simple cases (use for large studies, if you have a full-time statistician in the team)

A B

CD

A B C D

R R R R

Page 15: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

The method of analysis

• Golub et al. (1999) data set

• 38 leukemia patient bone marrow samples, hybridized individually to Affymetrix microarrays

• Differential expression between two leukemia types was examined, using random subsets of the complete dataset

Page 16: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

The method of analysis  0h 9.5h 11.5h 13.5h 15.5h 18.5h 20.5h

   6144 - purine base metabolism

6099 - tricarboxylic acid cycle

6099 - tricarboxylic acid cycle

3773 - heat shock protein activity

6099 - tricarboxylic acid cycle

     9277 - cell wall (sensu Fungi)

3773 - heat shock protein activity

5749 - respiratory chain complex II (sensu Eukarya)

6099 - tricarboxylic acid cycle

3773 - heat shock protein activity

     297 - spermine transporter activity

6950 - response to stress

6121 - oxidative phosphorylation, succinate to ubiquinone

5977 - glycogen metabolism

5749 - respiratory chain complex II (sensu Eukarya)

     15846 - polyamine transport

297 - spermine transporter activity

8177 - succinate dehydrogenase (ubiquinone) activity

6950 - response to stress

6121 - oxidative phosphorylation, succinate to ubiquinone

       4373 - glycogen (starch) synthase activity

3773 - heat shock protein activity

4373 - glycogen (starch) synthase activity

8177 - succinate dehydrogenase (ubiquinone) activity

       15846 - polyamine transport

4373 - glycogen (starch) synthase activity

4129 - cytochrome c oxidase activity

6537 - glutamate biosynthesis

       5353 - fructose transporter activity

7039 - vacuolar protein catabolism

5751 - respiratory chain complex IV (sensu Eukarya)

6097 - glyoxylate cycle

       15578 - mannose transporter activity

6950 - response to stress

5749 - respiratory chain complex II (sensu Eukarya)

5750 - respiratory chain complex III (sensu Eukarya)

       7039 - vacuolar protein catabolism

4129 - cytochrome c oxidase activity

6121 - oxidative phosphorylation, succinate to ubiquinone

9060 - aerobic respiration

       8645 - hexose transport

5751 - respiratory chain complex IV (sensu Eukarya)

8177 - succinate dehydrogenase (ubiquinone) activity

4129 - cytochrome c oxidase activity

iterativeGroupAnalysis(iGA)

Page 17: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

glyoxylate

cycle

citrate (TCA) cycle

oxidative phosphorylation

(complex V)

respiratory chaincomplex III

respiratory chaincomplex II

Graph-based iterative

GroupAnalysis (GiGA)

Page 18: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

What is a good replicate?

The experiment your competitor at the other side of the globe would do to see if your results are reproducible

Vary “all” parameters – challenge your results

Prepare new samples, from new cultures, using new buffers and new graduate students

Remember to produce matched controls

Page 19: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

What is a “bad” replicate?

• technical replicates (i.e. hybridizing the same sample repeatedly)

• dye-swapping experiments (usually gene-specific dye bias is not a big issue, and dye balancing is more efficient anyway)

• pooled samples, hybridized repeatedly

• the same preparation, only labelled twice

Page 20: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

Should samples be pooled?

• most samples are already pooled – they come from multiple cells

• pool to increase amount of mRNA, but only as much as necessary

• prepare independent pools to assess variation

• problems: bias, “contamination”, outliers, information loss...

Page 21: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

Which arrays are the best?

• Standard arrayscompare and exchange data easily

• Whole-genome arraysdetect unexpected effects, increase confidence

• Single-color arrays (Affymetrix GeneChip)for more complex comparisons

• Annotated arrays

Page 22: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

Further reading

• Dobbin, Shih & Simon (2003) J. Natl. Cancer Inst. 95: 1362.

• Yang & Speed (2002) Nature Rev. Genet. 3: 579.

• Breitling (2004) http://www.brc.dcs.gla.ac.uk/~rb106x/microarray_tips.htm

Page 23: Some views on microarray experimental design Rainer Breitling Molecular Plant Science Group & Bioinformatics Research Centre University of Glasgow, Scotland,

Contact

Rainer Breitling

Bioinformatics Research Centre

Davidson Building A416

[email protected]

http://www.brc.dcs.gla.ac.uk/~rb106x