economic impact of bioinformatics - esp · bioinformatics is the application of information...

120
Economic Impact of Bioinformatics Robert J. Robbins Fred Hutchinson Cancer Research Center 1100 Fairview Avenue North, LV-101 Seattle, Washington 98109 [email protected] (206) 667 2920 http://www.esp.org/rjr/atcc.pdf

Upload: builiem

Post on 16-Feb-2019

235 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

Economic Impact of Bioinformatics

Robert J. RobbinsFred Hutchinson Cancer Research Center

1100 Fairview Avenue North, LV-101Seattle, Washington 98109

[email protected](206) 667 2920

http://www.esp.org/rjr/atcc.pdf

Page 2: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

Abstract

Biotechnology, the "magic technology" of the 21st Century, depends on genomic research and genomics inturn depends upon information technology. Computers are the instruments that allow us to "see" genomes,just as electron microscopes allow us to see viruses.

When information technology becomes critical for any activity, the economics of that field changessubstantially. In the last 25 years, field after field of human endeavor has been transformed when therelentless effects of Moore's Law, delivering exponentially better computers at exponentially lower prices,eventually provide sufficient computational power at affordable prices.

Bioinformatics is the application of information technology to the problems of biology. With $2500 desktopPCs now delivering more raw computing power than the first Cray, bioinformatics is rapidly becoming thecritical technology for 21st Century biology.

DNA is legitimately seen as a biological mass-storage device, making bioinformatics a sine qua non foreffective genomic research. Others areas of biological investigation are also information rich -- for example,an exhaustive tabulation of the Earth's biodiversity would involve a cross-index of the millions of knownspecies against the approximately 500,000,000,000,000 square meters of the Earth's surface.

Access to bioinformatics support and logistics skills are emerging as rate limiting steps for much biomedicalresearch. As 21st Century biology becomes increasing dependent upon bioinformatics and logistics,competitive dynamics will change, possibly leading to the movement of substantial areas of biologicalresearch from the public to the private sector.

Abstract

Biotechnology, the "magic technology" of the 21st Century, depends on genomic research and genomics inturn depends upon information technology. Computers are the instruments that allow us to "see" genomes,just as electron microscopes allow us to see viruses.

When information technology becomes critical for any activity, the economics of that field changessubstantially. In the last 25 years, field after field of human endeavor has been transformed when therelentless effects of Moore's Law, delivering exponentially better computers at exponentially lower prices,eventually provide sufficient computational power at affordable prices.

Bioinformatics is the application of information technology to the problems of biology. With $2500 desktopPCs now delivering more raw computing power than the first Cray, bioinformatics is rapidly becoming thecritical technology for 21st Century biology.

DNA is legitimately seen as a biological mass-storage device, making bioinformatics a sine qua non foreffective genomic research. Others areas of biological investigation are also information rich -- for example,an exhaustive tabulation of the Earth's biodiversity would involve a cross-index of the millions of knownspecies against the approximately 500,000,000,000,000 square meters of the Earth's surface.

Access to bioinformatics support and logistics skills are emerging as rate limiting steps for much biomedicalresearch. As 21st Century biology becomes increasing dependent upon bioinformatics and logistics,competitive dynamics will change, possibly leading to the movement of substantial areas of biologicalresearch from the public to the private sector.

Page 3: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

3

Topics

• Biotechnology will be the magic technology of the 21stCentury.

• Biotechnology will be the magic technology of the 21stCentury.

Page 4: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

4

Topics

• Biotechnology will be the magic technology of the 21stCentury.

• Moore’s Law constantly transforms IT (and everythingelse).

• Biotechnology will be the magic technology of the 21stCentury.

• Moore’s Law constantly transforms IT (and everythingelse).

Page 5: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

5

Topics

• Biotechnology will be the magic technology of the 21stCentury.

• Moore’s Law constantly transforms IT (and everythingelse).

• Information Technology (IT) has a special relationshipwith biology, especially genetics and genomics.

• Biotechnology will be the magic technology of the 21stCentury.

• Moore’s Law constantly transforms IT (and everythingelse).

• Information Technology (IT) has a special relationshipwith biology, especially genetics and genomics.

Page 6: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

6

Topics

• Biotechnology will be the magic technology of the 21stCentury.

• Moore’s Law constantly transforms IT (and everythingelse).

• Information Technology (IT) has a special relationshipwith biology, especially genetics and genomics.

• 21st-Century biology will be based on bioinformaticsand powered by logistics skills.

• Biotechnology will be the magic technology of the 21stCentury.

• Moore’s Law constantly transforms IT (and everythingelse).

• Information Technology (IT) has a special relationshipwith biology, especially genetics and genomics.

• 21st-Century biology will be based on bioinformaticsand powered by logistics skills.

Page 7: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

7

Topics

• Biotechnology will be the magic technology of the 21stCentury.

• Moore’s Law constantly transforms IT (and everythingelse).

• Information Technology (IT) has a special relationshipwith biology, especially genetics and genomics.

• 21st-Century biology will be based on bioinformaticsand powered by logistics skills.

• Currently, support for public bio-informationinfrastructure seems inadequate.

• Biotechnology will be the magic technology of the 21stCentury.

• Moore’s Law constantly transforms IT (and everythingelse).

• Information Technology (IT) has a special relationshipwith biology, especially genetics and genomics.

• 21st-Century biology will be based on bioinformaticsand powered by logistics skills.

• Currently, support for public bio-informationinfrastructure seems inadequate.

Page 8: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

8

Topics

• Biotechnology will be the magic technology of the 21stCentury.

• Moore’s Law constantly transforms IT (and everythingelse).

• Information Technology (IT) has a special relationshipwith biology, especially genetics and genomics.

• 21st-Century biology will be based on bioinformaticsand powered by logistics skills.

• Currently, support for public bio-informationinfrastructure seems inadequate.

• In the future, much current public research may moveinto the private sector.

• Biotechnology will be the magic technology of the 21stCentury.

• Moore’s Law constantly transforms IT (and everythingelse).

• Information Technology (IT) has a special relationshipwith biology, especially genetics and genomics.

• 21st-Century biology will be based on bioinformaticsand powered by logistics skills.

• Currently, support for public bio-informationinfrastructure seems inadequate.

• In the future, much current public research may moveinto the private sector.

Page 9: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

Biotechnology

21st Century Magic

Page 10: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

10

Magic

To a person from 1897, much currenttechnology would seem like magic.

Page 11: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

11

Magic

To a person from 1897, much currenttechnology would seem like magic.

What technology of 2097 would seemmagical to a person from 1997?

Page 12: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

12

Magic

To a person from 1897, much currenttechnology would seem like magic.

What technology of 2097 would seemmagical to a person from 1997?

Candidate: Biotechnology so advancedthat the distinction between living andnon-living is blurred.

Candidate: Biotechnology so advancedthat the distinction between living andnon-living is blurred.

Page 13: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

Moore’s Law

Transforms InfoTech(and everything else)

Page 14: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

14

Moore’s Law: The Statement

Every eighteen months, thenumber of transistors that canbe placed on a chip doubles.

Gordon Moore, co-founder of Intel...Gordon Moore, co-founder of Intel...

Page 15: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

15

Moore’s Law: The EffectP

erfo

rman

ce(c

on

stan

t co

st)

Time

100,000

10,000

1,000

100

10

Page 16: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

16

Moore’s Law: The EffectP

erfo

rman

ce(c

on

stan

t co

st)

Time

100,000

10,000

1,000

100

10

P

Page 17: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

17

Moore’s Law: The EffectP

erfo

rman

ce(c

on

stan

t co

st)

Cos

t(c

on

stan

t p

erf

orm

an

ce)

Time

100,000 100,000

10,000

1,000

100

10

10,000

1,000

100

10

P

Page 18: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

18

Moore’s Law: The Effect

Three Phases of Novel IT Applications

• It’s Impossible

Three Phases of Novel IT Applications

• It’s Impossible

Page 19: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

19

Moore’s Law: The Effect

Three Phases of Novel IT Applications

• It’s Impossible

• It’s Impractical

Three Phases of Novel IT Applications

• It’s Impossible

• It’s Impractical

Page 20: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

20

Moore’s Law: The Effect

Three Phases of Novel IT Applications

• It’s Impossible

• It’s Impractical

• It’s Overdue

Three Phases of Novel IT Applications

• It’s Impossible

• It’s Impractical

• It’s Overdue

Page 21: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

21

Moore’s Law: The EffectP

erfo

rman

ce(c

on

stan

t co

st)

Cos

t(c

on

stan

t p

erf

orm

an

ce)

Time

100,000 100,000

10,000

1,000

100

10

10,000

1,000

100

10

P

Page 22: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

22

Moore’s Law: The EffectP

erfo

rman

ce(c

on

stan

t co

st)

Cos

t(c

on

stan

t p

erf

orm

an

ce)

Time

100,000 100,000

10,000

1,000

100

10

10,000

1,000

100

10

P

Page 23: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

23

Moore’s Law: The EffectP

erfo

rman

ce(c

on

stan

t co

st)

Cos

t(c

on

stan

t p

erf

orm

an

ce)

Time

100,000 100,000

10,000

1,000

100

10

10,000

1,000

100

10

P

Page 24: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

24

Moore’s Law: The EffectP

erfo

rman

ce(c

on

stan

t co

st)

Cos

t(c

on

stan

t p

erf

orm

an

ce)

Time

100,000 100,000

10,000

1,000

100

10

10,000

1,000

100

10

D

P

C

Page 25: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

25

Moore’s Law: The EffectP

erfo

rman

ce(c

on

stan

t co

st)

Cos

t(c

on

stan

t p

erf

orm

an

ce)

Time

100,000 100,000

10,000

1,000

100

10

10,000

1,000

100

10

D

P

C

Page 26: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

26

Moore’s Law: The EffectP

erfo

rman

ce(c

on

stan

t co

st)

Cos

t(c

on

stan

t p

erf

orm

an

ce)

Time

100,000 100,000

10,000

1,000

100

10

10,000

1,000

100

10

D

P

AA

C

Page 27: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

27

Moore’s Law: The EffectP

erfo

rman

ce(c

on

stan

t co

st)

Cos

t(c

on

stan

t p

erf

orm

an

ce)

Time

100,000 100,000

10,000

1,000

100

10

10,000

1,000

100

10

D

P

A

C

Page 28: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

28

Moore’s Law: The Effect

• The first to understand and deploy new ITcapabilities often seize great competitiveadvantage.

• The first to understand and deploy new ITcapabilities often seize great competitiveadvantage.

Page 29: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

29

Moore’s Law: The Effect

• The first to understand and deploy new ITcapabilities often seize great competitiveadvantage.

• As improved uses of technology aredeveloped, “business” processes change.

• The first to understand and deploy new ITcapabilities often seize great competitiveadvantage.

• As improved uses of technology aredeveloped, “business” processes change.

Page 30: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

30

Moore’s Law: The Effect

• The first to understand and deploy new ITcapabilities often seize great competitiveadvantage.

• As improved uses of technology aredeveloped, “business” processes change.

• Ultimately, access to appropriate ITbecomes essential for simple existence.

• The first to understand and deploy new ITcapabilities often seize great competitiveadvantage.

• As improved uses of technology aredeveloped, “business” processes change.

• Ultimately, access to appropriate ITbecomes essential for simple existence.

Page 31: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

31

Moore’s Law: The Effect

• The first to understand and deploy new ITcapabilities often seize great competitiveadvantage.

• As improved uses of technology aredeveloped, “business” processes change.

• Ultimately, access to appropriate ITbecomes essential for simple existence.

• The first to understand and deploy new ITcapabilities often seize great competitiveadvantage.

• As improved uses of technology aredeveloped, “business” processes change.

• Ultimately, access to appropriate ITbecomes essential for simple existence.

Relevance for biology?Relevance for biology?

Page 32: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

32

Cost (constant performance)

1975 1980 1985 1990 1995 2000 2005

1,000

10,000

100,000

1,000,000

10,000,000 UniversityPurchase

Page 33: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

33

Cost (constant performance)

1975 1980 1985 1990 1995 2000 2005

1,000

10,000

100,000

1,000,000

10,000,000 UniversityPurchase

DepartmentPurchase

Page 34: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

34

Cost (constant performance)

1975 1980 1985 1990 1995 2000 2005

1,000

10,000

100,000

1,000,000

10,000,000

RO1 GrantPurchase

UniversityPurchase

DepartmentPurchase

Page 35: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

35

Cost (constant performance)

1975 1980 1985 1990 1995 2000 2005

1,000

10,000

100,000

1,000,000

10,000,000

PersonalPurchase

RO1 GrantPurchase

UniversityPurchase

DepartmentPurchase

Page 36: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

36

Cost (constant performance)

1975 1980 1985 1990 1995 2000 2005

1,000

10,000

100,000

1,000,000

10,000,000

PersonalPurchase

RO1 GrantPurchase

UniversityPurchase

DepartmentPurchase

UnplannedPurchases

Page 37: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

IT-BiologySynergismIT-BiologySynergism

Page 38: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

38

IT is Special

Information Technology:

• affects the performance and themanagement of tasks

Information Technology:

• affects the performance and themanagement of tasks

Page 39: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

39

IT is Special

Information Technology:

• affects the performance and themanagement of tasks

• allows the manipulation of hugeamounts of highly complex data

Information Technology:

• affects the performance and themanagement of tasks

• allows the manipulation of hugeamounts of highly complex data

Page 40: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

40

IT is Special

Information Technology:

• affects the performance and themanagement of tasks

• allows the manipulation of hugeamounts of highly complex data

• is incredibly plastic

Information Technology:

• affects the performance and themanagement of tasks

• allows the manipulation of hugeamounts of highly complex data

• is incredibly plastic(programming and poetry are both exercises in pure thought)

Page 41: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

41

IT is Special

Information Technology:

• affects the performance and themanagement of tasks

• allows the manipulation of hugeamounts of highly complex data

• is incredibly plastic

Information Technology:

• affects the performance and themanagement of tasks

• allows the manipulation of hugeamounts of highly complex data

• is incredibly plastic(programming and poetry are both exercises in pure thought)

• improves exponentially(Moore’s Law)

Page 42: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

42

Biology is Special

Life is Characterized by:

• individuality

Life is Characterized by:

• individuality

Page 43: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

43

Biology is Special

Life is Characterized by:

• individuality

• historicity

Life is Characterized by:

• individuality

• historicity

Page 44: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

44

Biology is Special

Life is Characterized by:

• individuality

• historicity

• contingency

Life is Characterized by:

• individuality

• historicity

• contingency

Page 45: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

45

Biology is Special

Life is Characterized by:

• individuality

• historicity

• contingency

• high (digital) information content

Life is Characterized by:

• individuality

• historicity

• contingency

• high (digital) information content

Page 46: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

46

Biology is Special

Life is Characterized by:

• individuality

• historicity

• contingency

• high (digital) information content

Life is Characterized by:

• individuality

• historicity

• contingency

• high (digital) information content

No law of large numbers...No law of large numbers...

Page 47: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

47

Biology is Special

Life is Characterized by:

• individuality

• historicity

• contingency

• high (digital) information content

Life is Characterized by:

• individuality

• historicity

• contingency

• high (digital) information content

No law of large numbers, since everyliving thing is genuinely unique.No law of large numbers, since everyliving thing is genuinely unique.

Page 48: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

48

IT-Biology Synergism

• Physics needs calculus, the method formanipulating information aboutstatistically large numbers of vanishinglysmall, independent, equivalent things.

• Physics needs calculus, the method formanipulating information aboutstatistically large numbers of vanishinglysmall, independent, equivalent things.

Page 49: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

49

IT-Biology Synergism

• Physics needs calculus, the method formanipulating information aboutstatistically large numbers of vanishinglysmall, independent, equivalent things.

• Biology needs information technology, themethod for manipulating informationabout large numbers of dependent,historically contingent, individual things.

• Physics needs calculus, the method formanipulating information aboutstatistically large numbers of vanishinglysmall, independent, equivalent things.

• Biology needs information technology, themethod for manipulating informationabout large numbers of dependent,historically contingent, individual things.

Page 50: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

50

Biology is Special

For it is in relation to the statistical point of viewthat the structure of the vital parts of livingorganisms differs so entirely from that of anypiece of matter that we physicists and chemistshave ever handled in our laboratories ormentally at our writing desks.

For it is in relation to the statistical point of viewthat the structure of the vital parts of livingorganisms differs so entirely from that of anypiece of matter that we physicists and chemistshave ever handled in our laboratories ormentally at our writing desks.

Erwin Schrödinger. 1944. What is Life.Erwin Schrödinger. 1944. What is Life.

Page 51: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

51

Genetics as Code

[The] chromosomes ... contain in some kind of code-script the entire pattern of the individual's futuredevelopment and of its functioning in the mature state.... [By] code-script we mean that the all-penetratingmind, once conceived by Laplace, to which everycausal connection lay immediately open, could tellfrom their structure whether [an egg carrying them]would develop, under suitable conditions, into a blackcock or into a speckled hen, into a fly or a maize plant,a rhodo-dendron, a beetle, a mouse, or a woman.

[The] chromosomes ... contain in some kind of code-script the entire pattern of the individual's futuredevelopment and of its functioning in the mature state.... [By] code-script we mean that the all-penetratingmind, once conceived by Laplace, to which everycausal connection lay immediately open, could tellfrom their structure whether [an egg carrying them]would develop, under suitable conditions, into a blackcock or into a speckled hen, into a fly or a maize plant,a rhodo-dendron, a beetle, a mouse, or a woman.

Erwin Schrödinger. 1944. What is Life.Erwin Schrödinger. 1944. What is Life.

Page 52: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

52

One Human SequenceWe now know thatSchrödinger’s mysterioushuman “code-script”consists of 3.3 billionbase pairs of DNA.

Page 53: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

53

One Human Sequence

Typed in 10-pitch font, one human sequence would stretch for morethan 5,000 miles. Digitally formatted, it could be stored on one CD-ROM. Biologically encoded, it fits easily within a single cell.

We now know thatSchrödinger’s mysterioushuman “code-script”consists of 3.3 billionbase pairs of DNA.

Page 54: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

54

Bio-digital Information

DNA is a highly efficient digital storage device:

• There is more mass-storage capacity in theDNA of a side of beef than in all the hard drivesof all the world’s computers.

DNA is a highly efficient digital storage device:

• There is more mass-storage capacity in theDNA of a side of beef than in all the hard drivesof all the world’s computers.

Page 55: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

55

Bio-digital Information

DNA is a highly efficient digital storage device:

• There is more mass-storage capacity in theDNA of a side of beef than in all the hard drivesof all the world’s computers.

• Storing all of the (redundant) information in allof the world’s DNA on computer hard diskswould require that the entire surface of the Earthbe covered to a depth of three miles in Conner1.0 gB drives.

DNA is a highly efficient digital storage device:

• There is more mass-storage capacity in theDNA of a side of beef than in all the hard drivesof all the world’s computers.

• Storing all of the (redundant) information in allof the world’s DNA on computer hard diskswould require that the entire surface of the Earthbe covered to a depth of three miles in Conner1.0 gB drives.

Page 56: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

Genomics:An ExampleGenomics:

An Example

Page 57: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

57

Human Genome Project - Goals– construction of a high-resolution genetic map of the human

genome;– construction of a high-resolution genetic map of the human

genome;

USDOE. 1990. Understanding Our Genetic Inheritance.The U.S. Human Genome Project: The First Five Years.

USDOE. 1990. Understanding Our Genetic Inheritance.The U.S. Human Genome Project: The First Five Years.

Page 58: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

58

Human Genome Project - Goals– construction of a high-resolution genetic map of the human

genome;

– production of a variety of physical maps of all humanchromosomes and of the DNA of selected modelorganisms;

– construction of a high-resolution genetic map of the humangenome;

– production of a variety of physical maps of all humanchromosomes and of the DNA of selected modelorganisms;

USDOE. 1990. Understanding Our Genetic Inheritance.The U.S. Human Genome Project: The First Five Years.

USDOE. 1990. Understanding Our Genetic Inheritance.The U.S. Human Genome Project: The First Five Years.

Page 59: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

59

Human Genome Project - Goals– construction of a high-resolution genetic map of the human

genome;

– production of a variety of physical maps of all humanchromosomes and of the DNA of selected modelorganisms;

– determination of the complete sequence of human DNA andof the DNA of selected model organisms;

– construction of a high-resolution genetic map of the humangenome;

– production of a variety of physical maps of all humanchromosomes and of the DNA of selected modelorganisms;

– determination of the complete sequence of human DNA andof the DNA of selected model organisms;

USDOE. 1990. Understanding Our Genetic Inheritance.The U.S. Human Genome Project: The First Five Years.

USDOE. 1990. Understanding Our Genetic Inheritance.The U.S. Human Genome Project: The First Five Years.

Page 60: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

60

Human Genome Project - Goals– construction of a high-resolution genetic map of the human

genome;

– production of a variety of physical maps of all humanchromosomes and of the DNA of selected modelorganisms;

– determination of the complete sequence of human DNA andof the DNA of selected model organisms;

– development of capabilities for collecting, storing,distributing, and analyzing the data produced;

– construction of a high-resolution genetic map of the humangenome;

– production of a variety of physical maps of all humanchromosomes and of the DNA of selected modelorganisms;

– determination of the complete sequence of human DNA andof the DNA of selected model organisms;

– development of capabilities for collecting, storing,distributing, and analyzing the data produced;

USDOE. 1990. Understanding Our Genetic Inheritance.The U.S. Human Genome Project: The First Five Years.

USDOE. 1990. Understanding Our Genetic Inheritance.The U.S. Human Genome Project: The First Five Years.

Page 61: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

61

Human Genome Project - Goals– construction of a high-resolution genetic map of the human

genome;

– production of a variety of physical maps of all humanchromosomes and of the DNA of selected modelorganisms;

– determination of the complete sequence of human DNA andof the DNA of selected model organisms;

– development of capabilities for collecting, storing,distributing, and analyzing the data produced;

– creation of appropriate technologies necessary to achievethese objectives.

– construction of a high-resolution genetic map of the humangenome;

– production of a variety of physical maps of all humanchromosomes and of the DNA of selected modelorganisms;

– determination of the complete sequence of human DNA andof the DNA of selected model organisms;

– development of capabilities for collecting, storing,distributing, and analyzing the data produced;

– creation of appropriate technologies necessary to achievethese objectives.

USDOE. 1990. Understanding Our Genetic Inheritance.The U.S. Human Genome Project: The First Five Years.

USDOE. 1990. Understanding Our Genetic Inheritance.The U.S. Human Genome Project: The First Five Years.

Page 62: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

62

Infrastructure and the HGP

Progress towards all of the [Genome Project]goals will require the establishment of well-funded centralized facilities, including a stockcenter for the cloned DNA fragmentsgenerated in the mapping and sequencingeffort and a data center for the computer-basedcollection and distribution of large amounts ofDNA sequence information.

Progress towards all of the [Genome Project]goals will require the establishment of well-funded centralized facilities, including a stockcenter for the cloned DNA fragmentsgenerated in the mapping and sequencingeffort and a data center for the computer-basedcollection and distribution of large amounts ofDNA sequence information.

National Research Council. 1988. Mapping and Sequencing theHuman Genome. Washington, DC: National Academy Press. p. 3

National Research Council. 1988. Mapping and Sequencing theHuman Genome. Washington, DC: National Academy Press. p. 3

Page 63: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

63

GenBank Totals (Release 103)

DIVISION

Phage Sequences (PHG)Viral Sequences (VRL)

Bacteria (BCT)

Plant, Fungal, and Algal Sequences (PLN)

Invertebrate Sequences (INV)

Rodent Sequences (ROD)Primate Sequences (PRI1–2)

Other Mammals (MAM)Other Vertebrate Sequences (VRT)

High-Throughput Genome Sequences (HTG)

Genome Survey Sequences (GSS)Structural RNA Sequences (RNA)

Sequence Tagged Sites Sequences (STS)Patent Sequences (PAT)

Synthetic Sequences (SYN)Unannotated Sequences (UNA)

EST1-17

TOTALS

Entries

1,313 45,355 38,023

44,553

29,657

36,967 75,587 12,744 17,713

1,120

42,628 4,802 52,824 87,767 2,577 2,480

1,269,737

1,765,847

Base Pairs

2,138,810 44,484,848 88,576,641

92,259,434

105,703,550

45,437,309 134,944,314 12,358,310 17,040,159

72,064,395

22,783,326 2,487,397 18,161,532 27,593,724 5,698,945 1,933,676

466,634,317

1,160,300,687

Per Cent

0.184%3.834%7.634%

7.951%

9.110%

3.916%11.630%

1.065%1.469%

6.211%

1.964%0.214%1.565%2.378%0.491%0.167%

40.217%

100.000%

Per Cent

0.074%2.568%2.153%

2.523%

1.679%

2.093%4.280%0.722%1.003%

0.063%

2.414%0.272%2.991%4.970%0.146%0.140%

71.905%

100.000%

Page 64: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

64

Base Pairs in GenBank

0

2 0 0 ,0 0 0 ,0 0 0

4 0 0 ,0 0 0 ,0 0 0

6 0 0 ,0 0 0 ,0 0 0

8 0 0 ,0 0 0 ,0 0 0

1 ,0 0 0 ,0 0 0 ,0 0 0

1 ,2 0 0 ,0 0 0 ,0 0 0

0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 105

GenBank Release Numbers

9493929190898887 95 96 97

Page 65: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

65

Base Pairs in GenBank

0

2 0 0 ,0 0 0 ,0 0 0

4 0 0 ,0 0 0 ,0 0 0

6 0 0 ,0 0 0 ,0 0 0

8 0 0 ,0 0 0 ,0 0 0

1 ,0 0 0 ,0 0 0 ,0 0 0

1 ,2 0 0 ,0 0 0 ,0 0 0

0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 105

GenBank Release Numbers

9493929190898887 95 96 97

Growth in GenBank is exponential.More data were added in the last tenweeks than were added in the first tenyears of the project.

Growth in GenBank is exponential.More data were added in the last tenweeks than were added in the first tenyears of the project.

Page 66: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

66

Base Pairs in GenBank

0

2 0 0 ,0 0 0 ,0 0 0

4 0 0 ,0 0 0 ,0 0 0

6 0 0 ,0 0 0 ,0 0 0

8 0 0 ,0 0 0 ,0 0 0

1 ,0 0 0 ,0 0 0 ,0 0 0

1 ,2 0 0 ,0 0 0 ,0 0 0

0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 105

GenBank Release Numbers

9493929190898887 95 96 97

Growth in GenBank is exponential.More data were added in the last tenweeks than were added in the first tenyears of the project.

Growth in GenBank is exponential.More data were added in the last tenweeks than were added in the first tenyears of the project.

At this rate, what’s next...At this rate, what’s next...

Page 67: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

67

ABI Bass-o-Matic Sequencer

In with the sample, out with the sequence...

TGCGCATCGCGTATCGATAG

speed

gB/sec

EnterDefrost

7 8 9

4 5 6

1 2 3

0

+

-

Page 68: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

68

What’s Really Next

The post-genome era in biologicalresearch will take for granted readyaccess to huge amounts of genomicdata.

The challenge will be understandingthose data and using the understandingto solve real-world problems...

Page 69: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

69

Base Pairs in GenBank (Percent Increase)

0 .0 0 %

1 0 .0 0 %

2 0 .0 0 %

3 0 .0 0 %

4 0 .0 0 %

5 0 .0 0 %

6 0 .0 0 %

7 0 .0 0 %

8 0 .0 0 %

9 0 .0 0 %

1 0 0 .0 0 %

85 86 87 88 89 90 91 92 93 94 95 96

Year

Percent Increaseaverage = 56%

Percent Increaseaverage = 56%

Page 70: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

70

Projected Base Pairs

11 0

1 0 01 ,0 0 0

1 0 ,0 0 01 0 0 ,0 0 0

1 ,0 0 0 ,0 0 01 0 ,0 0 0 ,0 0 0

1 0 0 ,0 0 0 ,0 0 01 ,0 0 0 ,0 0 0 ,0 0 0

1 0 ,0 0 0 ,0 0 0 ,0 0 01 0 0 ,0 0 0 ,0 0 0 ,0 0 0

1 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 01 0 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 0

1 0 0 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 0

90 95 0 5 10 15 20 25

Year

Assumed annual growth rate: 50%(less than current rate)

Page 71: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

71

Projected Base Pairs

11 0

1 0 01 ,0 0 0

1 0 ,0 0 01 0 0 ,0 0 0

1 ,0 0 0 ,0 0 01 0 ,0 0 0 ,0 0 0

1 0 0 ,0 0 0 ,0 0 01 ,0 0 0 ,0 0 0 ,0 0 0

1 0 ,0 0 0 ,0 0 0 ,0 0 01 0 0 ,0 0 0 ,0 0 0 ,0 0 0

1 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 01 0 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 0

1 0 0 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 0

90 95 0 5 10 15 20 25

Year

Assumed annual growth rate: 50%(less than current rate)

Is this crazy?One trillion bp by 2015100 trillion by 2025

Is this crazy?One trillion bp by 2015100 trillion by 2025

Page 72: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

72

Projected Base Pairs

11 0

1 0 01 ,0 0 0

1 0 ,0 0 01 0 0 ,0 0 0

1 ,0 0 0 ,0 0 01 0 ,0 0 0 ,0 0 0

1 0 0 ,0 0 0 ,0 0 01 ,0 0 0 ,0 0 0 ,0 0 0

1 0 ,0 0 0 ,0 0 0 ,0 0 01 0 0 ,0 0 0 ,0 0 0 ,0 0 0

1 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 01 0 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 0

1 0 0 ,0 0 0 ,0 0 0 ,0 0 0 ,0 0 0

90 95 0 5 10 15 20 25

Year

Is this crazy?One trillion bp by 2015100 trillion by 2025

Is this crazy?One trillion bp by 2015100 trillion by 2025

500,00050,0005,000

Maybe not...Maybe not...

Projected database size,indicated as the number ofbase pairs per individualmedical record in the US.

Page 73: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

21st CenturyBiology

Post-Genome Era

Page 74: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

74

The Post-Genome Era

Post-genome research involves:

• applying genomic tools and knowledge to moregeneral problems

• asking new questions, tractable only to genomicor post-genomic analysis

• moving beyond the structural genomics of thehuman genome project and into the functionalgenomics of the post-genome era

Post-genome research involves:

• applying genomic tools and knowledge to moregeneral problems

• asking new questions, tractable only to genomicor post-genomic analysis

• moving beyond the structural genomics of thehuman genome project and into the functionalgenomics of the post-genome era

Page 75: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

75

The Post-Genome Era

Suggested definition:

• functional genomics = biology

Suggested definition:

• functional genomics = biology

Page 76: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

76

The Post-Genome Era

An early analysis:An early analysis:

Walter Gilbert. 1991. Towards a paradigmshift in biology. Nature, 349:99.

Page 77: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

77

Paradigm Shift in Biology

To use [the] flood of knowledge, which will pouracross the computer networks of the world,biologists not only must become computerliterate, but also change their approach to theproblem of understanding life.

To use [the] flood of knowledge, which will pouracross the computer networks of the world,biologists not only must become computerliterate, but also change their approach to theproblem of understanding life.

Walter Gilbert. 1991. Towards a paradigm shift in biology. Nature, 349:99.Walter Gilbert. 1991. Towards a paradigm shift in biology. Nature, 349:99.

Page 78: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

78

Paradigm Shift in Biology

The new paradigm, now emerging, is that all the‘genes’ will be known (in the sense of beingresident in databases available electronically),and that the starting point of a biologicalinvestigation will be theoretical. An individualscientist will begin with a theoretical conjecture,only then turning to experiment to follow or testthat hypothesis.

The new paradigm, now emerging, is that all the‘genes’ will be known (in the sense of beingresident in databases available electronically),and that the starting point of a biologicalinvestigation will be theoretical. An individualscientist will begin with a theoretical conjecture,only then turning to experiment to follow or testthat hypothesis.

Walter Gilbert. 1991. Towards a paradigm shift in biology. Nature, 349:99.Walter Gilbert. 1991. Towards a paradigm shift in biology. Nature, 349:99.

Page 79: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

79

Paradigm Shift in Biology

Case of Microbiology

< 5,000 known and described bacteria

5,000,000 base pairs per genome

25,000,000,000 TOTAL base pairs

If a full, annotated sequence were available for all known bacteria, the practiceof microbiology would match Gilbert’s prediction.

If a full, annotated sequence were available for all known bacteria, the practiceof microbiology would match Gilbert’s prediction.

Page 80: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

21st CenturyBiology

The Science

Page 81: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

81

Fundamental Dogma

The fundamental dogma of molecular biologyis that genes act to create phenotypes througha flow of information from DNA to RNA toproteins, to interactions among proteins(regulatory circuits and metabolic pathways),and ultimately to phenotypes.

Collections of individual phenotypes, ofcourse, constitute a population.

DNA

RNA

Proteins

Circuits

Phenotypes

Populations

Page 82: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

82

Fundamental DogmaDNA

RNA

Proteins

Circuits

Phenotypes

Populations

GenBankEMBLDDBJ

MapDatabases

SwissPROTPIR

PDB

Although a few databases already existto distribute molecular information,

Although a few databases already existto distribute molecular information,

Page 83: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

83

Fundamental DogmaDNA

RNA

Proteins

Circuits

Phenotypes

Populations

GenBankEMBLDDBJ

MapDatabases

SwissPROTPIR

PDB

Gene Expression?

Clinical Data ?

Regulatory Pathways?Metabolism?

Biodiversity?

Neuroanatomy?

Development ?

Molecular Epidemiology?

Comparative Genomics?

the post-genomic era will need manymore to collect, manage, and publishthe coming flood of new findings.

the post-genomic era will need manymore to collect, manage, and publishthe coming flood of new findings.

Although a few databases already existto distribute molecular information,

Although a few databases already existto distribute molecular information,

Page 84: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

21st CenturyBiology

The People

Page 85: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

85

Human Resources Issues

• Reduction in need for non-IT staff• Reduction in need for non-IT staff

Page 86: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

86

Human Resources Issues

• Reduction in need for non-IT staff

• Increase in need for IT staff, especially“information engineers”

• Reduction in need for non-IT staff

• Increase in need for IT staff, especially“information engineers”

Page 87: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

87

Human Resources Issues

• Reduction in need for non-IT staff

• Increase in need for IT staff, especially“information engineers”

• Reduction in need for non-IT staff

• Increase in need for IT staff, especially“information engineers”

In modern biology, a general trend is toconvert expert work into staff work andfinally into computation. New expertise isrequired to design, carry out, and interpretcontinuing work.

In modern biology, a general trend is toconvert expert work into staff work andfinally into computation. New expertise isrequired to design, carry out, and interpretcontinuing work.

Page 88: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

88

Human Resources Issues

Elbert Branscomb: “You must recognize thatsome day you may need as many computerscientists as biologists in your labs.”

Elbert Branscomb: “You must recognize thatsome day you may need as many computerscientists as biologists in your labs.”

Page 89: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

89

Human Resources Issues

Elbert Branscomb: “You must recognize thatsome day you may need as many computerscientists as biologists in your labs.”

Craig Venter: “At TIGR, we already havetwice as many computer scientists on ourstaff.”

Elbert Branscomb: “You must recognize thatsome day you may need as many computerscientists as biologists in your labs.”

Craig Venter: “At TIGR, we already havetwice as many computer scientists on ourstaff.”

Exchange at DOE workshop on high-throughput sequencing.Exchange at DOE workshop on high-throughput sequencing.

Page 90: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

Funding forBio-InformationInfrastructure

Funding forBio-InformationInfrastructure

Page 91: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

91

Call for Change

Among the many new tools that are or will be needed (for 21st-century biology), some of those having the highest priority are:

• bioinformatics

• computational biology

• functional imaging tools using biosensors and biomarkers

• transformation and transient expression technologies

• nanotechnologies

Among the many new tools that are or will be needed (for 21st-century biology), some of those having the highest priority are:

• bioinformatics

• computational biology

• functional imaging tools using biosensors and biomarkers

• transformation and transient expression technologies

• nanotechnologies

Impact of Emerging Technologies on the Biological Sciences: Report of aWorkshop. NSF-supported workshop, held 26-27 June 1995, Washington, DC.

Impact of Emerging Technologies on the Biological Sciences: Report of aWorkshop. NSF-supported workshop, held 26-27 June 1995, Washington, DC.

Page 92: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

92

The Problem

• IT moves at “Internet Speed” and respondsrapidly to market forces.

• IT moves at “Internet Speed” and respondsrapidly to market forces.

Page 93: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

93

The Problem

• IT moves at “Internet Speed” and respondsrapidly to market forces.

• IT will play a central role in 21st Centurybiology.

• IT moves at “Internet Speed” and respondsrapidly to market forces.

• IT will play a central role in 21st Centurybiology.

Page 94: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

94

The Problem

• IT moves at “Internet Speed” and respondsrapidly to market forces.

• IT will play a central role in 21st Centurybiology.

• Current levels of support for public bio-information infrastructure are too low.

• IT moves at “Internet Speed” and respondsrapidly to market forces.

• IT will play a central role in 21st Centurybiology.

• Current levels of support for public bio-information infrastructure are too low.

Page 95: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

95

The Problem

• IT moves at “Internet Speed” and respondsrapidly to market forces.

• IT will play a central role in 21st Centurybiology.

• Current levels of support for public bio-information infrastructure are too low.

• Reallocation of federal funding is difficult,and subject to political pressures.

• IT moves at “Internet Speed” and respondsrapidly to market forces.

• IT will play a central role in 21st Centurybiology.

• Current levels of support for public bio-information infrastructure are too low.

• Reallocation of federal funding is difficult,and subject to political pressures.

Page 96: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

96

The Problem

• IT moves at “Internet Speed” and respondsrapidly to market forces.

• IT will play a central role in 21st Centurybiology.

• Current levels of support for public bio-information infrastructure are too low.

• Reallocation of federal funding is difficult,and subject to political pressures.

• Federal-funding decision processes areponderously slow and inefficient.

• IT moves at “Internet Speed” and respondsrapidly to market forces.

• IT will play a central role in 21st Centurybiology.

• Current levels of support for public bio-information infrastructure are too low.

• Reallocation of federal funding is difficult,and subject to political pressures.

• Federal-funding decision processes areponderously slow and inefficient.

Page 97: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

97

Federal Funding of Bio-Databases

The challenges:The challenges:

Page 98: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

98

Federal Funding of Bio-Databases

The challenges:

• providing adequate funding levels

The challenges:

• providing adequate funding levels

Page 99: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

99

Federal Funding of Bio-Databases

The challenges:

• providing adequate funding levels

• making timely, efficient decisions

The challenges:

• providing adequate funding levels

• making timely, efficient decisions

Page 100: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

IT Budgets

A Reality Check

Page 101: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

101

Rhetorical Question

Which is likely to be more complex:

• identifying, documenting, and tracking thewhereabouts of all parcels in transit in the US atone time

Which is likely to be more complex:

• identifying, documenting, and tracking thewhereabouts of all parcels in transit in the US atone time

Page 102: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

102

Rhetorical Question

Which is likely to be more complex:

• identifying, documenting, and tracking thewhereabouts of all parcels in transit in the US atone time

• identifying, documenting, and analyzing thestructure and function of all individual genes inall economically significant organisms; thenanalyzing all significant gene-gene and gene-environment interactions in those organismsand their environments

Which is likely to be more complex:

• identifying, documenting, and tracking thewhereabouts of all parcels in transit in the US atone time

• identifying, documenting, and analyzing thestructure and function of all individual genes inall economically significant organisms; thenanalyzing all significant gene-gene and gene-environment interactions in those organismsand their environments

Page 103: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

103

Business Factoids

United Parcel Service:

• uses two redundant 3 Terabyte (yes, 3000 GB)databases to track all packages in transit.

• has 4,000 full-time employees dedicated to IT

• spends one billion dollars per year on IT

• has an income of 1.1 billion dollars, againstrevenues of 22.4 billion dollars

United Parcel Service:

• uses two redundant 3 Terabyte (yes, 3000 GB)databases to track all packages in transit.

• has 4,000 full-time employees dedicated to IT

• spends one billion dollars per year on IT

• has an income of 1.1 billion dollars, againstrevenues of 22.4 billion dollars

Page 104: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

104

Business ComparisonsCompany Revenues IT Budget Pct

Bristol-Myers Squibb 15,065,000,000 440,000,000 2.92 %

Pfizer 11,306,000,000 300,000,000 2.65 %

Pacific Gas & Electric 10,000,000,000 250,000,000 2.50 %

K-Mart 31,437,000,000 130,000,000 0.41 %

Wal-Mart 104,859,000,000 550,000,000 0.52 %

Sprint 14,235,000,000 873,000,000 6.13 %

MCI 18,500,000,000 1,000,000,000 5.41 %

United Parcel 22,400,000,000 1,000,000,000 4.46 %

AMR Corporation 17,753,000,000 1,368,000,000 7.71 %

IBM 75,947,000,000 4,400,000,000 5.79 %

Microsoft 11,360,000,000 510,000,000 4.49 %

Chase-Manhattan 16,431,000,000 1,800,000,000 10.95 %

Nation’s Bank 17,509,000,000 1,130,000,000 6.45 %

Page 105: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

105

Federal Funding of Biomedical-IT

Appropriate funding level:

• approx. 5-10% of research funding

• i.e., 1 - 2 billion dollars per year

Appropriate funding level:

• approx. 5-10% of research funding

• i.e., 1 - 2 billion dollars per year

Page 106: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

106

Federal Funding of Biomedical-IT

Appropriate funding level:

• approx. 5-10% of research funding

• i.e., 1 - 2 billion dollars per year

Appropriate funding level:

• approx. 5-10% of research funding

• i.e., 1 - 2 billion dollars per year

Source of estimate:

- Experience of IT-transformed industries.

- Current support for IT-rich biological research.

Source of estimate:

- Experience of IT-transformed industries.

- Current support for IT-rich biological research.

Page 107: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

Private Sector

Future Reality

Page 108: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

Who is this man?

Page 109: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

Who is this man?

And why should you care?

Page 110: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

TIGR

Page 111: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing
Page 112: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

112

Celera

We want this new company to be the definitive source ofgenomic and related medical information, which scientists canuse to better understand human biology and to deliver improvedhealthcare options.

We want this new company to be the definitive source ofgenomic and related medical information, which scientists canuse to better understand human biology and to deliver improvedhealthcare options.

Tony White, chief executive officer of Perkin-Elmer,Teleconference, May 11, 1998Tony White, chief executive officer of Perkin-Elmer,Teleconference, May 11, 1998

Page 113: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

113

Celera

We want this new company to be the definitive source ofgenomic and related medical information, which scientists canuse to better understand human biology and to deliver improvedhealthcare options. We believe that this company combinescompelling technology with unique sequencing strategies,resulting in a genomics sequencing facility with an expectedcapacity ...

We want this new company to be the definitive source ofgenomic and related medical information, which scientists canuse to better understand human biology and to deliver improvedhealthcare options. We believe that this company combinescompelling technology with unique sequencing strategies,resulting in a genomics sequencing facility with an expectedcapacity ...

Tony White, chief executive officer of Perkin-Elmer,Teleconference, May 11, 1998Tony White, chief executive officer of Perkin-Elmer,Teleconference, May 11, 1998

Page 114: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

114

Celera

We want this new company to be the definitive source ofgenomic and related medical information, which scientists canuse to better understand human biology and to deliver improvedhealthcare options. We believe that this company combinescompelling technology with unique sequencing strategies,resulting in a genomics sequencing facility with an expectedcapacity greater than that of the current combined world output.

We want this new company to be the definitive source ofgenomic and related medical information, which scientists canuse to better understand human biology and to deliver improvedhealthcare options. We believe that this company combinescompelling technology with unique sequencing strategies,resulting in a genomics sequencing facility with an expectedcapacity greater than that of the current combined world output.

Tony White, chief executive officer of Perkin-Elmer,Teleconference, May 11, 1998Tony White, chief executive officer of Perkin-Elmer,Teleconference, May 11, 1998

Page 115: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

115

Celera

We are not a philanthropic organisation, wehave a revenue model for this. We are surepeople will want to buy the information.

We are not a philanthropic organisation, wehave a revenue model for this. We are surepeople will want to buy the information.

Tony White, chief executive officer of Perkin-Elmer,quoted in The Guardian, 13 May 1998Tony White, chief executive officer of Perkin-Elmer,quoted in The Guardian, 13 May 1998

Page 116: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing
Page 117: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

117

Millenium Pharmaceuticals

Millenium’s bugs

Mark Levin is an engineer. That makes him ideally qualifiedto be a successful biotechnology entrepreneur

Millenium’s bugs

Mark Levin is an engineer. That makes him ideally qualifiedto be a successful biotechnology entrepreneur

TheEconomist

TheEconomist

SEPTEMBER 26TH - OCTOBER 2ND 1998

Page 118: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

118

Millenium PharmaceuticalsMany other biotech firms, increasingly desperate for cash asinvestors have shied away from their shares, have queued up todo deals with the pharmaceutical industry at almost any price. MrLevin, though, has been able to dictate his own terms. Uniquelyamong biotechnologists, he seems to have mastered the art ofhaving his cake and eating it.

Except that Mr Levin is not really a biotechnologist at all:he is a chemical engineer. He has worked in process control forcompanies as varied as Miller Brewing and Genentech, a firm ofbiotech pioneers. Whereas biologists tend to see biotech as thesearch for a compound, Mr Levin thinks of it as a complexproduction process. While they concentrate on the bio, he alsothinks hard about the technology.

Many other biotech firms, increasingly desperate for cash asinvestors have shied away from their shares, have queued up todo deals with the pharmaceutical industry at almost any price. MrLevin, though, has been able to dictate his own terms. Uniquelyamong biotechnologists, he seems to have mastered the art ofhaving his cake and eating it.

Except that Mr Levin is not really a biotechnologist at all:he is a chemical engineer. He has worked in process control forcompanies as varied as Miller Brewing and Genentech, a firm ofbiotech pioneers. Whereas biologists tend to see biotech as thesearch for a compound, Mr Levin thinks of it as a complexproduction process. While they concentrate on the bio, he alsothinks hard about the technology.

TheEconomist

TheEconomist

SEPTEMBER 26TH - OCTOBER 2ND 1998

Page 119: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

119

Millenium PharmaceuticalsMr Levin focuses on trying to make each link in the discoverychain as efficient as possible. He has assembled an impressivearray of technologies -- including robotics and informationsystems as well as molecular biology. He then enhances themand links them together in novel ways to create what the engineerin him likes to call "technology platforms". The idea is that theseplatforms should help drug searchers to travel rapidly on theirlong and tortuous journey from gene to treatment. Mr Levin'sgoal is to boost the productivity of drug discovery by 50%,which would lead to many more new drugs coming on to themarket each year.

Mr Levin focuses on trying to make each link in the discoverychain as efficient as possible. He has assembled an impressivearray of technologies -- including robotics and informationsystems as well as molecular biology. He then enhances themand links them together in novel ways to create what the engineerin him likes to call "technology platforms". The idea is that theseplatforms should help drug searchers to travel rapidly on theirlong and tortuous journey from gene to treatment. Mr Levin'sgoal is to boost the productivity of drug discovery by 50%,which would lead to many more new drugs coming on to themarket each year.

TheEconomist

TheEconomist

SEPTEMBER 26TH - OCTOBER 2ND 1998

Page 120: Economic Impact of Bioinformatics - ESP · Bioinformatics is the application of information technology to the problems of biology. With $2500 desktop PCs now delivering more raw computing

120

Slides:

http://www.esp.org/rjr/atcc.pdfhttp://www.esp.org/rjr/atcc.pdf