seeing things differently: innovation in computational ...€¦ · seeing things differently:...

49
Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. Associate Professor Department of Computer Science University of Montana “Pain-free MS data processing” Founder | CEO

Upload: others

Post on 26-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Seeing things differently: Innovation in Computational

Mass SpectrometryRob Smith, Ph.D.

Associate ProfessorDepartment of Computer ScienceUniversity of Montana

“Pain-free MS data processing”

Founder | CEO

Page 2: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

“I undertook something that not everyone may undertake: I descended into the depths, I bored into the foundations.”

—Nietzche, “Dawn of Morning”

Page 3: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Overview: Where is

the innovation?

Page 4: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Innovation

Innovation

Innovation

Page 5: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

InnovationInnovation

Innovation

Innovation

Innovation

Page 6: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Current Limits

Current Limits

Cur

rent

Lim

itsC

urrent Limits

InnovationInnovation

Innovation

Innovation

Innovation

Page 7: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Current Limits

Current Limits

Cur

rent

Lim

itsC

urrent Limits

InnovationInnovation

Innovation

Innovation

Innovation

Page 8: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Current Limits

Current Limits

Cur

rent

Lim

itsC

urrent Limits

InnovationInnovation

Innovation

Innovation

Innovation

Page 9: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Why don’t we go there?

Page 10: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Current Limits

Current Limits

Cur

rent

Lim

itsC

urrent Limits

InnovationInnovation

Innovation

Innovation

Innovation

Page 11: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Why don’t we go there?

• Need to identify the limits.

• Need to take risks.

Page 12: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

What does the journey look like?

Page 13: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

You are on the right track

when…

Page 14: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

a) The old guard says, “why would you want to

do that?”

Page 15: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Innovation

Innovation

Innovation

Their world looks like this:

Page 16: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Current Limits

Current Limits

Cur

rent

Lim

itsC

urrent Limits

InnovationInnovation

Innovation

Innovation

Innovation

Not this:

Page 17: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Inside the Box

“There are no unsolved problems.” - A Developer

“Conversations with 100 scientists in the field reveal a bifurcated perception of the state of mass spectrometry software.” R. Smith, Journal of Proteome Research, 2018.

Page 18: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Inside the Box

“How could you possibly make significant improvements to the state of

the art?!” - A Bigwig

Page 19: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Outside the Box

• “All scientific software sucks. It is idosyncratic, it makes no sense, it has glitches, it is a pain in the ass!” - A User

• “[There are ] a few mediocre ones, the rest are absolute crap.” - A User

• “They are complete trash.” - A User

“Conversations with 100 scientists in the field reveal a bifurcated perception of the state of mass spectrometry software.” R. Smith, Journal of Proteome Research, 2018.

Page 20: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

b) You ask “why not,” and you find there isn’t a sufficiently good reason.

Page 21: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Current Limits

Current Limits

Cur

rent

Lim

its

Current Lim

its

InnovationInnovation

Innovation

Innovation

Innovation

Why not?

The limits of the possible can only be defined by going beyond them into the impossible.

-Arthur C. Clarke

Page 22: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

c) You need a new vocabulary to describe

your solution.

Page 23: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Innovation occurs in the space between reality and the

language we use to describe it.

Page 24: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

d) You are able to see and measure limitations

in the status quo.

Page 25: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Outline

• The old guard says, “why would you want to do that?”

• You ask “why not,” and you find there isn’t a sufficiently good reason.

• You need a new vocabulary to describe your solution.

• You are able to see and measure limitations in the status quo.

Page 26: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Outline

• The old guard says, “why would you want to do that?”

• You ask “why not,” and you find there isn’t a sufficiently good reason.

• You need a new vocabulary to describe your solution.

• You are able to see and measure limitations in the status quo.

Page 27: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Part 1: Words and Concepts

Page 28: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

“Many problems are caused by the difference between how things actually work, and the

language / tools / paradigms / tropes we use to describe and engage with them.”

-Gregory Bateson

Page 29: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

“Language allows you to have ideas otherwise un-haveable, and that by

extension people who own different words live in different conceptual worlds.”

-Joshua Hartshorne

Page 30: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Innovation occurs in the space between reality and the

language we use to describe it.

Page 31: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

You can’t code what you can’t describe.

“Current controlled vocabularies are insufficient to uniquely map molecular entities to mass

spectrometry signal” Smith et al., BMC Bioinformatics 16(7), 2015.

Page 32: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different
Page 33: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different
Page 34: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Part 2: Asking different

questions

Page 35: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different
Page 36: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

• What we think we are asking

• What we are actually asking

• What we should be asking

Page 37: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

• What we think we are asking

• What we are actually asking

• What we should be asking } Not the

same!

Page 38: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

What we think we are doing What we are actually doing

p(x) p(x|a,b,c,….)}

What we want to measure

}Our assumptions

An analog or estimate

Page 39: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

p(x|a,b,c,….)}

Our assumptions

EASIER TO CALCULATE

But what if a,b,c,…. are wrong?

Page 40: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

What we think we are doing What we are actually doing

Given: -a spectrum -context

…what do I have?

Assuming: a single species.

the most abundant ions are from the same species.

ion abundance = parent abundance.

there are little to no modifications.

database contains the correct match.

…what matches best?

Page 41: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

What we think we are doing What we are actually doingWhat is the likelihood that a match is correct (FDR)?

Assumes:Target/decoy accurately simulates

the likelihood of a false positive match.

Decoy sequences are dissimilar to target sequences.

The database size is chosen such that the FDR is accurate.

What is the similarity between matched spectra and shuffled or reversed spectra?

Page 42: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

What we think we are doing What we are actually doing

Correspondence Alignment

Elution order never changes

MS/MS ID rates are high

m/z doesn’t shift

RT shifts are monotonic

Assumes:

“LC-MS alignment in theory and practice: a comprehensive algorithmic review.” Smith et al. Briefings in Bioinformatics 16(1), 2015.

Page 43: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

What we think we are doing What we are actually doing

Which PTMs are in this sample? Does this sample contain this particular PTM?

At what index are these peptides modified?

At what index is this particular PTM found?

One modification at a time.

Only the modification we are looking for.

Page 44: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

What we think we are doing What we are actually doingValidating accuracy w/ CV Measuring consistency w/ CV

Sameness -> correctnessCorrect peak integration

Page 45: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

What we think we are doing What we are actually doing

Validating algorithms Measuring agreement between algorithms

Sameness -> correctness

Page 46: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

If we had more time…

• Signal to noise is meaningful.

• DIA >> DDA

• 2-dimensional signals should be used (XICs, TICs, etc.)

• Predicting spectra is hard; machine learning can make it easier.

Page 47: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Summarizing

• Analogs are not the same as equals.

• We ignore massive and often provably incorrect assumptions.

• Bad assumptions = incorrect results.

Page 48: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Summarizing

• What is the space between reality and the language we use to describe it?

• Are our estimates actually any good?

• Can estimates be improved?

• Can we actually measure what we are currently only estimating?

Page 49: Seeing things differently: Innovation in Computational ...€¦ · Seeing things differently: Innovation in Computational Mass Spectrometry Rob Smith, Ph.D. ... live in different

Acknowledgements

www.primelabs.ms

Smith Computational Mass Spectrometry Lab

NSF Career Award 1552240

NSF SBIR 1819290

NSF I-Corps 1741270

MTBRCT 19-51-031

Funding:

“Pain-free MS data processing”

[email protected]