us atlas sctpm machine learning and atlas paolo calafiura what is ml: data-driven statistical...
TRANSCRIPT
![Page 1: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/1.jpg)
US ATLAS SCTPM
Machine Learning and ATLAS
Paolo Calafiura
What is ML: data-driven statistical modeling of complex systems
Why now: GPU/MIC/FPGA/Neuromorphic hardware evolution pushes us towards computing with many simple elements
Applications: many! Today will focus on
• Neuromorphic computing for low-power scalable tracking
• Deep Neural Networks for data analysis, data movement
1
![Page 2: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/2.jpg)
US ATLAS SCTPM
Motivating Example: Tracking
During Run 4 we have to process ~60M tracks/s (~20x Run 2)
I/O will likely constrain us to run full tracking on line, and to write only an xAOD-like format
Guesstimating x86 cost & performance evolution, we should find x2-5 CPU offline, (much) more @ Point 1
Surely we can parallelize our way out of trouble?
2
![Page 3: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/3.jpg)
US ATLAS SCTPM
Why is it so hard to do particle tracking in parallel?
Algorithms: Iterative (propagation, fitting), irregular (combinatorial searches with lots of branch points)
Data: sparse (hits), non-local access (B-field integration)
Can ML allow us to train Neural Networks (NN) that use regular, trivial algorithms, and a naturally data parallel approach?
3
![Page 4: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/4.jpg)
US ATLAS SCTPM
Neural Networks:Computing with simple elements
‘neuron’
by themselves, limited functional repertoire.
Simple computing elements…
(Kristofer Bouchard, LBNL)
![Page 5: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/5.jpg)
US ATLAS SCTPM
Feed-forward NN: Classification
1
‘neuron’
2
as a network, learn to perform diverse functions
Simple computing elements…
Flow of information
Classification
![Page 6: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/6.jpg)
US ATLAS SCTPM
Convolutional NN: Feature Extraction
Feature Extraction Classification
‘neuron’
60
Simple computing elements…
as a network, learn to perform diverse functions
![Page 7: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/7.jpg)
US ATLAS SCTPM
Recurrent NN: Time-varying Functions
F(t)
‘neuron’ Simple computing elements…
Feature Extraction Classification Time-varying Functions
Dynamics
as a network, learn to perform diverse functions
![Page 8: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/8.jpg)
US ATLAS SCTPM
Tracking Kaggle Challenge
Inspired by the very successful Higgs Challenge
Competition among ML experts:
• Problem: Given a list of space-points produce a list of track candidates
• Figure of merit: efficiency for given fake rate and CPU budget (still under discussion)
More next week during C&S plenary
8
![Page 9: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/9.jpg)
US ATLAS SCTPM
LHCb Trigger Retina Processor
FPGA implementation1mus trackingOffline-quality performanceCertainly good enough for seeding
Track parameter space22K bins, one “receptor” per bin
![Page 10: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/10.jpg)
US ATLAS SCTPM
Neuromorphic Computing
“Spikey” from Electronic Visions group in Heidelberg
Qualcomm’s NPU’s for robots.
SpiNNaker’s 1B neuron machine
Stanford’s Neurogrid
Intel’s concept design...
IBM’s TrueNorth
(Peter Nugent, LBNL)
![Page 11: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/11.jpg)
US ATLAS SCTPM
IBM TrueNorth
• 1 million programmable neurons• 256 million synapses• 4096 neurosynaptic cores• Uses 70mW per chip• 5.4 billion transistors• Spiking rate >1000Hz
A single chip can process color video in real-time while consuming 176,000 times less energy than a current Intel chip performing the exact same analysis. Note the Intel chip can not do this analysis in real-time and is in fact 300 times slower! Merolla+ Science (2014)
![Page 12: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/12.jpg)
US ATLAS SCTPM
Neuromorphic Kalman Filters (LBNL LDRD FY16 proposal)
Paolo Calafiura (CRD), Kristofer Bouchard (Life Sciences),David Donofrio (CRD), Rebecca Carney (Physics),
Maurice Garcia-Sciveres (Physics), Craig E. Tull (CRD)
Implement Kalman filters on neuromorphic chips for low-power, high-throughput, real-time data processing
Brain-machine interfaces Charged particle tracking
![Page 13: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/13.jpg)
US ATLAS SCTPM
Deep Learning for Data Analysis
Peter Sadowski (UCI)
![Page 14: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/14.jpg)
US ATLAS SCTPM
Optimizing Higgs Detection
Peter Sadowski (UCI)
![Page 15: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/15.jpg)
US ATLAS SCTPM
Optimizing Higgs Detection
Peter Sadowski (UCI)
![Page 16: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/16.jpg)
US ATLAS SCTPM
dianahep (NSF S2I2 project) Improve ML tools in ROOT
![Page 17: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/17.jpg)
US ATLAS SCTPM
Machine Learning and Data Management
Vast and growing amount of data on user access patterns. Combine engineered and learned features to:
• Pre-fetch data and pre-allocate resources
• Optimize data clustering and replication
• Suggest related data sets
17
![Page 18: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/18.jpg)
US ATLAS SCTPM
Conclusions
• ML will be predominant in Run 4 analysis (wager #1)• Deep neural networks in tracking, jet reco, and clustering will
allow us to exploit GPUs and FPGAs, and possible neuromorphic architectures (wager #2)
• 2025 grad-students will wonder why we wrote all that C++ junk instead of training a few good networks (wager #3)
ML learning experts are not formed overnight (and command high 6-digits salaries, so tend to disappear fast)
US ATLAS should start developing ML expertise by supporting pilot projects in all relevant areas, e.g. data analysis, reconstruction (tracking), and data/job management
18
![Page 19: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/19.jpg)
US ATLAS SCTPM
Thanks
• Kristofer Bouchard• David Clark• Kyle Cranmer• Maurice Garcia-Sciveres• Peter Nugent• Peter Sadowski• Tracking Kaggle group• …
]
![Page 20: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/20.jpg)
US ATLAS SCTPM
Backup
![Page 21: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/21.jpg)
US ATLAS SCTPM
Possible Tracking Network
F(t)
Seeding Selection Fitting
Dynamics
Track
Candidates
![Page 22: US ATLAS SCTPM Machine Learning and ATLAS Paolo Calafiura What is ML: data-driven statistical modeling of complex systems Why now: GPU/MIC/FPGA/Neuromorphic](https://reader038.vdocuments.us/reader038/viewer/2022103100/56649e7c5503460f94b7dcc4/html5/thumbnails/22.jpg)
US ATLAS SCTPM
Kalman Filters and Recurrent NNs
Classic Data Assimilation algorithm (1960, NASA)Iteratively track evolution of a dynamic system
StateData
KalmanFilter
Dynamics
Data
Dyn
State
RecurrentNeuralNetwork