david p. anderson space sciences laboratory university of california – berkeley...
TRANSCRIPT
![Page 1: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/1.jpg)
David P. AndersonSpace Sciences Laboratory
University of California – [email protected]
Designing Middleware for Volunteer Computing
![Page 2: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/2.jpg)
Why volunteer computing?
● 2006: 1 billion PCs, 55% privately owned● If 100M people participate:
– 100 PetaFLOPs, 1 Exabyte (10^18) storage● Consumer products drive technology
– GPUs (NVIDIA, Sony Cell)
your computers
academicbusine
ss
home PCs
![Page 3: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/3.jpg)
Volunteer computing history
95 96 97 98 99 00 01 02 03 04 05 06
GIMPS, distributed.net
SETI@home, folding@home
commercial projects
climateprediction.net
BOINC
Einstein@homeRosetta@homePredictor@homeLHC@homeBURPPrimeGrid...
![Page 4: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/4.jpg)
Scientific computing paradigms
Grid computing
Supercomputers
Volunteer computing
Cluster computing
ControlBang/buckleast
leastmost
most
![Page 5: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/5.jpg)
BOINC
SETI physicsClimate biomedical
Joe Alice Jens
volunteers
projects
![Page 6: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/6.jpg)
Participation in >1 project
● Better short-term resource utilization– communicate/compute in parallel– match applications to resources
● Better long-term resource utilization– project A works while project B thinks
projectcomputingneeds think
work
think
work
time
![Page 7: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/7.jpg)
Server performance
How many clients can a project support?
![Page 8: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/8.jpg)
Task server architecture
MySQLTransitionerScheduler
Feeder
File deleter
DB purger
Assimilator
Validator
Work creator
Shared mem
clients
![Page 9: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/9.jpg)
Server load (CPU)
Create Send Validate Assimilate File delete DB purge0
50
100
150
200
250
300
350
CPU seconds per 100,000 tasks
ApplicationMySQL
![Page 10: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/10.jpg)
Server load (disk I/O)
Create Send Validate Assimilate File delete DB purge0
50
100
150
200
250
300
350
400
Disk I/O (MB) per 100,000 tasks
![Page 11: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/11.jpg)
Server limits
● Single server (2X Xeon, 100 Mbps disk)– 8.8 million tasks/day– 4.4 PetaFLOPS (if 12 hrs on 1 GFLOPS CPU)– CPU is bottleneck (2.5% disk utilization)– 8.2 Mbps network (if 10K request/reply)
● Multiple servers (1 MySQL, 2 for others)– 23.6 million tasks/day– MySQL CPU is bottleneck– 21.9 Mbps network
![Page 12: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/12.jpg)
Credit
![Page 13: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/13.jpg)
Credit display
![Page 14: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/14.jpg)
Credit system goals
● Retain participants– fair between users, across projects– understandable– cheat-resistant
● Maximize utility to projects– hardware upgrades– assignment of projects to computers
![Page 15: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/15.jpg)
Credit system● Computation credit
– benchmark-based– application benchmarks– application operation counting– cheat-resistance: redundancy
● Other resources– network, disk storage, RAM
● Other behaviors– recruitment– other participation
![Page 16: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/16.jpg)
![Page 17: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/17.jpg)
Benchmarks not whole story
![Page 18: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/18.jpg)
Limits of Volunteer Computing
● How much processing/disk/RAM is out there?
● Combinations of resources● Data from 330,000 SETI@home
participants
![Page 19: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/19.jpg)
![Page 20: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/20.jpg)
Operating system
Number of hosts
GFLOPS per host
GFLOPS total
Windows total 292,688 1.676 490,545 XP 229,555 1.739 399,196 2000 42,830 1.310 56,107 2003 10,367 2.690 27,887 98 6,591 0.680 4,482 Millennium 1,973 0.789 1,557 NT 1,249 0.754 942 Longhorn 86 2.054 177 95 37 0.453 17 Linux 21,042 1.148 24,156 Darwin 15,830 1.150 18,205 SunOS 1,091 0.852 930 Others 1,134 1.364 1,547 Total 331,785 1.613 535,169
![Page 21: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/21.jpg)
![Page 22: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/22.jpg)
![Page 23: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/23.jpg)
![Page 24: David P. Anderson Space Sciences Laboratory University of California – Berkeley davea@ssl.berkeley.edu Designing Middleware for Volunteer Computing](https://reader035.vdocuments.us/reader035/viewer/2022062713/56649f4d5503460f94c6e1e9/html5/thumbnails/24.jpg)
Goals of BOINC
● > 100 projects, some churn● Handle big data better
– BitTorrent integration– Use GPUs and other resources– DAGs
● Participation– 10-100 million– multiple projects per participant