petapg
TRANSCRIPT
![Page 1: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/1.jpg)
Dispatching Petabyteswith PostgreSQL
Andrew [email protected]
![Page 2: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/2.jpg)
15M media objects3PB raw data
storage, streaming, processing
![Page 3: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/3.jpg)
HDFS? Isilon?custom solution
![Page 4: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/4.jpg)
1000s hard drivesfile system per drive
filename = sha256(file)
![Page 5: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/5.jpg)
dispatchingingestion, rebalancing
encoding, analysis
![Page 6: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/6.jpg)
PostgreSQL!(of course)
![Page 7: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/7.jpg)
entitiessha (asset), hdd, chassismetadata, actions, status
![Page 8: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/8.jpg)
15M master objects25M derivatives
70M copies
![Page 9: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/9.jpg)
200GB core500GB XML processing
2TB+ overall
![Page 10: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/10.jpg)
custom typesenum
native/wrappers
![Page 11: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/11.jpg)
hashtypesshatypes
+ crc32, bugfixes
![Page 12: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/12.jpg)
actionsfully async, fail-over
dumb polling
![Page 13: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/13.jpg)
smart lockingupdate set t=now()where t old
update returning
![Page 14: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/14.jpg)
XMLthird-party metadata
stored, processed in PG
![Page 15: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/15.jpg)
researchlarge-scale action logging
![Page 16: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/16.jpg)
productionaggregated views of dispatcher
![Page 17: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/17.jpg)
distributed logicdispatcher, XML processing,
production, researchfull-mesh data exchange
![Page 18: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/18.jpg)
table data transferslow or inflexible
simple custom scripts, diff
![Page 19: PetaPG](https://reader038.vdocuments.us/reader038/viewer/2022100602/557f0db0d8b42a01678b478a/html5/thumbnails/19.jpg)
dream industriesdisruptive innovation lab
funding, collaboratinginviting, hiring