clustrx: a new generation operating system designed for hpc · optimized for hpc purposes (97.1%...
TRANSCRIPT
![Page 1: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/1.jpg)
Clustrx: A new generationoperating systemdesigned for HPC
HPC 2010, June 21-25, CetraroDmitry Tkachev, R&D Director
![Page 2: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/2.jpg)
The Petascale ChallengeMSU Installation: 0.5PFlop ~ 5000 nodesFor installation of 10-100 PetaFlop:
~10000 or 100000 or 1 million nodes
…as a single resource
What challenges must the Operating System address?
![Page 3: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/3.jpg)
The 10 plagues of OS1. Reliability: Constantly changing
configuration of the system2. Resource management: What
nodes/processors can be used ?3. Management of the nodes: Switch on/off4. Monitoring: What’s the temperature? What
is used/not-used?5. Electricity: can parts be switched off?6. File System: Where is the data?7. Provisioning: What version of
Linux/Windows/etc is/should install8. Remote load: how to load the nodes in short
time?9 Keeping track of the
![Page 4: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/4.jpg)
The 10th plague
10. Does it scale?
![Page 5: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/5.jpg)
Solution definitionThe node Operating System (Linux, SUSE, RedServer, Cray OS, etc...)
Is NOT your solution (local resource allocation and utilization)
System Management software is the solution:
The (Cluster) Operating System
![Page 6: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/6.jpg)
Clustrx – The (Cluster) Operating System Light weight real time monitoring of ALL
resources• Computational nodes, switches, electricity,
cooling, etcClustrx is responsible for full life cycle
of the system:• Emergency switching on/off on particular
failures• Switching off to conserve electricityResource allocation for computation• Match of program needs to available
![Page 7: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/7.jpg)
Clustrx’s architecture
![Page 8: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/8.jpg)
Clustrx Subsystems1. Clustrx Watch - monitoring and control
1. dConf - Cluster-wide, decentralized distributed storage for configuration data
1. Resource manager - POSIX-compliant, modular, scalable, GRID-ready
1. Network boot & provisioning -infrastructure to support any number of computing nodes
![Page 9: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/9.jpg)
1. Clustrx WatchNear real-time monitoring of 10000’s of nodes (no practical limit)Full support of heterogeneous architectures Monitoring agents for all major OSes with low CPU loadIPMI monitoring and management SNMP monitoring Cluster state visualizationAutomated actions (including emergency shutdown)3rd parties’ SW monitoring through open APIAdaptive schemes of task management and cluster controls
![Page 10: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/10.jpg)
Clustrx Watch (2)Cluster state visualization
![Page 11: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/11.jpg)
Clustrx Watch (3)
![Page 12: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/12.jpg)
2. Clustrx dConf
Distributed scalable database to keep configuration
data
Tight integrated with other subsystems
Unified access to configuration data for systems
admin
Queries over dConf, automated actions to perform
massive changes, rollbacks
![Page 13: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/13.jpg)
3. Clustrx resource managerModular resource management for clusters
Open API for external plug-ins
POSIX-compliant
Tight integration with Clustrx Watch & dConf
Adaptive power management for green-
computing features to any hardware
Topology-based resource allocation
![Page 14: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/14.jpg)
Clustrx resource manager (2)
![Page 15: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/15.jpg)
4. Infrastructure management
Network boot infrastructure for disk & diskless
nodes
Supports all major OSes including Win HPC 2008
Linearly scalable
Administrative interface to create and manage
bootable OS images
Accounting of network loads and usage of images
![Page 16: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/16.jpg)
Clustrx green
User-defined rule-based policies for power
management
Decreasing frequency on computing nodes
Hibernating on demand
Powering on and off on demand
Automated power-down of idle hardware
components
![Page 17: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/17.jpg)
Heterogeneous architectures
Architecture-independent system
management
Hybrid MPI
Supports accelerated nodes
Main direction of further development
![Page 18: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/18.jpg)
Clustrx CNL
Optimized for HPC purposes (97.1% local
LINPACK)
Binary RHEL compatibility
Supports legacy 32bit applications
Set of memory managers and CPU
schedulers, selectable by resource manager
Kernel level monitoring and management
agent
![Page 19: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/19.jpg)
Our differentiationUniformed, holistic approach
HPC-grade kernel
A good part of code was developed from
scratch to address Petaflop-scale
challenges
Scalable to 10000s and more nodes
Cross-platform management
![Page 20: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/20.jpg)
Future directionsGrid-specific features (tested with CERN`s gLite 3.x,…)Extended billing, Monitoring and billing reports, export statistical dataVirtual clustersAdaptive task management based on real-time profilingGPU and other accelerators virtualizationOpen MPI tuning and optimizationCluster segmentationCheckpointing, reliable task runTransparent node image migration
![Page 21: Clustrx: A new generation operating system designed for HPC · Optimized for HPC purposes (97.1% local LINPACK) Binary RHEL compatibility Supports legacy 32bit applications Set of](https://reader033.vdocuments.us/reader033/viewer/2022042114/5e90d37d59e30262e36dce71/html5/thumbnails/21.jpg)
The Clustrx Operating System
Scalable and Reliable Next GenerationOperating System
for Petaflop and Exaflop computing