The REGALE architecture will be implemented as an end-to-end, ready-to-use software stack for extreme-scale HPC systems. REGALE partners bring in the project a complete toolset capable of supporting the implementation of the proposed architecture. Within REGALE the incoming tools will be promoted to modules of a complete system following an open architecture.
REGALE tools (open source)
Tool | Partner | Role within REGALE Strawman Architecture | Repository |
OAR | UGA | System Manager | https://github.com/oar-team/oar |
DCDB | LRZ | Monitor, Monitoring Data | https://gitlab.lrz.de/dcdb/dcdb |
EAR | BSC | Monitor, Node Manager, Job Manager, Monitoring Data | https://github.com/eas4dc/EAR |
Melissa | UGA | Workflow Engine | https://gitlab.inria.fr/melissa |
RYAX | RYAX | Workflow Engine | https://github.com/RyaxTech |
Examon | E4/UNIBO | Monitor, Monitoring Data | https://github.com/EEESlab/examon |
COUNTDOWN | CINECA/UNIBO | Job Manager | https://github.com/EEESlab/countdown |
Control Pulp | UNIBO | Node Manager | https://github.com/pulp-platform/control-pulp |
BeBiDa | RYAX | System Manager | https://gitlab.inria.fr/mmercier/bebida |
REGALE tools (closed source)
Closed source software used in the REGALE project are BEO (Bull Energy Optimizer) and BDPO (Bull Dynamic Power Optimizer). More information:
REGALE prototypes
Prototype | Link |
Integration Scenario 1 | closed source |
Integration Scenario 2 | different tools with their own repository |
Integration Scenario 3 | https://gitlab.bsc.es/ear_team/cluster_sim/ |
Integration Scenario 4 | closed source |
Integration Scenario 5 | to be updated |
REGALE Library | to be updated |
REGALE pilots
Sophistication artifacts
Artifacts | Link |
Minimizing resource waste | closed source |
Moldability for energy efficiency | https://github.com/cslab-ntua/oar3/tree/mold |
Node level power controls | closed source |
Thermal and power control on a node level | closed source |
Application aware power-capping | closed source |
Data-aware resource allocation | closed source |
Co-scheduling at the node level | https://github.com/cslab-ntua/oar3/tree/dev |
Co-scheduling within multicore processing units | In the design phase |
Elastic resource management | https://github.com/RyaxTech/bebida-optimization-service/ |
HazardNet: A Thermal Hazard Prediction Framework for Datacenters | https://github.com/MSKazemi/HazardNet |