Conference: ICPE 2023

The International Conference on Performance Engineering (ICPE) 2023 was held in Coimbra, Portugal from April 15 – 19, 2023. REGALE researchers presented a paper: Martin Molan, Junaid Ahmed Khan, Andrea Borghesi, Andrea Bartolini: Graph Neural Networks for Anomaly Anticipation in HPC Systems.

IWD: Interview with Varvara Asouti (NTUA)

With International Women’s Day just around the corner, we asked one of our female scientists, Varvara Asouti, some questions about “women in science/STEM”. Varvara leads Work Package 4 “Exascaling the REGALE pilots” in the REGALE project. The work in her work package aims to achieve exascale performance for the REGALE pilots, leveraging the capabilities of … Read more

Journal Paper

Andrea Borghesi; Alessio Burrello; Andrea Bartolini: ExaMon-X: a Predictive Maintenance Framework for Automatic Monitoring in Industrial IoT Systems IEEE Internet of Things Journal, 2023 Link: https://cris.unibo.it/bitstream/11585/861933/3/postPrintVersion_plus_editorialReference.pdf

EuroHPC malleability hackathon

The EuroHPC projects ADMIRE, DEEP-SEA, TIME-X and REGALE organised the 1st EuroHPC malleability hackathon at the University Grenoble Alpes in Grenoble, France. The event took place from 23rd – 27th January 2023. REGALE was represented by several researchers working on dynamic resource utilization for “traditional HPC workload”. More: https://sites.google.com/view/1st-eurohpc-mall-hackathon/

Workshop Paper

Eishi Arima; Minjoon Kang; Issa Saba; Josef Weidendorfer; Carsten Trinitis; Martin Schulz: Optimizing Hardware Resource Partitioning and Job Allocations on Modern GPUs under Power Caps ICPP Workshops ’22: 51st International Conference on Parallel Processing Workshop Link: https://arxiv.org/abs/2405.03838

Workshop Paper

Eishi Arima; Isaías A Comprés; Martin Schulz: On the Convergence of Malleability and the HPC PowerStack: Exploiting Dynamism in Over-Provisioned and Power-Constrained HPC Systems High Performance Computing: ISC High Performance 2022 International Workshops Link: https://arxiv.org/abs/2405.03847

Thesis

Mohsen Seyedkazemi Ardebili: Monitoring and Prediction of Thermal Emergencies in High Performance Computing Systems Link: https://amsdottorato.unibo.it/9891/

Conference Paper

Issa Saba; Eishi Arima; Dai Liu; Martin Schulz: Orchestrated Co-Scheduling, Resource Partitioning, and Power Capping on CPU-GPU Heterogeneous Systems via Machine Learning 35th International Conference on Architecture of Computing Systems, ARCS 2022 Link: https://arxiv.org/abs/2405.03831

Conference Paper

Quentin Guilloteau, Jonathan Bleuzen, Millian Poquet, Olivier Richard: “Painless Transposition of Reproducible Distributed Environments with NixOS Compose”. IEEE CLUSTER 2022. Link: https://hal.archives-ouvertes.fr/hal-03723771v1/document

Workshop Paper

Lucas Meyer, Alejandro Ribes, Bruno Raffin: “Simulation-Based Parallel Training”. AI for Science Workshop, Neurips 2022. Link: https://hal.archives-ouvertes.fr/hal-03842106