Repository logo
 

HPC digital twins for evaluating scheduling policies, incentive structures and their impact on power and cooling

dc.contributor.authorMaiterth, Matthias, author
dc.contributor.authorBrewer, Wesley H., author
dc.contributor.authorKuruvella, Jaya S., author
dc.contributor.authorDey, Arunavo, author
dc.contributor.authorIslam, Tanzima Z., author
dc.contributor.authorKabir, Rashadul, author
dc.contributor.authorMenear, Kevin, author
dc.contributor.authorDuplyakin, Dmitry, author
dc.contributor.authorPatki, Tapasya, author
dc.contributor.authorJones, Terry, author
dc.contributor.authorWang, Feiyi, author
dc.contributor.authorACM, publisher
dc.date.accessioned2025-12-22T19:11:59Z
dc.date.available2025-12-22T19:11:59Z
dc.date.issued2025-11-16
dc.description.abstractSchedulers are critical for optimal resource utilization in high-performance computing. Traditional methods to evaluate schedulers are limited to post-deployment analysis, or simulators, which do not model associated infrastructure. In this work, we present the first-of-its-kind integration of scheduling and digital twins in HPC. This enables what-if studies to understand the impact of parameter configurations and scheduling decisions on the physical assets, even before deployment, or regarching changes not easily realizable in production. We (1) provide the first digital twin framework extended with scheduling capabilities, (2) integrate various top-tier HPC systems given their publicly available datasets, (3) implement extensions to integrate external scheduling simulators. Finally, we show how to (4) implement and evaluate incentive structures, as-well-as (5) evaluate machine learning based scheduling, in such novel digital-twin based meta-framework to prototype scheduling. Our work enables what-if scenarios of HPC systems to evaluate sustainability, and the impact on the simulated system.
dc.format.mediumborn digital
dc.format.mediumarticles
dc.identifier.bibliographicCitationMatthias Maiterth,Wesley H. Brewer, Jaya S. Kuruvella, Arunavo Dey, Tanzima Z. Islam, Rashadul Kabir, Kevin Menear, Dmitry Duplyakin, Tapasya Patki, Terry Jones, and Feiyi Wang. 2025. HPC Digital Twins for Evaluating Scheduling Policies, Incentive Structures and their Impact on Power and Cooling. In Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC Workshops '25), November 16–21, 2025, St Louis, MO, USA. ACM, New York, NY, USA, 11 pages. https://doi.org/10.1145/3731599.3767559
dc.identifier.doihttps://doi.org/10.1145/3731599.3767559
dc.identifier.urihttps://hdl.handle.net/10217/242558
dc.languageEnglish
dc.language.isoeng
dc.publisherColorado State University. Libraries
dc.relation.ispartofPublications
dc.relation.ispartofACM DL Digital Library
dc.rights.licenseThis work is licensed under a Creative Commons Attribution 4.0 International License.
dc.rights.urihttps://creativecommons.org/licenses/by/4.0
dc.subjectscheduling simulators
dc.subjectdigital twin
dc.subjectdata center digital twin
dc.subjectsystem simulator
dc.subjectdistributed systems simulation
dc.subjectbatch scheduling
dc.titleHPC digital twins for evaluating scheduling policies, incentive structures and their impact on power and cooling
dc.typeText
dc.typeImage

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
FACF_ACMOA_3731599.3767559.pdf
Size:
2.53 MB
Format:
Adobe Portable Document Format

Collections