Nowadays, evaluating the performance of a vehicle before the production phase is challenging and important. In the automotive industry, many virtual simulations are needed to model the vehicle behavior in the best possible way. However, these simulations require a lot of time without the user knowing their runtime in advance. Knowing the required time in advance would allow the user to manage the simulations more effectively and choose the best strategy to use the available computational resources. For this reason, we present an innovative data-driven method to estimate in advance the execution time of simulations. Our approach integrates unsupervised techniques, such as constrained k-means clustering, with classification and regression algorithms based on tree structures. In this paper, we present an innovative and hierarchical data-driven method for estimating the execution time of jobs. Numerous experiments were conducted on a real dataset to verify the effectiveness of the proposed approach. The experimental results show that the proposed method is promising.

Predicting job execution time on a high-performance computing cluster using a hierarchical data-driven methodology / Bethaz, Paolo; Vacchetti, Bartolomeo; Capitelli, Enrica; Nosenzo, Vladi; Chiosso, Luca; Cerquitelli, Tania. - (2022). (Intervento presentato al convegno EDBT/ICDT Workshop, 6th International workshop on Data Analytics solutions for Real-LIfe APplications tenutosi a Edinburgh, UK nel 29th March-1st April, 2022).

Predicting job execution time on a high-performance computing cluster using a hierarchical data-driven methodology

bethaz, paolo;vacchetti, bartolomeo;cerquitelli, tania
2022

Abstract

Nowadays, evaluating the performance of a vehicle before the production phase is challenging and important. In the automotive industry, many virtual simulations are needed to model the vehicle behavior in the best possible way. However, these simulations require a lot of time without the user knowing their runtime in advance. Knowing the required time in advance would allow the user to manage the simulations more effectively and choose the best strategy to use the available computational resources. For this reason, we present an innovative data-driven method to estimate in advance the execution time of simulations. Our approach integrates unsupervised techniques, such as constrained k-means clustering, with classification and regression algorithms based on tree structures. In this paper, we present an innovative and hierarchical data-driven method for estimating the execution time of jobs. Numerous experiments were conducted on a real dataset to verify the effectiveness of the proposed approach. The experimental results show that the proposed method is promising.
File in questo prodotto:
File Dimensione Formato  
Darli_AP_2022 (1).pdf

accesso aperto

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 884.42 kB
Formato Adobe PDF
884.42 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2961273