Nowadays, evaluating the performance of a vehicle before the production phase is challenging and important. In the automotive industry, many virtual simulations are needed to model the vehicle behavior in the best possible way. However, these simulations require a lot of time without the user knowing their runtime in advance. Knowing the required time in advance would allow the user to manage the simulations more effectively and choose the best strategy to use the available computational resources. For this reason, we present an innovative data-driven method to estimate in advance the execution time of simulations. Our approach integrates unsupervised techniques, such as constrained k-means clustering, with classification and regression algorithms based on tree structures. In this paper, we present an innovative and hierarchical data-driven method for estimating the execution time of jobs. Numerous experiments were conducted on a real dataset to verify the effectiveness of the proposed approach. The experimental results show that the proposed method is promising.
Predicting job execution time on a high-performance computing cluster using a hierarchical data-driven methodology / Bethaz, Paolo; Vacchetti, Bartolomeo; Capitelli, Enrica; Nosenzo, Vladi; Chiosso, Luca; Cerquitelli, Tania. - (2022). (Intervento presentato al convegno EDBT/ICDT Workshop, 6th International workshop on Data Analytics solutions for Real-LIfe APplications tenutosi a Edinburgh, UK nel 29th March-1st April, 2022).
Predicting job execution time on a high-performance computing cluster using a hierarchical data-driven methodology
bethaz, paolo;vacchetti, bartolomeo;cerquitelli, tania
2022
Abstract
Nowadays, evaluating the performance of a vehicle before the production phase is challenging and important. In the automotive industry, many virtual simulations are needed to model the vehicle behavior in the best possible way. However, these simulations require a lot of time without the user knowing their runtime in advance. Knowing the required time in advance would allow the user to manage the simulations more effectively and choose the best strategy to use the available computational resources. For this reason, we present an innovative data-driven method to estimate in advance the execution time of simulations. Our approach integrates unsupervised techniques, such as constrained k-means clustering, with classification and regression algorithms based on tree structures. In this paper, we present an innovative and hierarchical data-driven method for estimating the execution time of jobs. Numerous experiments were conducted on a real dataset to verify the effectiveness of the proposed approach. The experimental results show that the proposed method is promising.File | Dimensione | Formato | |
---|---|---|---|
Darli_AP_2022 (1).pdf
accesso aperto
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Creative commons
Dimensione
884.42 kB
Formato
Adobe PDF
|
884.42 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2961273